List of Data Sources
The following datasets are used throughout this textbook and are available in the data/ directory on GitHub.
Baseball Data
- fences.csv - Major League Baseball stadium fence heights
- outfields.csv - Stadium outfield dimensions
- team_idx.csv - Team name mapping
- xruns_table.csv - Expected runs matrix
Crime and Urban Data
- lacrime.csv - Los Angeles crime incidents
- lacrime_yoy_changes.csv - LA crime year-over-year changes
- california_crime_2012_2013.csv - California crime statistics
- california_cities_2013.csv - California city demographics
- trees.csv - NYC street tree census
Transportation and Weather
- rides_weather.csv - Citi Bike trips with weather data
- weather.csv - Historical weather data
- knaflic-ticket-volume.csv - Ticket volume dashboard data
Other Datasets
- nobel.csv - Nobel Prize laureates
- bea_gdp.csv - US GDP by state
- nato.csv - NATO member states
- probly.csv - Probability examples
- numberly.csv - Number examples