Skip to content

NCAA March Madness - Results from all NCAA Men’s and Women’s Regular Seasons and Tournaments from 1985-2019 for men and 1998-2019 for women in a Postgres database.

Notifications You must be signed in to change notification settings

brendagutman/March_Madness

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

March 2021 Data Visualization Challenge: March Madness

For this month's challenge, you have been given a database containing information about all regular season and NCAA Tournament games going back to 1985 for men's teams and 1998 for women's teams. This data is split out into a number of tables (see the included ERD).

For detailed information on the variables contained in each table, see https://www.kaggle.com/c/ncaaw-march-mania-2021/data and https://www.kaggle.com/c/ncaam-march-mania-2021/data. You will need to sign up for a Kaggle account to view this information.

Each team is identified by their teamid value. Warning: the men's and women's teams for a given school will have a different teamid. For example, Belmont University's men's team has teamid 1125; whereas the women's team has teamid 3125.

Data on each game is included in the *regularseasoncompactresults and *ncaatourneycompactresults. Teams are identified as either the winning team (wteamid) or the losing team (lteamid).

For example, the 2008 women's NCAA championship game appears in the table wncaatourneycompactresults as

season daynum wteamid wscore lteamid lscore wloc numot
2008 155 3397 64 3390 48 N 0

The winning team (teamid 3397) was Tennessee, with a score of 64, and the losing team (teamid 3390) was Stanford, with a score of 48. The wloc column indicates that the game was played on a neutral court.

If you want to include information about the seed of each team, you will need to pull that from the *ncaatourneyseeds tables. Seeds are listed by region and then by seed. For example, in 2019, the Tennessee men's team's seed was "Z02", meaning that they were the number 2 seed in the "Z" region (the South region). This appears in the mncaatourneyseeds table as

season seed teamid
2019 Z02 1397

Play-in teams will have an a or b after their seed number. For example, in 2014, the Tennessee men's team was a play-in team and their seed is listed as "Y11b", meaning that they were an 11 seed, but that they were in a play-in game.

season seed teamid
2014 Y11b 1397

Recommendation: Do not try to create a visualization showing the whole history of NCAA basketball. Instead, decide on a single specific question that you would like to investigate and see what you can find in the data related to that question.

About

NCAA March Madness - Results from all NCAA Men’s and Women’s Regular Seasons and Tournaments from 1985-2019 for men and 1998-2019 for women in a Postgres database.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published