This will hone my data engineering skills while exploring some of my greatest passions in life - the NFL and advanced metrics.
This is still a work in progress, more coming soon...
- DBeaver or another universal database manager of your choice
- A local PostgreSQL installation (or another SQL database of your choice, SQLite is another good option)
- Python
- SQLAlchemy Python package for SQL connection to PostgreSQL database, Pandas for data manipulation, PyQt and Matplotlib for vizualization
- You will need to load the CSVs into SQL tables
- You will then need to establish a connection to your local PostgreSQL database, using a connection string. An example can be found at the top of the data_ingestion.py file
All data currently in the database contributed from the CSVs found from this GitHub profile: nflverse. This repository contains the source code responsible for scraping data from multiple sources