Skip to content

Latest commit

 

History

History
18 lines (13 loc) · 1 KB

README.md

File metadata and controls

18 lines (13 loc) · 1 KB

NFL Advanced Statistics Data Pipeline Project

This will hone my data engineering skills while exploring some of my greatest passions in life - the NFL and advanced metrics.

This is still a work in progress, more coming soon...

Anticipated Requirements

  • DBeaver or another universal database manager of your choice
  • A local PostgreSQL installation (or another SQL database of your choice, SQLite is another good option)
  • Python
  • SQLAlchemy Python package for SQL connection to PostgreSQL database, Pandas for data manipulation, PyQt and Matplotlib for vizualization

Run Instructions

  • You will need to load the CSVs into SQL tables
  • You will then need to establish a connection to your local PostgreSQL database, using a connection string. An example can be found at the top of the data_ingestion.py file

Citations

All data currently in the database contributed from the CSVs found from this GitHub profile: nflverse. This repository contains the source code responsible for scraping data from multiple sources