GitHub - eloiup/belugadb: case study data scientist

belugadb case study

This is a solution for the data scientist case study from BelugaDB. The exercise details can be found on https://belugadb.github.io/challenges/data-science-challenge

The analysis, feature engineering and model creation and evaluation were done through the use of jupyter notebooks.

The proper order to check the solution should be:

EDA.ipynb
FE.ipynb
Model.ipynb

Main conclusions were that data features seem relevant and should be further explored. Given the nature of the business problem presented, it would be relevant to know if any of the features are the equipment/team ids. That would allow for new feature engineering, such as time from last maintenance, avg true labels by equipment/team so far, and so on.

Model used was a lightgbm. Parameter tunning was done manually, but alternatives were proposed. No attempt of enseemble/stacking was done.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data		data
notebooks		notebooks
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

belugadb case study

About

Releases

Packages

Languages

eloiup/belugadb

Folders and files

Latest commit

History

Repository files navigation

belugadb case study

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages