- 👋 I’m Charles-Meldhine Madi Mnemoi. I am a Data Scientist in Co-op by day and a full-stack developper for eMush by night.
- 🛠️ Skills
- proficient in data analysis (Pandas, Matplotlib, Seaborn, Plotly) and machine learning (Scikit-learn, PyTorch) with Python and SQL ;
- familiar with DevOps/MLOps (Docker, CI/CD with GitHub Actions, GitLab CI and Docker Swarm, unit testing with pytest), API development (FastAPI), GCP cloud (Big Query, Cloud Run, Vertex AI) and agile development methods (Scrum, Kanban) ;
- acculturated to Large Language Models (LLM) and Retrieval-Augmented Generation (RAG).
- 📫 Reach me by mail or Linkedin
Below are some projects I've worked on.
A chatbot web application which can answer question about eMush with Retrieval-Augmented Generation (RAG) from curated documents.
cmnemoi-learn
is a Python package which reimplements machine learning algorithms from scratch (using only numpy
) with high quality development practices :
- unit testing with
pytest
- code quality checking with
black
,pylint
andmypy
- CI/CD pipeline with GitHub Actions to version and publish the package automatically to PyPI
Stack : PHP 8.3 (Symfony 6.4, PHPUnit, Codeception), Vue.js 3, PostgreSQL, GitLab, Docker, GitLab CI
eMush is an open source remake of Mush: the greatest space opera epic of Humanity, directly on your browser!
I am a full-stack developer for the project since July 2022.
KPIs :
- 1500+ users (100+ daily)
- contribution to 100 000+ lines of code
Missions :
- feature development, bugfixes and testing
- enhancement of CI pipelines
- implementing good practices (TDD, BDD, Clean Architecture)
- participation in discussions on project direction and features to be developed
- writing monthly news and patchnotes
- animating alpha tests
I've done the projects below when I was starting in Data Science and software engineering, they deserve a reboot now...
Data Science project of Lille's Bachelor of Economics, which consists of participating in the Kaggle competition New York City Taxi Fare Prediction.
- Developed a web application that estimates the price of a ride within a $1.4 range
- Cleaned and analyzed a dataset with 340,000+ rows to remove outliers and noise from data with normalization
- Created new variables based on ride duration and destinations
- Built the web application using Streamlit
- Quality "CI" pipeline with git hooks and Github Actions (lint with Ruff, test with Pytest)