The project classifies the given real time dataset of Mushroom as poisonous or edible . It uses python , MongoDB , AWS and Apache Airflow to accomplish this.
Documentation :
1. High Level Document (HLD) : https://drive.google.com/file/d/14prkMF4hC6p4roQruxpTSPbAWUBLGc4z/view?usp=share_link
2. Low Level Document (LLD) : https://drive.google.com/file/d/1pRCyxh_-6gzE9r5YEVX0sQSPL1UExD6L/view?usp=share_link
3. Architecture : https://drive.google.com/file/d/1vCMnj9Il1AFOQ5eimC1mL5PuGzMRZ-iN/view?usp=sharing
4. Wireframe Document : https://drive.google.com/file/d/1i5KBpkuqi0qn2fUL6OsR1zuRvkQMgYhb/view?usp=sharing
5. Detailed Project Report (DPR) : https://docs.google.com/presentation/d/1CNH4xVWLbv2C_wn37MA2JpHcfHBB5pAv/edit?usp=share_link&ouid=114650827502086972412&rtpof=true&sd=true
ABSTRACT : Mushrooms have been consumed since earliest history. The word Mushroom is derived from the French word for Fungi and Mold. Now-a-days, Mushroom are popular valuable food because they are low in calories, carbohydrate, Fat, sodium and also cholesterol free. Besides this, Mushroom provides important nutrients, including selenium, potassium, riboflavin, niacin, Vitamin D, proteins and fiber. All together with a long history as food source. Mushroom are important for their healing capacity and properties in traditional medicine. It has reported beneficial effects for health and treatment of some disease. Many nutraceutical properties are described in Mushroom like cancer and antitumor attributes. Mushroom act as antibacterial, immune system enhancer and cholesterol lowering Agent. Additionally, they are important source of bio-active compounds. This work is a machine learning model that classifies mushrooms into 2 classes: Poisonous and Edible depending on the features of the mushroom. During this machine learning implementation, we are going to see which features are important to predict whether a mushroom is poisonous or edible.
Problem Statement : The Audubon Society Field Guide to North American Mushrooms contains descriptions of hypothetical samples corresponding to 23 species of gilled mushrooms in the Agaricus and Lepiota Family Mushroom (1981). Each species is labelled as either definitely edible, definitely poisonous, or maybe edible but not recommended. This last category was merged with the toxic category. The Guide asserts unequivocally that there is no simple rule for judging a mushroom's edibility, such as "leaflets three, leave it be" for Poisonous Oak and Ivy.
The main goal is to predict which mushroom is poisonous & which is edible.
Tools Used :
Architecture :
Apache Airflow Interface :
Conclusion :
- The classification system is successfull able to classify the real-time data wit an accuracy of ~100%.
- The 'poisonous' mushrooms do not have Population Type as Numerous and Abundant according to our data.
- The 'poisonous' mushrooms do not have Habitat Type as Waste according to our data.
- The XGBoost Classifier model has 100% accuracy on both training data and test data.