Embark on the fascinating journey of the Book Genre Classifier, a project immersing you in the intricate world of categorizing books. This adventure goes beyond conventional methods to ensure precise predictions of book genres through various approaches.
Fine-Tuning with Huggingface Transformer LLM: In the first phase of our adventure, experience the unveiling of the nuanced art of predicting book genres. We fine-tune a Huggingface Transformer Language Model (LLM), incorporating cutting-edge technology for enhanced accuracy.
PEFT/LoRA Technique Unveiled: Delve into an additional fine-tuning technique—the PEFT/LoRA method. Discover its potential in genre classification, adding sophistication to our model and broadening the scope of precision.
Scikit-Learn Classification Odyssey: Join us on an engaging exploration of scikit-learn's classification saga. Immerse yourself in the world of Support Vector Classifier (SVC), Naive Bayes (NB), and Random Forest (RF) classifiers. Highlighting the distinct strengths of each model, this journey showcases the blend of classical and modern approaches.
Flask App for Seamless Interaction: Beyond technical complexities, experience the user-friendly side of the Book Genre Classifier. Our Flask app provides a smooth interface for users to interact with the classification system, making the exploration of genres an enjoyable and intuitive process.
Docker Deployment for Efficiency: Witness the efficiency of our deployment through Docker. We encapsulate the Book Genre Classifier into a Docker container, ensuring portability and easy replication across different environments.
Docker Image on the Horizon: Explore the encapsulated magic with our Docker image, capturing the essence of precision in book genre classification. This image represents the culmination of our efforts, ready to be shared, tested, and deployed.
Scaling Up with AWS EC2: For those aiming for a broader horizon, we guide you through deploying the Book Genre Classifier on AWS EC2. Scale up the reach of your classifier, making it accessible to a wider audience.
-
Transformer LLM Fine-Tuning: Delve into the sophistication as we fine-tune a Huggingface Transformer Language Model (LLM) to unveil the nuanced art of book genre prediction. Find detailed insights in the notebook file Book_Genre_Prediction.ipynb.
-
PEFT/LoRA Technique: Explore an additional fine-tuning technique, the PEFT/LoRA method, shedding light on its potential in genre classification. Refer to Book_Genre_Prediction.ipynb for a comprehensive understanding.
-
Scikit-Learn Classification Odyssey: Immerse yourself in a captivating exploration as we leverage scikit-learn's Support Vector Classifier (SVC), Naive Bayes (NB), and Random Forest (RF) classifiers, highlighting the unique strengths of each model. Find detailed insights in the notebook file Book_Genre_Prediction.ipynb.
It's challenging to determine a book's genre based on its title and summary, but it's essential for several reasons:
-
Enhanced Recommendations: Elevate book recommendations with a classifier that comprehends genres at a nuanced level, offering users personalized and accurate suggestions.
-
Insights into Literary Trends: Uncover hidden patterns in literary success by exploring the relationship between book genres and their portrayal in summaries.
-
Recommendation Systems: A robust genre classifier assists recommendation systems in providing readers with more personalized book recommendations, leading to increased reader engagement on digital platforms.
-
Literary Success: Understand the correlation between genres and their portrayal in summaries to gain insights into what makes books commercially successful or where improvements can be made.
-
Book Themes and Trends: Study book genres to better understand the prevailing themes and trends in the publishing industry, providing valuable insights for cultural and societal studies.
Clone the Book Genre Classifier repository and explore the precision of diverse approaches to revolutionize book genre classification.
git clone https://github.com/adinmg/book_genre_classifier.git
Ensure a seamless setup by installing the required packages listed in the requirements.txt
file.
pip install -r requirements.txt
-
Navigate to the project directory:
cd book_genre_classifier
-
Run the Flask app:
python app.py
-
Access the Book Genre Classifier at
http://127.0.0.1:5000/
in your web browser to interact with the classification system.
-
Ensure Docker is installed on your machine.
-
Build the Docker image:
docker build -t book_classifier-app .
-
Run the Docker container:
docker run -p 5000:5000 book_classifier-app
-
Access the Book Genre Classifier at
http://127.0.0.1:5000/
in your web browser to experience the precision of diverse approaches in book genre classification.
-
To test the Book Genre Classifier using the image from Docker Hub, execute the following command:
docker pull adinmg/book_classifier-app:v1
-
Run the Docker Container:
docker run -p 5000:5000 adinmg/book_classifier-app:v1
This command assumes that the application inside the container exposes port 5000 and maps it to the same port on your host machine.
- Accessing the Classifier:
After the container is running, you might access the Book Genre Classifier by opening your web browser and navigating to http://127.0.0.1:5000/
.
-
Create an AWS EC2 Instance:
- Log in to your AWS Management Console.
- Navigate to EC2 and launch a new instance.
- Choose an appropriate Amazon Machine Image (AMI) and configure instance details.
-
Configure Security Groups:
- Set up security groups to allow inbound traffic on the necessary ports (e.g., 5000 for Flask).
-
SSH into the EC2 Instance:
- Use the provided key pair to SSH into your EC2 instance.
-
Clone the Repository:
- Clone the Book Genre Classifier repository into your EC2 instance:
git clone https://github.com/adinmg/book_genre_classifier.git
-
Install Dependencies:
- Install the required packages:
pip install -r requirements.txt
Unicorn is already in the requirements.
-
Test the App in Gunicorn:
- Run the app using Gunicorn:
gunicorn -b 0.0.0.0:8000 app:app
(Make sure you are in the app folder)
-
Configure Gunicorn Service:
- Create a Gunicorn service configuration file:
sudo nano /etc/systemd/system/myapp.service
"myapp".service:
[Unit] Description=gunicorn instance for a simple flask app After=network.target [Service] User=ubuntu Group=www-data WorkingDirectory=/home/ubuntu/book_genre_classifier ExecStart=/home/ubuntu/myenv/bin/gunicorn -b localhost:8000 app:app Restart=always [Install] WantedBy=multi-user.target
-
Restart Daemon:
- Reload the systemd daemon to apply the changes:
sudo systemctl daemon-reload
-
Start the Gunicorn Service:
- Start the Gunicorn service:
sudo systemctl start myapp
-
Enable the Gunicorn Service:
- Enable the Gunicorn service to start on boot:
sudo systemctl enable myapp
-
Install Nginx:
- Install the Nginx web server:
sudo apt install nginx
-
Start Nginx:
- Start the Nginx service:
sudo systemctl start nginx
-
Enable Nginx:
- Enable Nginx to start on boot:
sudo systemctl enable nginx
-
Edit Nginx Default File:
- Open the default Nginx configuration file for editing:
sudo nano /etc/nginx/sites-available/default
-
Modify Default File:
- Add the following upstream block and modify the location block:
# Default server configuration (add) upstream myapp { server 127.0.0.1:8000; } # modify location location / { ... proxy_pass http://myapp; }
-
Restart Nginx:
- Restart the Nginx service to apply the changes:
sudo systemctl restart nginx
-
Debug NGINX if necessary:
- Check the Nginx error log for debugging:
sudo tail /var/log/nginx/error.log
Now, your Book Genre Classifier is deployed on AWS EC2 with Gunicorn and Nginx for efficient and reliable web service.