Web-Scraping-Demo

This is just a demo of "Web Scraping" with Python.

Usage

python3 scrapper.py visual - To scrape visually with selenium
python3 scrapper.py headless - To scrape in shell/commandline only with requests

This script with will scrape https://blog.scrapinghub.com/ recursively.
First the script will scrape home page for every blog "Post Title", "Post Date", "Post Author", "Post Link".
After successfully scrape the first/home page it will check if there is any second page, if any it will go further and do the same thing.
It will keep dooing the same thing until it reaches the last page of the blog.
After collecting all data it will store those data locally in a CSV file called result.csv.
Then it will create a directory named data/ under the current directory.
Then it will take all the links for the blog posts from result.csv file and scrape for the actual "Blog Article" and save it
in data/ directory with a name "ACTUAL-POST-TITLE".txt

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
poc.png		poc.png
scrapper.py		scrapper.py
scrapperMajor.py		scrapperMajor.py
scrapperMinor.py		scrapperMinor.py
visualMajor.py		visualMajor.py
visualMinor.py		visualMinor.py