Skip to content

Latest commit

 

History

History
31 lines (17 loc) · 2.61 KB

README.md

File metadata and controls

31 lines (17 loc) · 2.61 KB

HashKeyTwitterBot

This twitter bot uses selenium web driver to scrape twitter timelines and obtain account, tweet, date, and engagement data. This is facilitated by ChromeDriver, a web interaction tool that enables automated interaction of the chrome browser.

  1. Clone this GitHub repo.

  2. Install Dependencies:

    1. Install python at https://www.python.org/downloads/ if not installed already.

    2. Run "pip install selenium". You may need to install using "pip2" or "pip3" depending on your python version.

    3. Run "pip install python-dotenv".

    4. Install ChromeDriver at https://sites.google.com/chromium.org/driver. Install the webdriver that is compatible with your version of chrome. You can check your chrome version by going to the three dots at the top right corner of your browser Help --> About Google Chrome.

  3. Access the ChromeDriver file and move chromedriver to your IDE.

  4. Make a new Twitter account to serve as your bot. It is preferable to make a new email for this, otherwise using a spare email/phone number is fine.

  5. Follow relevant accounts through twitter. An inital list was provided that are considered "high signal". High Signal Accounts: @ljxie, @tongnk, @kankanivishal, @zaki, @gavofyork, @cburniske, @delitzer, @wyatt_benmo, @divine_economy, @dynamo_patri

  6. Insert your username and password credentials into the .env file in your IDE.

  7. Run the program. You should see a new chrome window open. If this fails, you may have been signed into twitter already. In this case, sign out, kill the program, and rerun. If you were already signed out and it still fails, you may have used the wrong chromedriver version.

  8. Once the program has finished running, you should see a csv file that has been downloaded to your system. Running the program may take a long time depending on how many tweets the program is scraping. You can adjust the amount of tweets you want per csv by modifying the 'if' statement in scrape.py, but it is currently set to 30 tweets. The csv file does not change name after the program is completed so make sure to rename the file before rerunning.

WARNING: This bot has not been thoroughly tested for avoiding twitter's newly improved bot detection algorithms. Only primitive avoidance measures are used. Excessive use of the bot may result in IP blocking by Twitter, which may prevent your ability to use the bot or even access Twitter on your device. Use at your own discretion.

NOTE: Twitter's login pages, bot detection algorithms, and rate limits have all been changing recently. As a result, this bot may become incompatible/lose functionality once new changes are implemented.