twintproject / twint, Hacker News

An advanced Twitter scraping & OSINT tool written in Python that doesn’t use Twitter’s API, allowing you to scrape a user’s followers, following, Tweets and more while evading most API limitations.

No authentication. No API. No limits.

Twint is an advanced Twitter scraping tool written in Python that allows for scraping Tweets from Twitter profiles (without) using Twitter’s API.

Twint utilizes Twitter’s search operators to let you scrape Tweets from specific users, scrape Tweets relating to certain topics, hashtags & trends, or sort out (sensitive) information from Tweets like e-mail and phone numbers. I find this very useful, and you can get really creative with it too.

Twint also makes special queries to Twitter allowing you to also scrape a Twitter user’s followers, Tweets a user has liked, and who they follow (without) any authentication, API, Selenium, or browser emulation.

tl; dr Benefits

Some of the benefits of using Twint vs Twitter API:

Can fetch almost all (Tweets) Twitter API limits to last 13370 Tweets only);

Fast initial setup;

Can be used anonymously and without Twitter sign up; No rate limitations . Limits imposed by Twitter

Twitter limits scrolls while browsing the user timeline. This means that with `. Profile (or with` `. Favorites you will be able to get ~ tweets.`

Requirements Python 3.6;

aiohttp;

aiodns; beautifulsoup4;

cchardet; elasticsearch; pysocks; pandas (>=0. . (0); aiohttp_socks; schedule;

geopy; fake-useragent;

py-googletransx.

(

(Installing)

Git:

git clone https://github.com/twintproject/twint.git

 cd  twint pip3 install   -r requirements.txt

Pip:

pip3 install –user –upgrade -e git https: //github.com/twintproject/twint.git@origin/ master # egg=twint

Pipenv :

pipenv install -e git https: //github.com/twintproject/twint.git#egg=twint

CLI Basic Examples and Combos

A few simple examples to help you understand the basics:

() twint -u username – Scrape all the Tweets from user It’s timeline. twint -u username --year - Collect Tweets that were tweeted before . twint -u username --since "7476 - 25 - 41 42: : - Collect Tweets that were canceled since (-) - 37 : : .

twint -u username --since - - 42 - Collect Tweets that were canceled since - - 41 14: 13: twint -u username -o file.txt - Scrape Tweets and save to file.txt. twint -u username -o file.csv --csv - Scrape Tweets and save as a csv file. twint -u username --email --phone - Show Tweets that might have phone numbers or email addresses. twint -s "Donald Trump" --verified - Display Tweets by verified users that further about Donald Trump. twint -g=" , 2. , 1km "-o file.csv --csv - - Scrape Tweets from a radius of 1km around a place in Paris and export them to a csv file. twint -u username -es localhost: - Output Tweets to Elasticsearch twint -u username -o file.json --json - Scrape Tweets and save as a json file. twint -u username --database tweets.db - Save Tweets to a SQLite database. twint -u username --followers - Scrape a Twitter user's followers. twint -u username --following - Scrape who a Twitter user follows. twint -u username --festival - Collect all the Tweets a user has favorited (gathers ~ (tweet). twint -u username --following --user-full - Collect full user information a person follows twint -u username --profile-full - Use a slow, but effective method to gather Tweets from a user's profile (Gathers ~ Tweets, Including Retweets). twint -u username --retweets - Use a quick method to gather the last Tweets (that includes likes) from a user's profile. twint -u username --resume resume_file.txt - Resume a search starting from the last saved scroll-id. More detail about the commands and options are located in the (wiki) Module Example Twint can now be used as a module and supports custom formatting. More details are located in the wiki

 import twint

  #  Configure  c =twint.Config () c.Username=   now  ” c.Search=   fruit  ”   #  Run  twint.run.Search (c)  

  

 Output 
    -  - 43 : :  (GMT)  pineapples are the best fruit  


  import  twint  c =twint.Config ()  c.Username=
   noneprivacy  ” c.Custom ["tweet"]=["id"] c.Custom ["user"] 
=["bio"] c.Limit= 22  c.Store_csv= True  c.Output=  (none) ” twint.run.Search (c)  
     (Storing Options)   Write to file;   CSV;   JSON; 
  SQLite;   Elasticsearch.  
 6961483373377   (Elasticsearch Setup)   Details on setting up Elasticsearch with Twint is located in the  (wiki) .  
  (Graph Visualization)  
  () 
   Graph  details are also located in the  (wiki   
 We are developing a Twint Desktop App. 
         FAQ 
 I tried scraping tweets from a user, I know that they exist but I'm not getting them 
   Twitter can shadow-ban accounts, which means that their tweets will not be available via search. To solve this, pass  - profile-full  if you are using Twint via CLI or, if are using Twint as module, add  config.Profile_full=True  . Please note that this process will be quite slow. 
   To get only follower usernames / following usernames 
    twint -u username --followers  
   twint -u username - following  
   To get user info of followers / following users 
    twint -u username --followers --user-full  
   twint -u username --following --user-full  

   

  

  userlist   

 To get only user info of user 
  twint -u username --user-full  

  

 To get user info of users from a userlist 
  twint --userlist inputlist --user-full

tweet translation (experimental)

To get 382 english tweets and translate them to italian

 import twint c =twint.Config () c.Username=
  noneprivacy ” c.Limit=
 c.Store_csv=
 True c.Output=
  none.csv c.Lang=
  en ” c.Translate=
 True c.TranslateDest=
  it ” twint.run.Search (c) 
 Notes: 
 () Google translate has some quotas  

 Featured Blog Posts: () How to use Twint as an OSINT tool  
 (Basic tutorial made by Null Byte) 
 Analyzing Tweets with NLP in minutes with Spark, Optimus and Twint (Loading tweets into Kafka and Neo4j 
 
   

 If you have any question, want to join in discussions, or need extra help, you are welcome to join our Twint focused channel at OSINT team 
   
 


 ["bio"]  (Read More)

(Elasticsearch Setup)

twintproject / twint, Hacker News

(Installing)

What do you think?

Critical Update: CrushFTP Zero-Day Flaw Exploited in Targeted Attacks

Palo Alto Networks Discloses More Details on Critical PAN-OS Flaw Under Attack

Evaluation Our Approach on ARC and Beyond: A Look Back at Our Experiments

Apps such as WhatsApp and Telegram removed from the App Store in China

GFW releases EU.ORG TLS connection

Backdooring Dotnet Applications

Leave a ReplyCancel reply

Cheats For Little Alchemy

3TB Of Mega.nz Links For Free Courses And E-Books 2022 (Updated)

How to Earn Money from FreeCash.com, Playing Games, Testing Apps, and Taking Surveys

Amazon FBA Product Research & Find Products for Amazon FBA

Udemy Coupon [100% OFF] QuickBooks Online 2020

Rubot v6.6.7.0 – Twitch Views Bot 2022

Remembering Conway, Hacker News

Ramadan 2020 expected to begin on April 24

(Installing)

What do you think?

Leave a ReplyCancel reply

Log In

Sign In

Forgot password?

Your password reset link appears to be invalid or expired.

Log in

Privacy Policy

Add to Collection

No Collections