Twayback: Downloading deleted Tweets from the Wayback Machine, made easy

Last update: Dec 27, 2022

Overview

Twayback: Downloading deleted Tweets from the Wayback Machine, made easy

Finding and downloading deleted Tweets takes a lot of time. Thankfully, with this tool, it becomes a piece of cake! 🎂

Twayback is a portmanteau of Twitter and the Wayback Machine. Enter your desired Twitter username, and let Twayback do the rest!

Requirements

Python 3
- Download
waybackpack
- Install: pip install waybackpack or pip3 install waybackpack
- Link to repo

Features

Can download some or all of a user's archived deleted Tweets.
Allows custom time range to narrow search for deleted Tweets archived between two dates.
Differentiates between accounts that are active, suspended, or don't/no longer exist.
Lets you know if a target handle's archived Tweets have been excluded from the Wayback Machine.

Usage

twayback -u USERNAME [OPTIONS]
Example: twayback -u jack

-u, --username        Specify target user's Twitter handle
-from, --fromdate     Narrow search for deleted Tweets *archived* on and after this date
                      (can be combined with -to)
                      (format YYMMDD)
-to, --todate         Narrow search for deleted Tweets *archived* on and before this date
                      (can be combined with -from)
                      (format YYMMDD)

Installation

For Windows only

Download the latest EXE file.
Launch Command Prompt in the EXE file's directory.
Run the command twayback -u USERNAME (Replace USERNAME with your target handle).

For Windows, Linux, and macOS

Download the latest ZIP file.
Extract ZIP file to a directory of your choice.
Open terminal in that directory.
Run the command pip install -r requirements.txt.
Run the command twayback -u USERNAME (Replace USERNAME with your target handle).

For more information, check out the Usage section above.

Things to keep in mind

Quality of the HTML files depends on how the Wayback Machine saved them. Some are better than others.
This tool is best for text. You might have some luck with photos. You cannot download videos.
By definition, if an account is suspended or no longer exists, all their Tweets would be considered deleted.
Custom date range is not about when Tweets were made, but rather when they were archived. For example, a Tweet from 2011 may have been archived today.

Future plans

GUI. This is a biggie. I don't know shit about Python, let alone GUI. But I'm hoping I can design one using Tkinter Designer. But I don't know how I can link actions to buttons and shit like that, that stuff is super foreign to me, so any help is appreciated, it would mean so much.

Plenty of thanks to jsvine for his amazing work on waybackpack. Without it, this tool cannot work nearly as well.

I hope you enjoy my little script. Please use it for good. Whatever you are, be a good one.

Comments

AttributeError: 'NoneType' object has no attribute 'getText'
I got two errors:

The first error occured after grabbing links from wayback machine UnboundLocalError: local variable 'wayback_id' referenced before assignment

Second, the error occured when I typed 'text' or 'both' AttributeError: 'NoneType' object has no attribute 'getText'

please help, thanks.
bug good first issue
opened by adrn-mm 9
Error in the process

Hello,

Thanks for the app which helps save a lot of time. I have tried with several users and in various ways and it does not work. I pass the screenshots of the errors.

Thank you very much.

opened by barripdmx 8
Twayback Partial Re-Write

Hello,

We recently came across your project, and thought we could contribute by re-writing some parts of the code. An effort was made to keep the logic and structure of the code the same.

By our metrics, we have achieved a speed-up of around 20-30% for accounts with over 1000 tweets. We don't have any actual tests written for the script however, so we have attached it here for review.

A substantial change was moving from Selenium to Playwright. Users need to run 'playwright install' to install the playwright browsers before running the script.

Another smaller, but notable change is that due to the nature of the as_completed function, it is not possible to show any indicator of progress during the gathering of statuses, and the order in which the tweets are returned is scrambled due to the script processing website information based on what web request finishes first.

If the style & format of the re-write is acceptable, and the changes proposed are not considered to be critical, we will proceed with creating a pull request.

twayback2.txt

opened by AccentuSoft 5
Parsing fails when encountering text in cyrillic

Hi, when I try to search for deleted tweets from an account that uses cyrillic, the process fails with the following exception:

Parsing text...: 100%|████████████████████████████████████████████████████████████████████████████████████████████████████████| 4/4 [00:07<00:00, 2.00s/it] Traceback (most recent call last): File "twayback.py", line 184, in File "encodings\cp1250.py", line 19, in encode UnicodeEncodeError: 'charmap' codec can't encode character '\u0430' in position 0: character maps to [20476] Failed to execute script 'twayback' due to unhandled exception!

Anything that could be done on my end? Thanks,

opened by Traut89now 5
KeyError: 'closest' when parsing accounts with larger number of tweets.

Hi,

after the latest update, the tool seems to be working perfectly when scraping accounts with lower number of tweets archived, but when I tried accounts with 1000+ tweets archived, the process failed with: Traceback (most recent call last): File "C:\15tway\twayback B\twayback.py", line 112, in wayback_url = (jsonResponse['archived_snapshots']['closest']['url']) KeyError: 'closest'

as also seen below on the screenshots.

opened by Traut89now 4
No deleted tweets have been found

Hi! Does anyone else, after they pull one search of an user, get an "No deleted tweets have been found" when they try to look for another one? I even try with the same one I did first and it pulls the same thing. I even close and open again the terminal but nothing, got the same issue. Was anyone run into this?

opened by sofiemmc 3
Cleaned up code a bit

No changes to functionality. I went through and added some more comments, renamed variables to be more descriptive and combined some redundancy with list creation

opened by humandecoded 2
Feature proxy

to avoid rate limiting for large groups of tweets added the ability to use proxies with our GET requests. User will need to provide a list of proxy URLs for the script to randomly pull from

opened by humandecoded 1
Twitter Rate Limiting

It looks like twitter is throwing up 429 (too many requests) after the first 900 or so hits. According to their API documentation they limit to around 900 hits per 15 minutes. Although we are not using the API, it seems they are putting similar limiting in place per IP.

I've begun working on a branch that lets users plug in a list of proxies the script will rotate through to avoid 429. This does make the tool less approachable but at this point it's not working on large groups of tweets. Will keep everyone updated
enhancement

opened by humandecoded 1
Adding more Detailed information
I am submitting this PR so that newbie cyber people & OSINT Analysts with no python experience can better understand installation and usage.

There is no information about using the command git clone in order to clone the repository

No information in regards to changing directory | cd

No information in regards to the command pip3 install -r requirements.txt

No information in regards to the command python3 twayback.py -u username (if we put ourselves in the shoes of anyone without any python knowledge, they will just write in their terminal twayback.py which won't be of any use, it's typically something I would have done in the past as when reading installation & usage instructions I would just copy paste without trying to understand. 🤦

Hope this helps a little, awesome tool & thanks for putting it out there !
opened by C3n7ral051nt4g3ncy 1

UPDATE requirements.txt

Module aiohttp is required

Traceback (most recent call last):
  File "/home/gdhindii/twayback/twayback.py", line 18, in <module>
    from aiohttp import ClientSession, TCPConnector
ModuleNotFoundError: No module named 'aiohttp'

opened by sam5epi0l 1

Proxy File List

I am not sure this is an issue or just lack of knowledge from my part, I rather suspect it is the latter, however, I am attempting to recover tweets from an account that has a couple of thousand deleted tweets. I eventually encounter a rate limit error as is to be expected. I saw the way to handle requests like these is to use a proxy file. Now I have a .txt file that I added inside the twayback folder with a list of proxies and formatted according to the guidelines:

url:port url:port url:port

and so on. I keep getting error for each proxy server so Im not sure if Im doing this right and what the issue could be. It will stay stuck at 0%, switch between proxies until it eventually just gives up with a "" error.

My request looks like this:

python3 twayback.py --proxy-file proxyfiles.txt -u USERNAME

Am I doing this correctly?

opened by DirkGaston 2
it doesn't work anymore

once I launch the app it closes itself, it was working fine before the last update I tried downloading a pervious version and the same thing happened I tried it on 2 devices and it didn't work on any of them

opened by unaufhaltbar 4

Releases(10-16-22)

10-16-22(Oct 16, 2022)

Broke the program out in to separate files for more modularity and easier understanding. Added the ability to rotate through a list of proxies to avoid 429 errors.
Source code(tar.gz)
Source code(zip)
03/09/2022(Mar 9, 2022)

The dreaded asyncio.exceptions.TimeoutError has finally been resolved! No more Twayback leaving you behind :)
Source code(tar.gz)
Source code(zip)
twayback.exe(64.39 MB)
twayback.zip(4.26 KB)
02/18/2022(Feb 18, 2022)

HUGE improvements to async have arrived, thanks to the awesome @humandecoded! Status checking should be much speedier. ⏩

Twayback A is discontinued, as speed was the point there were two versions. 👋

Happy Pluto Day! You'll always be a planet in my heart. 💔
Source code(tar.gz)
Source code(zip)
twayback.exe(63.73 MB)
twayback.zip(15.50 KB)
02/16/2022(Feb 16, 2022)

Twayback B has been updated!

async has been implemented, all thanks to @humandecoded , they are awesome! 💛

No more need to ping Archive.org to convert Twitter URLs to Wayback links! Now that happens locally on your computer. Saves you time, bandwidth, and errors! (#3)
Source code(tar.gz)
Source code(zip)
twayback.exe(63.24 MB)
twayback.zip(3.58 KB)
twayback_B.exe(63.72 MB)
twayback_B.zip(15.50 KB)
02/14/2022(Feb 14, 2022)

Many bugs have been fixed! When you downloaded HTML pages before, you might have noticed all files were the same. Well, no more. Python script should work on Windows, Linux, and macOS with equal reliability. Non-Latin characters (such as in Arabic or Cyrillic) should display properly. Small but important change: now you can type the date as 2022-02-14 or 2022/02/14, and it'll work! Or, if you don't like punctuation, 20220214 is perfectly fine! 📅

Twayback now has a sibling! 👯

There is Twayback A and Twayback B.

Use Twayback A if you like speed, don't want Python to check the status code of every archive URL, and are trying to get some or all deleted archived Tweets from a handle with less than 3,200 active Tweets. 👍🏻

Use Twayback B if you like to get all deleted archived Tweets from someone with over 3,200 active Tweets. It does this through status code-checking. 😉

Added timeouts and better exception handling so Twayback can retry in cases of failure. ❌

Also made some things tidy for the birds. 🦢
Source code(tar.gz)
Source code(zip)
twayback.exe(63.24 MB)
twayback.zip(3.58 KB)
twayback_B.exe(63.20 MB)
twayback_B.zip(3.50 KB)
02/13/2022(Feb 13, 2022)

New feature: Screenshots! (Requires Chrome and Chrome driver.) 😊

More bugs have been rescued. 🐜

Parsing Tweets to text ACTUALLY WORKS this time. ✍

Happy Galentine's Day! 💘
Source code(tar.gz)
Source code(zip)
twayback.exe(62.81 MB)
twayback.zip(3.35 KB)
02/12/2022(Feb 12, 2022)

Tweets download using the requests library 💻

Twayback can now parse the text of Tweets and extract it to a file 📝

Sweeping dust and cleaning up here and there 🧹
Source code(tar.gz)
Source code(zip)
twayback.exe(62.58 MB)
twayback.zip(2.67 KB)
02/11/2022(Feb 11, 2022)

Bugs are annoying. (Except caterpillars, they're awesome.) 🐛

This release fixes any crashes you have encountered. 👍

Also, I didn't import waybackpack before. Now it is imported. 🤦‍♂️

Happy International Day of Women and Girls in Science! 💪🔬🔭🧪
Source code(tar.gz)
Source code(zip)
twayback.exe(62.53 MB)
twayback.zip(2.36 KB)
02/06/2022(Feb 6, 2022)
Progress bars are awesome, and they're here! 📶

Peppered some color here and there 🌈

Twayback can now tell you if the Wayback Machine excludes archived Tweets for the target handle (it can happen) ❌

Changed from BingBot to DuckDuckBot for checking status codes (I ♥ DuckDuckGo)

Now powered by waybackpack(thanks jsvine!) 🙏

Source code(tar.gz)
Source code(zip)
twayback.exe(37.89 MB)
twayback.zip(2.45 KB)
02/04/2022(Feb 5, 2022)

Second version, yay!

Batch has left the chat. Code is now 100% Python.
Source code(tar.gz)
Source code(zip)
twayback.exe(33.39 MB)
twayback.zip(2.02 KB)
02/01/2022(Feb 1, 2022)

First version. Pretty messy, but it works.
Source code(tar.gz)
Source code(zip)
twayback.zip(2.39 KB)

Owner

He/Him. Good-for-nothing GitHubber.

GitHub Repository https://mennaruuk.github.io/twayback

Script that allows to download portable installers of different versions Adobe software for macOS

What is this and for what This is a script that allows you to download portable installers of programs from Adobe for macOS with different versions. T

715 Jan 06, 2023

The free and open-source Download Manager written in pure Python

2.7k Dec 31, 2022

😷 Dowload dos documentos da CPI da Pandemia

A CPI da Pandemia recebeu milhares de documentos públicos, todos disponibilizados no site do Senado Federal.

98 Sep 23, 2022

YouTube Downloader Bot With Python

TG YᴏᴜTᴜʙᴇ Uᴘʟᴏᴀᴅᴇʀ * Commands YouTube for Audio & Video and sends it to telegram after receiving valid URL [Do not forwarded any just copy and paste

5 Oct 21, 2022

Google Art Image Downloader Tkinter

Google-Art-Image-Downloader-Tkinter 由 google-art-downloader 整改的批量 Google 艺术展平台高清图片下载 ⭐ It works perfectly from 2018 year till today, thanks for stars!

1 Jan 05, 2022

Twitter Media Downloader (Telegram Bot)

8 Oct 27, 2022

Copy online media to your USB pen by night and watch it on your daily commute

commute-tube commute-tube is your friend on your daily commute. It will download videos of your interest to your USB pen by night so that you're able

19 Mar 23, 2022

📼Command line tool based on youtube-dl to easily download selected channels from your subscriptions.

youtube-cdl Command line tool based on youtube-dl to easily download selected channels from your subscriptions. This tool is very handy if you want to

64 Dec 25, 2022

The sole purpose of this script is to download any NFT collection from OpenSea

OpenSea NFT Stealer The sole purpose of this script is to download any NFT collection from OpenSea. Setup Prerequisites: Python 3 Python requests libr

9 Sep 04, 2022

Downloads and Updates GOG Galaxy 2.0 Plugins/Integrations

GOG Galaxy Plugins Downloader Summary This program downloads GOG Galaxy 2.0 Plugins and installs them to the proper location. You probably do not want

253 Dec 12, 2022

Shit-fetch - Shitpost fetcher (downloader)

shit-fetch Download shitpost (random) from https://random-shitpost.com/ Usage ./shitfetch.py --nsfw (true/false) --output ~/Downloads (default : ./)

1 Jan 02, 2022

Youtube-music - Youtube music with python

youtube-music fzf on https://github.com/junegunn/fzf python3 ytb.py [no/yes] yes

0 Feb 03, 2022

Python script to download all images/webms of a 4chan thread

Python3 script to continuously download all images/webms of multiple 4chan thread simultaneously - without installation

208 Jan 04, 2023

This script fully automates of downloading tiktok videos, editing them,compiling them and finally uploading them to youtube.

This script fully automates of downloading tiktok videos, editing them,compiling them and finally uploading them to youtube. If you wanted to create a tiktok video compiilation youtubbe channel this

32 Dec 16, 2022

Tool to get Canvas cover videos from Spotify tracks.

Spotify Canvas Downloader Tool to get Canvas cover videos from Spotify tracks. ✨ Try it out Building Clone the repository git clone https://github.com

35 Dec 28, 2022

Simple package for Sublime Text 4; download URL's for local viewing and editing

URLDownloader This is a simple example package that allows you to easily download the contents of any web URL to edit locally. Given a URL, the packag

3 Mar 05, 2022

YouTube Video publisher using youtube-dl & ROS2🐢

YouTube-publisher-ROS2 Publish sensor_msgs/Image by "YouTube" 🤗 🤗 🤗 ! You don't have to use webcamera or your video to check demos. Purpose Quick d

5 Dec 04, 2022

I sure love the mix of newsboat+mpv+youtube-dl to watch videos from my favourite creators directly from my command line. But sometimes I want to download them beforehand and have them sorted into different folders. Here is the script to do exactly that.

newsboat_video_downloader I sure love the mix of newsboat+mpv+youtube-dl to watch videos from my favourite creators directly from my command line. But

16 Dec 12, 2022

Animoo - Python scraper made with BeautifulSoup4 that scrapes images from /c/.

Animoo - Python scraper made with BeautifulSoup4 that scrapes images from /c/. Features Scrapes 10 pages Scrapes each thread Downloads all the images

1 Dec 29, 2021

File Downloader

File Downloader Watches a file containing download links and runs a command to download them. The link file is in form of: # comment DOWNLOAD_LINK

1 Jan 08, 2022

Twayback: Downloading deleted Tweets from the Wayback Machine, made easy

Related tags

Overview

Twayback: Downloading deleted Tweets from the Wayback Machine, made easy

Requirements

Features

Usage

Installation

For Windows only

For Windows, Linux, and macOS

Things to keep in mind

Future plans

Comments

Releases(10-16-22)

10-16-22(Oct 16, 2022)

03/09/2022(Mar 9, 2022)

02/18/2022(Feb 18, 2022)

HUGE improvements to async have arrived, thanks to the awesome @humandecoded! Status checking should be much speedier. ⏩

Twayback A is discontinued, as speed was the point there were two versions. 👋

Happy Pluto Day! You'll always be a planet in my heart. 💔

02/16/2022(Feb 16, 2022)

02/14/2022(Feb 14, 2022)

02/13/2022(Feb 13, 2022)

02/12/2022(Feb 12, 2022)

02/11/2022(Feb 11, 2022)

02/06/2022(Feb 6, 2022)

02/04/2022(Feb 5, 2022)

02/01/2022(Feb 1, 2022)

Owner

Script that allows to download portable installers of different versions Adobe software for macOS

The free and open-source Download Manager written in pure Python

😷 Dowload dos documentos da CPI da Pandemia

YouTube Downloader Bot With Python

Google Art Image Downloader Tkinter

Twitter Media Downloader (Telegram Bot)

Copy online media to your USB pen by night and watch it on your daily commute

📼Command line tool based on youtube-dl to easily download selected channels from your subscriptions.

The sole purpose of this script is to download any NFT collection from OpenSea

Downloads and Updates GOG Galaxy 2.0 Plugins/Integrations

Shit-fetch - Shitpost fetcher (downloader)

Youtube-music - Youtube music with python

Python script to download all images/webms of a 4chan thread

This script fully automates of downloading tiktok videos, editing them,compiling them and finally uploading them to youtube.

Tool to get Canvas cover videos from Spotify tracks.

Simple package for Sublime Text 4; download URL's for local viewing and editing

YouTube Video publisher using youtube-dl & ROS2🐢

I sure love the mix of newsboat+mpv+youtube-dl to watch videos from my favourite creators directly from my command line. But sometimes I want to download them beforehand and have them sorted into different folders. Here is the script to do exactly that.

Animoo - Python scraper made with BeautifulSoup4 that scrapes images from /c/.

File Downloader