A web scraper for nomadlist.com, made to avoid website restrictions.

Related tags

Web Crawlinggypsylist
Overview

Gypsylist

gypsylist.py is a web scraper for nomadlist.com, made to avoid website restrictions.

nomadlist.com is a website with a lot of information for digital nomad people, to find the best places to live and work remotely as a location independent remote worker. Unfortunately most of these contents are restricted if you are not member of this website.

This script doesn't cover all of the information retrievable from the website, but it's just an entry point to evaluate this without to sign up.

Installation

Before to use gypsylist you have to install some requirements:

pip3 install -r requirements.txt

Additionally, having selenium as dependency, you have also to setup the browser driver. To install this, please, take a look here: https://www.selenium.dev/documentation/webdriver/getting_started/install_drivers/.

Now you should be ready to run the script.

Usage

To use gypsylist, at first, browse the nomadlist.com website and apply the filters you need to do your research. Now, get the url path from the address bar of your browser (as shown below):

And use this to scrape with gypsylist:

./gypsylist.py --path "safe-places-for-remote-workers-to-live?sort=cost_for_nomad_in_usd&order=asc" --emoji

This is going to be the expected result:

#1
๐Ÿ™๏ธ  city: Lisbon
๐ŸŒŽ country: Portugal
โญ๏ธ overall: 4/5
๐Ÿ’ต cost: 4/5
๐Ÿ“ก internet: 5/5
๐Ÿ˜€ fun: 5/5
๐Ÿ‘ฎ safety: 4/5

...

#440
๐Ÿ™๏ธ  city: Zurich
๐ŸŒŽ country: Switzerland
โญ๏ธ overall: 3/5
๐Ÿ’ต cost: 1/5
๐Ÿ“ก internet: 5/5
๐Ÿ˜€ fun: 4/5
๐Ÿ‘ฎ safety: 4/5

#441
๐Ÿ™๏ธ  city: Leiden
๐ŸŒŽ country: Netherlands
โญ๏ธ overall: 3/5
๐Ÿ’ต cost: 1/5
๐Ÿ“ก internet: 5/5
๐Ÿ˜€ fun: 4/5
๐Ÿ‘ฎ safety: 4/5

#442
๐Ÿ™๏ธ  city: Honolulu, Hawaii
๐ŸŒŽ country: United States
โญ๏ธ overall: 4/5
๐Ÿ’ต cost: 1/5
๐Ÿ“ก internet: 5/5
๐Ÿ˜€ fun: 5/5
๐Ÿ‘ฎ safety: 4/5

#443
๐Ÿ™๏ธ  city: Lake Tahoe, CA
๐ŸŒŽ country: United States
โญ๏ธ overall: 3/5
๐Ÿ’ต cost: 1/5
๐Ÿ“ก internet: 5/5
๐Ÿ˜€ fun: 4/5
๐Ÿ‘ฎ safety: 4/5

(Always remember --emoji). Have fun!

Known Issues

This is not what you can call "a well written code" (sorry Gods of programming for this). For this reason there are several code smell or bugs that are not under review (due to the short time I dedicated to write the script).

  • Using --headless / -H parameter to set the browser in headless mode, you will retrieve just the first page contents from the website.
Owner
Alessio Greggi
Computer Scientist graduated at the University of Rome, Tor Vergata. Currently working as Linux Engineer. CTF Player during free time.
Alessio Greggi
Web scrapper para cotizar articulos

WebScrapper Este web scrapper esta desarrollado en python 3.10.0 para buscar en la pagina de cyber puerta articulos dentro del catalogo. El programa t

Jordan Gaona 1 Oct 27, 2021
Searching info from Google using Python Scrapy

Python-Search-Engine-Scrapy || Python-็ˆฌ่™ซ-็ดขๅผ•/ๅˆฉ็”จ็ˆฌ่™ซ่Žทๅ–่ฐทๆญŒไฟกๆฏ**/ Searching info from Google using Python Scrapy /* ๅˆฉ็”จ PYTHON ็ˆฌ่™ซ่Žทๅ–ๅคฉๆฐ”ไฟกๆฏ๏ผŒไปฅๅŠๅŸŽๅธ‚ไฟกๆฏๅ’Œ่ต„ๆ–™**/ translatio

HONGVVENG 1 Jan 06, 2022
A Python module to bypass Cloudflare's anti-bot page.

cloudflare-scrape A simple Python module to bypass Cloudflare's anti-bot page (also known as "I'm Under Attack Mode", or IUAM), implemented with Reque

3k Jan 04, 2023
Divar.ir Ads scrapper

Divar.ir Ads Scrapper Introduction This project first asynchronously grab Divar.ir Ads and then save to .csv and .xlsx files named data.csv and data.x

Iman Kermani 4 Aug 29, 2022
This is a python api to scrape search results from a url.

googlescrape Installation Installation is simple! # Stable version pip install googlescrape Examples from googlescrape import client scrapeClient=cli

1 Dec 15, 2022
Crawler in Python 3.7, 3.8. 3.9. Pypy3

Description Python Crawler written Python 3. (Supports major Python releases Python3.6, Python3.7 and Python 3.8) Installation and Use Setup VirtualEn

Vinit Kumar 2 Mar 12, 2022
A Python library for automating interaction with websites.

Home page https://mechanicalsoup.readthedocs.io/ Overview A Python library for automating interaction with websites. MechanicalSoup automatically stor

4.3k Jan 07, 2023
Scrape Twitter for Tweets

Backers Thank you to all our backers! ๐Ÿ™ [Become a backer] Sponsors Support this project by becoming a sponsor. Your logo will show up here with a lin

Ahmet Taspinar 2.2k Jan 05, 2023
This is my CS 20 final assesment.

eeeeeSpider This is my CS 20 final assesment. How to use: Open program Run to your hearts content! There are no external dependancies that you will ha

1 Jan 17, 2022
API to parse tibia.com content into python objects.

Tibia.py An API to parse Tibia.com content into object oriented data. No fetching is done by this module, you must provide the html content. Features:

Allan Galarza 25 Oct 31, 2022
Python framework to scrape Pastebin pastes and analyze them

pastepwn - Paste-Scraping Python Framework Pastebin is a very helpful tool to store or rather share ascii encoded data online. In the world of OSINT,

Rico 105 Dec 29, 2022
Crawler do site Fundamentus.com com o uso do framework scrapy, tanto da aba detalhada como a de resumo.

Crawler do site Fundamentus.com com o uso do framework scrapy, tanto da aba detalhada como a de resumo. (Todas as infomaรงรตes)

Guilherme Silva Uchoa 3 Oct 04, 2022
๐Ÿ•ท Phone Crawler with multi-thread functionality

Phone Crawler: Phone Crawler with multi-thread functionality Disclaimer: I'm not responsible for any illegal/misuse actions, this program was made for

Kmuv1t 3 Feb 10, 2022
Video Games Web Scraper is a project that crawls websites and APIs and extracts video game related data from their pages.

Video Games Web Scraper Video Games Web Scraper is a project that crawls websites and APIs and extracts video game related data from their pages. This

Albert Marrero 1 Jan 12, 2022
A dead simple crawler to get books information from Douban.

Introduction A dead simple crawler to get books information from Douban. Pre-requesites Python 3 Install dependencies from requirements.txt (Optional)

Yun Wang 1 Jan 10, 2022
Simple proxy scraper made by using ProxyScrape's api.

What is Moon? Moon is a lightweight and fast proxy scraper made by using ProxyScrape's api. What can i do with this? You can use proxies for varietys

1 Jul 04, 2022
A database scraper created with mechanical soup and sqlite

WebscrapingDatabases a database scraper created with mechanical soup and sqlite author: Mariya Sha Watch on YouTube: This repository was created to su

Mariya 30 Aug 08, 2022
This is a script that scrapes the longitude and latitude on food.grab.com

grab This is a script that scrapes the longitude and latitude for any restaurant in Manila on food.grab.com, location can be adjusted. Search Result p

0 Nov 22, 2021
PyQuery-based scraping micro-framework.

demiurge PyQuery-based scraping micro-framework. Supports Python 2.x and 3.x. Documentation: http://demiurge.readthedocs.org Installing demiurge $ pip

Matias Bordese 109 Jul 20, 2022
Simply scrape / download all the media from an fansly account.

Simply scrape / download all the media from an fansly account. Providing updates as long as its continuously gaining popularity, so hit the โญ button!

Mika C. 334 Jan 01, 2023