A dead simple crawler to get books information from Douban.

Last update: Jan 10, 2022

Related tags

Web Crawling douban-books-crawler

Overview

Introduction

A dead simple crawler to get books information from Douban.

Pre-requesites

Python 3
Install dependencies from requirements.txt
(Optional) Install Anaconda to handle environment

Usage

Run get_tags to fetch all the trending tags.

# This will generate a file tags.csv
python app.py get_tags

Run crawl_books to start crawling the books by the tags from the previous step.

python app.py crawl_books -i tags.csv

Certainly, you can create the tags.csv without using the get_tags script. You might want to make sure the tags you specified can lead to any actual result of books.

License

MIT © mogita

Owner

Yun Wang

GitHub Repository

A simple reddit scraper to get memes (only images) from r/ProgrammerHumor.

memey A simple reddit scraper to get memes (only images) from r/ProgrammerHumor. Note Only works if you have firefox installed (yet). Instructions foo

2 Nov 16, 2021

An utility library to scrape data from TikTok, Instagram, Twitch, Youtube, Twitter or Reddit in one line!

Social Media Scraper An utility library to scrape data from TikTok, Instagram, Twitch, Youtube, Twitter or Reddit in one line! Go to the website » Vie

2 Aug 03, 2022

Python scraper to check for earlier appointments in Clalit Health Services

clalit-appt-checker Python scraper to check for earlier appointments in Clalit Health Services Some background If you ever needed to schedule a doctor

16 Sep 17, 2022

A dead simple crawler to get books information from Douban.

Introduction A dead simple crawler to get books information from Douban. Pre-requesites Python 3 Install dependencies from requirements.txt (Optional)

1 Jan 10, 2022

Instagram profile scrapper with python

IG Profile Scrapper Instagram profile Scrapper Just type the username, and boo! :D Instalation clone this repo to your computer git clone https://gith

6 Nov 07, 2022

Anonymously scrapes onlinesim.ru for new usable phone numbers.

phone-scraper Anonymously scrapes onlinesim.ru for new usable phone numbers. Usage Clone the repository $ git clone https://github.com/thomasgruebl/ph

16 Oct 08, 2022

用python爬取江苏几大高校的就业网站，并提供3种方式通知给用户，分别是通过微信发送、命令行直接输出、windows气泡通知。

crawler_for_university 用python爬取江苏几大高校的就业网站，并提供3种方式通知给用户，分别是通过微信发送、命令行直接输出、windows气泡通知。环境依赖 wxpy,requests,bs4等库功能描述该项目基于python，通过爬虫爬各高校的就业信息网，爬取招聘信

8 Aug 16, 2021

Scrapegoat is a python library that can be used to scrape the websites from internet based on the relevance of the given topic irrespective of language using Natural Language Processing

Scrapegoat is a python library that can be used to scrape the websites from internet based on the relevance of the given topic irrespective of language using Natural Language Processing. It can be ma

10 Jul 06, 2022

此脚本为 python 脚本,实现原理为利用 selenium 定位相关元素,再配合点击事件完成浏览器的自动化.

5 Nov 19, 2021

一个m3u8视频流下载脚本

一个Python的m3u8流视频下载脚本介绍 m3u8流视频日益常见，目前好用的下载器也有很多，我把之前自己写的一个小脚本分享出来，供广大网友使用。写此程序的目的在于给视频下载爱好者提供一个下载样例，可直接调用，勿再重复造轮子。使用方法在python中直接运行程序或进行外部调用 import

0 Oct 10, 2021

🐞 Douban Movie / Douban Book Scarpy

Python3-based Douban Movie/Douban Book Scarpy crawler for cover downloading + data crawling + review entry.

1 Dec 03, 2022

Web scraper for Zillow

Zillow-Scraper Instructions All terminal commands are highlighted. Make sure you first have python 3 installed. You can check this by running "python

1 Nov 23, 2021

Video Games Web Scraper is a project that crawls websites and APIs and extracts video game related data from their pages.

Video Games Web Scraper Video Games Web Scraper is a project that crawls websites and APIs and extracts video game related data from their pages. This

1 Jan 12, 2022

A dead simple crawler to get books information from Douban.

Related tags

Overview

Introduction

Pre-requesites

Usage

License

Owner

Yun Wang

A simple reddit scraper to get memes (only images) from r/ProgrammerHumor.

An utility library to scrape data from TikTok, Instagram, Twitch, Youtube, Twitter or Reddit in one line!

Python scraper to check for earlier appointments in Clalit Health Services

A dead simple crawler to get books information from Douban.

Instagram profile scrapper with python

Anonymously scrapes onlinesim.ru for new usable phone numbers.

用python爬取江苏几大高校的就业网站，并提供3种方式通知给用户，分别是通过微信发送、命令行直接输出、windows气泡通知。

Scrapegoat is a python library that can be used to scrape the websites from internet based on the relevance of the given topic irrespective of language using Natural Language Processing

此脚本为 python 脚本,实现原理为利用 selenium 定位相关元素,再配合点击事件完成浏览器的自动化.

一个m3u8视频流下载脚本

🐞 Douban Movie / Douban Book Scarpy

Web scraper for Zillow

Video Games Web Scraper is a project that crawls websites and APIs and extracts video game related data from their pages.

An introduction to free, automated web scraping with GitHub’s powerful new Actions framework.

A leetcode scraper to compile all questions in leetcode free tier to text file. pdf also available.

Bulk download tool for the MyMedia platform

simple http & https proxy scraper and checker

UsernameScraperTool - Username Scraper Tool With Python

抢京东茅台脚本，定时自动触发，自动预约，自动停止

Download images from forum threads