Scrape plants scientific name information from Agroforestry Species Switchboard 2.0.

Last update: Dec 23, 2021

Overview

Agroforestry Species Switchboard 2.0 Scraper

Scrape plants scientific name information from Species Switchboard 2.0.

Requirements

python >= 3.10 (you can use pyenv for easier python version management)
pipenv

How to run

Install dependencies

cp env.sample .env
pipenv --python 3
pipenv install

Run
```
pipenv run python main.py
```
The result will be placed in a file named result.*.csv

Test Shell

pipenv run scrapy shell 'http://apps.worldagroforestry.org/products/switchboard/index.php/species_search/Acacia%20abyssinica'

Cleanup All Outputs

rm result.* && rm log.*

Special Cases

Case	Link	Note
ICRAF Databases Not Found	Engelhardia spicata
Genus Found	Forficula	What to do next?
Multiple Species Found	Alstonia spectabilis	Get the matched species right?
Species Variant Found	Engelhardtia spicata	Need human to check
Similar Species Found	Costus speciosus	Need human to check

Contributing

Fork this repo
Develop
Create pull request
Tag @rizqirizqi for review
Merge~~

License

GPL-3.0

Scrape plants scientific name information from Agroforestry Species Switchboard 2.0.

Related tags

Overview

Agroforestry Species Switchboard 2.0 Scraper

Requirements

How to run

Test Shell

Cleanup All Outputs

Special Cases

Contributing

License

Owner

Mgs. M. Rizqi Fadhlurrahman

A simple, configurable and expandable combined shop scraper to minimize the costs of ordering several items

A way to scrape sports streams for use with Jellyfin.

Pro Football Reference Game Data Webscraper

Scrapes the Sun Life of Canada Philippines web site for historical prices of their investment funds and then saves them as CSV files.

A web service for scanning media hosted by a Matrix media repository

Web scraped S&P 500 Data from Wikipedia using Pandas and performed Exploratory Data Analysis on the data.

CRI Scrape is a tool for get general info about Italian Red Cross in GAIA Platform

This was supposed to be a web scraping project, but somehow I've turned it into a spamming project

A Powerful Spider(Web Crawler) System in Python.

A simple Discord scraper for discord bots

Twitter Eye is a Twitter Information Gathering Tool With Twitter Eye

Scraping and visualising India's real-time COVID-19 data from the MOHFW dataset.

京东秒杀商品抢购Python脚本

Shopee Scraper - A web scraper in python that extract sales, price, avaliable stock, location and more of a given seller in Brazil

A Python module to bypass Cloudflare's anti-bot page.

一个m3u8视频流下载脚本

New World Market Scraper

Pythonic Crawling / Scraping Framework based on Non Blocking I/O operations.

腾讯课堂，模拟登陆，获取课程信息，视频下载，视频解密。

Haphazard scripts for scraping bitcoin/bitcoin data from GitHub