This is a web scraper, using Python framework Scrapy, built to extract data from the Deals of the Day section on Mercado Livre website.

Last update: Jan 12, 2022

Related tags

Web Crawling mercadolivre-scraper

Overview

Deals of the Day

This is a web scraper, using the Python framework Scrapy, built to extract data such as price and product name from the Deals of the Day section on Mercado Livre website.

What Data Do We Want to Scrape?

Product Name
Original Price
Current Price
Product Url
Data Extraction Date

Note: The scraper handles pagination and extracts the aforementioned data throughout the entire Deals of the Day section.

💻 Requirements

Before you start, please check if you have met these few basic requirements:

Installed the latest stable python version (Python 3.7 or later).
Created a virtual enviroment to run the ScraPy framework on your machine.
Installed Scrapy 1.6 or a later stable version.

Note: It is strongly recommended that you install Scrapy in a dedicated virtualenv, to avoid conflicting with your system packages.

Getting Started

From terminal

Create an Enviroment:

mkdir virtual-enviroments
$ cd virtual-enviroments
$ python3 -m venv venv

Activate it:
Linux/macOS

$ source venv/bin/activate

Install the Scrapy framework:

$ pip install Scrapy

🚀 How to Use:

Clone this repository into your workspace:

$ git clone https://github.com/david-adds/mercadolivre-scraper.git

Once you have cloned the repository, open it up so you can run the scraper.

$ cd mercadolivre-scraper

Then, run the spider to scrape the data:

$ scrapy crawl deals

This is a web scraper, using Python framework Scrapy, built to extract data from the Deals of the Day section on Mercado Livre website.

Related tags

Overview

Deals of the Day

What Data Do We Want to Scrape?

Note: The scraper handles pagination and extracts the aforementioned data throughout the entire Deals of the Day section.

💻 Requirements

Note: It is strongly recommended that you install Scrapy in a dedicated virtualenv, to avoid conflicting with your system packages.

Getting Started

🚀 How to Use:

Owner

David Souza

A package that provides you Latest Cyber/Hacker News from website using Web-Scraping.

This repo has the source code for the crawler and data crawled from auto-data.net

Scrapping the data from each page of biocides listed on the BAUA website into a csv file

Web crawling framework based on asyncio.

API to parse tibia.com content into python objects.

A simple python web scraper.

Python web scrapper

Web Scraping images using Selenium and Python

:arrow_double_down: Dumb downloader that scrapes the web

A simple flask application to scrape gogoanime website.

A tool can scrape product in aliexpress: Title, Price, and URL Product.

Searching info from Google using Python Scrapy

simple http & https proxy scraper and checker

A Python module to bypass Cloudflare's anti-bot page.

Download images from forum threads

Demonstration on how to use async python to control multiple playwright browsers for web-scraping

爬虫案例合集。包括但不限于《淘宝、京东、天猫、豆瓣、抖音、快手、微博、微信、阿里、头条、pdd、优酷、爱奇艺、携程、12306、58、搜狐、百度指数、维普万方、Zlibraty、Oalib、小说、招标网、采购网、小红书》

A web scraper which checks price of a product regularly and sends price alerts by email if price reduces.

Pelican plugin that adds site search capability

Google Developer Profile Badge Scraper