WebScrapping Project - G1 Latest News

Last update: Feb 13, 2022

Related tags

Overview

Web Scrapping com Python

Esse projeto consiste em um código para o usuário buscar as últimas nóticias sobre um termo qualquer, no site G1. Para esse projeto foi escolhida a linguagem de programação Python. Para que fosse possível realizar essa busca, foram utilizadas três bibiliotecas, que foram:

selenium - Utilizada para automatizar o processo e obter o conteúdo da página Web.
bs4 - BeautifoulSoup - Utilizada para manipular o conteúdo HTML.
Pandas - Utilizada para criar e exportar um dataframe com as informações obtidas.

💻 Pré-Requisitos

Antes de comerçar, verifique se você atende os seguintes requisitos:

Possuir Windows, Linux or Mac.
Possuir o Python instalado em sua máquina.
Possuir o navegador Google Chrome instalado em sua máquina na versão 97.0.4692.71.
Possuir conexão à Internet

💻 Running

Instale os pacotes necessários:

$ pip install -r requirements.txt

Execute o arquivo main.py, aguarde alguns segundos e será gerada uma planilha XLSX e um arquivo CSV com as informações.

License

MIT

Free Software, Hell Yeah!

WebScrapping Project - G1 Latest News

Related tags

Overview

Web Scrapping com Python

💻 Pré-Requisitos

💻 Running

License

Owner

Eduardo Henrique

A web service for scanning media hosted by a Matrix media repository

Python script who crawl first shodan page and check DBLTEK vulnerability

京东秒杀商品抢购Python脚本

Instagram_scrapper - This project allow you to scrape the list of followers, following or both from a public Instagram account, and create a csv or excel file easily.

This was supposed to be a web scraping project, but somehow I've turned it into a spamming project

:arrow_double_down: Dumb downloader that scrapes the web

茅台抢购最新优化版本，茅台秒杀，优化了抢购协程队列

Facebook Group Scraping Using Beautiful Soup & Selenium

自动完成每日体温上报（Github Actions）

CRI Scrape is a tool for get general info about Italian Red Cross in GAIA Platform

Scraping and visualising India's real-time COVID-19 data from the MOHFW dataset.

Incredibly fast crawler designed for OSINT.

A high-level distributed crawling framework.

Linkedin webscraping - Linkedin web scraping with python

A Scrapper with python

爬取各大SRC当日公告 | 通过微信通知的小工具 | 赏金工具

👨🏼‍⚖️ reddit bot that turns comment chains into ace attorney scenes

爱奇艺会员,腾讯视频,哔哩哔哩,百度,各类签到

河南工业大学完美校园自动校外打卡

A python module to parse the Open Graph Protocol

WebScrapping Project - G1 Latest News

Related tags

Overview

Web Scrapping com Python

💻 Pré-Requisitos

💻 Running

License

Owner

Eduardo Henrique

A web service for scanning media hosted by a Matrix media repository

Python script who crawl first shodan page and check DBLTEK vulnerability

京东秒杀商品抢购Python脚本

Instagram_scrapper - This project allow you to scrape the list of followers, following or both from a public Instagram account, and create a csv or excel file easily.

This was supposed to be a web scraping project, but somehow I've turned it into a spamming project

:arrow_double_down: Dumb downloader that scrapes the web

茅台抢购最新优化版本，茅台秒杀，优化了抢购协程队列

Facebook Group Scraping Using Beautiful Soup & Selenium

自动完成每日体温上报（Github Actions）

CRI Scrape is a tool for get general info about Italian Red Cross in GAIA Platform

Scraping and visualising India's real-time COVID-19 data from the MOHFW dataset.

Incredibly fast crawler designed for OSINT.

A high-level distributed crawling framework.

Linkedin webscraping - Linkedin web scraping with python

A Scrapper with python

爬取各大SRC当日公告 | 通过微信通知的小工具 | 赏金工具

👨🏼‍⚖️ reddit bot that turns comment chains into ace attorney scenes

爱奇艺会员,腾讯视频,哔哩哔哩,百度,各类签到

河南工业大学 完美校园 自动校外打卡

A python module to parse the Open Graph Protocol

河南工业大学完美校园自动校外打卡