Fundamentus_scrapy

Baixa informacões que os outros scrapys do fundamentus não realizam.

Para iniciar (python main.py), sera criado um arquivo chamado acoes.csv ao termino do scrapy.

Não é um codigo elegante, mas funcional.

As informacões baixadas são:

       columns = ['Papel', 'Cotação', 'Tipo', 'Data ult cot', 'Empresa', 'Min 52 sem',
                  'Setor', 'Max 52 sem', 'Subsetor', 'Vol $ méd (2m)', 'Valor de mercado',
                  'Últ balanço processado', 'Valor da firma', 'Nro. Ações',

                  'Dia', 'P/L',
                  'LPA', 'Mês', 'P/VP', 'VPA', '30 dias', 'P/EBIT', 'Marg. Bruta',
                  '12 meses', 'PSR', 'Marg. EBIT', '2021', 'P/Ativos', 'Marg. Líquida',
                  '2020', 'P/Cap. Giro', 'EBIT / Ativo', '2019', 'P/Ativ Circ Liq',
                  'ROIC', '2018', 'Div. Yield', 'ROE', '2017', 'EV / EBITDA',
                  'Liquidez Corr', '2016', 'EV / EBIT', 'Div Br/ Patrim', '2015',
                  'Cres. Rec (5a)', 'Giro Ativos',

                  'Ativo',
                  'Dív. Bruta',
                  'Disponibilidades',
                  'Dív. Líquida',
                  'Ativo Circulante',               
                  'Depósitos',
                  'Cart. de Crédito',
                  'Patrim. Líq',

                  'Receita Líquida_12meses',         
                  'Receita Líquida_3meses', 'EBIT_12meses', 'EBIT_3meses',
                  'Lucro Líquido_12meses', 'Lucro Líquido_3meses']

Realizei este projeto com o fim de aprendizado e por não encontrar no github nenhum scrapy que pegue todas as informaçoes que eu precisava como setores e subsetores para realizar modelos KNN de machine learning.

Fundamentus scrapy

Related tags

Overview

Fundamentus_scrapy

Owner

Guilherme Silva Uchoa

Python scraper to check for earlier appointments in Clalit Health Services

Crawler do site Fundamentus.com com o uso do framework scrapy, tanto da aba detalhada como a de resumo.

High available distributed ip proxy pool, powerd by Scrapy and Redis

A scrapy pipeline that provides an easy way to store files and images using various folder structures.

Parse feeds in Python

Command line program to download documents from web portals.

The first public repository that provides free BUBT website scraping API script on Github.

联通手机营业厅自动做任务、签到、领流量、领积分等。

News, full-text, and article metadata extraction in Python 3. Advanced docs:

This is a python api to scrape search results from a url.

PS5 bot to find a console in france for chrismas 🎄🎅🏻 NOT FOR SCALPERS

Scrape Twitter for Tweets

Scrap the 42 Intranet's elearning videos in a single click

A python script to extract answers to any question on Quora (Quora+ included)

让中国用户使用git从github下载的速度提高1000倍!

Scraping Top Repositories for Topics on GitHub,

Pyrics is a tool to scrape lyrics, get rhymes, generate relevant lyrics with rhymes.

Html Content / Article Extractor, web scrapping lib in Python

script to scrape direct download links (ddls) from google drive index.

VG-Scraper is a python program using the module called BeautifulSoup which allows anyone to scrape something off an website. This program lets you put in a number trough an input and a number is 1 news article.