Newsscraper - A simple Python 3 module to get crypto or news articles and their content from various RSS feeds.

Last update: Jan 02, 2022

Overview

NewsScraper

A simple Python 3 module to get crypto or news articles and their content from various RSS feeds.

🔧 Installation

Clone the repo locally.
Use the package manager pip to install the requirements.

pip install -r requirements.txt

✨ Basic Usage

import NewsScraper

all_data = NewsScraper.fetch_all()
news_data = NewsScraper.fetch_news_data()
crypto_data = NewsScraper.fetch_crypto_data()

fetch_all()

Returns a set of NewsScraper.Result containing fetched results from all available RSS feeds

Can include categories: GLOBAL, US, EU, CRYPTO, BLOCKCHAIN, BTC, ETH, LTC.

fetch_news_data()

Returns a set of NewsScraper.Result containing fetched results from CNN, ABC News, Yahoo News, Fox News RSS feeds

Can include categories: GLOBAL, US, EU.

fetch_crypto_data()

Returns a set of NewsScraper.Result containing fetched results from CoinJournal, Crypto Currency News RSS feeds.

Can include categories: CRYPTO, BLOCKCHAIN, BTC, ETH, LTC.

🔨 Advanced Usage

NewsScraper.Result class

A class used to represent a returned article.

Attributes

context : str

A string describing the category of the article.

ex. "GLOBAL", "US", "BLOCKCHAIN", "BTC".
title : str

A string containing the name of the article.
summary : str

A string containing the summary of the article.

NOTE: sometimes it can have the value of "", because the RSS feed didn't provide a summary.
content : str

A string containing the content of the article.

Methods

Result.json()

Returns a dictionary with the attributes of the class formatted in JSON.

ex.

{
  "context": "global",
  "title": "title of the article",
  "summary": "summary of the article",
  "content": "content of the article"
}

News RSS Feeds

All of these functions return a set of NewsScraper.Result containing fetched results of the described RSS feeds.

fetch_abc()
fetch_cnn()
fetch_yahoo()
fetch_fox_news()

Can include categories: GLOBAL, US, EU.

Alternatively, you can use fetch_news_data() to receive results from all of them.

Crypto RSS Feeds

All of these functions return a set of NewsScraper.Result containing fetched results of the described RSS feeds.

fetch_coinjournal()
fetch_cryptocurrencynews()

Can include categories: CRYPTO, BLOCKCHAIN, BTC, ETH, LTC.

Alternatively, you can use fetch_news_data() to receive results from all of them.

🤝 Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

📝 License

This project is licensed under the MIT license.

Newsscraper - A simple Python 3 module to get crypto or news articles and their content from various RSS feeds.

Related tags

Overview

NewsScraper

🔧 Installation

✨ Basic Usage

🔨 Advanced Usage

NewsScraper.Result class

context : str

title : str

summary : str

content : str

Result.json()

News RSS Feeds

Crypto RSS Feeds

🤝 Contributing

📝 License

Owner

Rokas

An automated, headless YouTube Watcher and Scraper

Scrapy uses Request and Response objects for crawling web sites.

A multithreaded tool for searching and downloading images from popular search engines. It is straightforward to set up and run!

A scalable frontier for web crawlers

Deep Web Miner Python | Spyder Crawler

Python framework to scrape Pastebin pastes and analyze them

FilmMikirAPI - A simple rest-api which is used for scrapping on the Kincir website using the Python and Flask package

HappyScrapper - Google news web scrapper with python

Dictionary - Application focused on word search through web scraping

基于Github Action的定时HITsz疫情上报脚本，开箱即用

Script for scrape user data like "id,username,fullname,followers,tweets .. etc" by Twitter's search engine .

This code will be able to scrape movies from a movie website and also provide download links to newly uploaded movies.

Displays market info for the LUNI token on the Terra Blockchain

Pythonic Crawling / Scraping Framework based on Non Blocking I/O operations.

Simple library for exploring/scraping the web or testing a website you’re developing

A Spider for BiliBili comments with a simple API server.

A web crawler script that crawls the target website and lists its links

Html Content / Article Extractor, web scrapping lib in Python

OSTA web scraper, for checking the status of school buses in Ottawa

A tool for scraping and organizing data from NewsBank API searches