Iptvcrawl - A scrapy project for crawl IPTV playlist

Related tags

Web Crawlingiptvcrawl
Overview

iptvcrawl

a scrapy project for crawl IPTV playlist.

Dependency

  • Python3
  • pip install scrapy

Usage

scrapy crawl ejatv

Output

Output playlist file is playlist.m3u. You should note that this file will be overwritten every time when you run spider.

Customize

You can customer the filter condition. Just edit the start_urls in ejatv.py

Example:

this url

https://eja.tv/?limit=0&country=js&language=Chinese&category=&level=0&search=

means channel from Japan, language is Chinese, and any category

Avaliable parameters value are follow:

Category

Animation => Animation
Auto => Auto
Business => Business
Classic => Classic
Comedy => Comedy
Cooking => Cooking
Culture => Culture
Documentary => Documentary
Education => Education
Entertainment => Entertainment
Family => Family
Fashion => Fashion
General => General
Kids => Kids
Legislative => Legislative
Lifestyle => Lifestyle
Local => Local
Movies => Movies
Music => Music
News => News
Outdoor => Outdoor
Relax => Relax
Religious => Religious
Science => Science
Series => Series
Shop => Shop
Sport => Sport
Sports => Sports
Travel => Travel
Weather => Weather
XXX => XXX
Youtube => Youtube
VOD => VOD

Language

Akan => Akan
Albanian => Albanian
Amharic => Amharic
Arabic => Arabic
Armenian => Armenian
Azerbaijani => Azerbaijani
Bosnian => Bosnian
Bulgarian => Bulgarian
Catalan => Catalan
Chinese => Chinese
Croatian => Croatian
Czech => Czech
Danish => Danish
Divehi => Divehi
Dutch => Dutch
English => English
Estonian => Estonian
Faroese => Faroese
Finnish => Finnish
French => French
Galician => Galician
Georgian => Georgian
German => German
Greek => Greek
Hebrew => Hebrew
Hindi => Hindi
Hungarian => Hungarian
Icelandic => Icelandic
Ignota => Ignota
Indonesian => Indonesian
Italian => Italian
Japanese => Japanese
Javanese => Javanese
Kannada => Kannada
Kazakh => Kazakh
Khmer => Khmer
Kinyarwanda => Kinyarwanda
Korean => Korean
Kurdish => Kurdish
Lao => Lao
Latvian => Latvian
Lithuanian => Lithuanian
Luxembourgish => Luxembourgish
Macedonian => Macedonian
Malay => Malay
Malay => Malay 
Malayalam => Malayalam
Maltese => Maltese
Mandarin%20Chinese => Mandarin Chinese
Min%20Nan%20Chinese => Min Nan Chinese
Modern%20Greek => Modern Greek 
Montenegrin => Montenegrin
Music => Music
Māori => Māori
Norwegian%20Bokmål => Norwegian Bokmål
Persian => Persian
Polish => Polish
Portuguese => Portuguese
Punjabi => Punjabi
Pushto => Pushto
Romanian => Romanian
Russian => Russian
Serbian => Serbian
Sinhala => Sinhala
Slovak => Slovak
Slovenian => Slovenian
Somali => Somali
Spanish => Spanish
Sundanese => Sundanese
Swahili => Swahili
Swedish => Swedish
Tagalog => Tagalog
Tamil => Tamil
Telugu => Telugu
Thai => Thai
Turkish => Turkish
Ukrainian => Ukrainian
Urdu => Urdu
Uzbek => Uzbek
Vietnamese => Vietnamese
Western%20Frisian => Western Frisian
Yue%20Chinese => Yue Chinese

Country

af => Afghanistan
al => Albania
dz => Algeria
ar => Argentina
am => Armenia
aw => Aruba
au => Australia
at => Austria
az => Azerbaijan
bh => Bahrain
bb => Barbados
by => Belarus
be => Belgium
bo => Bolivia
ba => Bosnia and Herzegovina
br => Brazil
bg => Bulgaria
bf => Burkina Faso
kh => Cambodia
cm => Cameroon
ca => Canada
cl => Chile
cn => China
co => Colombia
cr => Costa Rica
hr => Croatia
cw => Curacao
cy => Cyprus
cz => Czechia
cd => Democratic Republic of the Congo
dk => Denmark
do => Dominican Republic
ec => Ecuador
eg => Egypt
sv => El Salvador
gq => Equatorial Guinea
ee => Estonia
et => Ethiopia
fo => Faroe Islands
fj => Fiji
fi => Finland
fr => France
ge => Georgia
de => Germany
gh => Ghana
gr => Greece
gt => Guatemala
gy => Guyana
ht => Haiti
hn => Honduras
hk => Hong Kong
hu => Hungary
is => Iceland
in => India
id => Indonesia
int => International
ir => Iran
iq => Iraq
ie => Ireland
il => Israel
it => Italy
jm => Jamaica
jp => Japan
jo => Jordan
kz => Kazakhstan
ke => Kenya
xk => Kosovo
kw => Kuwait
kg => Kyrgyzstan
la => Laos
lv => Latvia
lb => Lebanon
ly => Libya
li => Liechtenstein
lt => Lithuania
lu => Luxembourg
mo => Macao
my => Malaysia
mv => Maldives
mt => Malta
mx => Mexico
md => Moldova
mc => Monaco
me => Montenegro
ma => Morocco
mz => Mozambique
nl => Netherlands
an => Netherlands Antilles
nz => New Zealand
ni => Nicaragua
ng => Nigeria
mk => North Macedonia
no => Norway
om => Oman
pk => Pakistan
ps => Palestinian Territory
pa => Panama
py => Paraguay
pe => Peru
ph => Philippines
pl => Poland
pt => Portugal
pr => Puerto Rico
qa => Qatar
cg => Republic of the Congo
ro => Romania
ru => Russia
rw => Rwanda
sm => San Marino
sa => Saudi Arabia
sn => Senegal
rs => Serbia
sl => Sierra Leone
sg => Singapore
sk => Slovakia
si => Slovenia
so => Somalia
kr => South Korea
es => Spain
lk => Sri Lanka
sd => Sudan
se => Sweden
ch => Switzerland
sy => Syria
tw => Taiwan
tj => Tajikistan
tz => Tanzania
th => Thailand
tn => Tunisia
tr => Turkey
vi => U.S. Virgin Islands
ug => Uganda
ua => Ukraine
ae => United Arab Emirates
gb => United Kingdom
us => United States
uy => Uruguay
uz => Uzbekistan
va => Vatican
ve => Venezuela
vn => Vietnam
ye => Yemen
zm => Zambia
Owner
Zhijun
Zhijun
a small library for extracting rich content from urls

A small library for extracting rich content from urls. what does it do? micawber supplies a few methods for retrieving rich metadata about a variety o

Charles Leifer 588 Dec 27, 2022
A Python package that scrapes Google News article data while remaining undetected by Google.

A Python package that scrapes Google News article data while remaining undetected by Google. Our scraper can scrape page data up until the last page and never trigger a CAPTCHA (download stats: https

Geminid Systems, Inc 6 Aug 10, 2022
Scrapes the Sun Life of Canada Philippines web site for historical prices of their investment funds and then saves them as CSV files.

slocpi-scraper Sun Life of Canada Philippines Inc Investment Funds Scraper Install dependencies pip install -r requirements.txt Usage General format:

Daryl Yu 2 Jan 07, 2022
Facebook Group Scraping Using Beautiful Soup & Selenium

Extract Facebook group posts that are related to a specific topic and write them to a .json file.

Fatima Ghadieh 14 Aug 12, 2022
This program will help you to properly scrape all data from a specific website

This program will help you to properly scrape all data from a specific website

MD. MINHAZ 0 May 15, 2022
Scrape puzzle scrambles from csTimer.net

Scroodle Selenium script to scrape scrambles from csTimer.net csTimer runs locally in your browser, so this doesn't strain the servers any more than i

Jason Nguyen 1 Oct 29, 2021
Automatically scrapes all menu items from the Taco Bell website

Automatically scrapes all menu items from the Taco Bell website. Returns as PANDAS dataframe.

Sasha 2 Jan 15, 2022
A Powerful Spider(Web Crawler) System in Python.

pyspider A Powerful Spider(Web Crawler) System in Python. Write script in Python Powerful WebUI with script editor, task monitor, project manager and

Roy Binux 15.7k Jan 04, 2023
A Smart, Automatic, Fast and Lightweight Web Scraper for Python

AutoScraper: A Smart, Automatic, Fast and Lightweight Web Scraper for Python This project is made for automatic web scraping to make scraping easy. It

Mika 4.8k Jan 04, 2023
哔哩哔哩爬取器:以个人为中心

Open Bilibili Crawer 哔哩哔哩是一个信息非常丰富的社交平台,我们基于此构造社交网络。在该网络中,节点包括用户(up主),以及视频、专栏等创作产物;关系包括:用户之间,包括关注关系(following/follower),回复关系(评论区),转发关系(对视频or动态转发);用户对创

Boshen Shi 3 Oct 21, 2021
Scrapy-soccer-games - Scraping information about soccer games from a few websites

scrapy-soccer-games Esse projeto tem por finalidade pegar informação de tabela d

Caio Alves 2 Jul 20, 2022
Deep Web Miner Python | Spyder Crawler

Webcrawler written in Python. This crawler does dig in till the 3 level of inside addressed and mine the respective data accordingly

Karan Arora 17 Jan 24, 2022
Introduction to WebScraping Workshop - Semcomp 24 Beta

Extrair informações da internet de forma automatizada. Existem diversas maneiras de fazer isso, nesse tutorial vamos ver algumas delas, por meio de bibliotecas de python.

Luísa Moura 19 Sep 11, 2022
Github scraper app is used to scrape data for a specific user profile created using streamlit and BeautifulSoup python packages

Github Scraper Github scraper app is used to scrape data for a specific user profile. Github scraper app gets a github profile name and check whether

Siva Prakash 6 Apr 05, 2022
A Telegram crawler to search groups and channels automatically and collect any type of data from them.

Introduction This is a crawler I wrote in Python using the APIs of Telethon months ago. This tool was not intended to be publicly available for a numb

39 Dec 28, 2022
Scraping web pages to get data

Scraping Data Get public data and save in database This is project use Python How to run a project 1 - Clone the repository 2 - Install beautifulsoup4

Soccer Project 2 Nov 01, 2021
Newsscraper - A simple Python 3 module to get crypto or news articles and their content from various RSS feeds.

NewsScraper A simple Python 3 module to get crypto or news articles and their content from various RSS feeds. 🔧 Installation Clone the repo locally.

Rokas 3 Jan 02, 2022
Open Crawl Vietnamese Text

Open Crawl Vietnamese Text This repo contains crawled Vietnamese text from multiple sources. This list of a topic-centric public data sources in high

QAI Research 4 Jan 05, 2022
Linkedin webscraping - Linkedin web scraping with python

linkedin_webscraping This is the first step of a full project called "LinkedIn J

Pedro Dib 4 Apr 24, 2022
feapder 是一款简单、快速、轻量级的爬虫框架。以开发快速、抓取快速、使用简单、功能强大为宗旨。支持分布式爬虫、批次爬虫、多模板爬虫,以及完善的爬虫报警机制。

feapder 是一款简单、快速、轻量级的爬虫框架。起名源于 fast、easy、air、pro、spider的缩写,以开发快速、抓取快速、使用简单、功能强大为宗旨,历时4年倾心打造。支持轻量爬虫、分布式爬虫、批次爬虫、爬虫集成,以及完善的爬虫报警机制。 之

boris 1.4k Dec 29, 2022