Speech Rankings

This project mimics CSRankings to generate an ordered list of researchers in speech/spoken language processing along with their possible research topics, based on recent publications on important venues of the field, so as to help students seeking for PhD studies to find desirable advisors.

How to use

The pre-generated report is available at here. To build it by yourself,

Run prepare_data.py to build publications.json and authors.json, or simply use the data provided, covering those from 2011 to 2021.
Run export.py to generate the report.

How does it work

We scrape author metadata and publication data of the following three types of venues from DBLP, including:

Speech venues: Interspeech, Speech Communications, SLT, SSW, ASRU, IWSLT
Mixed venues: ICASSP, TASLP
General venues: NeurIPS, ICML, ICLR, ACL, EMNLP, NAACL, KDD, AAAI, IJCAI

All publications in Speech venues are included. Paricularly for Interspeech, section/field of each paper are collected from ISCA Archive to show possible research topics of each researcher. So are the keywords from IEEE Xplore for papers published on IEEE-held venues. Keywords (as well as titles) are also used to filter out non-speech papers in Mixed venues by a set of rules. Titles are used to identify speech papers in General venues. Researchers are sorted by the total number of publications.

The collected data contain errors, and the project is neither intended to index speech-related papers nor to compare researchers in the field.

A CSRankings-like index for speech researchers

Related tags

Overview

Speech Rankings

How to use

How does it work

Owner

Mutian He

An example project using OpenPrompt under pytorch-lightning for prompt-based SST2 sentiment analysis model

AllenNLP integration for Shiba: Japanese CANINE model

Text Analysis & Topic Extraction on Android App user reviews

End-to-end text to speech system using gruut and onnx. There are 40 voices available across 8 languages.

A Python script which randomly chooses and prints a file from a directory.

AI and Machine Learning workflows on Anthos Bare Metal.

Azure Text-to-speech service for Home Assistant

GPT-3 command line interaction

Nested Named Entity Recognition

Tools to download and cleanup Common Crawl data

Japanese Long-Unit-Word Tokenizer with RemBertTokenizerFast of Transformers

Official PyTorch Implementation of paper "NeLF: Neural Light-transport Field for Single Portrait View Synthesis and Relighting", EGSR 2021.

In this project, we aim to achieve the task of predicting emojis from tweets. We aim to investigate the relationship between words and emojis.

Resources for "Natural Language Processing" Coursera course.

The Internet Archive Research Assistant - Daily search Internet Archive for new items matching your keywords

Large-scale open domain KNOwledge grounded conVERsation system based on PaddlePaddle

An open source library for deep learning end-to-end dialog systems and chatbots.

NLP - Machine learning

中文空间语义理解评测

Journalism AI – Quotes extraction for modular journalism