Speech Rankings
This project mimics CSRankings to generate an ordered list of researchers in speech/spoken language processing along with their possible research topics, based on recent publications on important venues of the field, so as to help students seeking for PhD studies to find desirable advisors.
How to use
The pre-generated report is available at here. To build it by yourself,
- Run
prepare_data.py
to buildpublications.json
andauthors.json
, or simply use the data provided, covering those from 2011 to 2021. - Run
export.py
to generate the report.
How does it work
We scrape author metadata and publication data of the following three types of venues from DBLP, including:
- Speech venues: Interspeech, Speech Communications, SLT, SSW, ASRU, IWSLT
- Mixed venues: ICASSP, TASLP
- General venues: NeurIPS, ICML, ICLR, ACL, EMNLP, NAACL, KDD, AAAI, IJCAI
All publications in Speech venues are included. Paricularly for Interspeech, section/field of each paper are collected from ISCA Archive to show possible research topics of each researcher. So are the keywords from IEEE Xplore for papers published on IEEE-held venues. Keywords (as well as titles) are also used to filter out non-speech papers in Mixed venues by a set of rules. Titles are used to identify speech papers in General venues. Researchers are sorted by the total number of publications.
The collected data contain errors, and the project is neither intended to index speech-related papers nor to compare researchers in the field.