Natural Language Processing Tasks and Examples
With the advancement of A.I. technology in recent years, natural language processing technology has been able to solve so many problems. While working as an NLP engineer, I encountered various tasks, and I thought it would be nice to gather and organize the natural language processing tasks I have dealt with in one place. Borrowing Kyubyong's project format, I organized natural language processing tasks with references and example code.
Automated Essay Scoring
WIKI
Automated Essay ScoringDATA
The Hewlett Foundation: Automated Essay ScoringMODEL
BERTMODEL
RoBERTaMODEL
ElectraOFF-THE-SHELF
Pororo's AES
Automatic Speech Recognition
WIKI
Speech RecognitionDATA
LibriSpeechDATA
AISHELL-1DATA
KsponSpeechMODEL
Deep Speech2MODEL
Listen, Attend and SpellMODEL
Wav2vec 2.0OFF-THE-SHELF
Pororo's ASRCODE
Example with KsponSpeech
Dialogue Generation
WIKI
Dialogue SystemDATA
Persona ChatDATA
Korean SNS CorpusMODEL
Dialogue GPTCODE
Example with Korean SNS Corpus
Dialogue Retrieval
WIKI
Dialogue SystemDATA
Persona ChatDATA
Korean SNS CorpusMODEL
Poly EncoderCODE
Example with Korean SNS Corpus
Fill in the Blank
WIKI
Cloze TestINFO
Masked-Language-Modeling with BERTMODEL
BERTMODEL
RoBERTaOFF-THE-SHELF
Pororo's Fill in the BlankCODE
Example with WikiCorpus
Grammatical Error Correction
WIKI
AutocorrectionDATA
NUS Non-commercial research/trial corpus licenseDATA
Cornell Movie--Dialogs CorpusOFF-THE-SHELF
Pororo's GEC
Grapheme To Phoneme
WIKI
GraphemeWIKI
PhonemeREPRESENTATIVE-DATA
Multilingual Pronunciation DataOFF-THE-SHELF-MODEL
Pororo's G2P
Language Modeling
WIKI
Language ModelINFO
A beginner’s guide to language modelsMODEL
GPT3MODEL
GPT2MODEL
Ken-LMMODEL
RNN-LMCODE
Example with OpenWebText
Machine Reading Comprehension
WIKI
Reading ComprehensionINFO
Machine Reading Comprehension with BERTDATA
SQuADDATA
KorQuadMODEL
BERTMODEL
RoBERTaMODEL
ElectraOFF-THE-SHELF
Pororo's MRCCODE
Example with SQuAD & KorQuad
Machine Translation
WIKI
TranslationDATA
WMT 2014 English-to-FrenchDATA
Korean-English translation corpusMODEL
TransformerOFF-THE-SHELF
Pororo's TranslationCODE
Example with Korean-English translation corpus
Math Word Problem Solving
PAPER-WITH-CODE
Math Word Problem SolvingDATA
DeepMind Mathmatics Dataset
Natural Language Inference
WIKI
Textual EntailmentDATA
GLUE-MNLIDATA
KorNLIMODEL
BERTMODEL
RoBERTaMODEL
ElectraOFF-THE-SHELF
Pororo's NLICODE
Example with GLUE-MNLI
Named Entity Recognition
WIKI
Named Entity RecognitionDATA
CoNLL-2002 NER corpusDATA
CoNLL-2003 NER corpusDATA
Naver NERMODEL
BERTMODEL
RoBERTaMODEL
ElectraOFF-THE-SHELF
Pororo's NERCODE
Example with Naver NER
Paraphrase Generation
WIKI
ParaphraseOFF-THE-SHELF
Pororo's Paraphrase Generation
Phoneme To Grapheme
OFF-THE-SHELF
Pororo's P2G
Sentiment Analysis
WIKI
Sentiment AnalysisDATA
GLUE-SSTDATA
NSMCMODEL
BERTMODEL
RoBERTaMODEL
ElectraOFF-THE-SHELF
Pororo's Sentiment AnalysisCODE
Example with NSMC
Semantic Textual Similarity
WIKI
Semantic SimilarityDATA
GLUE-STSDATA
KorSTSMODEL
BERTMODEL
RoBERTaMODEL
ElectraOFF-THE-SHELF
Pororo's STSCODE
Example with SQuAD
Speech Synthesis
WIKI
Speech SynthesisDATA
LJ SpeechDATA
CSS10DATA
KSSMODEL
Tacotron2MODEL
FastSpeech2MODEL
WaveNetMODEL
Hifi-GANOFF-THE-SHELF
Pororo's TTSCODE
Example with LJ-SpeechCODE
Example with KSS
Summarization
WIKI
Automatic SummarizationDATA
XSumDATA
Korean Summarization CorpusMODEL
BARTOFF-THE-SHELF
Pororo's SummarizationCODE
Example with XSum
Author
- Soohwan Kim @sooftware
- Contacts: [email protected]