Here are
144 public repositories
matching this topic...
curated collection of papers for the nlp practitioner ???????
multi_task_NLP is a utility toolkit enabling NLP developers to easily train and infer a single model for multiple tasks.
-
Updated
Nov 21, 2022
-
Python
Открытые лингвистические датасеты: тональный словарь русского языка КартаСловСент, датасет по семантике, ассоциативный граф и датасет по орфографическим ошибкам и опечаткам.
Chinese, English NER, English-Chinese machine translation dataset. 中英文?????据集,中英文机器???据集, 中文分??据集
-
Updated
Feb 3, 2021
-
Python
chinese NLP corpus of chinese science fiction,chinese science fiction corpus : About 4675 Chinese science fiction novels 大?有4675本科幻小?,中文科幻小?自然?言?理?料?,中文科幻小?文本?料?,中文科幻小?文本?据?,科幻小??料
UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language
-
Updated
Feb 11, 2024
-
Macaulay2
TriggerNER: Learning with Entity Triggers as Explanations for Named Entity Recognition (ACL 2020)
-
Updated
Jun 15, 2022
-
Python
Implementation of Very Deep Convolutional Neural Network for Text Classification
-
Updated
Jun 28, 2022
-
Python
a Fine-tuned LLaMA that is Good at Arithmetic Tasks
-
Updated
Sep 15, 2023
-
Jupyter Notebook
A Constrained Text Generation Challenge Towards Generative Commonsense Reasoning
-
Updated
Jan 5, 2024
-
Python
手工整理??行???、??等?料。可用于?音??、??系?等各?nlp模型??。
chinese NLP corpus of chinese science fiction, chinese science fiction corpus: Archive of the Ark Plan of Ula Science Fiction Website ?拉科幻小??方舟??存?,中文科幻小?自然?言?理?料?,中文科幻小?文本?料?,中文科幻小?文本?据?,科幻小??料
?字?据集,包括?字的相?信息,例如???、部首、?音、英文??/同??等。
Code and data for "Summarising Historical Text in Modern Languages" (EACL 2021)
-
Updated
Apr 22, 2021
-
Jupyter Notebook
The release of the FreebaseQA data set (NAACL 2019).
Yoruba language training text for NLP, ASR and TTS tasks
-
Updated
Mar 3, 2023
-
Python
Extracts Transcript and Summary (Abstractive and Extractive) from the AMI Meeting Corpus
-
Updated
Dec 4, 2019
-
Python
A collection of datasets for Ukrainian language
-
Updated
Nov 25, 2023
-
Python
WikiWhy is a new benchmark for evaluating LLMs' ability to explain between cause-effect relationships. It is a QA dataset containing 9000+ "why" question-answer-rationale triplets.
-
Updated
Dec 7, 2023
-
Python
Improve this page
Add a description, image, and links to the
nlp-datasets
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
nlp-datasets
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.