1 |
Documenting Geographically and Contextually Diverse Data Sources: The BigScience Catalogue of Language Data and Resources
|
|
|
|
In: https://hal.inria.fr/hal-03550289 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
3 |
Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets
|
|
|
|
In: https://hal.inria.fr/hal-03177623 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
4 |
The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics
|
|
|
|
In: Proceedings of the 1st Workshop on Natural Language Generation, Evaluation, and Metrics (GEM 2021) ; https://hal.archives-ouvertes.fr/hal-03466171 ; Proceedings of the 1st Workshop on Natural Language Generation, Evaluation, and Metrics (GEM 2021), Aug 2021, Online, France. pp.96-120, ⟨10.18653/v1/2021.gem-1.10⟩ (2021)
|
|
BASE
|
|
Show details
|
|
6 |
Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Transformers: State-of-the-Art Natural Language Processing ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Transformers: State-of-the-Art Natural Language Processing ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Learning Representations of Text through Language and Discourse Modeling: From Characters to Sentences
|
|
|
|
BASE
|
|
Show details
|
|
11 |
A Fast Variational Approach for Learning Markov Random Field Language Models
|
|
|
|
In: DTIC (2015)
|
|
BASE
|
|
Show details
|
|
|
|