DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5 6 7 8 9 10...32.432
Hits 101 – 120 of 648.621

101
Linguistic resources for paraphrase generation in Portuguese: a Lexicon-Grammar approach
In: ISSN: 1574-020X ; EISSN: 1574-0218 ; Language Resources and Evaluation ; https://hal.archives-ouvertes.fr/hal-03548861 ; Language Resources and Evaluation, Springer Verlag, 2022, ⟨10.1007/s10579-021-09561-5⟩ ; https://link.springer.com/article/10.1007/s10579-021-09561-5 (2022)
BASE
Show details
102
Caveats of Measuring Semantic Change of Cognates and Borrowings using Multilingual Word Embeddings
In: LChange'22 - 3rd International Workshop on Computational Approaches to Historical Language Change 2022 ; https://hal.inria.fr/hal-03635005 ; LChange'22 - 3rd International Workshop on Computational Approaches to Historical Language Change 2022, May 2022, Dublin, Ireland (2022)
BASE
Show details
103
Towards a Cleaner Document-Oriented Multilingual Crawled Corpus
In: https://hal.inria.fr/hal-03536361 ; 2022 (2022)
Abstract: 12 pages, 6 figures, 2 tables ; The need for raw large raw corpora has dramatically increased in recent years with the introduction of transfer learning and semi-supervised learning methods to Natural Language Processing. And while there have been some recent attempts to manually curate the amount of data necessary to train large language models, the main way to obtain this data is still through automatic web crawling. In this paper we take the existing multilingual web corpus OSCAR and its pipeline Ungoliant that extracts and classifies data from Common Crawl at the line level, and propose a set of improvements and automatic annotations in order to produce a new document-oriented version of OSCAR that could prove more suitable to pre-train large generative language models as well as hopefully other applications in Natural Language Processing and Digital Humanities.
Keyword: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; Common Crawl; Language Modeling; Web corpus
URL: https://hal.inria.fr/hal-03536361
BASE
Hide details
104
DeepL et Google Translate face à l'ambiguïté phraséologique
In: https://hal.archives-ouvertes.fr/hal-03583995 ; 2022 (2022)
BASE
Show details
105
Preprint Citation Praxis in PLOS
In: ISSN: 0138-9130 ; EISSN: 1588-2861 ; Scientometrics ; https://hal.archives-ouvertes.fr/hal-03506094 ; In press (2022)
BASE
Show details
106
An Overview of Indian Spoken Language Recognition from Machine Learning Perspective
In: ISSN: 2375-4699 ; EISSN: 2375-4702 ; ACM Transactions on Asian and Low-Resource Language Information Processing ; https://hal.inria.fr/hal-03616853 ; ACM Transactions on Asian and Low-Resource Language Information Processing, ACM, In press, ⟨10.1145/3523179⟩ (2022)
BASE
Show details
107
Mapping of Language-and-Memory Networks in Patients With Temporal Lobe Epilepsy by Using the GE2REC Protocol
In: ISSN: 1662-5161 ; Frontiers in Human Neuroscience ; https://hal-cnrs.archives-ouvertes.fr/hal-03529823 ; Frontiers in Human Neuroscience, Frontiers, 2022, 15, ⟨10.3389/fnhum.2021.752138⟩ (2022)
BASE
Show details
108
Wh-interrogatives in ancient Greek ; Wh-interrogatives in ancient Greek: Disentangling focus- and wh-movement
In: ISSN: 0039-3193 ; EISSN: 1467-9582 ; Studia Linguistica ; https://hal.archives-ouvertes.fr/hal-03579191 ; Studia Linguistica, Wiley-Blackwell, In press (2022)
BASE
Show details
109
Morphology in the Corsican Language Database (BDLC) : assessment and perspectives ; La morphologie dans la Banque de Données Langue Corse : bilan et perspectives
In: ISSN: 1638-9808 ; EISSN: 1765-3126 ; Corpus ; https://hal.archives-ouvertes.fr/hal-03591866 ; Corpus, Bases, Corpus, Langage - UMR 7320, 2022, Corpus et données en morpholgie, ⟨10.4000/corpus.7115⟩ ; https://journals.openedition.org/corpus/7115 (2022)
BASE
Show details
110
Starting a new treebank? Go SUD! Theoretical and practical benefits of the Surface-Syntactic distributional approach
In: Sixth International Conference on Dependency Linguistics (Depling, SyntaxFest 2021) ; https://hal.inria.fr/hal-03509136 ; Sixth International Conference on Dependency Linguistics (Depling, SyntaxFest 2021), Mar 2022, Sofia, Bulgaria (2022)
BASE
Show details
111
L’enseignement/apprentissage du romani en Serbie : entre micro-actes glottopolitiques et reconfiguration des politiques officielles à l’école
In: EISSN: 1769-7425 ; Glottopol ; https://hal.archives-ouvertes.fr/hal-03603324 ; Glottopol, Université de Rouen, Laboratoire Dylis, 2022, Glottopolitiques engagées et solidaires : contextes, idéologies, histoire ; http://glottopol.univ-rouen.fr/ (2022)
BASE
Show details
112
Rapport et Bilan Scientifique - 2e Symposium sur la Politique Linguistique Familiale 2021
In: https://hal-inalco.archives-ouvertes.fr/hal-03525635 ; [Rapport de recherche] INALCO, Sorbonne Paris-Cité (SPC). 2022 (2022)
BASE
Show details
113
One model for the learning of language.
In: Proceedings of the National Academy of Sciences of the United States of America, vol 119, iss 5 (2022)
BASE
Show details
114
The Open Handbook of Linguistic Data Management
Berez-Kroeker, Andrea L; McDonnell, Bradley; Koller, Eve. - : eScholarship, University of California, 2022
BASE
Show details
115
The puzzling nuanced status of who free relative clauses in English: a follow-up to Patterson and Caponigro (2015)
In: ENGLISH LANGUAGE & LINGUISTICS, vol 26, iss 1 (2022)
BASE
Show details
116
Family history of FXTAS is associated with age-related cognitive-linguistic decline among mothers with the FMR1 premutation.
In: Journal of neurodevelopmental disorders, vol 14, iss 1 (2022)
BASE
Show details
117
For Critical Language Awareness and Against the “Exclusive-use-of-the-target-language” Myth: The Effects of Sociolinguistic Content in English in an Elementary Spanish Classroom
In: L2 Journal, vol 14, iss 3 (2022)
BASE
Show details
118
Providing a parent-administered outcome measure in a bilingual family of a father and a mother of two adolescents with ASD: brief report.
In: Developmental neurorehabilitation, vol 25, iss 2 (2022)
BASE
Show details
119
Shakespeare at the Crossroads of Race, Language, and American Empire
Richter, Stephen. - : eScholarship, University of California, 2022
BASE
Show details
120
Language Contact
In: UCLA Encyclopedia of Egyptology, vol 1, iss 1 (2022)
BASE
Show details

Page: 1 2 3 4 5 6 7 8 9 10...32.432

Catalogues
32.469
5.063
21.069
0
503
2.135
2.883
Bibliographies
157.602
1.273
4.545
0
56
31
225
8.219
2.185
Linked Open Data catalogues
0
Online resources
5.182
1.448
380
135
Open access documents
453.747
729
0
1
1
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern