3 |
TaPaCo: A Corpus of Sentential Paraphrases for 73 Languages ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
TaPaCo: A Corpus of Sentential Paraphrases for 73 Languages ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
LSDC - A Comprehensive Dataset for Low Saxon Dialect Classification ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
An Evaluation Benchmark for Testing the Word Sense Disambiguation Capabilities of Machine Translation Systems ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
An Evaluation Benchmark for Testing the Word Sense Disambiguation Capabilities of Machine Translation Systems ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
An Evaluation Benchmark for Testing the Word Sense Disambiguation Capabilities of Machine Translation Systems ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
A quantitative approach to Swiss German - Dialectometric analyses and comparisons of linguistic levels
|
|
|
|
BASE
|
|
Show details
|
|
14 |
ArchiMob: Ein multidialektales Korpus schweizerdeutscher Spontansprache
|
|
|
|
In: Linguistik Online; Bd. 98 Nr. 5 (2019): Alemannische Dialektologie – Forschungsstand und Perspektiven. Sonderheft; 425-454 ; Linguistik Online; Vol. 98 No. 5 (2019): Alemannische Dialektologie – Forschungsstand und Perspektiven. Sonderheft; 425-454 ; 1615-3014 (2019)
|
|
BASE
|
|
Show details
|
|
15 |
Donnez votre Français à la Science ! Internet et la documentation de la diversité linguistique : présentation de la plateforme et premiers résultats
|
|
|
|
In: 6e Congrès Mondial de Linguistique Française ; https://hal.archives-ouvertes.fr/hal-02271315 ; 6e Congrès Mondial de Linguistique Française, Jul 2018, Mons, Belgium. ⟨10.1051/shsconf/20184602003⟩ (2018)
|
|
BASE
|
|
Show details
|
|
16 |
The WMT'18 Morpheval test suites for English-Czech, English-German, English-Finnish and Turkish-English
|
|
|
|
In: Proceedings of the Third Conference on Machine Translation ; 3rd Conference on Machine Translation (WMT 18) ; https://hal.archives-ouvertes.fr/hal-01910244 ; 3rd Conference on Machine Translation (WMT 18), Oct 2018, Bruxelles, Belgium. pp.550-564, ⟨10.18653/v1/W18-64060⟩ ; http://www.statmt.org/wmt18/ (2018)
|
|
BASE
|
|
Show details
|
|
17 |
Crowdsourcing Regional Variation Data and Automatic Geolocalisation of Speakers of European French
|
|
|
|
In: International Conference on Language Resources and Evaluation ; https://hal.archives-ouvertes.fr/hal-02271314 ; International Conference on Language Resources and Evaluation, European Language Resources Association (ELRA), May 2018, Miyazaki, Japan (2018)
|
|
Abstract:
International audience ; We present the crowdsourcing platform Donnez Votre Français à la Science (DFS, or “Give your French to Scienceâ€), which aims to collect linguistic data and document language use, with a special focus on regional variation in European French. The activities not only gather data that is useful for scientific studies, but they also provide feedback to the general public; this is important in order to reward participants, to encourage them to follow future surveys, and to foster interaction with the scientific community. The two main activities described here are 1) a linguistic survey on lexical variation with immediate feedback and 2) a speaker geolocalisation system; i.e., a quiz that guesses the linguistic origin of the participant by comparing their answers with previously gathered linguistic data. For the geolocalisation activity, we set up a simulation framework to optimise predictions. Three classification algorithms are compared: the first one uses clustering and shibboleth detection, whereas the other two rely on feature elimination techniques with support Vector Machines and Maximum Entropy models as underlying base classifiers. The best-performing system uses a selection of 17 questions and reaches a localisation accuracy of 66%, extending the prediction from the one-best area (one among 109 base areas) to its first-order and second-order neighbouring areas.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO]Computer Science [cs]; linguistique
|
|
URL: https://hal.archives-ouvertes.fr/hal-02271314
|
|
BASE
|
|
Hide details
|
|
18 |
Crowdsourcing regional variables and automatic geolocalisation of speakers of European French
|
|
|
|
In: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018) ; https://hal.archives-ouvertes.fr/hal-02498762 ; Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), 2018, Miyazaki, Japan ; https://www.aclweb.org/anthology/L18-1527/ (2018)
|
|
BASE
|
|
Show details
|
|
19 |
Measuring Semantic Abstraction of Multilingual NMT with Paraphrase Recognition and Generation Tasks ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|