1 |
An Overview of Indian Spoken Language Recognition from Machine Learning Perspective
|
|
|
|
In: ISSN: 2375-4699 ; EISSN: 2375-4702 ; ACM Transactions on Asian and Low-Resource Language Information Processing ; https://hal.inria.fr/hal-03616853 ; ACM Transactions on Asian and Low-Resource Language Information Processing, ACM, In press, ⟨10.1145/3523179⟩ (2022)
|
|
BASE
|
|
Show details
|
|
2 |
Didactique de l’oral et interactions : rétrospective méthodologie et expérimentation en contexte algérien
|
|
|
|
In: ISSN: 0077-2712 ; EISSN: 1952-4250 ; Mélanges CRAPEL ; https://hal.archives-ouvertes.fr/hal-03630135 ; Mélanges CRAPEL, Centre de recherches et d'applications pédagogiques en langues, 2022, Enseignement du français parlé aujourd’hui : Recherches et expériences de terrain, Mélanges CRAPEL (43/1) ; https://www.atilf.fr/wp-content/uploads/publications/MelangesCrapel/Melanges_43_1_6_Cortier_et_al.pdf (2022)
|
|
BASE
|
|
Show details
|
|
3 |
Language identification, a tool for Corsican and for the evaluation of linguistic resources ; L'identification de langue, un outil au service du corse et de l'évaluation des ressources linguistiques
|
|
|
|
In: Traitement Automatique des Langues ; https://hal.archives-ouvertes.fr/hal-03633290 ; Traitement Automatique des Langues, 2022, Diversité Linguistique, 62 (3), pp.13-37 ; https://www.atala.org/content/diversité-linguistique-linguistic-diversity-natural-language-processing (2022)
|
|
BASE
|
|
Show details
|
|
4 |
Machine Translation and Gender biases in video game localisation: a corpus-based analysis
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03540605 ; 2022 (2022)
|
|
Abstract:
The video game industry has been a historically gender-biased terrain due to a higher number of male protagonists and hypersexualised representations [Dietz, 1998; Downs & Smith, 2010; Lynch et al., 2016]. Nowadays, echoing the debate on inclusive language, companies attempt to erase gender disparity by introducing main female characters as well as non-binary characters. From a technological point of view, even though recent studies show that Machine Translation remains largely unadopted by individual video game localisers [Rivas Ginel, 2021], multilanguage vendors are willing to invest in these tools to reduce costs [LIND, 2020]. However, the predominance of the masculine in Natural Language Processing and Machine Learning has created allocation and representation biases in Neural Machine Translation [Crawford, 2017].This paper aims to analyse the percentage of gender bias resulting from the use of Google Translate, DeepL, and SmartCat when translating in-game raw content from English into French. The games DeltaRune, The Devil's Womb and The Faces of the Forest were chosen due to the presence of non-binary characters, non-sexualized characters, and female protagonists. We compared the results in order to recount and analyse the differences between these tools' output when in terms of errors related to gender. To this end, we created a parallel corpus to compare source documents and all the translations to visualise the semantic and grammatical directions of the words embeddings [Zhou; Shi; Zhao and al, 2019] and extracted the collocations and concordance lines that represented gender identity by analysing the patterns in the source language.
|
|
Keyword:
[SHS.LANGUE]Humanities and Social Sciences/Linguistics; corpora; Gender biases; Machine Translation; Neural Machine Translation NMT; Video game Localisation
|
|
URL: https://hal.archives-ouvertes.fr/hal-03540605/file/Machine%20Translation%20and%20Gender%20biases%20in%20video%20game%20localisation%20%281%29.pdf https://hal.archives-ouvertes.fr/hal-03540605 https://hal.archives-ouvertes.fr/hal-03540605/document
|
|
BASE
|
|
Hide details
|
|
8 |
Gendered body language in children’s literature over time ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Measuring Terminology Consistency in Translated Corpora: Implementation of the Herfindahl-Hirshman Index
|
|
|
|
In: Information; Volume 13; Issue 2; Pages: 43 (2022)
|
|
BASE
|
|
Show details
|
|
10 |
Frequency, Informativity and Word Length: Insights from Typologically Diverse Corpora
|
|
|
|
In: Entropy; Volume 24; Issue 2; Pages: 280 (2022)
|
|
BASE
|
|
Show details
|
|
11 |
Text+: Language- and text-based Research Data Infrastructure ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Text+: Language- and text-based Research Data Infrastructure ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Text+: Language- and text-based Research Data Infrastructure ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
DCT vs. corpus orales: reflexiones metodológicas sobre el estudio de los actos de habla ; DCT vs. spoken corpora: methodologic reflections on the study of speech acts
|
|
|
|
In: Pragmalingüística, (29), 377-395 (2022)
|
|
BASE
|
|
Show details
|
|
15 |
Source language difficulties in learner translation: Evidence from an error-annotated corpus
|
|
|
|
BASE
|
|
Show details
|
|
16 |
ANLIzing the Adversarial Natural Language Inference Dataset
|
|
|
|
In: Proceedings of the Society for Computation in Linguistics (2022)
|
|
BASE
|
|
Show details
|
|
17 |
Chinese Idioms: Stepping Into L2 Student’s Shoes
|
|
|
|
In: Acta Linguistica Asiatica, Vol 12, Iss 1 (2022) (2022)
|
|
BASE
|
|
Show details
|
|
|
|