1 |
ViQuAE, a Dataset for Knowledge-based Visual Question Answering about Named Entities
|
|
|
|
In: ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’22) ; https://hal-universite-paris-saclay.archives-ouvertes.fr/hal-03650618 ; 2022 (2022)
|
|
BASE
|
|
Show details
|
|
2 |
RETRIEVING SPEAKER INFORMATION FROM PERSONALIZED ACOUSTIC MODELS FOR SPEECH RECOGNITION
|
|
|
|
In: IEEE ICASSP 2022 ; https://hal.archives-ouvertes.fr/hal-03539741 ; IEEE ICASSP 2022, 2022, Singapour, Singapore (2022)
|
|
BASE
|
|
Show details
|
|
3 |
Towards combined semantic and lexical scores based on a new representation of textual data to extract experimental data from scientific publications
|
|
|
|
In: ISSN: 1751-5858 ; EISSN: 1751-5866 ; International Journal of Intelligent Information and Database Systems ; https://hal.inrae.fr/hal-03616243 ; International Journal of Intelligent Information and Database Systems, Inderscience, 2022, 15 (1), pp.78. ⟨10.1504/IJIIDS.2022.120146⟩ (2022)
|
|
BASE
|
|
Show details
|
|
4 |
Obvie: interface web pour la fouille et la comparaison de textes
|
|
|
|
In: Atelier DigitAl Humanities and cuLtural herItAge: data and knowledge management and analysis durant la conférence francophone sur l'Extraction et la Gestion des Connaissances (egc2022) ; https://hal.archives-ouvertes.fr/hal-03543362 ; Atelier DigitAl Humanities and cuLtural herItAge: data and knowledge management and analysis durant la conférence francophone sur l'Extraction et la Gestion des Connaissances (egc2022), Jan 2022, Blois, France ; https://egc2022.univ-tours.fr/ateliers/ (2022)
|
|
BASE
|
|
Show details
|
|
5 |
Preprint Citation Praxis in PLOS
|
|
|
|
In: ISSN: 0138-9130 ; EISSN: 1588-2861 ; Scientometrics ; https://hal.archives-ouvertes.fr/hal-03506094 ; In press (2022)
|
|
BASE
|
|
Show details
|
|
6 |
Automatic generation of the complete vocal tract shape from the sequence of phonemes to be articulated
|
|
|
|
In: ISSN: 0167-6393 ; EISSN: 1872-7182 ; Speech Communication ; https://hal.univ-lorraine.fr/hal-03650212 ; Speech Communication, Elsevier : North-Holland, 2022, ⟨10.1016/j.specom.2022.04.004⟩ (2022)
|
|
BASE
|
|
Show details
|
|
7 |
Qui parle du climat en France ? Ce que nous apprennent les réseaux sociaux
|
|
|
|
In: ISSN: 2431-2134 ; The Conversation ; https://halshs.archives-ouvertes.fr/halshs-03629060 ; The Conversation, The Conversation France, 2022 ; https://theconversation.com/fr (2022)
|
|
BASE
|
|
Show details
|
|
8 |
Le Post-it, le président, les mèmes et nous
|
|
|
|
In: ISSN: 2431-2134 ; The Conversation ; https://halshs.archives-ouvertes.fr/halshs-03563945 ; The Conversation, The Conversation France, 2022 (2022)
|
|
BASE
|
|
Show details
|
|
9 |
À quoi reconnait-on un discours totalitaire ou fasciste ?
|
|
|
|
In: ISSN: 2431-2134 ; The Conversation ; https://hal.univ-rennes2.fr/hal-03617854 ; The Conversation, The Conversation France, 2022 (2022)
|
|
BASE
|
|
Show details
|
|
10 |
Corpus d’enquêtes sur les pratiques d’information scientifique des chercheurs. Constitution et exploitation des données
|
|
|
|
In: EISSN: 2263-0856 ; Revue française des sciences de l'information et de la communication ; https://hal.archives-ouvertes.fr/hal-03618819 ; Revue française des sciences de l'information et de la communication, Société Française des Sciences de l'Information et de la Communication, 2022, Data Paper : émergence d’une nouvelle donne scientifique (2022)
|
|
BASE
|
|
Show details
|
|
11 |
Satisfaction can co-exist with hesitation: qualitative analysis of acceptability of telemedicine among multi-lingual patients in a safety-net healthcare system during the COVID-19 pandemic.
|
|
|
|
In: BMC health services research, vol 22, iss 1 (2022)
|
|
BASE
|
|
Show details
|
|
12 |
Islands and Bridges of Language: Bio-Inspired Structural Analysis of Language Embedding Data
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Telemedicine implementation and use in community health centers during COVID-19: Clinic personnel and patient perspectives.
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Les Langages de Pao de Jack Vance et le pouvoir linguistique
|
|
|
|
In: Dixième colloque Stella Incognita : pouvoir(s), responsabilités et cas de conscience en science-fiction ; https://halshs.archives-ouvertes.fr/halshs-03636499 ; Dixième colloque Stella Incognita : pouvoir(s), responsabilités et cas de conscience en science-fiction, Université de Reims; Association Stella Incognita, Apr 2022, Reims, France (2022)
|
|
BASE
|
|
Show details
|
|
15 |
Assessing the impact of OCR noise on multilingual event detection over digitised documents
|
|
|
|
In: ISSN: 1432-5012 ; EISSN: 1432-1300 ; International Journal on Digital Libraries ; https://hal.archives-ouvertes.fr/hal-03635985 ; International Journal on Digital Libraries, Springer Verlag, 2022, ⟨10.1007/s00799-022-00325-2⟩ (2022)
|
|
BASE
|
|
Show details
|
|
16 |
Introducing the HIPE 2022 Shared Task: Named Entity Recognition and Linking in Multilingual Historical Documents
|
|
|
|
In: Advances in Information Retrieval. 44th European Conference on IR Research, ECIR 2022, Stavanger, Norway, April 10–14, 2022, Proceedings, Part II ; https://hal.archives-ouvertes.fr/hal-03635971 ; Matthias Hagen; Suzan Verberne; Craig Macdonald; Christin Seifert; Krisztian Balog; Kjetil Nørvåg; Vinay Setty. Advances in Information Retrieval. 44th European Conference on IR Research, ECIR 2022, Stavanger, Norway, April 10–14, 2022, Proceedings, Part II, 13186, Springer International Publishing, pp.347-354, 2022, Lecture Notes in Computer Science, 978-3-030-99738-0. ⟨10.1007/978-3-030-99739-7_44⟩ (2022)
|
|
BASE
|
|
Show details
|
|
17 |
Can Character-based Language Models Improve Downstream Task Performance in Low-Resource and Noisy Language Scenarios?
|
|
|
|
In: Seventh Workshop on Noisy User-generated Text (W-NUT 2021, colocated with EMNLP 2021) ; https://hal.inria.fr/hal-03527328 ; Seventh Workshop on Noisy User-generated Text (W-NUT 2021, colocated with EMNLP 2021), Jan 2022, punta cana, Dominican Republic ; https://aclanthology.org/2021.wnut-1.47/ (2022)
|
|
BASE
|
|
Show details
|
|
18 |
Cross-lingual few-shot hate speech and offensive language detection using meta learning
|
|
|
|
In: ISSN: 2169-3536 ; EISSN: 2169-3536 ; IEEE Access ; https://hal.archives-ouvertes.fr/hal-03559484 ; IEEE Access, IEEE, 2022, 10, pp.14880-14896. ⟨10.1109/ACCESS.2022.3147588⟩ (2022)
|
|
Abstract:
International audience ; Automatic detection of abusive online content such as hate speech, offensive language, threats, etc. has become prevalent in social media, with multiple efforts dedicated to detecting this phenomenon in English. However, detecting hatred and abuse in low-resource languages is a non-trivial challenge. The lack of sufficient labeled data in low-resource languages and inconsistent generalization ability of transformer-based multilingual pre-trained language models for typologically diverse languages make these models inefficient in some cases. We propose a meta learning-based approach to study the problem of few-shot hate speech and offensive language detection in low-resource languages that will allow hateful or offensive content to be predicted by only observing a few labeled data items in a specific target language. We investigate the feasibility of applying a meta learning approach in cross-lingual few-shot hate speech detection by leveraging two meta learning models based on optimization-based and metric-based (MAML and Proto-MAML) methods. To the best of our knowledge, this is the first effort of this kind. To evaluate the performance of our approach, we consider hate speech and offensive language detection as two separate tasks and make two diverse collections of different publicly available datasets comprising 15 datasets across 8 languages for hate speech and 6 datasets across 6 languages for offensive language. Our experiments show that meta learning-based models outperform transfer learning-based models in a majority of cases, and that Proto-MAML is the best performing model, as it can quickly generalize and adapt to new languages with only a few labeled data points (generally, 16 samples per class yields an effective performance) to identify hateful or offensive content.
|
|
Keyword:
[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; [INFO.INFO-NI]Computer Science [cs]/Networking and Internet Architecture [cs.NI]; [INFO.INFO-SI]Computer Science [cs]/Social and Information Networks [cs.SI]; Cross-lingual classification; Few-shot learning; Hate speech; Meta learning; Offensive language; Transfer learning; XLMRoBERTa
|
|
URL: https://doi.org/10.1109/ACCESS.2022.3147588 https://hal.archives-ouvertes.fr/hal-03559484
|
|
BASE
|
|
Hide details
|
|
19 |
Computational Measures of Deceptive Language: Prospects and Issues
|
|
|
|
In: ISSN: 2297-900X ; EISSN: 2297-900X ; Frontiers in Communication ; https://hal.archives-ouvertes.fr/hal-03629780 ; Frontiers in Communication, Frontiers, 2022, 7, pp.792378. ⟨10.3389/fcomm.2022.792378⟩ (2022)
|
|
BASE
|
|
Show details
|
|
20 |
‘I will never concede’: Donald Trump’s discourse of denial on Twitter (Nov. 4th 2020 – Jan. 8)
|
|
|
|
In: ISSN: 1278-3331 ; EISSN: 2427-0466 ; Anglophonia / Caliban - French Journal of English Linguistics ; https://hal.archives-ouvertes.fr/hal-03606771 ; Anglophonia / Caliban - French Journal of English Linguistics, Presses universitaires du Midi, 2022, A. Culioli’s Contribution to English Linguistics:1960-2022, ⟨10.4000/anglophonia.4613⟩ (2022)
|
|
BASE
|
|
Show details
|
|
|
|