Catalogue search • Linguistik portal • Fachinformationsdienst (FID)

1	Cross-lingual and cross-domain evaluation of Machine Reading Comprehension with Squad and CALOR-Quest corpora
	Charlet, Delphine; Damnati, Géraldine; Bechet, Frédéric; Marzinotto, Gabriel; Heinecke, Johannes
	In: Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020) ; LREC 2020 ; https://hal.archives-ouvertes.fr/hal-02973245 ; LREC 2020, May 2020, MARSEILLE, France. pp.5491-5497 ; https://lrec2020.lrec-conf.org/en/ (2020)
	Abstract: International audience ; Machine Reading received recently a lot of attention thanks to both the availability of very large corpora such as SQuAD or MS MARCO containing triplets (document, question, answer), and the introduction of Transformer Language Models such as BERT which obtains excellent results, even matching human performance according to the SQuAD leaderboard. One of the key features of Transformer Models is their ability to be jointly trained across multiple languages, using a shared subword vocabulary, leading to the construction of cross-lingual lexical representations. This feature has been used recently to perform zero-shot cross-lingual experiments where a multilingual BERT model fine-tuned on a machine reading comprehension task exclusively for English was directly applied to Chinese and French documents with interesting performance. In this paper we study the cross-language and cross-domain capabilities of BERT on a Machine Reading Comprehension task on two corpora: SQuAD and a new French Machine Reading dataset, called CALOR-QUEST. The semantic annotation available on CALOR-QUEST allows us to give a detailed analysis on the kind of questions that are properly handled through the cross-language process.
	Keyword: [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing; [INFO.INFO-TT]Computer Science [cs]/Document and Text Processing; cross-lingual; FrameNet; Machine Reading Comprehension
	URL: https://hal.archives-ouvertes.fr/hal-02973245 https://hal.archives-ouvertes.fr/hal-02973245/document https://hal.archives-ouvertes.fr/hal-02973245/file/2020.lrec-1.674.pdf
	BASE
	Hide details

2	Handling Normalization Issues for Part-of-Speech Tagging of Online Conversational Text
	Damnati, Géraldine; Auguste, Jeremy; Nasr, Alexis...
	In: LREC proceedings ; Eleventh International Conference on Language Resources and Evaluation (LREC 2018) ; https://hal.archives-ouvertes.fr/hal-01943391 ; Eleventh International Conference on Language Resources and Evaluation (LREC 2018), 2018, Miyazaki, Japan (2018)
	BASE
	Show details

3	Prédiction de l'échec d'une conversation médiée dans un contexte de dialogues à rôles asymétriques
	Carbou, Romain; Charlet, Delphine; Damnati, Géraldine...
	In: Vingt-cinquième conférence sur le Traitement Automatique des Langues Naturelles (TALN) ; https://hal.archives-ouvertes.fr/hal-01798604 ; Vingt-cinquième conférence sur le Traitement Automatique des Langues Naturelles (TALN), ATALA, May 2018, Rennes, France (2018)
	BASE
	Show details

Search in the Catalogues and Directories