Home Catalogue search

eng

Refine your search:
- Keyword
- Creator / Publisher
- Year
- Medium:
  - Online (13)
- Type:
  - Article (12)
  - Miscellaneous (1)
- BLLDB-Access:
  - free (13)
  - subject to license (0)

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 13 of 13

1	Investigating alignment interpretability for low-resource NMT
	Zanon Boito, Marcely; Villavicencio, Aline; Besacier, Laurent
	In: ISSN: 0922-6567 ; EISSN: 1573-0573 ; Machine Translation ; https://hal.archives-ouvertes.fr/hal-03139744 ; Machine Translation, Springer Verlag, 2021, ⟨10.1007/s10590-020-09254-w⟩ (2021)
	BASE
	Show details

2	LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
	Evain, Solène; Nguyen, Ha; Le, Hang...
	In: INTERSPEECH 2021: Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03317730 ; INTERSPEECH 2021: Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic (2021)
	BASE
	Show details

3	LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
	Evain, Solène; Nguyen, Ha; Le, Hang...
	In: INTERSPEECH 2021: ; INTERSPEECH 2021: Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03317730 ; INTERSPEECH 2021: Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic (2021)
	BASE
	Show details

4	LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
	Evain, Solène; Nguyen, Ha; Le, Hang...
	In: INTERSPEECH 2021: ; INTERSPEECH 2021: Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-03317730 ; INTERSPEECH 2021: Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic (2021)
	BASE
	Show details

5	Investigating Language Impact in Bilingual Approaches for Computational Language Documentation
	Zanon Boito, Marcely; Villavicencio, Aline; Besacier, Laurent
	In: Proceedings of the 1st Joint SLTU and CCURL Workshop (SLTU-CCURL 2020), ; SLTU-CCURL workshop, LREC 2020 ; https://hal.archives-ouvertes.fr/hal-02895907 ; SLTU-CCURL workshop, LREC 2020, May 2020, Marseille, France (2020)
	BASE
	Show details

6	MaSS: A Large and Clean Multilingual Corpus of Sentence-aligned Spoken Utterances Extracted from the Bible
	Zanon Boito, Marcely; Havard, William,; Garnerin, Mahault...
	In: Proceedings of The 12th Language Resources and Evaluation Conference ; https://hal.archives-ouvertes.fr/hal-02611059 ; Proceedings of The 12th Language Resources and Evaluation Conference, May 2020, Marseille, France. pp.6486 - 6493 (2020)
	BASE
	Show details

7	Empirical Evaluation of Sequence-to-Sequence Models for Word Discovery in Low-resource Settings
	Zanon Boito, Marcely; Villavicencio, Aline; Besacier, Laurent
	In: Interspeech 2019 ; https://hal.archives-ouvertes.fr/hal-02193867 ; Interspeech 2019, Sep 2019, Graz, Austria (2019)
	Abstract: International audience ; Since Bahdanau et al. [1] first introduced attention for neural machine translation, most sequence-to-sequence models made use of attention mechanisms [2, 3, 4]. While they produce soft-alignment matrices that could be interpreted as alignment between target and source languages, we lack metrics to quantify their quality, being unclear which approach produces the best alignments. This paper presents an empirical evaluation of 3 of the main sequence-to-sequence models for word discovery from unsegmented phoneme sequences: CNN, RNN and Transformer-based. This task consists in aligning word sequences in a source language with phoneme sequences in a target language, inferring from it word segmentation on the target side [5]. Evaluating word segmentation quality can be seen as an extrinsic evaluation of the soft-alignment matrices produced during training. Our experiments in a low-resource scenario on Mboshi and English languages (both aligned to French) show that RNNs surprisingly outperform CNNs and Transformer for this task. Our results are confirmed by an intrinsic evaluation of alignment quality through the use Average Normalized Entropy (ANE). Lastly, we improve our best word discovery model by using an alignment entropy confidence measure that accumulates ANE over all the occurrences of a given alignment pair in the collection.
	Keyword: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; computational language documentation; low-resource languages; sequence-to-sequence models; soft-alignment matrices; word discovery
	URL: https://hal.archives-ouvertes.fr/hal-02193867/file/IS2019marcely-camera-ready.pdf https://hal.archives-ouvertes.fr/hal-02193867 https://hal.archives-ouvertes.fr/hal-02193867/document
	BASE
	Hide details

8	How Does Language Influence Documentation Workflow? Unsupervised Word Discovery Using Translations in Multiple Languages
	Zanon Boito, Marcely; Villavicencio, Aline; Besacier, Laurent
	In: Journées Scientifiques du Groupement de Recherche: Linguistique Informatique, Formelle et de Terrain (LIFT). ; https://hal.archives-ouvertes.fr/hal-02895895 ; Journées Scientifiques du Groupement de Recherche: Linguistique Informatique, Formelle et de Terrain (LIFT)., Nov 2019, Orléans, France (2019)
	BASE
	Show details

9	A small Griko-Italian speech translation corpus
	Zanon Boito, Marcely; Anastasopoulos, Antonios; Lekakou, Marika...
	In: 6th international workshop on spoken language technologies for under-resourced languages(SLTU'18) ; https://hal.archives-ouvertes.fr/hal-01962528 ; 6th international workshop on spoken language technologies for under-resourced languages(SLTU'18), Aug 2018, New Delhi, India (2018)
	BASE
	Show details

10	Unsupervised Word Segmentation from Speech with Attention
	Godard, Pierre; Zanon Boito, Marcely; Ondel, Lucas...
	In: Interspeech 2018 ; https://hal.archives-ouvertes.fr/hal-01818092 ; Interspeech 2018, Sep 2018, Hyderabad, India (2018)
	BASE
	Show details

11	Unsupervised Word Segmentation from Speech with Attention ...
	Godard, Pierre; Zanon-Boito, Marcely; Ondel, Lucas. - : arXiv, 2018
	BASE
	Show details

12	Unwritten Languages Demand Attention Too! Word Discovery with Encoder-Decoder Models
	Zanon Boito, Marcely; Bérard, Alexandre; Villavicencio, Aline...
	In: IEEE Automatic Speech Recognition and Understanding (ASRU) ; https://hal.archives-ouvertes.fr/hal-01592091 ; IEEE Automatic Speech Recognition and Understanding (ASRU), Dec 2017, Okinawa, Japan (2017)
	BASE
	Show details

13	Unsupervised Word Discovery Using Attentional Encoder-Decoder Models
	Zanon Boito, Marcely; Besacier, Laurent; Villavicencio, Aline
	In: WiNLP workshop, ACL 2017 ; https://hal.archives-ouvertes.fr/hal-02895851 ; WiNLP workshop, ACL 2017, Jul 2017, Vancouver, Canada (2017)
	BASE
	Show details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern