1 |
An Overview of Indian Spoken Language Recognition from Machine Learning Perspective
|
|
|
|
In: ISSN: 2375-4699 ; EISSN: 2375-4702 ; ACM Transactions on Asian and Low-Resource Language Information Processing ; https://hal.inria.fr/hal-03616853 ; ACM Transactions on Asian and Low-Resource Language Information Processing, ACM, In press, ⟨10.1145/3523179⟩ (2022)
|
|
Abstract:
International audience ; Automatic spoken language identification (LID) is a very important research field in the era of multilingual voice-command-based human-computer interaction (HCI). A front-end LID module helps to improve the performance of many speech-based applications in the multilingual scenario. India is a populous country with diverse cultures and languages. The majority of the Indian population needs to use their respective native languages for verbal interaction with machines. Therefore, the development of efficient Indian spoken language recognition systems is useful for adapting smart technologies in every section of Indian society. The field of Indian LID has started gaining momentum in the last two decades, mainly due to the development of several standard multilingual speech corpora for the Indian languages. Even though significant research progress has already been made in this field, to the best of our knowledge, there are not many attempts to analytically review them collectively. In this work, we have conducted one of the very first attempts to present a comprehensive review of the Indian spoken language recognition research field. In-depth analysis has been presented to emphasize the unique challenges of low-resource and mutual influences for developing LID systems in the Indian contexts. Several essential aspects of the Indian LID research, such as the detailed description of the available speech corpora, the major research contributions, including the earlier attempts based on statistical modeling to the recent approaches based on different neural network architectures, and the future research trends are discussed. This review work will help assess the state of the present Indian LID research by any active researcher or any research enthusiasts from related fields.
|
|
Keyword:
[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; [INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]; [INFO.INFO-HC]Computer Science [cs]/Human-Computer Interaction [cs.HC]; [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing; [SCCO.LING]Cognitive science/Linguistics; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; [STAT.ML]Statistics [stat]/Machine Learning [stat.ML]; acoustic phonetics; code-switching; corpora development; discriminative model; Indian language identification; Language resources; language similarity; Machine learning; Signal processing systems Low-resourced languages
|
|
URL: https://hal.inria.fr/hal-03616853/file/TALLIP_Overview.pdf https://doi.org/10.1145/3523179 https://hal.inria.fr/hal-03616853 https://hal.inria.fr/hal-03616853/document
|
|
BASE
|
|
Hide details
|
|
2 |
Speech Perception and Implementation in a Virtual Medical Assistant
|
|
|
|
In: 6. ICAART – 14th International Conference on Agents and Artificial Intelligence ; https://hal.archives-ouvertes.fr/hal-03621550 ; 6. ICAART – 14th International Conference on Agents and Artificial Intelligence, Feb 2022, Vienna, Austria (2022)
|
|
BASE
|
|
Show details
|
|
3 |
Assessing the impact of OCR noise on multilingual event detection over digitised documents
|
|
|
|
In: ISSN: 1432-5012 ; EISSN: 1432-1300 ; International Journal on Digital Libraries ; https://hal.archives-ouvertes.fr/hal-03635985 ; International Journal on Digital Libraries, Springer Verlag, 2022, ⟨10.1007/s00799-022-00325-2⟩ (2022)
|
|
BASE
|
|
Show details
|
|
4 |
Introducing the HIPE 2022 Shared Task: Named Entity Recognition and Linking in Multilingual Historical Documents
|
|
|
|
In: Advances in Information Retrieval. 44th European Conference on IR Research, ECIR 2022, Stavanger, Norway, April 10–14, 2022, Proceedings, Part II ; https://hal.archives-ouvertes.fr/hal-03635971 ; Matthias Hagen; Suzan Verberne; Craig Macdonald; Christin Seifert; Krisztian Balog; Kjetil Nørvåg; Vinay Setty. Advances in Information Retrieval. 44th European Conference on IR Research, ECIR 2022, Stavanger, Norway, April 10–14, 2022, Proceedings, Part II, 13186, Springer International Publishing, pp.347-354, 2022, Lecture Notes in Computer Science, 978-3-030-99738-0. ⟨10.1007/978-3-030-99739-7_44⟩ (2022)
|
|
BASE
|
|
Show details
|
|
5 |
Surgical Video Summarization: Multifarious Uses, Summarization Process and Ad-Hoc Coordination
|
|
|
|
In: ISSN: 2573-0142 ; EISSN: 2573-0142 ; Proceedings of the ACM on Human-Computer Interaction ; CHI '21: CHI Conference on Human Factors in Computing Systems ; https://hal.archives-ouvertes.fr/hal-03160860 ; Proceedings of the ACM on Human-Computer Interaction , Association for Computing Machinery (ACM), 2021, 4 (4), ⟨10.1145/3449214⟩ (2021)
|
|
BASE
|
|
Show details
|
|
6 |
Cartolabe: A Web-Based Scalable Visualization of Large Document Collections
|
|
|
|
In: ISSN: 0272-1716 ; IEEE Computer Graphics and Applications ; https://hal.inria.fr/hal-02499006 ; IEEE Computer Graphics and Applications, Institute of Electrical and Electronics Engineers, 2021, 41 (2), pp.76--88. ⟨10.1109/MCG.2020.3033401⟩ (2021)
|
|
BASE
|
|
Show details
|
|
7 |
SonAmi: A Tangible Creativity Support Tool for Productive Procrastination
|
|
|
|
In: C&C ’21 - 13th ACM Conference on Creativity & Cognition ; https://hal.inria.fr/hal-03442565 ; C&C ’21 - 13th ACM Conference on Creativity & Cognition, Jun 2021, Virtual Event, Italy. pp.1-10, ⟨10.1145/3450741.3465250⟩ (2021)
|
|
BASE
|
|
Show details
|
|
8 |
How Hermeneutic Spirals may reduce Complexity to Narrative Schemata - expanding on "Complexity and the Userly Text"
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03254233 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
9 |
Sequence-to-Sequence Predictive Model: From Prosody To Communicative Gestures
|
|
|
|
In: Proceeding of the 24th International Conference on Human-Computer Interaction ; https://hal.archives-ouvertes.fr/hal-03428910 ; Proceeding of the 24th International Conference on Human-Computer Interaction, 2021 (2021)
|
|
BASE
|
|
Show details
|
|
10 |
Die Synthese und Decodierung der Bedeutung ; The Synthesis and Decoding of Meaning ; La synthèse et le décodage du sens
|
|
|
|
In: Journal of Artificial General Intelligence ; https://hal.archives-ouvertes.fr/hal-01422672 ; Journal of Artificial General Intelligence, 2021, 12 (1), pp.26-70. ⟨10.2478/jagi-2021-0002⟩ (2021)
|
|
BASE
|
|
Show details
|
|
11 |
Impact of interface design on drivers’ behavior in partially automated cars: An on-road study
|
|
|
|
In: ISSN: 1369-8478 ; Transportation Research Part F: Traffic Psychology and Behaviour ; https://hal.archives-ouvertes.fr/hal-03322187 ; Transportation Research Part F: Traffic Psychology and Behaviour, Elsevier, 2021, 81, pp.508-521. ⟨10.1016/j.trf.2021.06.019⟩ (2021)
|
|
BASE
|
|
Show details
|
|
12 |
Measuring the sensitivity and significance of the French version of the System Usability Scale ; Mesure de la sensibilité et de la signification de la version française du System Usability Scale
|
|
|
|
In: Actes de la 32e conférence francophone sur l'Interaction Humain-Machine (IHM'20.21) ; https://hal.archives-ouvertes.fr/hal-03567056 ; Actes de la 32e conférence francophone sur l'Interaction Humain-Machine (IHM'20.21), Apr 2021, Virtual Event, France. pp.2:1-13, ⟨10.1145/3450522.3451241⟩ (2021)
|
|
BASE
|
|
Show details
|
|
13 |
Towards alignment strategies in human-agent interactions based on measures of lexical repetitions
|
|
|
|
In: ISSN: 1574-020X ; EISSN: 1574-0218 ; Language Resources and Evaluation ; https://hal.archives-ouvertes.fr/hal-03147824 ; Language Resources and Evaluation, Springer Verlag, 2021, 55 (2), pp.353-388. ⟨10.1007/s10579-021-09532-w⟩ ; https://link.springer.com/journal/10579/volumes-and-issues/55-2 (2021)
|
|
BASE
|
|
Show details
|
|
14 |
Towards alignment strategies in human-agent interactions based on measures of lexical repetitions
|
|
|
|
In: ISSN: 1574-020X ; EISSN: 1572-8412 ; Language Resources and Evaluation ; https://hal.archives-ouvertes.fr/hal-03147824 ; Language Resources and Evaluation, Springer Verlag, 2021, ⟨10.1007/s10579-021-09532-w⟩ (2021)
|
|
BASE
|
|
Show details
|
|
15 |
État de l'art du changement sémantique à partir de plongements contextualisés
|
|
|
|
In: COnférence en Recherche d'Informations et Applications - CORIA 2021, French Information Retrieval Conference ; https://hal.archives-ouvertes.fr/hal-03320337 ; COnférence en Recherche d'Informations et Applications - CORIA 2021, French Information Retrieval Conference, Apr 2021, Grenoble (virtuel), France (2021)
|
|
BASE
|
|
Show details
|
|
16 |
Le projet LELREP : vers une lexicologie appliquée à l'école
|
|
|
|
In: ISSN: 0023-8376 ; Les Langues Modernes ; https://hal.archives-ouvertes.fr/hal-03621818 ; Les Langues Modernes, Association des professeurs de langues vivantes (APLV), 2021 (2021)
|
|
BASE
|
|
Show details
|
|
17 |
Méthodes pour l’étude du lexique – Mémos
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03443313 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
18 |
Atténuer les erreurs de numérisation dans la reconnaissance d'entités nommées pour les documents historiques
|
|
|
|
In: Conférence en Recherche d'Informations et Applications (CORIA 2021) ; https://hal.archives-ouvertes.fr/hal-03320332 ; Conférence en Recherche d'Informations et Applications (CORIA 2021), ARIA : Association Francophone de Recherche d’Information (RI) et Applications, Apr 2021, Grenoble (virtuel), France. pp.1 - 7 ; http://coria.asso-aria.org/2021/articles/mini_24/main.pdf (2021)
|
|
BASE
|
|
Show details
|
|
19 |
Prosodic Boundary Prediction Model for Vietnamese Text-To-Speech
|
|
|
|
In: Proc. Interspeech 2021 ; Interspeech 2021 ; https://hal.archives-ouvertes.fr/hal-03329116 ; Interspeech 2021, Aug 2021, Brno, Czech Republic. pp.3885-3889, ⟨10.21437/interspeech.2021-125⟩ (2021)
|
|
BASE
|
|
Show details
|
|
20 |
A systematic review of mode awareness measurements for automated driving
|
|
|
|
In: 7th International Conference on Driver Distraction and Inattention - DDI 2021 ; https://hal.archives-ouvertes.fr/hal-03114205 ; 7th International Conference on Driver Distraction and Inattention - DDI 2021, Oct 2021, Lyon, France (2021)
|
|
BASE
|
|
Show details
|
|
|
|