1 |
Integration and evaluation of social competences such as humor in an artificial interactive agent
|
|
|
|
In: Proceedings of the 1st ACM SIGCHI International Workshop on Investigating Social Interactions with Artificial Agents ; the 1st ACM SIGCHI International Workshop ; https://hal.archives-ouvertes.fr/hal-01807782 ; the 1st ACM SIGCHI International Workshop, Nov 2017, Glasgow, United Kingdom. pp.41-42, ⟨10.1145/3139491.3139495⟩ (2017)
|
|
BASE
|
|
Show details
|
|
3 |
Comparing stochastic approaches to spoken language understanding in multiple languages
|
|
|
|
In: ISSN: 1558-7916 ; IEEE Transactions on Audio, Speech and Language Processing ; https://hal.inria.fr/hal-00746965 ; IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2011, 19 (6), pp.1569-1583 (2011)
|
|
Abstract:
International audience ; One of the first steps in building a spoken language understanding (SLU) module for dialogue systems is the extraction of flat concepts out of a given word sequence, usually provided by an automatic speech recognition (ASR) system. In this paper, six different modeling approaches are investigated to tackle the task of concept tagging. These methods include classical, well-known generative and discriminative methods like Finite State Transducers (FSTs), Statistical Machine Translation (SMT), Maximum Entropy Markov Models (MEMMs), or Support Vector Machines (SVMs) as well as techniques recently applied to natural language processing such as Conditional Random Fields (CRFs) or Dynamic Bayesian Networks (DBNs). Following a detailed description of the models, experimental and comparative results are presented on three corpora in different languages and with different complexity. The French MEDIA corpus has already been exploited during an evaluation campaign and so a direct comparison with existing benchmarks is possible. Recently collected Italian and Polish corpora are used to test the robustness and portability of the modeling approaches. For all tasks, manual transcriptions as well as ASR inputs are considered. Additionally to single systems, methods for system combination are investigated. The best performing model on all tasks is based on conditional random fields. On the MEDIA evaluation corpus, a concept error rate of 12.6% could be achieved. Here, additionally to attribute names, attribute values have been extracted using a combination of a rule-based and a statistical approach. Applying system combination using weighted ROVER with all six systems, the concept error rate (CER) drops to 12.0%.
|
|
Keyword:
[INFO.INFO-OH]Computer Science [cs]/Other [cs.OH]
|
|
URL: https://hal.inria.fr/hal-00746965/file/plugin-05639034.pdf https://hal.inria.fr/hal-00746965 https://hal.inria.fr/hal-00746965/document
|
|
BASE
|
|
Hide details
|
|
4 |
Using MMIL for the High Level Semantic Annotation of the French MEDIA Dialogue Corpus
|
|
|
|
In: Ninth International Conference on Computational Semantics - IWCS 2011 ; https://hal.inria.fr/inria-00638000 ; Ninth International Conference on Computational Semantics - IWCS 2011, ACL, Jan 2011, London, United Kingdom (2011)
|
|
BASE
|
|
Show details
|
|
6 |
Composition sémantique pour la compréhension de la parole dans un cadre de dialogue
|
|
|
|
In: Les 27e Journées d’Etudes sur la Parole (JEP) ; https://hal.archives-ouvertes.fr/hal-01159983 ; Les 27e Journées d’Etudes sur la Parole (JEP), Jun 2008, Avignon, France (2008)
|
|
BASE
|
|
Show details
|
|
7 |
Semantic composition process in a speech understanding system
|
|
|
|
In: The 2008 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) ; https://hal.archives-ouvertes.fr/hal-01158578 ; The 2008 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Mar 2008, Las Vegas, United States. ⟨10.1109/ICASSP.2008.4518788⟩ (2008)
|
|
BASE
|
|
Show details
|
|
9 |
Advances in Transcription of Broadcast News and Conversational Telephone Speech Within the Combined EARS BBN/LIMSI System
|
|
|
|
In: ISSN: 1558-7916 ; IEEE Transactions on Audio, Speech and Language Processing ; https://hal.archives-ouvertes.fr/hal-01299058 ; IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2006 (2006)
|
|
BASE
|
|
Show details
|
|
15 |
Dynamic Lexicon for a Very Large Vocabulary Vocal Dictation
|
|
|
|
In: EUROSPEECH 1997 - Fifth European Conference of Speech Communication and Technology ; https://hal.archives-ouvertes.fr/hal-01627669 ; EUROSPEECH 1997 - Fifth European Conference of Speech Communication and Technology, Sep 1997, Rhodes, Greece. pp.2691-2694 (1997)
|
|
BASE
|
|
Show details
|
|
16 |
K-Nearest Neighbours Estimator in a HMM-Based System
|
|
|
|
In: Computational Models of Speech Patterns Processing ; NATO Advanced Study Institute on Computational Models of Speech Pattern Processing ; https://hal.archives-ouvertes.fr/hal-01574484 ; NATO Advanced Study Institute on Computational Models of Speech Pattern Processing, Jul 1997, St. Helier, Jersey, United Kingdom. pp.96-101, ⟨10.1007/978-3-642-60087-6_10⟩ (1997)
|
|
BASE
|
|
Show details
|
|
17 |
K-NN Versus Gaussian in HMM-Based Recognition System
|
|
|
|
In: Fifth European Conference on Speech Communication and Technology, EUROSPEECH 1997 ; https://hal.archives-ouvertes.fr/hal-01624700 ; Fifth European Conference on Speech Communication and Technology, EUROSPEECH 1997, Sep 1997, Rhodes, Greece. pp.529-532 (1997)
|
|
BASE
|
|
Show details
|
|
|
|