DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5...18
Hits 1 – 20 of 348

1
Improved facial expression recognition based on DWT feature for deep CNN
In: ISSN: 2079-9292 ; Electronics ; https://hal.archives-ouvertes.fr/hal-03143597 ; Electronics, MDPI, 2019, 8 (3), pp.324. ⟨10.3390/electronics8030324⟩ (2019)
BASE
Show details
2
The Airbus Air Traffic Control speech recognition 2018 challenge: towards ATC automatic transcription and call sign detection
In: Proceedings of INTERSPEECH 2019 ; 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019) ; https://hal.archives-ouvertes.fr/hal-02419437 ; 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019), Sep 2019, Graz, Austria. pp.2993-2997 (2019)
Abstract: International audience ; In this paper, we describe the outcomes of the challenge organized and run by Airbus and partners in 2018 on Air Traffic Control (ATC) speech recognition. The challenge consisted of two tasks applied to English ATC speech: 1) automatic speech-to-text transcription, 2) call sign detection (CSD). The registered participants were provided with 40 hours of speech along with manual transcriptions. Twenty-two teams submitted predictions on a five hour evaluation set. ATC speech processing is challenging for several reasons: high speech rate, foreign-accented speech with a great diversity of accents, noisy communication channels. The best ranked team achieved a 7.62% Word Error Rate and a 82.41% CSD F1-score. Transcribing pilots' speech was found to be twice as harder as controllers' speech. Remaining issues towards solving ATC ASR are also discussed in the paper.
Keyword: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; Air traffic control; Specialized language; Speech recognition
URL: https://hal.archives-ouvertes.fr/hal-02419437/document
https://hal.archives-ouvertes.fr/hal-02419437/file/pellegrini_25003.pdf
https://hal.archives-ouvertes.fr/hal-02419437
BASE
Hide details
3
Challenges in Audio Processing of Terrorist-Related Data
In: International Conference on Multimedia Modeling ; https://hal.archives-ouvertes.fr/hal-02415176 ; International Conference on Multimedia Modeling, Springer, Jan 2019, Thessaloniki, Greece (2019)
BASE
Show details
4
Impact and Detection of Facial Beautification in Face Recognition: An Overview
In: ISSN: 2169-3536 ; EISSN: 2169-3536 ; IEEE Access ; https://hal.inria.fr/hal-02378939 ; IEEE Access, IEEE, 2019, ⟨10.1109/ACCESS.2019.DOI⟩ (2019)
BASE
Show details
5
Char+CV-CTC: Combining Graphemes and Consonant/Vowel Units for CTC-Based ASR Using Multitask Learning
In: Proceedings of INTERSPEECH 2019 ; 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019) ; https://hal.archives-ouvertes.fr/hal-02419431 ; 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019), Sep 2019, Graz, Austria. pp.1611-1615 (2019)
BASE
Show details
6
Interests of using Automatic Speech recognition for Speech-Language Therapists
In: World Congress of the International Association of Logopedics and Phoniatrics ; https://hal.archives-ouvertes.fr/hal-03012571 ; World Congress of the International Association of Logopedics and Phoniatrics, IALP : International Association of Logopedics and Phoniatrics, Aug 2019, Taipei, Taiwan. pp.(electronic medium) ; http://www.ialptaipei2019.org/ (2019)
BASE
Show details
7
Challenges in Audio Processing of Terrorist-Related Data
In: International Conference on Multimedia Modeling ; https://hal.archives-ouvertes.fr/hal-02387373 ; International Conference on Multimedia Modeling, Springer, Jan 2019, Thessaloniki, Greece (2019)
BASE
Show details
8
A replicable comparison study of NER software: StanfordNLP, NLTK, OpenNLP, SpaCy, Gate
In: Sixth International Conference on Social Networks Analysis, Management and Security, SNAMS 2019 ; https://hal.archives-ouvertes.fr/hal-03002671 ; Sixth International Conference on Social Networks Analysis, Management and Security, SNAMS 2019, Oct 2019, Grenada, Spain. ⟨10.1109/SNAMS.2019.8931850⟩ (2019)
BASE
Show details
9
Extractive Text-Based Summarization of Arabic videos: Issues, Approaches and Evaluations
In: ICALP: International Conference on Arabic Language Processing ; https://hal.archives-ouvertes.fr/hal-02314238 ; ICALP: International Conference on Arabic Language Processing, Oct 2019, Nancy, France. pp.65-78, ⟨10.1007/978-3-030-32959-4_5⟩ (2019)
BASE
Show details
10
Brain and perceptual representations of faces, voices, and person identity
Tsantani, Maria Stephanie. - : Brunel University London, 2019
BASE
Show details
11
Judging Normality and Attractiveness in Faces: Direct Evidence of a More Refined Representation for Own-Race, Young Adult Faces
Zhou, Xiaomei; Short, Lindsey A.; Chan, Harmonie S. J.. - : Sage Publications, 2019
BASE
Show details
12
Extracting Data From Line Charts in Scanned Medical Documents
Silva de Azevedo, Kathleen. - : Auckland University of Technology, 2019
BASE
Show details
13
Segmentation-Free Bangla Offline Handwriting Recognition Using Sequential Detection of Characters and Diacritics with a Faster R-CNN
In: Electrical and Computer Engineering Faculty Publications and Presentations (2019)
BASE
Show details
14
Sign Language Video Analysis For Automatic Recognition and Detection
In: 14th IEEE International Conference on Automatic Face and Gesture Recognition ; https://hal.archives-ouvertes.fr/hal-02146366 ; 14th IEEE International Conference on Automatic Face and Gesture Recognition, May 2019, Lille, France (2019)
BASE
Show details
15
Phoneme‐Order Encoding During Spoken Word Recognition: A Priming Investigation
In: ISSN: 0364-0213 ; EISSN: 1551-6709 ; Cognitive Science ; https://hal.archives-ouvertes.fr/hal-02292742 ; Cognitive Science, Wiley, 2019, 43 (10), pp.1-16. ⟨10.1111/cogs.12785⟩ (2019)
BASE
Show details
16
Multimodal deep networks for text and image-based document classification ; Réseau de neurones multimodal pour la classification de documents image/texte
In: Conférence Nationale sur les Applications Pratiques de l'Intelligence Artificielle (APIA) ; https://hal.archives-ouvertes.fr/hal-02163257 ; Conférence Nationale sur les Applications Pratiques de l'Intelligence Artificielle (APIA), Jul 2019, Toulouse, France (2019)
BASE
Show details
17
Adapting a FrameNet Semantic Parser for Spoken Language Understanding Using Adversarial Learning
In: Interspeech 2019 ; https://hal.archives-ouvertes.fr/hal-02298417 ; Interspeech 2019, Sep 2019, Graz, Austria. pp.799-803, ⟨10.21437/Interspeech.2019-2732⟩ (2019)
BASE
Show details
18
A unified multilingual handwriting recognition system using multigrams sub-lexical units
In: ISSN: 0167-8655 ; Pattern Recognition Letters ; https://hal-normandie-univ.archives-ouvertes.fr/hal-02075654 ; Pattern Recognition Letters, Elsevier, 2019, 121, pp.68-76. ⟨10.1016/j.patrec.2018.07.027⟩ (2019)
BASE
Show details
19
Automatic recognition of Sign Language structures in RGB videos: the detection of pointing and lexical signs
In: https://hal.archives-ouvertes.fr/hal-02146368 ; 2019 (2019)
BASE
Show details
20
Temporary ambiguity in whispered word recognition: a semantic priming study
In: ISSN: 2044-5911 ; EISSN: 2044-592X ; Journal of Cognitive Psychology ; https://hal.archives-ouvertes.fr/hal-01994539 ; Journal of Cognitive Psychology, Taylor & Francis edition, 2019, 31 (2), pp.157-174. ⟨10.1080/20445911.2019.1573243⟩ ; https://www.tandfonline.com/doi/full/10.1080/20445911.2019.1573243 (2019)
BASE
Show details

Page: 1 2 3 4 5...18

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
348
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern