1 |
Multi-Lingual Dialogue Act Recognition with Deep Learning Methods
|
|
|
|
In: Interspeech 2019 ; https://hal.archives-ouvertes.fr/hal-02319818 ; Interspeech 2019, Sep 2019, Graz, Austria. ⟨10.21437/Interspeech.2019-1691⟩ (2019)
|
|
BASE
|
|
Show details
|
|
2 |
Multi-lingual Dialogue Act Recognition with Deep Learning Methods ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
On the Effects of Using word2vec Representations in Neural Networks for Dialogue Act Recognition
|
|
|
|
In: ISSN: 0885-2308 ; EISSN: 1095-8363 ; Computer Speech and Language ; https://hal.archives-ouvertes.fr/hal-01581410 ; Computer Speech and Language, Elsevier, 2018, 47, pp.175 - 193. ⟨10.1016/j.csl.2017.07.009⟩ (2018)
|
|
BASE
|
|
Show details
|
|
4 |
Czech Text Document Corpus v 2.0
|
|
|
|
Abstract:
BASIC INFORMATION --- Czech Text Document Corpus v 2.0 is a collection of text documents for automatic document classification in Czech language. It is composed of the text documents provided by the Czech News Agency and is freely available for research purposes. This corpus was created in order to facilitate a straightforward comparison of the document classification approaches on Czech data. It is particularly dedicated to evaluation of multi-label document classification approaches, because one document is usually labelled with more than one label. Besides the information about the document classes, the corpus is also annotated at the morphological layer. The main part (for training and testing) is composed of 11,955 real newspaper articles. We provide also a development set which is intended to be used for tuning of the hyper-parameters of the created models. This set contains 2735 additional articles. The total category number is 60 out of which 37 most frequent ones are used for classification. The reason of this reduction is to keep only the classes with the sufficient number of occurrences to train the models. Technical Details --- Text documents are stored in the individual text files using UTF-8 encoding. Each filename is composed of the serial number and the list of the categories abbreviations separated by the underscore symbol and the .txt suffix. Serial numbers are composed of five digits and the numerical series starts from the value one. For instance the file 00046_kul_nab_mag.txt represents the document file number 46 annotated by the categories kul (culture), nab (religion) and mag (magazine selection). The content of the document, i.e. the word tokens, is stored in one line. The tokens are separated by the space symbols. Every text document was further automatically mophologically analyzed. This analysis includes lemmatization, POS tagging and syntactic parsing. The fully annotated files are stored in .conll files. We also provide the lemmatized form, file with suffix .lemma, and appropriate POS-tags, see .pos files. The tokenized version of the documents is also available in .tok files. This corpus is available only for research purposes for free. Commercial use in any form is strictly excluded.
|
|
Keyword:
corpus; Czech; document classification; multi-label; text
|
|
URL: http://hdl.handle.net/11234/1-2884
|
|
BASE
|
|
Hide details
|
|
5 |
Deep Neural Networks for Czech Multi-label Document Classification ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Investigation of word senses over time using linguistic corpora
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Dialogue act recognition approaches
|
|
|
|
In: ISSN: 1335-9150 ; Computing and Informatics ; https://hal.inria.fr/inria-00431396 ; Computing and Informatics, Slovak University Press, Bratislava, 2010, 29 (2), pp.227--250 (2010)
|
|
BASE
|
|
Show details
|
|
8 |
Lexical Structure for Dialogue Act Recognition
|
|
|
|
In: ISSN: 1796-2048 ; Journal of Multimedia ; https://hal.inria.fr/inria-00184475 ; Journal of Multimedia, Academy Publisher, 2007, 2 (3), pp.1-8 (2007)
|
|
BASE
|
|
Show details
|
|
9 |
Automatic Recognition of Dialogue Acts ; Reconnaissance automatique des actes de dialogue
|
|
|
|
In: https://tel.archives-ouvertes.fr/tel-01748248 ; Modeling and Simulation. Université Henri Poincaré - Nancy 1, 2007. English. ⟨NNT : 2007NAN10114⟩ (2007)
|
|
BASE
|
|
Show details
|
|
10 |
Sentence Modality Recognition In French Based On Prosody ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Sentence Modality Recognition In French Based On Prosody ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Automatic Recognition of Dialogue Acts ; Reconnaissance automatique des actes de dialogue
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Automatic dialog acts recognition based on sentence structure
|
|
|
|
In: IEEE International Conference on Acoustics, Speech, and Signal Processing - ICASSP/2006 ; https://hal.archives-ouvertes.fr/hal-00078245 ; 2006, pp.61-64 (2006)
|
|
BASE
|
|
Show details
|
|
14 |
Automatic Dialog Acts Recognition based on Words Clusters
|
|
|
|
In: 9th Western Pacific Acoustics Conference - WESPAC IX 2006 ; https://hal.archives-ouvertes.fr/hal-00086310 ; 2006, 6 p (2006)
|
|
BASE
|
|
Show details
|
|
15 |
Sentence structure for dialog act recognition in Czech
|
|
|
|
In: 2nd IEEE International Conference on Information et Communication Technologies: from Theory to Applications - ICTTA´06 ; https://hal.archives-ouvertes.fr/hal-00078247 ; 2006 (2006)
|
|
BASE
|
|
Show details
|
|
16 |
Combination of classifiers for automatic recognition of dialog acts
|
|
|
|
In: Proceedings of the 9th European Conference on Speech Communication and Technology - Interspeech - Eurospeech 2005 - Lisbon, Portugal ; https://hal.archives-ouvertes.fr/hal-00013940 ; 2005, pp.825-828 (2005)
|
|
BASE
|
|
Show details
|
|
17 |
Analysis of Importance of the prosodic Features for Automatic Sentence Modality Recognition in French in real Conditions
|
|
|
|
In: WSEAS International Conference on Electronics, Control and Signal Processing - ICECS'04 ; https://hal.inria.fr/inria-00100102 ; WSEAS International Conference on Electronics, Control and Signal Processing - ICECS'04, Nov 2004, Crete, Greece, pp.1820-1824 (2004)
|
|
BASE
|
|
Show details
|
|
|
|