81 |
Dynamic Extension of ASR Lexicon Using Wikipedia Data
|
|
|
|
In: IEEE Workshop on Spoken and Language Technology (SLT) ; https://hal.archives-ouvertes.fr/hal-01874495 ; IEEE Workshop on Spoken and Language Technology (SLT), Dec 2018, Athènes, Greece (2018)
|
|
BASE
|
|
Show details
|
|
82 |
Categorization of B2B Service Offers: Lessons learnt from the Silex Use case
|
|
|
|
In: 4ème conférence sur les Applications Pratiques de l'Intelligence Artificielle APIA2018 ; https://hal.archives-ouvertes.fr/hal-01830905 ; 4ème conférence sur les Applications Pratiques de l'Intelligence Artificielle APIA2018, Jul 2018, Nancy, France (2018)
|
|
Abstract:
International audience ; In the domain of Information Retrieval and Natural Language Processing, text classification has become a crucial task. In this article, we share our experience of text cate-gorization in an industrial context and we present a comparative evaluation of binary and multi-label classification algorithms applied to texts describing service offers, in the SILEX B2B platform. We show that for some use cases like the one we consider, a traditional representation of texts by "bags of words" gives better classification results than the promising representation by "word embeddings". ; Dans le domaine de la recherche d'information et du traite-ment automatique du langage, la tâche de classification de textes est devenue une tâche cruciale. Dans cet article, nous partageons notre expérience de la classification de textes dans un contexte industriel et présentons une évalu-ation comparative de différents algorithmes de classification binaire et multi-label appliqués à des textes décrivant des offres de services, issus de la plateforme B2B SILEX pour la recommandation de prestataires de services. Nous montrons que dans certains cas pratiques comme celui que nous considérons, une représentation des données sous la forme de "bags of words" donne de meilleurs résultats de classification qu'une représentation réputée plus promet-teuse par "word embeddings".
|
|
Keyword:
[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; [INFO]Computer Science [cs]; Apprentissage automatique; Bag of words; Catégorisation de textes; Machine Learning; Plongement lexical; Sac de mots; Text categorization; Word embedding
|
|
URL: https://hal.archives-ouvertes.fr/hal-01830905/document https://hal.archives-ouvertes.fr/hal-01830905 https://hal.archives-ouvertes.fr/hal-01830905/file/APIA_2018_paper_15.pdf
|
|
BASE
|
|
Hide details
|
|
83 |
Facing the facts of fake: a distributional semantics and corpus annotation approach
|
|
|
|
In: ISSN: 2197-2796 ; Yearbook of the German Cognitive Linguistics Association ; https://hal.archives-ouvertes.fr/hal-01959609 ; Yearbook of the German Cognitive Linguistics Association, De Gruyter, 2018, 6 (9-42) (2018)
|
|
BASE
|
|
Show details
|
|
84 |
Building and evaluating resources for sentiment analysis in the Greek language
|
|
|
|
In: ISSN: 1574-020X ; EISSN: 1574-0218 ; Language Resources and Evaluation ; https://hal.archives-ouvertes.fr/hal-03382985 ; Language Resources and Evaluation, Springer Verlag, 2018, 52 (4), pp.1021-1044. ⟨10.1007/s10579-018-9420-4⟩ (2018)
|
|
BASE
|
|
Show details
|
|
85 |
Exploration par apprentissage de discussions de personnes en détresse psychologique
|
|
|
|
In: 29es Journées Francophones d'Ingénierie des Connaissances, IC 2018 ; https://hal.archives-ouvertes.fr/hal-01839561 ; 29es Journées Francophones d'Ingénierie des Connaissances, IC 2018, Jul 2018, Nancy, France. pp.95-102 ; http://pfia2018.loria.fr/ (2018)
|
|
BASE
|
|
Show details
|
|
86 |
Unsupervised Creation of Normalisation Dictionaries for Micro-Blogs in Arabic, French and English
|
|
|
|
In: 19th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing 2018) ; https://hal.archives-ouvertes.fr/hal-01795348 ; 19th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing 2018), Mar 2018, Hanoi, Vietnam (2018)
|
|
BASE
|
|
Show details
|
|
87 |
Unsupervised Creation of Normalization Dictionaries for Micro-Blogs in Arabic, French and English
|
|
|
|
In: ISSN: 1405-5546 ; EISSN: 2007-9737 ; Computación y sistemas ; https://hal.archives-ouvertes.fr/hal-01958675 ; Computación y sistemas, Instituto Politécnico Nacional IPN Centro de Investigación en Computación, 2018, 19th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing 2018), 22 (3), pp.729-737. ⟨10.13053/CyS-22-3-3034⟩ ; https://www.cys.cic.ipn.mx/ojs/index.php/CyS/article/view/3034/2514 (2018)
|
|
BASE
|
|
Show details
|
|
88 |
Word Embeddings for Wine Recommender Systems Using Vocabularies of Experts and Consumers
|
|
|
|
In: ISSN: 2199-188X ; Open Journal of Web Technologies ; https://halshs.archives-ouvertes.fr/halshs-01872273 ; Open Journal of Web Technologies, RonPub, 2018, Special Issue: Proceedings of the International Workshop on Web Data Processing & Reasoning (WDPAR 2018) in conjunction with the 41st German Conference on Artificial Intelligence, 5 (1), pp.23-30 ; https://www.ronpub.com/ojwt/OJWT_2018v5i1n04_Cruz.html (2018)
|
|
BASE
|
|
Show details
|
|
91 |
A Framework to Understand Emoji Meaning: Similarity and Sense Disambiguation of Emoji using EmojiNet
|
|
|
|
In: http://rave.ohiolink.edu/etdc/view?acc_num=wright1547506375922938 (2018)
|
|
BASE
|
|
Show details
|
|
92 |
A Semi-supervised Corpus Annotation for Saudi Sentiment Analysis Using Twitter
|
|
|
|
BASE
|
|
Show details
|
|
93 |
An Empirical Study of Word Embedding Dimensionality Reduction ...
|
|
|
|
BASE
|
|
Show details
|
|
94 |
An Empirical Study of Word Embedding Dimensionality Reduction ...
|
|
|
|
BASE
|
|
Show details
|
|
95 |
Sparse distributed representations as word embeddings for language understanding
|
|
|
|
BASE
|
|
Show details
|
|
96 |
The Effect of Data Quantity on Dialog System Input Classification Models ; Datamängdens effekt på modeller för avsiktsklassificering i chattkonversationer
|
|
|
|
BASE
|
|
Show details
|
|
97 |
Bidirectional Recurrent Neural Network Approach for Arabic Named Entity Recognition
|
|
|
|
In: Future Internet ; Volume 10 ; Issue 12 (2018)
|
|
BASE
|
|
Show details
|
|
98 |
An Integrated Graph Model for Document Summarization
|
|
|
|
In: Information ; Volume 9 ; Issue 9 (2018)
|
|
BASE
|
|
Show details
|
|
99 |
Combining Word Embedding and Knowledge-Based Topic Modeling for Entity Summarization
|
|
|
|
In: Computer Science Faculty Publications (2018)
|
|
BASE
|
|
Show details
|
|
100 |
Παράσταση γλωσσολογικού συναισθηματικού περιεχομένου με χρήση υπολογιστικής νοημοσύνης ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|