1 |
From bag-of-words towards natural language: adapting topic models to avoid stop word removal ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
On the use of machine translation and topic-modeling to analyze non-parallel multilingual corpora: A case study in the history of philosophy of science ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
On the use of machine translation and topic-modeling to analyze non-parallel multilingual corpora: A case study in the history of philosophy of science ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Analysis of Destination Images in the Emerging Ski Market: The Case Study in the Host City of the 2022 Beijing Winter Olympic Games
|
|
|
|
In: Sustainability; Volume 14; Issue 1; Pages: 555 (2022)
|
|
BASE
|
|
Show details
|
|
5 |
Same same, but different ? On the Relation of Information Science and the Digital Humanities A Scientometric Comparison of Academic Journals Using LDA and Hierarchical Clustering ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
A Deep Topical N-gram Model and Topic Discovery on COVID-19 News and Research Manuscripts
|
|
|
|
In: Electronic Thesis and Dissertation Repository (2021)
|
|
BASE
|
|
Show details
|
|
7 |
Determining Tone of a Body of Text
|
|
|
|
In: Senior Projects Spring 2020 (2020)
|
|
BASE
|
|
Show details
|
|
8 |
Modeling the Research Landscapes of Artificial Intelligence Applications in Diabetes (GAPRESEARCH)
|
|
|
|
In: International Journal of Environmental Research and Public Health ; Volume 17 ; Issue 6 (2020)
|
|
BASE
|
|
Show details
|
|
9 |
A special case of long distance agreement in Marathi
|
|
|
|
In: Glossa: a journal of general linguistics; Vol 5, No 1 (2020); 93 ; 2397-1835 (2020)
|
|
BASE
|
|
Show details
|
|
10 |
An LDA-based Approach for Product Attribute Identification from Online Customer Reviews
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Detection of weak signals in weakly structured data masses ; Détection de signaux faibles dans des masses de données faiblement structurées
|
|
|
|
In: EISSN: 2516-3280 ; Recherche d’Information, Document et Web Sémantique ; https://hal.archives-ouvertes.fr/hal-02552771 ; Recherche d’Information, Document et Web Sémantique, ISTE OpenScience, 2019, 3 (1), ⟨10.21494/ISTE.OP.2020.0463⟩ (2019)
|
|
BASE
|
|
Show details
|
|
12 |
Mouse tracking as a window into decision making
|
|
|
|
In: ISSN: 1554-351X ; EISSN: 1554-3528 ; Behavior Research Methods ; https://hal.archives-ouvertes.fr/hal-02274523 ; Behavior Research Methods, Psychonomic Society, Inc, 2019, 51 (3), pp.1085-1101. ⟨10.3758/s13428-018-01194-x⟩ (2019)
|
|
BASE
|
|
Show details
|
|
13 |
Acoustic distances, Pillai scores and LDA classification scores as metrics of L2 comprehensibility and nativelikeness
|
|
|
|
In: Proceedings of the 19th International Congress of Phonetic Sciences, Melbourne, Australia 2019 ; ICPhS2019 ; https://hal.archives-ouvertes.fr/hal-03046802 ; ICPhS2019, Aug 2019, Melbourne, Australia (2019)
|
|
BASE
|
|
Show details
|
|
14 |
Expanding the analysis of functional Near-Infrared Spectroscopy (fNIRS) data with multivariate techniques ... : application to a children’s literacy study ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Scalable Collapsed Inference for High-Dimensional Topic Models ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Comparison studies on active cross-situational object-word learning using Non-Negative Matrix Factorization and Latent Dirichlet Allocation
|
|
|
|
In: ISSN: 2379-8920 ; EISSN: 2379-8939 ; IEEE Transactions on Cognitive and Developmental Systems ; https://hal.archives-ouvertes.fr/hal-01561168 ; IEEE Transactions on Cognitive and Developmental Systems, Institute of Electrical and Electronics Engineers, Inc, In press, ⟨10.1109/TCDS.2017.2725304⟩ (2018)
|
|
BASE
|
|
Show details
|
|
18 |
Introducing Semantics in Short Text Classification
|
|
|
|
In: ISSN: 0302-9743 ; Lecture Notes in Computer Science ; https://hal.archives-ouvertes.fr/hal-03625724 ; Lecture Notes in Computer Science, Springer, 2018, Computational Linguistics and Intelligent Text Processing 17th International Conference, CICLing 2016, Konya, Turkey, April 3–9, 2016, Revised Selected Papers, Part II, 9624, pp.433 - 445. ⟨10.1007/978-3-319-75487-1_34⟩ (2018)
|
|
Abstract:
International audience ; To overcome short text classification issues due to shortness and sparseness, the enrichment process is classically proposed: topics (word clusters) are extracted from external knowledge sources using Latent Dirichlet Allocation. All the words, associated to topics which encompass short text words, are added to the initial short text content. We propose (i) an explicit representation of a two-level enrichment method in which the enrichment is considered either with respect to each word in the text or to the global semantic meaning of the short text and (ii) a new semantic Random Forest kind in which semantic relations between features are taken into account at node level rather than at tree level as it was recently proposed in the literature to avoid potential tree correlation. We demonstrate that our enrichment method is valid not only for Random Forest based methods but also for other methods like MaxEnt, SVM and Naive Bayes.
|
|
Keyword:
[INFO.INFO-TT]Computer Science [cs]/Document and Text Processing; LDA; semantics; Semantics LDA; short text classification; text enrichment
|
|
URL: https://hal.archives-ouvertes.fr/hal-03625724/file/211.pdf https://hal.archives-ouvertes.fr/hal-03625724 https://hal.archives-ouvertes.fr/hal-03625724/document https://doi.org/10.1007/978-3-319-75487-1_34
|
|
BASE
|
|
Hide details
|
|
19 |
A fuzzy credibility model to estimate the operational value at risk using internal and external data of risk events
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Korean vowel identification by English and Mandarin listeners: Effects of L1-L2 vowel inventory size and acoustic relationship
|
|
|
|
In: Toronto Working Papers in Linguistics; Vol 40 (2018): Special issue from the CRC-sponsored phonology/phonetics workshops ; 1718-3510 ; 1705-8619 (2018)
|
|
BASE
|
|
Show details
|
|
|
|