61 |
Automatic processing of Tunisian dialect: construction of linguistic resources ; TRAITEMENT AUTOMATIQUE DU DIALECTE TUNISIEN : CONSTRUCTION DE RESSOURCES LINGUISTIQUES
|
|
|
|
In: https://hal.archives-ouvertes.fr/tel-02869866 ; Informatique et langage [cs.CL]. Université de Sfax (Tunisie), 2016. Français (2016)
|
|
BASE
|
|
Show details
|
|
62 |
Cross-lingual alignment transfer: a chicken-and-egg story?
|
|
|
|
In: Workshop on Multilingual and Cross-lingual Methods in NLP ; https://hal.archives-ouvertes.fr/hal-01622815 ; Workshop on Multilingual and Cross-lingual Methods in NLP , Jun 2016, San Diego, CA, United States. pp.35-44, ⟨10.18653/v1/W16-1205⟩ ; https://aclanthology.coli.uni-saarland.de/volumes/proceedings-of-the-workshop-on-multilingual-and-cross-lingual-methods-in-nlp (2016)
|
|
BASE
|
|
Show details
|
|
63 |
A semi-automatic method for constructing MUSE sentiment-annotated corpora ; Une méthode semi-automatique de construction des corpus MUSE annotés en sentiments
|
|
|
|
In: ICAL ; https://hal.archives-ouvertes.fr/hal-01526827 ; ICAL, Nguyen Tat Thanh University, Dec 2016, Ho Chi Minh City, Vietnam. pp.17-18 ; http://ical.amu.edu.pl/ (2016)
|
|
BASE
|
|
Show details
|
|
64 |
MultiVec: a Multilingual and Multilevel Representation Learning Toolkit for NLP
|
|
|
|
In: The 10th edition of the Language Resources and Evaluation Conference (LREC) ; https://hal.archives-ouvertes.fr/hal-01335930 ; The 10th edition of the Language Resources and Evaluation Conference (LREC), May 2016, Portoroz, Slovenia ; http://lrec2016.lrec-conf.org/en/ (2016)
|
|
BASE
|
|
Show details
|
|
65 |
Text extraction in document images: highlight on using corner points
|
|
|
|
In: Proceedings of 12th International Workshop on Document Analysis Systems ; International Workshop on Document Analysis Systems (DAS) ; https://hal.archives-ouvertes.fr/hal-01269802 ; International Workshop on Document Analysis Systems (DAS), Apr 2016, Santorini, Greece (2016)
|
|
BASE
|
|
Show details
|
|
66 |
Éthique et traitement automatique des langues et de la parole : entre truismes et tabous
|
|
|
|
In: ISSN: 1248-9433 ; EISSN: 1965-0906 ; Revue TAL ; https://hal.archives-ouvertes.fr/hal-01422254 ; Revue TAL, ATALA (Association pour le Traitement Automatique des Langues), 2016, TAL et éthique, 57 (2), pp.7-19 ; https://www.atala.org/-Revue-TAL- (2016)
|
|
BASE
|
|
Show details
|
|
67 |
Exploring Natural Language Processing Methods for Finno-Ugric Langages
|
|
|
|
In: Second International Workshop on Computational Linguistics for Uralic Languages ; https://hal.archives-ouvertes.fr/hal-01273769 ; Second International Workshop on Computational Linguistics for Uralic Languages, Jan 2016, Szeged, Hungary (2016)
|
|
BASE
|
|
Show details
|
|
68 |
Investigating gender adaptation for speech translation ; Étude de l’adaptation au genre du locuteur pour la traduction de la parole
|
|
|
|
In: Actes de la conférence Traitement Automatique des Langues Naturelles ; 23ème Conférence sur le Traitement Automatique des Langues Naturelles ; https://hal.archives-ouvertes.fr/hal-01353860 ; 23ème Conférence sur le Traitement Automatique des Langues Naturelles, Jul 2016, Paris, France. pp.490-497 (2016)
|
|
BASE
|
|
Show details
|
|
69 |
Construction de dictionnaire électronique des verbes du malgache
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-01371850 ; Editions universitaires europeennes, 2016, 978-3-8417-2775-6 ; https://www.editions-ue.com/ (2016)
|
|
BASE
|
|
Show details
|
|
70 |
Une approche ontologique d’intégration de ressources dictionnairiques et terminologiques, dans le contexte du Web des Données Ouvertes (LOD) pour les Humanités Numériques
|
|
|
|
In: TOTh ; https://hal-cnam.archives-ouvertes.fr/hal-02555547 ; TOTh, Jun 2016, Chambéry, France (2016)
|
|
BASE
|
|
Show details
|
|
71 |
Towards an Automatic Analyze and Standardization of Unstructured Data in the Context of Big and Linked Data
|
|
|
|
In: International ACM Conference on Management of Digital EcoSystems (MEDES'16) ; https://hal-cnam.archives-ouvertes.fr/hal-02555545 ; International ACM Conference on Management of Digital EcoSystems (MEDES'16), Nov 2016, Hendaye, France (2016)
|
|
BASE
|
|
Show details
|
|
72 |
Interfacing the Domain of Global Security with Natural Language Processing: the Role of Language Modelling
|
|
|
|
In: The Tenth International Conference on Natural Language Processing (HrTAL2016) ; https://hal.archives-ouvertes.fr/hal-01954609 ; The Tenth International Conference on Natural Language Processing (HrTAL2016), Sep 2016, Dubrovnik, Croatia. pp.249-252 ; https://hrcak.srce.hr/171801 (2016)
|
|
BASE
|
|
Show details
|
|
73 |
Sentiment detection in micro-blogs using unsupervised chunk extraction
|
|
|
|
In: Lingua Sinica ; https://hal.archives-ouvertes.fr/hal-01573567 ; Lingua Sinica, 2016, 2 (1), ⟨10.1186/s40655-015-0010-8⟩ ; https://link.springer.com/article/10.1186/s40655-015-0010-8 (2016)
|
|
Abstract:
International audience ; In this paper, we present a proposed system designed for sentiment detection for micro-blog data in Chinese. Our system surprisingly benefits from the lack of word boundary in Chinese writing system and shifts the focus directly to larger and more relevant chunks. We use an unsupervised Chinese word segmentation system and binomial test to extract specific and endogenous lexicon chunks from the training corpus. We combine the lexicon chunks with other external resources to train a maximum entropy model for document classification. With this method, we obtained an averaged F1 score of 87.2 which outperforms the state-of-the-art approach based on the released data in the second SocialNLP shared task. 1 Background Recently, due to its great potential applications such as opinion mining and topic detection , sentiment analysis on micro-blog data has gained much attention than ever before. The state-of-the art approaches to sentiment analysis/detection involves attributing a polarity to a textual message. The polarity may accept different sets of values depending on the tasks, such as ratings and binary or ternary values (positive, negative, neutral). The original form of this work had been prepared for the participation in the shared task at the second SocialNLP workshop. The task targets on sentiment detection in Chi-nese micro-blogs, which posts were extracted from Plurk online service and were mainly written in Modern Standard Chinese (MSC) with some code switching or code mixing in English, Japanese, and Taiwanese Hokkien. Messages are provided with meta-data, including timestamps, user IDs of the original posters and repliers. Besides, the posts were grouped topic-wise into 95 files by the task organizers a. In this task, the provided Plurk micro-blogging messages are typically short and are manually annotated with positive or negative polarities by the organizer. In addition to the provided data, external resources of other kinds are also combined for this task, which will be detailed in Section 3. Although it is worth noting that applying result comparisons on different languages or corpora are hazardous for this task, we achieved a score which resembles state-of-the-art on similar tasks in other languages.
|
|
Keyword:
[INFO.INFO-TT]Computer Science [cs]/Document and Text Processing; [SCCO.LING]Cognitive science/Linguistics; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; Emotion lexicon; Sentiment analysis; Unsupervised learning
|
|
URL: https://hal.archives-ouvertes.fr/hal-01573567/file/magistry_hsieh_chang_sentiments.pdf https://doi.org/10.1186/s40655-015-0010-8 https://hal.archives-ouvertes.fr/hal-01573567/document https://hal.archives-ouvertes.fr/hal-01573567
|
|
BASE
|
|
Hide details
|
|
74 |
Supervised Topic Models for Diagnosis Code Assignment to Discharge Summaries
|
|
|
|
In: 17th International Conference on Intelligent Text Processing and Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-02052345 ; 17th International Conference on Intelligent Text Processing and Computational Linguistics, Apr 2016, Konya, Turkey (2016)
|
|
BASE
|
|
Show details
|
|
75 |
Automatic Text Summarization Approaches to Speed up Topic Model Learning Process
|
|
|
|
In: ISSN: 0976-0962 ; International Journal of Computational Linguistics and Applications ; https://hal.archives-ouvertes.fr/hal-02356467 ; International Journal of Computational Linguistics and Applications, Alexander Gelbukh, 2016, 7, pp.87 - 109 (2016)
|
|
BASE
|
|
Show details
|
|
76 |
A semi-supervised Learning Approach to find equivalent long-string Organization Names
|
|
|
|
In: Colloque- Forum PEPS EXIA ; https://hal-enpc.archives-ouvertes.fr/hal-02310298 ; Colloque- Forum PEPS EXIA, Oct 2016, Champs sur Marne, France. 2016 (2016)
|
|
BASE
|
|
Show details
|
|
77 |
A French weblog corpus for new insights on blog post tagging
|
|
|
|
In: 8th International Conference on Corpus Linguistics ; https://hal-auf.archives-ouvertes.fr/hal-01358274 ; 8th International Conference on Corpus Linguistics , Mar 2016, Malaga, Spain ; http://tecnolengua.uma.es/cilc2016/?lang=en (2016)
|
|
BASE
|
|
Show details
|
|
78 |
Building a General Knowledge Base of Physical Objects for Robots
|
|
|
|
In: 13th International Conference, ESWC 2016 ; https://hal.inria.fr/hal-01330142 ; 13th International Conference, ESWC 2016, May 2016, Helaklion, Greece (2016)
|
|
BASE
|
|
Show details
|
|
79 |
Delaunay triangulation-based features for camera-based document image retrieval system
|
|
|
|
In: Document Analysis Systems ; https://hal.archives-ouvertes.fr/hal-01320491 ; Document Analysis Systems, Apr 2016, Santorini, Greece. pp.1-6 (2016)
|
|
BASE
|
|
Show details
|
|
80 |
A Pragma-Semantic Analysis of the Emotion/Sentiment Relation in Debates
|
|
|
|
In: 4th International Workshop on Artificial Intelligence and Cognition ; https://hal.inria.fr/hal-01342438 ; 4th International Workshop on Artificial Intelligence and Cognition, Jul 2016, New York, United States (2016)
|
|
BASE
|
|
Show details
|
|
|
|