21 |
Detecting salient events in large corpora by a combination of NLP and data mining techniques (poster)
|
|
|
|
In: Supplementary Proceedings of the 14th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing 2013) ; https://hal.archives-ouvertes.fr/hal-01023926 ; Supplementary Proceedings of the 14th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing 2013), Mar 2013, Samos, Greece (2013)
|
|
BASE
|
|
Show details
|
|
22 |
Let Everything Turn Well in Your Wife
|
|
|
|
In: The 51st Annual Meeting of the Association for Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-01074637 ; The 51st Annual Meeting of the Association for Computational Linguistics, Aug 2013, Sofia, Bulgaria. 6 p (2013)
|
|
BASE
|
|
Show details
|
|
23 |
Aspects de l'itération. L'expression de la répétition en français : analyse linguistique et formalisation.
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-01074784 ; Peter Lang, 106, 2013, 978-3-0343-1415-2 ; http://www.peterlang.com (2013)
|
|
BASE
|
|
Show details
|
|
24 |
Vers une approche « rhétorique » en TAL : application à la veille épidémiologique multilingue
|
|
|
|
In: SEPTET, Des mots aux actes ; https://hal.archives-ouvertes.fr/hal-01074771 ; SEPTET, Des mots aux actes, Editions Anagrammes, 2013, [13 p.] (2013)
|
|
BASE
|
|
Show details
|
|
25 |
Named Entity Filtering based on Concept Association Graphs
|
|
|
|
In: 14th International Conference in Computational Linguistics and Intelligent Text Processing (CICLing 2013) ; https://hal.archives-ouvertes.fr/hal-01073640 ; 14th International Conference in Computational Linguistics and Intelligent Text Processing (CICLing 2013), Mar 2013, Samos, Greece. 11 p (2013)
|
|
BASE
|
|
Show details
|
|
26 |
Overview of INEX 2013
|
|
|
|
In: Information Access Evaluation. Multilinguality, Multimodality, and Visualization ; 4th Conference on Multilingual and Multimodal Information Access Evaluation - CLEF 2013 ; https://hal.archives-ouvertes.fr/hal-01147314 ; 4th Conference on Multilingual and Multimodal Information Access Evaluation - CLEF 2013, Sep 2013, Valencia, Spain. pp.269-281 (2013)
|
|
BASE
|
|
Show details
|
|
27 |
Overview of the INEX 2013 Social Book Search Track
|
|
|
|
In: Information Access Evaluation meets Multilinguality, Multimodality, and Visualization" - Fourth International Conference of the Cross-Language Evaluation Forum, CLEF 2013 ; https://hal.archives-ouvertes.fr/hal-01073644 ; Information Access Evaluation meets Multilinguality, Multimodality, and Visualization" - Fourth International Conference of the Cross-Language Evaluation Forum, CLEF 2013, Sep 2013, Valencia, Spain. 26 p (2013)
|
|
BASE
|
|
Show details
|
|
28 |
Any Language Early Detection of Epidemic Diseases from Web News Streams
|
|
|
|
In: Healthcare Informatics (ICHI), 2013 IEEE International Conference on ; https://hal.archives-ouvertes.fr/hal-01073195 ; Healthcare Informatics (ICHI), 2013 IEEE International Conference on, Sep 2013, philadelphie, United States. pp.159 - 168, ⟨10.1109/ICHI.2013.94⟩ (2013)
|
|
BASE
|
|
Show details
|
|
29 |
Extensible Document-Based Model Web Engineering
|
|
|
|
In: Research Challenges in Information Science (RCIS), 2013 IEEE Seventh International Conference on ; https://hal.archives-ouvertes.fr/hal-01073662 ; Research Challenges in Information Science (RCIS), 2013 IEEE Seventh International Conference on, May 2013, Paris, France. pp.1 - 11, ⟨10.1109/RCIS.2013.6577713⟩ (2013)
|
|
BASE
|
|
Show details
|
|
30 |
DAnIEL, parsimonious yet high-coverage multilingual epidemic surveillance ; DAnIEL : Veille épidémiologique multilingue parcimonieuse
|
|
|
|
In: 20ème conférence du Traitement Automatique du Langage Naturel 2013 (TALN 2013) ; https://hal.archives-ouvertes.fr/hal-01074881 ; 20ème conférence du Traitement Automatique du Langage Naturel 2013 (TALN 2013), Jun 2013, Sables d'Olonne, France. p.787-788 (2013)
|
|
BASE
|
|
Show details
|
|
31 |
Which granularity to bootstrap a multilingual method of document alignment: character N-grams or word N-grams?
|
|
|
|
In: EISSN: 1877-0428 ; Procedia - Social and Behavioral Sciences ; https://hal.archives-ouvertes.fr/hal-01074838 ; Procedia - Social and Behavioral Sciences, Elsevier, 2013, pp.473 - 481 (2013)
|
|
Abstract:
International audience ; This article tackle multilingual automatic alignment. Alignment refers to the process by which segments that are translation ofone another are automatically matched. Instead of comparing only pairs of languages at sentence level, as it is usually done toconform to human process in translation. The computer is used here for its capacity to infer semantic alignment from a collection oftexts that are translations of the same content. The corpus contains press releases from Europa, the European Community website,available in up to 23 languages. The alignment process takes advantage of frequency similarity between different linguistic versionsof a document by computing matching features for each repeated string in all versions. This is done to find reliable anchors inthe process of linking versions. The question of the best granularity is raised to bring out some semantic equivalences, whencomparing two linguistic versions, character N-grams or word N-grams. The alignment systems are traditionally based on wordN-grams splitting. The observation of the morphological variety of languages, even inside a single linguistic family, quickly showsthat the word granularity is inadequate to provide a widely multilingual system, i.e. a language independent system able to handleflexional languages as well as positional languages. Instead, when starting from a multilingual collection to focus on pairs of texts,we defend that character N-grams alignment is more efficient than word N-grams alignment.
|
|
Keyword:
[INFO.INFO-TT]Computer Science [cs]/Document and Text Processing; alignment; character N-grams based method; Corpus linguistic; matching; multidocuments; multilinguism; Natural Language Processing (NLP)
|
|
URL: https://hal.archives-ouvertes.fr/hal-01074838/file/CHS-LECLUZE-2013-1.pdf https://hal.archives-ouvertes.fr/hal-01074838 https://hal.archives-ouvertes.fr/hal-01074838/document
|
|
BASE
|
|
Hide details
|
|
32 |
Added-Value of Automatic Multilingual Text Analysis for Epidemic Surveillance
|
|
|
|
In: 14th Conference on Artificial Intelligence in Medicine ; https://hal.archives-ouvertes.fr/hal-01074535 ; 14th Conference on Artificial Intelligence in Medicine, May 2013, Murcia, Spain. pp.284 - 294, ⟨10.1007/978-3-642-38326-7_40⟩ (2013)
|
|
BASE
|
|
Show details
|
|
33 |
Parallel areas detection in multi-documents for multilingual alignment ; Détection de zones parallèles à l’intérieur de multi-documents pour l’alignement multilingue
|
|
|
|
In: 20ème conférence du Traitement Automatique du Langage Naturel 2013 (TALN 2013) ; https://hal.archives-ouvertes.fr/hal-01074950 ; 20ème conférence du Traitement Automatique du Langage Naturel 2013 (TALN 2013), Jun 2013, Sables d'Olonne, France (2013)
|
|
BASE
|
|
Show details
|
|
34 |
Graph Mining under Linguistic Constraints to Explore Large Texts
|
|
|
|
In: Computación y sistemas ; 14th international conference on Computational Linguistics and Intelligent Text Processing (CICLing'13) ; https://hal.archives-ouvertes.fr/hal-00817068 ; 14th international conference on Computational Linguistics and Intelligent Text Processing (CICLing'13), Mar 2013, Samos, Greece. pp.239-250 (2013)
|
|
BASE
|
|
Show details
|
|
35 |
SDMC : un outil en ligne d'extraction de motifs séquentiels pour la fouille de textes
|
|
|
|
In: Conférence Francophone sur l'Extraction et la Gestion des Connaissances (EGC'13) ; https://hal.archives-ouvertes.fr/hal-00817074 ; Conférence Francophone sur l'Extraction et la Gestion des Connaissances (EGC'13), Jan 2013, Toulouse, France (2013)
|
|
BASE
|
|
Show details
|
|
36 |
Personalized Semantic Resources: The SemComp Project Presentation and Preliminary Works
|
|
|
|
In: International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (KEOD 2013) ; https://hal.archives-ouvertes.fr/hal-01073599 ; International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (KEOD 2013), Sep 2013, vilamoura, Portugal. ⟨10.5220/0004539501640169⟩ (2013)
|
|
BASE
|
|
Show details
|
|
37 |
Multilingual epidemic surveillance : a parsimonious caracter-based approach ; Veille épidémiologique multilingue : une approche parcimonieuse au grain caractèrefondée sur le genre textuel
|
|
|
|
In: https://hal.archives-ouvertes.fr/tel-01074940 ; Traitement du texte et du document. Université de Caen, 2013. Français (2013)
|
|
BASE
|
|
Show details
|
|
38 |
Opinion analysis: the effect of negation on polarity and intensity
|
|
|
|
In: Proceedings of KONVENS workhop PATHOS - 1st Workshop on Practice and Theory of Opinion Mining and Sentiment Analysis ; KONVENS workhop PATHOS - 1st Workshop on Practice and Theory of Opinion Mining and Sentiment Analysis ; https://hal.archives-ouvertes.fr/hal-01071601 ; KONVENS workhop PATHOS - 1st Workshop on Practice and Theory of Opinion Mining and Sentiment Analysis, Sep 2012, Vienne, Austria. pp.282-290 (2012)
|
|
BASE
|
|
Show details
|
|
39 |
Fouille de données pour la stylistique : cas des motifs séquentiels émergents
|
|
|
|
In: 11es Journées internationales d'Analyse statistique des Données Textuelles (JADT) ; 11es Journées Internationales d'Analyse Statistique des Données Textuelles (JADT'12) ; https://hal.archives-ouvertes.fr/hal-00675586 ; 11es Journées Internationales d'Analyse Statistique des Données Textuelles (JADT'12), Jun 2012, Liège, Belgique. pp.821-833 ; http://lexicometrica.univ-paris3.fr/jadt/jadt2012/tocJADT2012.htm (2012)
|
|
BASE
|
|
Show details
|
|
40 |
Opinion Mining in an Informative Corpus: Building Lexicons
|
|
|
|
In: Proceedings of KONVENS workhop PATHOS - 1st Workshop on Practice and Theory of Opinion Mining and Sentiment Analysis ; KONVENS workhop PATHOS - 1st Workshop on Practice and Theory of Opinion Mining and Sentiment Analysis ; https://hal.archives-ouvertes.fr/hal-01071162 ; KONVENS workhop PATHOS - 1st Workshop on Practice and Theory of Opinion Mining and Sentiment Analysis, 2012, vienne, Austria. pp.314-318 (2012)
|
|
BASE
|
|
Show details
|
|
|
|