DE eng

Search in the Catalogues and Directories

Hits 1 – 2 of 2

1
Multilingual document clusters discovery. RIAO’2004
In: http://www-list.cea.fr/gb/publications/docs/si/ingenierie_connaissance/gb/riao_2004_mathieu.pdf (2004)
BASE
Show details
2
Multilingual Document Clusters Discovery
In: http://www.riao.org/Proceedings-2004/papers/0090.pdf (2004)
Abstract: Cross Language Information Retrieval community has brought up search engines over multilingual corpora, and multilingual text categorization systems. In this paper, we focus on the multilingual clusters discovery problem, which aim is to extract topic-related multilingual document clusters from a multilingual document collection in an unsupervised way. Our approach is based on a linguistic analysis of the documents that allows to identify relevant features for a vector representation of the documents, each language being associated with a different vector space. We propose a cross-lingual similarity measure for the documents, using bilingual dictionaries. A Shared Nearest Neighbor clustering algorithm is then used to build the clusters. We present an evaluation framework for this task, analyze and discuss the results we obtained and propose directions for future works.
URL: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.59.4877
http://www.riao.org/Proceedings-2004/papers/0090.pdf
BASE
Hide details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
2
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern