1 |
Mining an English-Chinese parallel Dataset of Financial News
|
|
|
|
In: Journal of Open Humanities Data; Vol 8 (2022); 9 ; 2059-481X (2022)
|
|
BASE
|
|
Show details
|
|
4 |
The rumour spectrum
|
|
|
|
In: ISSN: 1932-6203 ; EISSN: 1932-6203 ; PLoS ONE ; https://hal.archives-ouvertes.fr/hal-01691934 ; PLoS ONE, Public Library of Science, 2018, 13 (1), pp.e0189080.1-27. ⟨10.1371/journal.pone.0189080⟩ (2018)
|
|
BASE
|
|
Show details
|
|
5 |
A semi-supervised Learning Approach to find equivalent long-string Organization Names
|
|
|
|
In: Colloque- Forum PEPS EXIA ; https://hal-enpc.archives-ouvertes.fr/hal-02310298 ; Colloque- Forum PEPS EXIA, Oct 2016, Champs sur Marne, France. 2016 (2016)
|
|
BASE
|
|
Show details
|
|
6 |
On a Possible Similarity between Gene and Semantic Networks ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Duplicate Detection with Efficient Language Models for Automatic Bibliographic Heterogeneous Data Integration
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03373972 ; 2015 (2015)
|
|
BASE
|
|
Show details
|
|
8 |
svcR: An R Package for Support Vector Clustering improved with Geometric Hashing applied to Lexical Pattern Discovery
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03373979 ; 2015 (2015)
|
|
BASE
|
|
Show details
|
|
9 |
On a Possible Similarity between Gene and Semantic Networks
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03373977 ; 2015 (2015)
|
|
BASE
|
|
Show details
|
|
10 |
Duplicate Detection with Efficient Language Models for Automatic Bibliographic Heterogeneous Data Integration ...
|
|
|
|
Abstract:
We present a new method to detect duplicates used to merge different bibliographic record corpora with the help of lexical and social information. As we show, a trivial key is not available to delete useless documents. Merging heteregeneous document databases to get a maximum of information can be of interest. In our case we try to build a document corpus about the TOR molecule so as to extract relationships with other gene components from PubMed and WebOfScience document databases. Our approach makes key fingerprints based on n-grams. We made two documents gold standards using this corpus to make an evaluation. Comparison with other well-known methods in deduplication gives best scores of recall (95\%) and precision (100\%). ...
|
|
Keyword:
Databases cs.DB; FOS Computer and information sciences
|
|
URL: https://arxiv.org/abs/1504.07597 https://dx.doi.org/10.48550/arxiv.1504.07597
|
|
BASE
|
|
Hide details
|
|
11 |
svcR: An R Package for Support Vector Clustering improved with Geometric Hashing applied to Lexical Pattern Discovery ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Clustering and Relational Ambiguity: from Text Data to Natural Data
|
|
|
|
In: EISSN: 2416-5999 ; Journal of Data Mining and Digital Humanities ; https://hal.archives-ouvertes.fr/hal-00920423 ; Journal of Data Mining and Digital Humanities, Episciences.org, 2013, 1 (1), pp.1 (2013)
|
|
BASE
|
|
Show details
|
|
14 |
Knowledge Needs and Information Extraction
|
|
|
|
In: https://hal.inrae.fr/hal-02804243 ; Wiley-ISTE, 269 p., 2013, Computer Engineering and IT series, 978-1-84821-515-3 (2013)
|
|
BASE
|
|
Show details
|
|
16 |
Modeling Noun-Phrases Dynamics in Specialized Text Collections
|
|
|
|
In: ISSN: 0929-6174 ; Journal of Quantitative Linguistics ; https://hal.archives-ouvertes.fr/hal-02054488 ; Journal of Quantitative Linguistics, Taylor & Francis (Routledge), 2010, 17 (3), pp.212-228. ⟨10.1080/09296174.2010.485447⟩ (2010)
|
|
BASE
|
|
Show details
|
|
17 |
Bayesian Discriminant Analysis for Lexical Semantic Tagging
|
|
|
|
In: European Meeting on Cybernetics and Systems Research (EMCSR) ; https://hal.archives-ouvertes.fr/hal-03373905 ; European Meeting on Cybernetics and Systems Research (EMCSR), Apr 2002, Vienne, Austria (2002)
|
|
BASE
|
|
Show details
|
|
18 |
Apprentissage statistique pour l'extraction de concepts à partir de textes : application au filtrage d'informations textuelles
|
|
|
|
In: https://tel.archives-ouvertes.fr/tel-00006210 ; domain_stic.gest. Université Louis Pasteur - Strasbourg I, 2000. Français (2000)
|
|
BASE
|
|
Show details
|
|
|
|