DE eng

Search in the Catalogues and Directories

Hits 1 – 1 of 1

1
Term ranking adaptation to the domain: genetic algorithm based optimisation of the C-Value
In: International Conference on Natural Language Processing ; https://hal.archives-ouvertes.fr/hal-01972763 ; International Conference on Natural Language Processing, Springer, Jan 2014, Warsaw, Poland (2014)
Abstract: International audience ; Approaches based on linguistic rules have been proposed to automatically extract candidate terms to help the terminology building from corpora. However, they face to the difficulty to identify the relevant terms among the noun phrases extracted. Although several statistical measures as the frequency or the C-Value have been proposed to ranked the terms according to their termhood, they fail to propose corpus and domain-independent ranking. We tackle this problem by proposing a parametrised C-Value which optimally considers the length and the syntactic roles of the nested terms thanks to a genetic algorithm. We compare its impact on the ranking of term extracted from on three corpora. Results show average precision increases by 9% above the frequency based ranking and by 12% above the C-Value based ranking.
Keyword: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO]Computer Science [cs]; Genetic Algorithm; Term extraction; Term ranking; Terminology
URL: https://hal.archives-ouvertes.fr/hal-01972763
BASE
Hide details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
1
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern