1 |
Redefining Absent Keyphrases and their Effect on Retrieval Effectiveness
|
|
|
|
In: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies ; https://hal.archives-ouvertes.fr/hal-03272840 ; Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Jun 2021, Online, France. pp.4185-4193, ⟨10.18653/v1/2021.naacl-main.330⟩ (2021)
|
|
BASE
|
|
Show details
|
|
2 |
Redefining Absent Keyphrases and their Effect on Retrieval Effectiveness
|
|
|
|
In: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies ; https://hal.archives-ouvertes.fr/hal-03477781 ; Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Jun 2021, Online, France. pp.4185-4193, ⟨10.18653/v1/2021.naacl-main.330⟩ (2021)
|
|
BASE
|
|
Show details
|
|
3 |
An Evaluation Dataset for Identifying Communicative Functions of Sentences in English Scholarly Papers
|
|
|
|
In: Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020) ; 12th Conference on Language Resources and Evaluation (LREC 2020) ; https://hal.archives-ouvertes.fr/hal-03272825 ; 12th Conference on Language Resources and Evaluation (LREC 2020), May 2020, Marseille, France (2020)
|
|
Abstract:
International audience ; Formulaic expressions, such as 'in this paper we propose', are used by authors of scholarly papers to perform communicative functions; the communicative function of the present example is 'stating the aim of the paper'. Collecting such expressions and pairing them with their communicative functions would be highly valuable for various tasks, particularly for writing assistance. However, such collection and paring in a principled and automated manner would require high-quality annotated data, which are not available. In this study, we address this shortcoming by creating a manually annotated dataset for detecting communicative functions in sentences. Starting from a seed list of labelled formulaic expressions, we retrieved new sentences from scholarly papers in the ACL Anthology and asked multiple human evaluators to label communicative functions. To show the usefulness of our dataset, we conducted a series of experiments that determined to what extent sentence representations acquired by recent models, such as word2vec and BERT, can be employed to detect communicative functions in sentences.
|
|
Keyword:
[INFO.INFO-TT]Computer Science [cs]/Document and Text Processing; communicative function; formulaic expression; multi-word expression; rhetorical structure; sentence representation
|
|
URL: https://hal.archives-ouvertes.fr/hal-03272825/document https://hal.archives-ouvertes.fr/hal-03272825 https://hal.archives-ouvertes.fr/hal-03272825/file/2020.lrec-1.212.pdf
|
|
BASE
|
|
Hide details
|
|
4 |
Keyphrase Generation for Scientific Document Retrieval
|
|
|
|
In: The 58th Annual Meeting of the Association for Computational Linguistics (ACL) ; https://hal.archives-ouvertes.fr/hal-02556086 ; The 58th Annual Meeting of the Association for Computational Linguistics (ACL), Jul 2020, Online, United States. ⟨10.18653/v1/2020.acl-main.105⟩ (2020)
|
|
BASE
|
|
Show details
|
|
5 |
KPTimes: A Large-Scale Dataset for Keyphrase Generation on News Documents
|
|
|
|
In: 12th International Conference on Natural Language Generation (INLG) ; https://hal.archives-ouvertes.fr/hal-02395709 ; 12th International Conference on Natural Language Generation (INLG), Oct 2019, Tokyo, Japan. pp.130-135, ⟨10.18653/v1/W19-8617⟩ (2019)
|
|
BASE
|
|
Show details
|
|
6 |
KPTimes: A Large-Scale Dataset for Keyphrase Generation on News Documents ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Unsupervised Keyphrase Extraction with Multipartite Graphs
|
|
|
|
In: Proceedings of NAACL-HLT 2018, ; 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL HLT) ; https://hal.archives-ouvertes.fr/hal-01983546 ; 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL HLT), Jun 2018, Nouvelle Orléans, United States. pp.667 - 672 (2018)
|
|
BASE
|
|
Show details
|
|
8 |
Indexation d'articles scientifiques Présentation et résultats du défi fouille de textes DEFT 2016
|
|
|
|
In: Atelier DEFT 2016 ; https://hal.archives-ouvertes.fr/hal-01693785 ; Atelier DEFT 2016, Jul 2016, Paris, France (2016)
|
|
BASE
|
|
Show details
|
|
9 |
TermITH-Eval: a French Standard-Based Resource for Keyphrase Extraction Evaluation
|
|
|
|
In: LREC - Language Resources and Evaluation Conference ; https://hal.archives-ouvertes.fr/hal-01693805 ; LREC - Language Resources and Evaluation Conference, May 2016, Potoroz, Slovenia (2016)
|
|
BASE
|
|
Show details
|
|
10 |
Label Pre-annotation for Building Non-projective Dependency Treebanks for French
|
|
|
|
In: 15th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing 2014) ; https://hal.archives-ouvertes.fr/hal-01004007 ; 15th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing 2014), Apr 2014, Kathmandu, Nepal. pp.1-12 (2014)
|
|
BASE
|
|
Show details
|
|
11 |
Keyphrase Extraction for N-best Reranking in Multi-Sentence Compression
|
|
|
|
In: North American Chapter of the Association for Computational Linguistics (NAACL) ; https://hal.archives-ouvertes.fr/hal-00816353 ; North American Chapter of the Association for Computational Linguistics (NAACL), Jun 2013, Atlanta, United States (2013)
|
|
BASE
|
|
Show details
|
|
12 |
Improving Update Summarization by Revisiting the MMR Criterion ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|