1 |
The NLP4NLP Corpus (II): 50 Years of Research in Speech and Language Processing
|
|
|
|
In: ISSN: 2504-0537 ; EISSN: 2504-0537 ; Frontiers in Research Metrics and Analytics ; https://hal.archives-ouvertes.fr/hal-02413749 ; Frontiers in Research Metrics and Analytics, Frontiers Media, 2019, 3, pp.1-30 (2019)
|
|
BASE
|
|
Show details
|
|
2 |
The NLP4NLP Corpus (I): 50 Years of Publication, Collaboration and Citation in Speech and Language Processing
|
|
|
|
In: ISSN: 2504-0537 ; EISSN: 2504-0537 ; Frontiers in Research Metrics and Analytics ; https://hal.archives-ouvertes.fr/hal-02413751 ; Frontiers in Research Metrics and Analytics, Frontiers Media, 2019, 3, pp.1-30 (2019)
|
|
BASE
|
|
Show details
|
|
3 |
Reuse and Plagiarism in Speech and Natural Language Processing
|
|
|
|
In: ISSN: 1432-5012 ; EISSN: 1432-1300 ; International Journal on Digital Libraries ; https://hal.archives-ouvertes.fr/hal-01840700 ; International Journal on Digital Libraries, Springer Verlag, 2017, 18, pp.1-14 (2017)
|
|
BASE
|
|
Show details
|
|
4 |
A Study of Reuse and Plagiarism in Speech and Natural Language Processing papers
|
|
|
|
In: BIRNDL 2016 ; https://hal.archives-ouvertes.fr/hal-01840817 ; BIRNDL 2016, Jan 2016, Newark, United States (2016)
|
|
Abstract:
International audience ; The aim of this experiment is to present an easy way to compare fragments of texts in order to detect (supposed) results ofcopy & paste operations between articles in the domain of Natural Language Processing, including Speech Processing (NLP).The search space of the comparisons is a corpus labelled as NLP4NLP, which includes 34 different sources and gathers alarge part of the publications in the NLP field over the past 50 years. This study considers the similarity between the papersof each individual source and the complete set of papers in the whole corpus, according to four different types of relationship(self-reuse, self-plagiarism, reuse and plagiarism) and in both directions: a source paper borrowing a fragment of text fromanother paper of the collection, or in the reverse direction, fragments of text from the source paper being borrowed andinserted in another paper of the collection.
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO]Computer Science [cs]; Informetrics; Natural Language Processing; Plagiarism Detection; Scientometrics; Speech Processing; Text reuse
|
|
URL: https://hal.archives-ouvertes.fr/hal-01840817
|
|
BASE
|
|
Hide details
|
|
|
|