Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Hits 1 – 4 of 4

1	The NLP4NLP Corpus (II): 50 Years of Research in Speech and Language Processing
	Mariani, Joseph,; Francopoulo, Gil; Paroubek, Patrick...
	In: ISSN: 2504-0537 ; EISSN: 2504-0537 ; Frontiers in Research Metrics and Analytics ; https://hal.archives-ouvertes.fr/hal-02413749 ; Frontiers in Research Metrics and Analytics, Frontiers Media, 2019, 3, pp.1-30 (2019)
	BASE
	Show details

2	The NLP4NLP Corpus (I): 50 Years of Publication, Collaboration and Citation in Speech and Language Processing
	Mariani, Joseph,; Francopoulo, Gil; Paroubek, Patrick
	In: ISSN: 2504-0537 ; EISSN: 2504-0537 ; Frontiers in Research Metrics and Analytics ; https://hal.archives-ouvertes.fr/hal-02413751 ; Frontiers in Research Metrics and Analytics, Frontiers Media, 2019, 3, pp.1-30 (2019)
	BASE
	Show details

3	Reuse and Plagiarism in Speech and Natural Language Processing
	Mariani, Joseph,; Francopoulo, Gil; Paroubek, Patrick
	In: ISSN: 1432-5012 ; EISSN: 1432-1300 ; International Journal on Digital Libraries ; https://hal.archives-ouvertes.fr/hal-01840700 ; International Journal on Digital Libraries, Springer Verlag, 2017, 18, pp.1-14 (2017)
	BASE
	Show details

4	A Study of Reuse and Plagiarism in Speech and Natural Language Processing papers
	Mariani, Joseph,; Francopoulo, Gil; Paroubek, Patrick
	In: BIRNDL 2016 ; https://hal.archives-ouvertes.fr/hal-01840817 ; BIRNDL 2016, Jan 2016, Newark, United States (2016)
	Abstract: International audience ; The aim of this experiment is to present an easy way to compare fragments of texts in order to detect (supposed) results ofcopy & paste operations between articles in the domain of Natural Language Processing, including Speech Processing (NLP).The search space of the comparisons is a corpus labelled as NLP4NLP, which includes 34 different sources and gathers alarge part of the publications in the NLP field over the past 50 years. This study considers the similarity between the papersof each individual source and the complete set of papers in the whole corpus, according to four different types of relationship(self-reuse, self-plagiarism, reuse and plagiarism) and in both directions: a source paper borrowing a fragment of text fromanother paper of the collection, or in the reverse direction, fragments of text from the source paper being borrowed andinserted in another paper of the collection.
	Keyword: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO]Computer Science [cs]; Informetrics; Natural Language Processing; Plagiarism Detection; Scientometrics; Speech Processing; Text reuse
	URL: https://hal.archives-ouvertes.fr/hal-01840817
	BASE
	Hide details

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern