1 |
A Dataset for Toponym Resolution in Nineteenth-Century English Newspapers
|
|
|
|
In: Journal of Open Humanities Data; Vol 8 (2022); 3 ; 2059-481X (2022)
|
|
BASE
|
|
Show details
|
|
4 |
A Multilingual Dataset for Named Entity Recognition, Entity Linking and Stance Detection in Historical Newspapers
|
|
|
|
In: SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval ; https://hal.archives-ouvertes.fr/hal-03418387 ; SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, Jul 2021, Virtual Event, Canada. pp.2328-2334, ⟨10.1145/3404835.3463255⟩ (2021)
|
|
Abstract:
International audience ; Named entity processing over historical texts is more and more being used due to the massive documents and archives being stored in digital libraries. However, due to the poor annotated resources of historical nature, information extraction performances fall behind those on contemporary texts. In this paper, we introduce the development of the NewsEye resource, a multilingual dataset for named entity recognition and linking enriched with stances towards named entities. The dataset is comprised of diachronic historical newspaper material published between 1850 and 1950 in French, German, Finnish, and Swedish. Such historical resource is essential in the context of developing and evaluating named entity processing systems. It evenly allows enhancing the performances of existing approaches on historical documents which enables adequate and efficient semantic indexing of historical documents on digital cultural heritage collections.
|
|
Keyword:
[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]; [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO.INFO-DL]Computer Science [cs]/Digital Libraries [cs.DL]; [INFO.INFO-HC]Computer Science [cs]/Human-Computer Interaction [cs.HC]; [INFO.INFO-IR]Computer Science [cs]/Information Retrieval [cs.IR]; [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG]; [INFO.INFO-TT]Computer Science [cs]/Document and Text Processing; datasets; diachronic historical newspapers; entity linking; multilingual; named entity recognition; stance detection
|
|
URL: https://hal.archives-ouvertes.fr/hal-03418387 https://hal.archives-ouvertes.fr/hal-03418387/file/SIGIR2021_NER-resources.pdf https://doi.org/10.1145/3404835.3463255 https://hal.archives-ouvertes.fr/hal-03418387/document
|
|
BASE
|
|
Hide details
|
|
7 |
A Corpus Approach Study on the Manzanar Free Press
|
|
|
|
In: University Honors Theses (2021)
|
|
BASE
|
|
Show details
|
|
8 |
Gado2: multilingual newspapers from the Netherlands Indies ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Gado2: multilingual newspapers from the Netherlands Indies ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Gado2: multilingual newspapers from the Netherlands Indies ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Gado2: multilingual newspapers from the Netherlands Indies ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
ГАЗЕТТІК МАҚАЛАЛАРДЫ АУДАРУДЫҢ МӘДЕНИЕТАРАЛЫҚ ЕРЕКШЕЛІКТЕРІ ... : МЕЖКУЛЬТУРНЫЕ ОСОБЕННОСТИ ПЕРЕВОДА ГАЗЕТНЫХ СТАТЕЙ ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
"Our Gaelic Department": The Irish-Language Column in the New York Irish-American, 1857-1896
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Speaking to the Masses - Hybrid Poetics and Marshall McLuhan's "Newspaper Landscape"
|
|
|
|
In: Studies in Arts and Humanities ; 2 ; 5-21 (2021)
|
|
BASE
|
|
Show details
|
|
17 |
The Wave of ’68. Portraits of Rebel Students in the Italian Press
|
|
|
|
In: El Futuro del Pasado; Vol. 12 (2021); 481-503 ; 1989-9289 ; 10.14201/fdp.202112 (2021)
|
|
BASE
|
|
Show details
|
|
19 |
The Wave of '68. Portraits of Rebel Students in the Italian Press
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Make leave, not war. Intertextual references in the British press coverage of Brexit
|
|
|
|
In: Topics in Linguistics, Vol 22, Iss 2, Pp 1-14 (2021) (2021)
|
|
BASE
|
|
Show details
|
|
|
|