Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4 5...17

Hits 1 – 20 of 340

1	Automated construction of a French Entity Linking dataset to geolocate social network posts in the context of natural disasters
	Caillaut, Gaëtan; Gracianne, Cécile; Abadie, Nathalie...
	In: ISCRAM ; https://hal.archives-ouvertes.fr/hal-03631387 ; ISCRAM, May 2022, Tarbes, France (2022)
	BASE
	Show details

2	Documenting the gender gap in Indian Wikipedia communities: Findings from a qualitative pilot study
	Chakraborty, Anwesha; Hussain, Netha
	In: First Monday; Volume 27, Number 3 - 7 March 2022 ; 1396-0466 (2022)
	BASE
	Show details

3	Using Geolocated Text to Quantify Location in Real Estate Appraisal
	Heuwinkel, Tim; Kucklick, Jan-Peter; Müller, Oliver. - 2022
	BASE
	Show details

4	Creating Biographical Networks from Chinese and English Wikipedia
	Blouin, Baptiste; Van Den Bosch, Nora; Magistry, Pierre
	In: https://halshs.archives-ouvertes.fr/halshs-03217972 ; 2021 (2021)
	Abstract: With the rise of digital humanities, historians explore how to intellectually engage with textual sources given the available computational tools of today. The ENP-China project employs Natural Language Processing methods to tap into sources of unprecedented scale with the goal to study the transformation of elites in Modern China (1830-1949). One of the subprojects is extracting various kinds of data from biographies and, for that, we created a large corpus of biographies automatically collected from the Chinese and English Wikipedia. The dataset contains 228,144 biographical articles from the offline Chinese Wikipedia copy and is supplemented with 110,713 English biographies that are linked to a Chinese page. We also enriched this bilingual corpus with metadata that records every mentioned person, organization, geopolitical entity and location per Wikipedia biography and links the names to their counterpart in the other language. This data structure allows the researcher to analyze the relationships between biographies via shared contents and compare networks in different language settings. In this paper we will describe our methodology for building this new dataset. The first step was to use automatic text classification for extracting Chinese biographies. We trained a binary classifier to detect biographies on manually classified examples and used a subset of unseen texts to assess its accuracy. The second step used Named Entity Recognition to generate metadata and extract relations from the links in Wikipedia. Furthermore, we will delve into the method for building networks from this dataset. We argue that depending on the specific research question, different networks may be built. Using the metadata, researchers can create various kinds of networks to suit their needs. On top of releasing this dataset as an enriched bilingual corpus, we will provide an online interface to query and explore it. Our interface benefits from the bipartite graph structure (it can be seen as a network of documents and entities) and applies the same exploration and clustering strategy as in Cillex.
	Keyword: [SHS.STAT]Humanities and Social Sciences/Methods and statistics; BERT; biography; deep learning; historical network analysis; NER; Wikidata; Wikipedia
	URL: https://halshs.archives-ouvertes.fr/halshs-03217972/file/Creating_Biographical_Networks_from_Chinese_and_English_Wikipedia.pdf https://halshs.archives-ouvertes.fr/halshs-03217972 https://halshs.archives-ouvertes.fr/halshs-03217972/document
	BASE
	Hide details

5	Who cares about calling non-consensual sex "rape" in summaries of fictional narratives on Wikipedia? From a gender identity hypothesis to recurrent activist discursive practices
	Grand d'Esnon, Anne
	In: Exploring Gender Identities Online ; https://hal-univ-bourgogne.archives-ouvertes.fr/hal-03293248 ; Exploring Gender Identities Online, Jul 2021, Greifswald / Constance (on line), Germany (2021)
	BASE
	Show details

6	Tabouid: un jeu de langage et de culture générale généré à partir de Wikipédia
	Bernard, Timothée
	In: Actes de la 28e Conférence sur le Traitement Automatique des Langues Naturelles. Volume 1 : conférence principale ; Traitement Automatique des Langues Naturelles ; https://hal.archives-ouvertes.fr/hal-03265882 ; Traitement Automatique des Langues Naturelles, 2021, Lille, France. pp.278-279 (2021)
	BASE
	Show details

7	Alfabetizzazione digitale, scrittura enciclopedica ed educazione linguistica democratica ...
	Tavosanis, Mirko Luigi Aurelio. - : University of Salento, 2021
	BASE
	Show details

8	Comparable corpora of South-Slavic Wikipedias CLASSLA-Wikipedia 1.0
	Ljubešić, Nikola; Markoski, Filip; Markoska, Elena. - : Jožef Stefan Institute, 2021
	BASE
	Show details

9	ViquiQuAD: an extractive QA dataset from Catalan Wikipedia ...
	Rodriguez-Penagos, Carlos Gerardo; Armentano-Oller, Carme. - : Zenodo, 2021
	BASE
	Show details

10	FAIR and Open multilingual clinical trials in Wikidata and Wikipedia ...
	Rasberry, Lane; Kwok, Cherrie. - : Zenodo, 2021
	BASE
	Show details

11	ViquiQuAD: an extractive QA dataset from Catalan Wikipedia ...
	Rodriguez-Penagos, Carlos Gerardo; Armentano-Oller, Carme. - : Zenodo, 2021
	BASE
	Show details

12	WikiProject Clinical Trials for multilingual access to information ...
	Rasberry, Lane. - : Zenodo, 2021
	BASE
	Show details

13	FAIR and Open multilingual clinical trials in Wikidata and Wikipedia ...
	Rasberry, Lane; Kwok, Cherrie. - : Zenodo, 2021
	BASE
	Show details

14	The Influence of Multilingualism and Mutual Intelligibility on Wikipedia Reading Behaviour: A Research Proposal ...
	Meier, Florian. - : Universität Regensburg, 2021
	BASE
	Show details

15	WikiProject Clinical Trials for multilingual access to information ...
	Rasberry, Lane. - : Zenodo, 2021
	BASE
	Show details

16	FAIR and Open multilingual clinical trials in Wikidata and Wikipedia ...
	Rasberry, Lane; Kwok, Cherrie. - : Zenodo, 2021
	BASE
	Show details

17	WikiProject Clinical Trials for multilingual access to information ...
	Rasberry, Lane. - : Zenodo, 2021
	BASE
	Show details

18	Graphs, Computation, and Language ...
	Ustalov, Dmitry. - : Zenodo, 2021
	BASE
	Show details

19	Graphs, Computation, and Language ...
	Ustalov, Dmitry. - : Zenodo, 2021
	BASE
	Show details

20	Extracting Relations from Italian Wikipedia using Self-Training ...
	Siciliani, Lucia; Cassotti, Pierluigi; Basile, Pierpaolo. - : Zenodo, 2021
	BASE
	Show details

Page: 1 2 3 4 5...17

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern