DE eng

Search in the Catalogues and Directories

Hits 1 – 1 of 1

1
By the People Crowdsourcing Datasets from the Library of Congress
In: Journal of Open Humanities Data; Vol 8 (2022); 5 ; 2059-481X (2022)
Abstract: The By the People (BTP) datasets comprise text of selected collections of the Library of Congress (LOC) created by volunteers in the By the People crowdsourced transcription program, which invites public transcription of historical documents. All transcriptions are created and reviewed by volunteers in a consensus-based model in which two or more volunteers must agree on a transcription for it to be considered complete. Resulting transcriptions are added to the digital collections alongside the images to enable search and accessibility of the collections. Additionally, completed transcription “campaigns” are published as freely downloadable datasets of .CSV files containing all campaign transcriptions, as well as minimal metadata. The datasets can support a multitude of purposes including computational research in fields such as history, linguistics, economics, and political science.
Keyword: accessibility; American Civil War history; baseball and sports history; civil rights history; crowdsourcing; cultural heritage; digital humanities; feminist history; Handwritten Text Recognition; history; text; transcription; women's suffrage
URL: https://openhumanitiesdata.metajnl.com/jms/article/view/67
https://doi.org/10.5334/johd.67
BASE
Hide details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
1
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern