DE eng

Search in the Catalogues and Directories

Hits 1 – 3 of 3

1
Dependency Lengths in Speech and Writing: A Cross-Linguistic Comparison via YouDePP, a Pipeline for Scraping and Parsing YouTube Captions ...
Kramer, Alex. - : University of Massachusetts Amherst, 2021
BASE
Show details
2
Dependency Lengths in Speech and Writing: A Cross-Linguistic Comparison via YouDePP, a Pipeline for Scraping and Parsing YouTube Captions
In: Proceedings of the Society for Computation in Linguistics (2021)
Abstract: Recording, transcribing, and annotating naturalistic spoken data is typically difficult and time-intensive. Online sources, however, are a rich and relatively untapped source of naturalistic speech. Using corpora of 7 languages gathered via YouDePP, a pipeline for scraping and dependency-parsing pre-transcribed speech from YouTube, I investigate how dependency length minimization (DLM) varies across written and spoken modalities. I compare the dependency length growth rates in these corpora to those in Universal Dependencies 2.6 and find that dependency lengths in writing are not consistently longer than those in speech. Rather, the dependency lengths of more head-initial, SVO languages grew at a slightly faster rate in speech than in writing, while the reverse pattern held for more head-final, SOV languages.
Keyword: Computational Linguistics; dependency length minimization; dependency locality; spoken corpora; Typological Linguistics and Linguistic Diversity; typology; YouTube
URL: https://scholarworks.umass.edu/scil/vol4/iss1/37
https://scholarworks.umass.edu/cgi/viewcontent.cgi?article=1175&context=scil
BASE
Hide details
3
A Data-driven Approach to Crosslinguistic Structural Biases
In: Proceedings of the Society for Computation in Linguistics (2021)
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
3
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern