DE eng

Search in the Catalogues and Directories

Hits 1 – 2 of 2

1
Evaluating Information Loss from Phonological Dimensionality Reduction ...
Abstract: As in many sciences, cross-linguistic data is often complex and multi-dimensional. The PHOIBLE database of phonological inventories (Moran et al. 2014) is one example, containing 2160 distinct segments across 2155 phoneme inventories. Each segment type is defined by a unique vector of (mostly binary) distinctive phonetic and phonological features. This multivariate dataset can be modeled as a set of coordinates, where each variable (e.g. segment, its distinctive features, its presence in a language, the language's genealogy and location) is an axis in high-dimensional space. Often, such high dimensionality is reduced during analysis. For example, methods which match segments between phonological inventories, such as phonetic string alignment algorithms used for the automated detection of cognates or phonetic similarity measures, typically collapse the vast variability of segment types into as few as ten sound classes. What is unclear is how much information is lost and whether this loss is uniform across ...
Keyword: FOS Languages and literature; Linguistics
URL: https://dx.doi.org/10.6084/m9.figshare.4465976
https://figshare.com/articles/presentation/Evaluating_Information_Loss_from_Phonological_Dimensionality_Reduction/4465976
BASE
Hide details
2
Evaluating Information Loss from Phonological Dimensionality Reduction ...
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
2
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern