2 |
The extent and degree of utterance-final word lengthening in spontaneous speech from 10 languages
|
|
|
|
In: Linguistics Vanguard ; https://hal.univ-lyon2.fr/hal-03167445 ; Linguistics Vanguard, 2021, 7 (1), pp.20190063. ⟨10.1515/lingvan-2019-0063⟩ (2021)
|
|
BASE
|
|
Show details
|
|
3 |
Finding Concept-specific Biases in Form--Meaning Associations ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Finding Concept-specific Biases in Form–Meaning Associations ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Finding Concept-specific Biases in Form–Meaning Associations
|
|
|
|
In: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (2021)
|
|
Abstract:
This work presents an information-theoretic operationalisation of cross-linguistic non-arbitrariness. It is not a new idea that there are small, cross-linguistic associations between the forms and meanings of words. For instance, it has been claimed (Blasi et al., 2016) that the word for “tongue” is more likely than chance to contain the phone [l]. By controlling for the influence of language family and geographic proximity within a very large concept-aligned, cross-lingual lexicon, we extend methods previously used to detect within language non-arbitrariness (Pimentel et al., 2019) to measure cross-linguistic associations. We find that there is a significant effect of non-arbitrariness, but it is unsurprisingly small (less than 0.5% on average according to our information-theoretic estimate). We also provide a concept-level analysis which shows that a quarter of the concepts considered in our work exhibit a significant level of cross-linguistic non-arbitrariness. In sum, the paper provides new methods to detect cross-linguistic associations at scale, and confirms their effects are minor.
|
|
URL: https://doi.org/10.3929/ethz-b-000518985 https://hdl.handle.net/20.500.11850/518985
|
|
BASE
|
|
Hide details
|
|
6 |
CLDF dataset derived from Wichmann's "Lexicostatistical Dataset of Mixe-Zoquean" from 2006 ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
CLDF dataset derived from Wichmann's "Lexicostatistical Dataset of Mixe-Zoquean" from 2006 ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Finding Concept-specific Biases in Form--Meaning Associations ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
The extent and degree of utterance-final word lengthening in spontaneous speech from 10 languages
|
|
|
|
BASE
|
|
Show details
|
|
10 |
English colour terms carry gender and valence biases: A corpus study using word embeddings
|
|
|
|
In: PLOS ONE, vol. 16, no. 6, pp. e0251559 (2021)
|
|
BASE
|
|
Show details
|
|
11 |
CLDF dataset derived from Wichmann et al.'s "ASJP Database" v17 from 2016 ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
CLDF dataset derived from Wichmann et al.'s "ASJP Database" v18 from 2018 ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
CLDF dataset derived from Wichmann et al.'s "ASJP Database" v19 from 2020 ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
CLDF dataset derived from Wichmann et al.'s "ASJP Database" v13 from 2010 ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
CLDF dataset derived from Wichmann et al.'s "ASJP Database" v15 from 2012 ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
CLDF dataset derived from Wichmann et al.'s "ASJP Database" v19 from 2020 ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
CLDF dataset derived from Wichmann et al.'s "ASJP Database" v19 from 2020 ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
CLDF dataset derived from Wichmann et al.'s "ASJP Database" from 2010 ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
CLDF dataset derived from Wichmann et al.'s "ASJP Database" v14 from 2011 ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
CLDF dataset derived from Wichmann et al.'s "ASJP Database" v16 from 2013 ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|