2 |
On Homophony and Rényi Entropy ...
|
|
|
|
Abstract:
Homophony's widespread presence in natural languages is a controversial topic. Recent theories of language optimality have tried to justify its prevalence, despite its negative effects on cognitive processing time; e.g., Piantadosi et al. (2012) argued homophony enables the reuse of efficient wordforms and is thus beneficial for languages. This hypothesis has recently been challenged by Trott and Bergen (2020), who posit that good wordforms are more often homophonous simply because they are more phonotactically probable. In this paper, we join in on the debate. We first propose a new information-theoretic quantification of a language's homophony: the sample Rényi entropy. Then, we use this quantification to revisit Trott and Bergen's claims. While their point is theoretically sound, a specific methodological issue in their experiments raises doubts about their results. After addressing this issue, we find no clear pressure either towards or against homophony -- a much more nuanced result than either ... : Accepted for publication in EMNLP 2021. Code available in https://github.com/rycolab/homophony-as-renyi-entropy ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://arxiv.org/abs/2109.13766 https://dx.doi.org/10.48550/arxiv.2109.13766
|
|
BASE
|
|
Hide details
|
|
4 |
A surprisal--duration trade-off across and within the world's languages ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
What About the Precedent: An Information-Theoretic Analysis of Common Law ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
What About the Precedent: An Information-Theoretic Analysis of Common Law
|
|
|
|
In: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (2021)
|
|
BASE
|
|
Show details
|
|
7 |
Synthetic Textual Features for the Large-Scale Detection of Basic-level Categories in English and Mandarin ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Proceedings of the 17th European Chapter of the Association for Computational Linguistics ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Metaphor Detection Using Context and Concreteness
|
|
|
|
In: Proceedings of the Second Workshop on Figurative Language Processing (2020)
|
|
BASE
|
|
Show details
|
|
10 |
Variable typing: Assigning meaning to variables in mathematical text
|
|
Stathopoulos, YA; Baker, Simon; Rei, Marek. - : Association for Computational Linguistics, 2018. : https://aclanthology.org/volumes/N18-1/, 2018. : NAACL HLT 2018 - 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference, 2018
|
|
BASE
|
|
Show details
|
|
12 |
Unsupervised Timeline Generation for Wikipedia History Articles ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Improving argument overlap for proposition-based summarisation ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Predicting the impact of scientific concepts using full‐text features
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Unsupervised Timeline Generation for Wikipedia History Articles
|
|
Bauer, Sandro; Teufel, Simone. - : Association for Computational Linguistics, 2016. : https://aclanthology.org/volumes/D16-1/, 2016. : Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016
|
|
BASE
|
|
Show details
|
|
16 |
Improving argument overlap for proposition-based summarisation
|
|
Fang, Y; Teufel, Simone. - : Association for Computational Linguistics, 2016. : http://aclanthology.info/papers/P16-2078/improving-argument-overlap-for-proposition-based-summarisation, 2016. : 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Short Papers, 2016
|
|
BASE
|
|
Show details
|
|
17 |
Identifying Problem Statements in Scientific Text
|
|
Heffernan, Kevin; Teufel, Simone. - : University of Potsdam, 2016. : http://www.ling.uni-potsdam.de/comma2016/, 2016. : Workshop on Foundations of the Language of Argumentation (in conjunction with COMMA), 2016
|
|
BASE
|
|
Show details
|
|
18 |
A Proposition-based Abstractive Summarizer
|
|
Fang, Y; Zhu, H; Muszynska, E. - : International Committee on Computational Linguistics, 2016. : https://aclanthology.org/volumes/C16-1/, 2016. : Proceedings of COLING 2016, 2016
|
|
BASE
|
|
Show details
|
|
20 |
MEAD - A Platform for Multidocument Multilingual Text Summarization
|
|
|
|
BASE
|
|
Show details
|
|
|
|