Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3

Hits 1 – 20 of 52

1	Probing for the Usage of Grammatical Number ...
	Lasri, Karim; Pimentel, Tiago; Lenci, Alessandro. - : arXiv, 2022
	BASE
	Show details

2	On Homophony and Rényi Entropy ...
	Pimentel, Tiago; Meister, Clara Isabel; Teufel, Simone. - : ETH Zurich, 2021
	BASE
	Show details

3	On Homophony and Rényi Entropy ...
	Pimentel, Tiago; Meister, Clara; Teufel, Simone. - : arXiv, 2021
	BASE
	Show details

4	On Homophony and Rényi Entropy ...
	The 2021 Conference on Empirical Methods in Natural Language Processing 2021; Cotterell, Ryan; Meister, Clara. - : Underline Science Inc., 2021
	BASE
	Show details

5	Finding Concept-specific Biases in Form--Meaning Associations ...
	Pimentel, Tiago; Roark, Brian; Wichmann, Søren. - : arXiv, 2021
	BASE
	Show details

6	Quantifying Gender Bias Towards Politicians in Cross-Lingual Language Models ...
	Stańczak, Karolina; Choudhury, Sagnik Ray; Pimentel, Tiago. - : arXiv, 2021
	BASE
	Show details

7	Revisiting the Uniform Information Density Hypothesis ...
	Meister, Clara; Pimentel, Tiago; Haller, Patrick. - : arXiv, 2021
	BASE
	Show details

8	Revisiting the Uniform Information Density Hypothesis ...
	The 2021 Conference on Empirical Methods in Natural Language Processing 2021; Cotterell, Ryan; Haller, Patrick. - : Underline Science Inc., 2021
	BASE
	Show details

9	Modeling the Unigram Distribution ...
	The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing 2021; Blasi, Damián; Cotterell, Ryan. - : Underline Science Inc., 2021
	BASE
	Show details

10	A Bayesian Framework for Information-Theoretic Probing ...
	The 2021 Conference on Empirical Methods in Natural Language Processing 2021; Cotterell, Ryan; Pimentel, Tiago. - : Underline Science Inc., 2021
	BASE
	Show details

11	A surprisal--duration trade-off across and within the world's languages ...
	The 2021 Conference on Empirical Methods in Natural Language Processing 2021; Blasi, Damián; Cotterell, Ryan. - : Underline Science Inc., 2021
	BASE
	Show details

12	Revisiting the Uniform Information Density Hypothesis ...
	Meister, Clara Isabel; Pimentel, Tiago; Haller, Patrick. - : ETH Zurich, 2021
	BASE
	Show details

13	What About the Precedent: An Information-Theoretic Analysis of Common Law ...
	Valvoda, Josef; Pimentel, Tiago; Stoehr, Niklas. - : ETH Zurich, 2021
	BASE
	Show details

14	Modeling the Unigram Distribution ...
	Nikkarinen, Irene; Pimentel, Tiago; Blasi, Damián. - : ETH Zurich, 2021
	BASE
	Show details

15	Finding Concept-specific Biases in Form–Meaning Associations ...
	Pimentel, Tiago; Roark, Brian; Wichmann, Søren. - : ETH Zurich, 2021
	BASE
	Show details

16	How (Non-)Optimal is the Lexicon? ...
	Pimentel, Tiago; Nikkarinen, Irene; Mahowald, Kyle. - : arXiv, 2021
	BASE
	Show details

17	Disambiguatory Signals are Stronger in Word-initial Positions ...
	Pimentel, Tiago; Cotterell, Ryan; Roark, Brian. - : arXiv, 2021
	BASE
	Show details

18	Modeling the Unigram Distribution
	Blasi, Damián; Pimentel, Tiago; Nikkarinen, Irene; Cotterell, Ryan
	In: Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 (2021)
	Abstract: The unigram distribution is the non-contextual probability of finding a specific word form in a corpus. While of central importance to the study of language, it is commonly approximated by each word’s sample frequency in the corpus. This approach, being highly dependent on sample size, assigns zero probability to any out-of-vocabulary (oov) word form. As a result, it produces negatively biased probabilities for any oov word form, while positively biased probabilities to in corpus words. In this work, we argue in favor of properly modeling the unigram distribution—claiming it should be a central task in natural language processing. With this in mind, we present a novel model for estimating it in a language (a neuralization of Goldwater et al.’s (2011) model) and show it produces much better estimates across a diverse set of 7 languages than the naïve use of neural character-level language models.
	URL: https://hdl.handle.net/20.500.11850/518989 https://doi.org/10.3929/ethz-b-000518989
	BASE
	Hide details

19	What About the Precedent: An Information-Theoretic Analysis of Common Law
	Teufel, Simone; Valvoda, Josef; Stoehr, Niklas...
	In: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (2021)
	BASE
	Show details

20	Finding Concept-specific Biases in Form–Meaning Associations
	Roark, Brian; Blasi, Damián; Pimentel, Tiago...
	In: Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (2021)
	BASE
	Show details

Page: 1 2 3

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern