1 |
Finding Variants for Construction-Based Dialectometry: A Corpus-Based Approach to Regional CxGs ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Representations of Language Varieties Are Reliable Given Corpus Similarity Measures ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Learned Construction Grammars Converge Across Registers Given Increased Exposure ...
|
|
|
|
Abstract:
This paper measures the impact of increased exposure on whether learned construction grammars converge onto shared representations when trained on data from different registers. Register influences the frequency of constructions, with some structures common in formal but not informal usage. We expect that a grammar induction algorithm exposed to different registers will acquire different constructions. To what degree does increased exposure lead to the convergence of register-specific grammars? The experiments in this paper simulate language learning in 12 languages (half Germanic and half Romance) with corpora representing three registers (Twitter, Wikipedia, Web). These simulations are repeated with increasing amounts of exposure, from 100k to 2 million words, to measure the impact of exposure on the convergence of grammars. The results show that increased exposure does lead to converging grammars across all languages. In addition, a shared core of register-universal constructions remains constant across ...
|
|
Keyword:
Computational Linguistics; Machine Learning; Machine Learning and Data Mining; Natural Language Processing
|
|
URL: https://dx.doi.org/10.48448/5gyq-3d26 https://underline.io/lecture/39859-learned-construction-grammars-converge-across-registers-given-increased-exposure
|
|
BASE
|
|
Hide details
|
|
5 |
Production vs Perception: The Role of Individuality in Usage-Based Grammar Induction ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Global Syntactic Variation in Seven Languages: Towards a Computational Dialectology ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Learned Construction Grammars Converge Across Registers Given Increased Exposure
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Production vs Perception: The Role of Individuality in Usage-Based Grammar Induction
|
|
|
|
BASE
|
|
Show details
|
|
10 |
Representations of Language Varieties Are Reliable Given Corpus Similarity Measures
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Modeling Global Syntactic Variation in English Using Dialect Classification ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Mapping Languages and Demographics with Georeferenced Corpora
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Global Syntactic Variation in Seven Languages: Toward a Computational Dialectology
|
|
|
|
In: Front Artif Intell (2019)
|
|
BASE
|
|
Show details
|
|
17 |
Modeling the Complexity and Descriptive Adequacy of Construction Grammars
|
|
|
|
In: Proceedings of the Society for Computation in Linguistics (2018)
|
|
BASE
|
|
Show details
|
|
18 |
Learnability and falsifiability of Construction Grammars
|
|
|
|
In: Proceedings of the Linguistic Society of America; Vol 2 (2017): Proceedings of the Linguistic Society of America; 1:1–15 ; 2473-8689 (2017)
|
|
BASE
|
|
Show details
|
|
19 |
The Linguistic Status of Predictions and Feature Ranks from SVM Text Classifiers
|
|
|
|
In: LSA Annual Meeting Extended Abstracts; Vol 6: LSA Annual Meeting Extended Abstracts 2015; 5:1-5 ; 2377-3367 (2015)
|
|
BASE
|
|
Show details
|
|
|
|