2 |
The corpora they are a-changing: a case study in Italian newspapers
|
|
|
|
In: Basile, Pierpaolo orcid:0000-0002-0545-1105 , Caputo, Annalina orcid:0000-0002-7144-8545 , Caselli, Tommaso orcid:0000-0003-2936-0256 , Cassotti, Pierluigi and Varvara, Rossella orcid:0000-0001-9957-2807 (2021) The corpora they are a-changing: a case study in Italian newspapers. In: 2nd International Workshop on Computational Approaches to Historical Language Change 2021, Online. (2021)
|
|
BASE
|
|
Show details
|
|
3 |
DELA Corpus - A Document-Level Corpus Annotated with Context-Related Issues
|
|
|
|
In: Castilho, Sheila orcid:0000-0002-8416-6555 , Cavalheiro Camargo, João Lucas orcid:0000-0003-3746-1225 , Menezes, Miguel and Way, Andy orcid:0000-0001-5736-5930 (2021) DELA Corpus - A Document-Level Corpus Annotated with Context-Related Issues. In: Sixth Conference on Machine Translation (WMT21), 10-11 Nov 2021, Punta Cana, Dominican Republic (Online). ISBN 978-1-954085-94-7 (2021)
|
|
BASE
|
|
Show details
|
|
4 |
English machine reading comprehension: new approaches to answering multiple-choice questions
|
|
Dzendzik, Daria. - : Dublin City University. School of Computing, 2021. : Dublin City University. ADAPT, 2021
|
|
In: Dzendzik, Daria (2021) English machine reading comprehension: new approaches to answering multiple-choice questions. PhD thesis, Dublin City University. (2021)
|
|
BASE
|
|
Show details
|
|
5 |
Chinese character decomposition for neural MT with multi-word expressions
|
|
|
|
In: Han, Lifeng orcid:0000-0002-3221-2185 , Jones, Gareth J.F. orcid:0000-0003-2923-8365 , Smeaton, Alan F. orcid:0000-0003-1028-8389 and Bolzoni, Paolo (2021) Chinese character decomposition for neural MT with multi-word expressions. In: 23rd Nordic Conference on Computational Linguistics (NoDaLiDa 2021), 31 May- 2 June 2021, Reykjavik, Iceland (Online). (In Press) (2021)
|
|
BASE
|
|
Show details
|
|
6 |
cushLEPOR uses LABSE distilled knowledge to improve correlation with human translation evaluations
|
|
|
|
In: Erofeev, Gleb, Sorokina, Irina, Han, Lifeng orcid:0000-0002-3221-2185 and Gladkoff, Serge (2021) cushLEPOR uses LABSE distilled knowledge to improve correlation with human translation evaluations. In: Machine Translation Summit 2021, 16-20 Aug 2021, USA (online). (In Press) (2021)
|
|
BASE
|
|
Show details
|
|
7 |
Monte Carlo modelling of confidence intervals in translation quality evaluation (TQE) and post-editing dstance (PED) measurement
|
|
|
|
In: Alekseeva, Alexandra orcid:0000-0002-7990-4592 , Gladkoff, Serge, Sorokina, Irina and Han, Lifeng orcid:0000-0002-3221-2185 (2021) Monte Carlo modelling of confidence intervals in translation quality evaluation (TQE) and post-editing dstance (PED) measurement. In: Metrics 2021: Workshop on Informetric and Scientometric Research (SIG-MET), 23-24 Oct 2021, Online. (2021)
|
|
BASE
|
|
Show details
|
|
8 |
Meta-evaluation of machine translation evaluation methods
|
|
|
|
In: Han, Lifeng orcid:0000-0002-3221-2185 (2021) Meta-evaluation of machine translation evaluation methods. In: Workshop on Informetric and Scientometric Research (SIG-MET), 23-24 Oct 2021, Online. (2021)
|
|
BASE
|
|
Show details
|
|
9 |
Proactive information retrieval
|
|
Sen, Procheta. - : Dublin City University. School of Computing, 2021. : Dublin City University. ADAPT, 2021
|
|
In: Sen, Procheta (2021) Proactive information retrieval. PhD thesis, Dublin City University. (2021)
|
|
BASE
|
|
Show details
|
|
10 |
Is there a bilingual disadvantage for word segmentation? A computational modeling approach
|
|
|
|
In: ISSN: 0305-0009 ; EISSN: 1469-7602 ; Journal of Child Language ; https://hal.archives-ouvertes.fr/hal-03498905 ; Journal of Child Language, Cambridge University Press (CUP), 2021, pp.1-28. ⟨10.1017/S0305000921000568⟩ (2021)
|
|
BASE
|
|
Show details
|
|
11 |
SCALa: A blueprint for computational models of language acquisition in social context
|
|
|
|
In: ISSN: 0010-0277 ; EISSN: 1873-7838 ; Cognition ; https://hal.inria.fr/hal-03373586 ; Cognition, Elsevier, 2021, 213, pp.104779. ⟨10.1016/j.cognition.2021.104779⟩ (2021)
|
|
BASE
|
|
Show details
|
|
12 |
Buzz or Change: How the Social Network Structure Conditions the Fate of Lexical Innovations on Twitter
|
|
|
|
In: 8th Conference on CMC and Social Media Corpora for the Humanities (CMC-Corpora 2021) ; https://hal.archives-ouvertes.fr/hal-03426028 ; 8th Conference on CMC and Social Media Corpora for the Humanities (CMC-Corpora 2021), Oct 2021, Nijmegen, Radboud University, Netherlands (2021)
|
|
BASE
|
|
Show details
|
|
13 |
Universals of Linguistic Idiosyncrasy in Multilingual Computational Linguistics ; Universals of Linguistic Idiosyncrasy in Multilingual Computational Linguistics: Dagstuhl Seminar 21351
|
|
|
|
In: Universals of Linguistic Idiosyncrasy in Multilingual Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-03507948 ; Universals of Linguistic Idiosyncrasy in Multilingual Computational Linguistics, Aug 2021, pp.89--138, 2021, 2192-5283. ⟨10.4230/DagRep.11.7.89⟩ ; https://gitlab.com/unlid/dagstuhl-seminar/-/wikis/home (2021)
|
|
BASE
|
|
Show details
|
|
14 |
Do Infants Really Learn Phonetic Categories?
|
|
|
|
In: EISSN: 2470-2986 ; Open Mind ; https://hal.archives-ouvertes.fr/hal-03550830 ; Open Mind, MIT Press, 2021, 5, pp.113-131. ⟨10.1162/opmi_a_00046⟩ (2021)
|
|
BASE
|
|
Show details
|
|
15 |
Type-logical investigations: proof-theoretic, computational and linguistic aspects of modern type-logical grammars
|
|
|
|
In: https://hal-lirmm.ccsd.cnrs.fr/tel-03452731 ; Computation and Language [cs.CL]. Université Montpellier, 2021 (2021)
|
|
BASE
|
|
Show details
|
|
16 |
Weak supervision for learning discourse structure in multi-party dialogues ; Supervision distante pour l'apprentissage de structures discursives dans les conversations multi-locuteurs
|
|
|
|
In: https://tel.archives-ouvertes.fr/tel-03622653 ; Artificial Intelligence [cs.AI]. Université Paul Sabatier - Toulouse III, 2021. English. ⟨NNT : 2021TOU30138⟩ (2021)
|
|
BASE
|
|
Show details
|
|
17 |
Functional pressures and linguistic typology
|
|
|
|
Abstract:
The explanation of linguistic variation and change is one of the central questions in the language sciences. Functional explanations focus on how the needs and abilities of language users shape the distribution of linguistic structures that typically conventionalize -- e.g. structures that are harder to perceive or learn accurately are less likely to conventionalize accurately. Perceptibility effects are common sound patterns that seem closely related to the relative confusability of different speech sound sequences. One class of explanations -- purely phonological accounts --have assumed speakers (implicitly) know how confusability varies as a function of immediately adjacent sounds, and that this is a rich enough description of confusability to explain perceptibility effects. Chapter 2 shows that the perceptibility of tokens of any given sound in American English systematically varies based on a listener's incrementally-adjusted expectations about what the speaker intends to say, and shows that this variation is significantly greater than variation due to immediately adjacent sounds. To derive this result, I present a computational psycholinguistic model of word recognition and apply it to experimental confusability data and a transcribed lexicon of 10^4 words. I conclude that purely phonological accounts of perceptibility effects need to be more complicated and less modular than currently appreciated. Chapter 3 applies the same word recognition model and novel information-theoretic measures of confusability to two conversational corpora and shows that words that are more contextually confusable are lengthened in contexts where they are more confusable, and shortened where they are less so. This is a crucial step towards a linking hypothesis between the realtime perceptibility of different speech sound sequences and conventionalized perceptibility effects. Chapter 4 considers morphology. Prior research has observed an inverse relation between morphological complexity and demographic variables like speech community size and proportion of adult learners. Recent work has hypothesized that higher complexity may be helpful to child learners, and that populations with differing demographics constitute environments with different 'selection pressures' for language variants to 'evolve' in. I argue that mathematical formulations of Darwinian evolution suggest a more likely explanation: 'neutral' change caused by random fluctuations in variant frequency ('drift') is much more powerful in small populations and can easily overwhelm selection relative to large populations.
|
|
Keyword:
computational psycholinguistics; language change; Linguistics; morphology; phonology
|
|
URL: https://escholarship.org/uc/item/50g9r4tb
|
|
BASE
|
|
Hide details
|
|
18 |
Arc-Eager Construction Provides Learning Advantage Beyond Stack Management
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Neuro-computational models of language processing
|
|
|
|
In: EISSN: 2333-9691 ; Annual Review of Linguistics ; https://hal.archives-ouvertes.fr/hal-03334485 ; Annual Review of Linguistics, Annual Reviews, In press, ⟨10.1146/lingbuzz/006147⟩ (2021)
|
|
BASE
|
|
Show details
|
|
20 |
Handling Heavily Abbreviated Manuscripts: HTR engines vs text normalisation approaches
|
|
|
|
In: International Conference on Document Analysis and Recognition 2021 ; https://hal-enc.archives-ouvertes.fr/hal-03279602 ; International Conference on Document Analysis and Recognition 2021, 2021, Lausanne, Switzerland. pp.306-316, ⟨10.1007/978-3-030-86159-9_21⟩ (2021)
|
|
BASE
|
|
Show details
|
|
|
|