2 |
The corpora they are a-changing: a case study in Italian newspapers
|
|
|
|
In: Basile, Pierpaolo orcid:0000-0002-0545-1105 , Caputo, Annalina orcid:0000-0002-7144-8545 , Caselli, Tommaso orcid:0000-0003-2936-0256 , Cassotti, Pierluigi and Varvara, Rossella orcid:0000-0001-9957-2807 (2021) The corpora they are a-changing: a case study in Italian newspapers. In: 2nd International Workshop on Computational Approaches to Historical Language Change 2021, Online. (2021)
|
|
BASE
|
|
Show details
|
|
3 |
DELA Corpus - A Document-Level Corpus Annotated with Context-Related Issues
|
|
|
|
In: Castilho, Sheila orcid:0000-0002-8416-6555 , Cavalheiro Camargo, João Lucas orcid:0000-0003-3746-1225 , Menezes, Miguel and Way, Andy orcid:0000-0001-5736-5930 (2021) DELA Corpus - A Document-Level Corpus Annotated with Context-Related Issues. In: Sixth Conference on Machine Translation (WMT21), 10-11 Nov 2021, Punta Cana, Dominican Republic (Online). ISBN 978-1-954085-94-7 (2021)
|
|
BASE
|
|
Show details
|
|
4 |
English machine reading comprehension: new approaches to answering multiple-choice questions
|
|
Dzendzik, Daria. - : Dublin City University. School of Computing, 2021. : Dublin City University. ADAPT, 2021
|
|
In: Dzendzik, Daria (2021) English machine reading comprehension: new approaches to answering multiple-choice questions. PhD thesis, Dublin City University. (2021)
|
|
BASE
|
|
Show details
|
|
5 |
Chinese character decomposition for neural MT with multi-word expressions
|
|
|
|
In: Han, Lifeng orcid:0000-0002-3221-2185 , Jones, Gareth J.F. orcid:0000-0003-2923-8365 , Smeaton, Alan F. orcid:0000-0003-1028-8389 and Bolzoni, Paolo (2021) Chinese character decomposition for neural MT with multi-word expressions. In: 23rd Nordic Conference on Computational Linguistics (NoDaLiDa 2021), 31 May- 2 June 2021, Reykjavik, Iceland (Online). (In Press) (2021)
|
|
BASE
|
|
Show details
|
|
6 |
cushLEPOR uses LABSE distilled knowledge to improve correlation with human translation evaluations
|
|
|
|
In: Erofeev, Gleb, Sorokina, Irina, Han, Lifeng orcid:0000-0002-3221-2185 and Gladkoff, Serge (2021) cushLEPOR uses LABSE distilled knowledge to improve correlation with human translation evaluations. In: Machine Translation Summit 2021, 16-20 Aug 2021, USA (online). (In Press) (2021)
|
|
BASE
|
|
Show details
|
|
7 |
Monte Carlo modelling of confidence intervals in translation quality evaluation (TQE) and post-editing dstance (PED) measurement
|
|
|
|
In: Alekseeva, Alexandra orcid:0000-0002-7990-4592 , Gladkoff, Serge, Sorokina, Irina and Han, Lifeng orcid:0000-0002-3221-2185 (2021) Monte Carlo modelling of confidence intervals in translation quality evaluation (TQE) and post-editing dstance (PED) measurement. In: Metrics 2021: Workshop on Informetric and Scientometric Research (SIG-MET), 23-24 Oct 2021, Online. (2021)
|
|
BASE
|
|
Show details
|
|
8 |
Meta-evaluation of machine translation evaluation methods
|
|
|
|
In: Han, Lifeng orcid:0000-0002-3221-2185 (2021) Meta-evaluation of machine translation evaluation methods. In: Workshop on Informetric and Scientometric Research (SIG-MET), 23-24 Oct 2021, Online. (2021)
|
|
BASE
|
|
Show details
|
|
9 |
Proactive information retrieval
|
|
Sen, Procheta. - : Dublin City University. School of Computing, 2021. : Dublin City University. ADAPT, 2021
|
|
In: Sen, Procheta (2021) Proactive information retrieval. PhD thesis, Dublin City University. (2021)
|
|
BASE
|
|
Show details
|
|
10 |
Is there a bilingual disadvantage for word segmentation? A computational modeling approach
|
|
|
|
In: ISSN: 0305-0009 ; EISSN: 1469-7602 ; Journal of Child Language ; https://hal.archives-ouvertes.fr/hal-03498905 ; Journal of Child Language, Cambridge University Press (CUP), 2021, pp.1-28. ⟨10.1017/S0305000921000568⟩ (2021)
|
|
BASE
|
|
Show details
|
|
11 |
SCALa: A blueprint for computational models of language acquisition in social context
|
|
|
|
In: ISSN: 0010-0277 ; EISSN: 1873-7838 ; Cognition ; https://hal.inria.fr/hal-03373586 ; Cognition, Elsevier, 2021, 213, pp.104779. ⟨10.1016/j.cognition.2021.104779⟩ (2021)
|
|
BASE
|
|
Show details
|
|
12 |
Buzz or Change: How the Social Network Structure Conditions the Fate of Lexical Innovations on Twitter
|
|
|
|
In: 8th Conference on CMC and Social Media Corpora for the Humanities (CMC-Corpora 2021) ; https://hal.archives-ouvertes.fr/hal-03426028 ; 8th Conference on CMC and Social Media Corpora for the Humanities (CMC-Corpora 2021), Oct 2021, Nijmegen, Radboud University, Netherlands (2021)
|
|
Abstract:
International audience ; The diffusion process of linguistic innovations has long been a topic of interest in sociolinguistics (Weinreich et al., 1968) and many studies have highlighted the influence of social structures on change (Labov, 2001; Milroy & Milroy, 1997). The recent access to massive social network data and the advent of computational sociolinguistics (Nguyen et al., 2016) allow an approach to this phenomenon that combines a large amount of data and a fine-grained temporality. Using methods from both computational sociolinguistics and network science, we focus on the diffusion of lexical innovations and we ask what differentiates, after an expansion phase, those that stabilize within our observation period from those that are eventually abandoned. In particular, we examine the impact of the social structure of linguistic communities on these diffusion and acceptance processes. We rely on a corpus of French tweets, that spans from 2012 to 2019 and includes about 600 million tweets from more than two million users. Based on the evolution over time of the rate of use of each linguistic form, we select those that appear during the period covered by the corpus and then we distinguish the forms that stabilize from those that eventually die out. By modeling the trajectories, we then identify the three characteristic periods of the diffusion of an innovation (Fagyal et al., 2010). Finally, by establishing the network of contact between users on the basis of their followers and followees, we examine the circulation of forms between them at different periods, and identify factors that condition the stabilization or not of innovations. This poster will present the methodologies used to identify linguistic innovations and to model their trajectory. We will also present the first results on the connection between the evolution of forms and the structure of the contact network.
|
|
Keyword:
[INFO]Computer Science [cs]; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; computational sociolinguistics; language change; lexicon; Twitter
|
|
URL: https://hal.archives-ouvertes.fr/hal-03426028
|
|
BASE
|
|
Hide details
|
|
13 |
Universals of Linguistic Idiosyncrasy in Multilingual Computational Linguistics ; Universals of Linguistic Idiosyncrasy in Multilingual Computational Linguistics: Dagstuhl Seminar 21351
|
|
|
|
In: Universals of Linguistic Idiosyncrasy in Multilingual Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-03507948 ; Universals of Linguistic Idiosyncrasy in Multilingual Computational Linguistics, Aug 2021, pp.89--138, 2021, 2192-5283. ⟨10.4230/DagRep.11.7.89⟩ ; https://gitlab.com/unlid/dagstuhl-seminar/-/wikis/home (2021)
|
|
BASE
|
|
Show details
|
|
14 |
Do Infants Really Learn Phonetic Categories?
|
|
|
|
In: EISSN: 2470-2986 ; Open Mind ; https://hal.archives-ouvertes.fr/hal-03550830 ; Open Mind, MIT Press, 2021, 5, pp.113-131. ⟨10.1162/opmi_a_00046⟩ (2021)
|
|
BASE
|
|
Show details
|
|
15 |
Type-logical investigations: proof-theoretic, computational and linguistic aspects of modern type-logical grammars
|
|
|
|
In: https://hal-lirmm.ccsd.cnrs.fr/tel-03452731 ; Computation and Language [cs.CL]. Université Montpellier, 2021 (2021)
|
|
BASE
|
|
Show details
|
|
16 |
Weak supervision for learning discourse structure in multi-party dialogues ; Supervision distante pour l'apprentissage de structures discursives dans les conversations multi-locuteurs
|
|
|
|
In: https://tel.archives-ouvertes.fr/tel-03622653 ; Artificial Intelligence [cs.AI]. Université Paul Sabatier - Toulouse III, 2021. English. ⟨NNT : 2021TOU30138⟩ (2021)
|
|
BASE
|
|
Show details
|
|
18 |
Arc-Eager Construction Provides Learning Advantage Beyond Stack Management
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Neuro-computational models of language processing
|
|
|
|
In: EISSN: 2333-9691 ; Annual Review of Linguistics ; https://hal.archives-ouvertes.fr/hal-03334485 ; Annual Review of Linguistics, Annual Reviews, In press, ⟨10.1146/lingbuzz/006147⟩ (2021)
|
|
BASE
|
|
Show details
|
|
20 |
Handling Heavily Abbreviated Manuscripts: HTR engines vs text normalisation approaches
|
|
|
|
In: International Conference on Document Analysis and Recognition 2021 ; https://hal-enc.archives-ouvertes.fr/hal-03279602 ; International Conference on Document Analysis and Recognition 2021, 2021, Lausanne, Switzerland. pp.306-316, ⟨10.1007/978-3-030-86159-9_21⟩ (2021)
|
|
BASE
|
|
Show details
|
|
|
|