1 |
JSEALS Special Publication No. 9 Vietnamese Linguistics: State of the Field
|
|
|
|
In: Journal of the Southeast Asian Linguistics Society (2022) (2022)
|
|
BASE
|
|
Show details
|
|
2 |
Why final stop phonemes became unreleased stops? The cases of Vietnamese and Korean
|
|
|
|
In: 18th RFP Conference (French Phonology Network) ; RFP 2021 - 18èmes rencontres du Réseau Français de Phonologie / 18th Meeting of the French Phonology Network (RFP2021) ; https://hal.archives-ouvertes.fr/hal-03481440 ; RFP 2021 - 18èmes rencontres du Réseau Français de Phonologie / 18th Meeting of the French Phonology Network (RFP2021), E.A. 999 – Université Clermont Auvergne, Jul 2021, Clermont-Ferrand, France. pp.51-53 ; https://rfp2021.sciencesconf.org/ (2021)
|
|
BASE
|
|
Show details
|
|
3 |
Prosodic Boundary Prediction Model for Vietnamese Text-To-Speech
|
|
|
|
In: Proc. Interspeech 2021 ; Interspeech 2021 ; https://hal.archives-ouvertes.fr/hal-03329116 ; Interspeech 2021, Aug 2021, Brno, Czech Republic. pp.3885-3889, ⟨10.21437/interspeech.2021-125⟩ (2021)
|
|
BASE
|
|
Show details
|
|
4 |
Phonologie du vietnamien
|
|
|
|
In: https://hal.archives-ouvertes.fr/hal-03278515 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
5 |
André Palmeiro's Epistola (Macau 8/V 1632) cum paradigmate Orationis Dominicae Pater Noster in lingua Sinica, Japonica, Annamitica: A linguistic analysis
|
|
|
|
In: Missionary Linguistics VI. Missionary Linguistics in Asia. Selected Papers from the Tenth International Conference on Missionary Linguistics, Rome 21-24 March 2018. ; https://hal.archives-ouvertes.fr/hal-03420073 ; Otto Zwartjes; Paolo De Troia. Missionary Linguistics VI. Missionary Linguistics in Asia. Selected Papers from the Tenth International Conference on Missionary Linguistics, Rome 21-24 March 2018., 130, John Benjamins, pp.1-76, 2021, Studies in the History of the Language Sciences, 9789027210043. ⟨10.1075/sihols.130.01zwa⟩ ; https://benjamins.com/catalog/sihols.130.01zwa (2021)
|
|
BASE
|
|
Show details
|
|
6 |
Automatic Language Identification in Code-Switched Hindi-English Social Media Text
|
|
|
|
In: Journal of Open Humanities Data; Vol 7 (2021); 7 ; 2059-481X (2021)
|
|
Abstract:
Natural Language Processing (NLP) tools typically struggle to process code-switched data and so linguists are commonly forced to annotate such data manually. As this data becomes more readily available, automatic tools are increasingly needed to help speed up the annotation process and improve consistency. Last year, such a toolkit was developed to semi-automatically annotate transcribed bilingual code-switched Vietnamese-English speech data with token-based language information and POS tags (hereafter the CanVEC toolkit, L. Nguyen & Bryant, 2020). In this work, we extend this methodology to another language pair, Hindi-English, to explore the extent to which we can standardise the automation process. Specifically, we applied the principles behind the CanVEC toolkit to data from the International Conference on Natural Language Processing (ICON) 2016 shared task, which consists of social media posts (Facebook, Twitter and WhatsApp) that have been annotated with language and POS tags (Molina et al., 2016). We used the ICON-2016 annotations as the gold-standard labels in the language identification task. Ultimately, our tool achieved an F1 score of 87.99% on the ICON-2016 data. We then evaluated the first 500 tokens of each social media subset manually, and found almost 40% of all errors were caused entirely by problems with the gold-standard, i.e., our system was correct. It is thus likely that the overall accuracy of our system is higher than reported. This shows great potential for effectively automating the annotation of code-switched corpora, on different language combinations, and in different genres. We finally discuss some limitations of our approach and release our code and human evaluation together with this paper.
|
|
Keyword:
automatic annotation; code-switching; Computational Linguistics; English; Hindi; language identification; Linguistics; Vietnamese
|
|
URL: https://openhumanitiesdata.metajnl.com/jms/article/view/44 https://doi.org/10.5334/johd.44
|
|
BASE
|
|
Hide details
|
|
8 |
Cross-generational linguistic variation in the Canberra Vietnamese heritage language community: A corpus-centred investigation ...
|
|
Nguyen, Li. - : Apollo - University of Cambridge Repository, 2021
|
|
BASE
|
|
Show details
|
|
9 |
The Interactional Structure of Nominals: An Investigation of Paranouns ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
An exploratory study of predictors of vocabulary knowledge of Vietnames preschool-age children in a city
|
|
|
|
In: Dutch Journal of Applied Linguistics, Vol 10 (2021) (2021)
|
|
BASE
|
|
Show details
|
|
11 |
International Phonetic Alphabet (Vietnamese version) ; Alphabet Phonétique International (version vietnamienne) ; Bảng phiên âm quốc tế. Bản tiếng Việt
|
|
|
|
In: https://halshs.archives-ouvertes.fr/halshs-02469549 ; 2020 (2020)
|
|
BASE
|
|
Show details
|
|
14 |
Relating production and perception of L2 tone: supplementary materials ...
|
|
Unkn Unknown. - : University of Edinburgh. School of Philosophy, Psychology and Language Sciences. Linguistics and English Language, 2020
|
|
BASE
|
|
Show details
|
|
16 |
International Phonetic Alphabet (Vietnamese version) ; Alphabet Phonétique International (version vietnamienne) ; Bảng phiên âm quốc tế. Bản tiếng Việt
|
|
|
|
In: https://halshs.archives-ouvertes.fr/halshs-02469549 ; 2020 (2020)
|
|
BASE
|
|
Show details
|
|
17 |
Cross-generational linguistic variation in the Canberra Vietnamese heritage language community: A corpus-centred investigation
|
|
Nguyen, Li. - : University of Cambridge, 2020. : Churchill, 2020
|
|
BASE
|
|
Show details
|
|
18 |
Glottal Stop Initials and Nasalization in Sino-Vietnamese and Southern Chinese
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Socio-cultural and contextual variables in medical encounters between a doctor, an elderly patient, and the patient’s companion in Vietnam
|
|
Tran, Thi Thao Phuong. - : The University of Queensland, School of Health and Rehabilitation Sciences, 2020
|
|
BASE
|
|
Show details
|
|
20 |
Examining the Part-of-speech Features in Assessing the Readability of Vietnamese Texts
|
|
|
|
In: Acta Linguistica Asiatica, Vol 10, Iss 2 (2020) (2020)
|
|
BASE
|
|
Show details
|
|
|
|