Home
Catalogue search
Refine your search:
Keyword
Creator / Publisher:
Arcan, Mihael (7)
Chakravarthi, Bharathi Raja (7)
McCrae, John P. (6)
Alonso, Juan (1)
Casas, Noe (1)
European Regional Development Fund (1)
Fundación BBVA (1)
Horizon 2020 (1)
Jayapal, Arun (1)
Masoud, Maraim (1)
more
Year
Medium
Type
BLLDB-Access:
free (7)
subject to license (0)
Search in the Catalogues and Directories
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
Sort by
creator [A → Z]
'
creator [Z → A]
'
publishing year ↑ (asc)
'
publishing year ↓ (desc)
'
title [A → Z]
'
title [Z → A]
'
Simple Search
Hits 1 – 7 of 7
1
Bilingual lexicon induction across orthographically-distinct under-resourced Dravidian languages
Chakravarthi, Bharathi Raja
;
Rajasekaran, Navaneethan
;
Arcan, Mihael
;
McGuinness, Kevin
;
O'Connor, Noel E.
;
McCrae, John P.
In: Chakravarthi, Bharathi Raja orcid:0000-0002-4575-7934 , Rajasekaran, Navaneethan, Arcan, Mihael orcid:0000-0002-3116-621X , McGuinness, Kevin orcid:0000-0003-1336-6477 , O'Connor, Noel E. orcid:0000-0002-4033-9135 and McCrae, John P. orcid:0000-0002-7227-1331 (2020) Bilingual lexicon induction across orthographically-distinct under-resourced Dravidian languages. In: 7th Workshop on NLP for Similar Languages, Varieties and Dialects, 13 Dec 2020, Barcelona, Spain (Online). (2020)
Abstract:
Bilingual lexicons are a vital tool for under-resourced languages and recent state-of-the-art approaches to this leverage pretrained monolingual word embeddings using supervised or semi- supervised approaches. However, these approaches require cross-lingual information such as seed dictionaries to train the model and find a linear transformation between the word embedding spaces. Especially in the case of low-resourced languages, seed dictionaries are not readily available, and as such, these methods produce extremely weak results on these languages. In this work, we focus on the Dravidian languages, namely Tamil, Telugu, Kannada, and Malayalam, which are even more challenging as they are written in unique scripts. To take advantage of orthographic information and cognates in these languages, we bring the related languages into a single script. Previous approaches have used linguistically sub-optimal measures such as the Levenshtein edit distance to detect cognates, whereby we demonstrate that the longest common sub-sequence is linguistically more sound and improves the performance of bilingual lexicon induction. We show that our approach can increase the accuracy of bilingual lexicon induction methods on these languages many times, making bilingual lexicon induction approaches feasible for such under-resourced languages.
Keyword:
Computational linguistics
;
Information retrieval
;
Machine translating
URL:
http://doras.dcu.ie/25223/
BASE
Hide details
2
Aspects of Terminological and Named Entity Knowledge within Rule-Based Machine Translation Models for Under-Resourced Neural Machine Translation Scenarios ...
Torregrosa, Daniel
;
Pasricha, Nivranshu
;
Masoud, Maraim
. - : arXiv, 2020
BASE
Show details
3
Comparison of Different Orthographies for Machine Translation of Under-Resourced Dravidian Languages
Chakravarthi, Bharathi Raja
;
Arcan, Mihael
;
McCrae, John P.
. - : Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik, 2019. : OASIcs - OpenAccess Series in Informatics. 2nd Conference on Language, Data and Knowledge (LDK 2019), 2019
BASE
Show details
4
Improving wordnets for under-resourced languages using machine translation
Chakravarthi, Bharathi Raja
;
Arcan, Mihael
;
McCrae, John P.
. - : Global Wordnet Association, 2019
BASE
Show details
5
WordNet gloss translation for under-resourced languages using multilingual neural machine translation
McCrae, John P.
;
Arcan, Mihael
;
Chakravarthi, Bharathi Raja
. - : European Association for Machine Translation, 2019
BASE
Show details
6
Multilingual multimodal machine translation for Dravidian languages utilizing phonetic transcription
Arcan, Mihael
;
Chakravarthi, Bharathi Raja
;
Priyadharshini, Ruba
. - : European Association for Machine Translation, 2019
BASE
Show details
7
Comparison of Different Orthographies for Machine Translation of Under-Resourced Dravidian Languages ...
Chakravarthi, Bharathi Raja
;
Arcan, Mihael
;
McCrae, John P.
. - : Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik GmbH, Wadern/Saarbruecken, Germany, 2019
BASE
Show details
Mobile view
All
Catalogues
UB Frankfurt Linguistik
0
IDS Mannheim
0
OLC Linguistik
0
UB Frankfurt Retrokatalog
0
DNB Subject Category Language
0
Institut für Empirische Sprachwissenschaft
0
Leibniz-Centre General Linguistics (ZAS)
0
Bibliographies
BLLDB
0
BDSL
0
IDS Bibliografie zur deutschen Grammatik
0
IDS Bibliografie zur Gesprächsforschung
0
IDS Konnektoren im Deutschen
0
IDS Präpositionen im Deutschen
0
IDS OBELEX meta
0
MPI-SHH Linguistics Collection
0
MPI for Psycholinguistics
0
Linked Open Data catalogues
Annohub
0
Online resources
Link directory
0
Journal directory
0
Database directory
0
Dictionary directory
0
Open access documents
BASE
7
Linguistik-Repository
0
IDS Publikationsserver
0
Online dissertations
0
Language Description Heritage
0
© 2013 - 2024 Lin|gu|is|tik
|
Imprint
|
Privacy Policy
|
Datenschutzeinstellungen ändern