Home
Catalogue search
Refine your search:
Keyword:
Computation and Language cs.CL (7)
FOS Computer and information sciences (7)
Machine translation (3)
code-mixing (3)
dataset (3)
sentiment analysis (3)
Dravidian languages (2)
Phonetic transcription (2)
Tamil (2)
Tamil, Malayalam, Kannada, Dravidian languages, Sentiment Analysis, Offensive langauge identification, Code-mixed, corpora (2)
more
Creator / Publisher:
Chakravarthi, Bharathi Raja (24)
McCrae, John P. (16)
Suryawanshi, Shardul (9)
Arcan, Mihael (8)
Jose, Navya (8)
Priyadharshini, Ruba (8)
European Regional Development Fund (6)
Horizon 2020 (6)
Irish Research Council (5)
Science Foundation Ireland (5)
more
Year
Medium
Type
BLLDB-Access
Search in the Catalogues and Directories
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
Sort by
creator [A → Z]
'
creator [Z → A]
'
publishing year ↑ (asc)
'
publishing year ↓ (desc)
'
title [A → Z]
'
title [Z → A]
'
Simple Search
Page:
1
2
Hits 1 – 20 of 25
1
DravidianCodeMix: Sentiment Analysis and Offensive Language Identification Dataset for Dravidian Languages in Code-Mixed Text ...
Chakravarthi, Bharathi Raja
;
Priyadharshini, Ruba
;
Muralidaran, Vigneshwaran
. - : Zenodo, 2021
BASE
Show details
2
DravidianCodeMix: Sentiment Analysis and Offensive Language Identification Dataset for Dravidian Languages in Code-Mixed Text ...
Chakravarthi, Bharathi Raja
;
Priyadharshini, Ruba
;
Muralidaran, Vigneshwaran
. - : Zenodo, 2021
BASE
Show details
3
DravidianCodeMix: Sentiment Analysis and Offensive Language Identification Dataset for Dravidian Languages in Code-Mixed Text ...
Chakravarthi, Bharathi Raja
;
Priyadharshini, Ruba
;
Muralidaran, Vigneshwaran
. - : arXiv, 2021
BASE
Show details
4
Attentive fine-tuning of Transformers for Translation of low-resourced languages @LoResMT 2021 ...
Puranik, Karthik
;
Hande, Adeep
;
Priyadharshini, Ruba
. - : arXiv, 2021
BASE
Show details
5
Hypers at ComMA@ICON: Modelling Aggressiveness, Gender Bias and Communal Bias Identification ...
Benhur, Sean
;
Nayak, Roshan
;
Sivanraju, Kanchana
. - : arXiv, 2021
BASE
Show details
6
Dataset for Identification of Homophobia and Transophobia in Multilingual YouTube Comments ...
Chakravarthi, Bharathi Raja
;
Priyadharshini, Ruba
;
Ponnusamy, Rahul
. - : arXiv, 2021
BASE
Show details
7
https://www.aclweb.org/anthology/W19-6809/ ...
Chakravarthi, Bharathi Raja
. - : Zenodo, 2021
BASE
Show details
8
https://www.aclweb.org/anthology/W19-6809/ ...
Chakravarthi, Bharathi Raja
. - : Zenodo, 2021
BASE
Show details
9
Bilingual lexicon induction across orthographically-distinct under-resourced Dravidian languages
Chakravarthi, Bharathi Raja
;
Rajasekaran, Navaneethan
;
Arcan, Mihael
;
McGuinness, Kevin
;
O'Connor, Noel E.
;
McCrae, John P.
In: Chakravarthi, Bharathi Raja orcid:0000-0002-4575-7934 , Rajasekaran, Navaneethan, Arcan, Mihael orcid:0000-0002-3116-621X , McGuinness, Kevin orcid:0000-0003-1336-6477 , O'Connor, Noel E. orcid:0000-0002-4033-9135 and McCrae, John P. orcid:0000-0002-7227-1331 (2020) Bilingual lexicon induction across orthographically-distinct under-resourced Dravidian languages. In: 7th Workshop on NLP for Similar Languages, Varieties and Dialects, 13 Dec 2020, Barcelona, Spain (Online). (2020)
Abstract:
Bilingual lexicons are a vital tool for under-resourced languages and recent state-of-the-art approaches to this leverage pretrained monolingual word embeddings using supervised or semi- supervised approaches. However, these approaches require cross-lingual information such as seed dictionaries to train the model and find a linear transformation between the word embedding spaces. Especially in the case of low-resourced languages, seed dictionaries are not readily available, and as such, these methods produce extremely weak results on these languages. In this work, we focus on the Dravidian languages, namely Tamil, Telugu, Kannada, and Malayalam, which are even more challenging as they are written in unique scripts. To take advantage of orthographic information and cognates in these languages, we bring the related languages into a single script. Previous approaches have used linguistically sub-optimal measures such as the Levenshtein edit distance to detect cognates, whereby we demonstrate that the longest common sub-sequence is linguistically more sound and improves the performance of bilingual lexicon induction. We show that our approach can increase the accuracy of bilingual lexicon induction methods on these languages many times, making bilingual lexicon induction approaches feasible for such under-resourced languages.
Keyword:
Computational linguistics
;
Information retrieval
;
Machine translating
URL:
http://doras.dcu.ie/25223/
BASE
Hide details
10
A sentiment analysis dataset for code-mixed Malayalam-English
Sherly, Elizabeth
;
Jose, Navya
;
McCrae, John P.
. - : European Language Resources Association (ELRA), 2020
BASE
Show details
11
Corpus creation for sentiment analysis in code-mixed Tamil-English text
McCrae, John P.
;
Priyadharshini, Ruba
;
Muralidaran, Vigneshwaran
. - : European Language Resources Association (ELRA), 2020
BASE
Show details
12
Leveraging orthographic information to improve machine translation of under-resourced languages
Asoka Chakravarthi, Bharathi Raja
. - : NUI Galway, 2020
BASE
Show details
13
A comparative study of different state-of-the-art hate speech detection methods in Hindi-English code-mixed data
Fransen, Theodorus
;
McCrae, John P.
;
Chakravarthi, Bharathi Raja
. - : European Language Resources Association (ELRA), 2020
BASE
Show details
14
A survey of current datasets for code-switching research
Jose, Navya
;
Chakravarthi, Bharathi Raja
;
Suryawanshi, Shardul
. - : IEEE, 2020
BASE
Show details
15
NUIG-Panlingua-KMI Hindi-Marathi MT Systems for Similar Language Translation Task @ WMT 2020
Chakravarthi, Bharathi Raja
;
Kumar, Ritesh
;
Rani, Priya
. - : Association for Computational Linguistics, 2020
BASE
Show details
16
A Sentiment Analysis Dataset for Code-Mixed Malayalam-English ...
Chakravarthi, Bharathi Raja
;
Jose, Navya
;
Suryawanshi, Shardul
. - : arXiv, 2020
BASE
Show details
17
ULD@NUIG at SemEval-2020 Task 9: Generative Morphemes with an Attention Model for Sentiment Analysis in Code-Mixed Text ...
Goswami, Koustava
;
Rani, Priya
;
Chakravarthi, Bharathi Raja
. - : arXiv, 2020
BASE
Show details
18
A Sentiment Analysis Dataset for Code-Mixed Malayalam-English ...
Chakravarthi, Bharathi Raja
;
Jose, Navya
;
Suryawanshi, Shardul
. - : Zenodo, 2020
BASE
Show details
19
A Sentiment Analysis Dataset for Code-Mixed Malayalam-English ...
Chakravarthi, Bharathi Raja
;
Jose, Navya
;
Suryawanshi, Shardul
. - : Zenodo, 2020
BASE
Show details
20
Aspects of Terminological and Named Entity Knowledge within Rule-Based Machine Translation Models for Under-Resourced Neural Machine Translation Scenarios ...
Torregrosa, Daniel
;
Pasricha, Nivranshu
;
Masoud, Maraim
. - : arXiv, 2020
BASE
Show details
Page:
1
2
Mobile view
All
Catalogues
UB Frankfurt Linguistik
0
IDS Mannheim
0
OLC Linguistik
0
UB Frankfurt Retrokatalog
0
DNB Subject Category Language
0
Institut für Empirische Sprachwissenschaft
0
Leibniz-Centre General Linguistics (ZAS)
0
Bibliographies
BLLDB
0
BDSL
0
IDS Bibliografie zur deutschen Grammatik
0
IDS Bibliografie zur Gesprächsforschung
0
IDS Konnektoren im Deutschen
0
IDS Präpositionen im Deutschen
0
IDS OBELEX meta
0
MPI-SHH Linguistics Collection
0
MPI for Psycholinguistics
0
Linked Open Data catalogues
Annohub
0
Online resources
Link directory
0
Journal directory
0
Database directory
0
Dictionary directory
0
Open access documents
BASE
25
Linguistik-Repository
0
IDS Publikationsserver
0
Online dissertations
0
Language Description Heritage
0
© 2013 - 2024 Lin|gu|is|tik
|
Imprint
|
Privacy Policy
|
Datenschutzeinstellungen ändern