1 |
Unsupervised Multilingual Sentence Embeddings for Parallel Corpus Mining ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Beyond Offline Mapping: Learning Cross-lingual Word Embeddings through Context Anchoring ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Label Verbalization and Entailment for Effective Zero and Few-Shot Relation Extraction ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
A Call for More Rigor in Unsupervised Cross-lingual Learning ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Beyond Offline Mapping: Learning Cross Lingual Word Embeddings through Context Anchoring ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Translation Artifacts in Cross-lingual Transfer Learning ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Learning about phraseology from corpora: A linguistically motivated approach for Multiword Expression identification
|
|
|
|
In: PLoS One (2020)
|
|
Abstract:
Multiword Expressions (MWEs) are idiosyncratic combinations of words which pose important challenges to Natural Language Processing. Some kinds of MWEs, such as verbal ones, are particularly hard to identify in corpora, due to their high degree of morphosyntactic flexibility. This paper describes a linguistically motivated method to gather detailed information about verb+noun MWEs (VNMWEs) from corpora. Although the main focus of this study is Spanish, the method is easily adaptable to other languages. Monolingual and parallel corpora are used as input, and data about the morphosyntactic variability of VNMWEs is extracted. This information is then tested in an identification task, obtaining an F score of 0.52, which is considerably higher than related work.
|
|
Keyword:
Research Article
|
|
URL: http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7451662/ http://www.ncbi.nlm.nih.gov/pubmed/32853283 https://doi.org/10.1371/journal.pone.0237767
|
|
BASE
|
|
Hide details
|
|
10 |
Adapting NMT to caption translation in Wikimedia Commons for low-resource languages
|
|
|
|
In: Poncelas, Alberto orcid:0000-0002-5089-1687 , Sarasola, Kepa orcid:0000-0003-4349-6088 , Dowling, Meghan orcid:0000-0003-1637-4923 , Way, Andy orcid:0000-0001-5736-5930 , Labaka, Gorka orcid:0000-0003-4611-2502 and Alegria, Iñaki orcid:0000-0002-0272-1472 (2019) Adapting NMT to caption translation in Wikimedia Commons for low-resource languages. Procesamiento de Lenguaje Natural, 63 . pp. 33-40. ISSN 1135-5948 (2019)
|
|
BASE
|
|
Show details
|
|
11 |
Adapting NMT to caption translation in Wikimedia Commons for low-resource languages
|
|
|
|
In: Poncelas, Alberto orcid:0000-0002-5089-1687 , Sarasola, Kepa orcid:0000-0003-4349-6088 , Dowling, Meghan orcid:0000-0003-1637-4923 , Way, Andy orcid:0000-0001-5736-5930 , Labaka, Gorka orcid:0000-0003-4611-2502 and Alegria, Iñaki orcid:0000-0002-0272-1472 (2019) Adapting NMT to caption translation in Wikimedia Commons for low-resource languages. Procesamiento del Lenguaje Natural, 63 . pp. 33-40. ISSN 1135-5948 (2019)
|
|
BASE
|
|
Show details
|
|
12 |
Analyzing the Limitations of Cross-lingual Word Embedding Mappings ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Neural machine translation of clinical texts between long distance languages
|
|
|
|
In: J Am Med Inform Assoc (2019)
|
|
BASE
|
|
Show details
|
|
14 |
Uncovering divergent linguistic information in word embeddings with lessons for intrinsic and extrinsic evaluation ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Using linguistic data for English and Spanish verb-noun combination identification
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Comparing rule-based and data-driven approaches to Spanish-to-Basque machine translation
|
|
|
|
In: Labaka, Gorka, Stroppa, Nicolas, Way, Andy orcid:0000-0001-5736-5930 and Sarasola, Kepa (2007) Comparing rule-based and data-driven approaches to Spanish-to-Basque machine translation. In: Machine Translation Summit XI, 10-14 September, 2007, Copenhagen, Denmark. (2007)
|
|
BASE
|
|
Show details
|
|
|
|