1 |
Chinese character decomposition for neural MT with multi-word expressions
|
|
|
|
In: Han, Lifeng orcid:0000-0002-3221-2185 , Jones, Gareth J.F. orcid:0000-0003-2923-8365 , Smeaton, Alan F. orcid:0000-0003-1028-8389 and Bolzoni, Paolo (2021) Chinese character decomposition for neural MT with multi-word expressions. In: 23rd Nordic Conference on Computational Linguistics (NoDaLiDa 2021), 31 May- 2 June 2021, Reykjavik, Iceland (Online). (In Press) (2021)
|
|
Abstract:
Chinese character decomposition has been used as a feature to enhance Machine Translation (MT) models, combining rad- icals into character and word level mod- els. Recent work has investigated ideo- graph or stroke level embedding. How- ever, questions remain about different de- composition levels of Chinese character representations, radical and strokes, best suited for MT. To investigate the impact of Chinese decomposition embedding in detail, i.e., radical, stroke, and intermedi- ate levels, and how well these decomposi- tions represent the meaning of the original character sequences, we carry out analy- sis with both automated and human evalu- ation of MT. Furthermore, we investigate if the combination of decomposed Mul- tiword Expressions (MWEs) can enhance the model learning. MWE integration into MT has seen more than a decade of explo- ration. However, decomposed MWEs has not previously been explored.
|
|
Keyword:
Algorithms; Artificial intelligence; Computational linguistics; Computer software; Language; Linguistics; Machine translating; Semantics
|
|
URL: http://doras.dcu.ie/25742/
|
|
BASE
|
|
Hide details
|
|
2 |
Quantification: the view from natural language generation ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Quantification: the view from natural language generation
|
|
|
|
In: Frontiers in Artificial Intelligence ; 2021. - https://doi.org/10.3389/frai.2021.627177 (2021)
|
|
BASE
|
|
Show details
|
|
4 |
A study of semantics across different representations of language
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Desambiguación Verbal Automática. Un estudio sobre el rendimiento de la información semántica argumental
|
|
|
|
BASE
|
|
Show details
|
|
9 |
An LFG analysis of pronominal binding in Mandarin Chinese
|
|
|
|
In: Proceedings of the Linguistic Society of America; Vol 1 (2016): Proceedings of the Linguistic Society of America; 2:1–15 ; 2473-8689 (2016)
|
|
BASE
|
|
Show details
|
|
10 |
Tool paper: Combining Alf and UML in modeling tools: An example with papyrus
|
|
|
|
In: 15th International Workshop on OCL and Textual Modeling, OCL 2015 ; https://hal-cea.archives-ouvertes.fr/cea-01844056 ; 15th International Workshop on OCL and Textual Modeling, OCL 2015, Sep 2015, Ottawa, Canada. pp.105-119 (2015)
|
|
BASE
|
|
Show details
|
|
11 |
Investigating the use of distributional semantic models for co-hyponym identification in special corpora
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Analysing entity context in multilingual wikipedia to support entity-centric retrieval applications ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Common-sense knowledge for natural language understanding: Experiments in unsupervised and supervised settings
|
|
|
|
BASE
|
|
Show details
|
|
16 |
CEA LIST's participation at the CLEF CHiC 2013
|
|
|
|
In: 2013 Cross Language Evaluation Forum Conference, CLEF 2013 ; https://hal-cea.archives-ouvertes.fr/cea-01844707 ; 2013 Cross Language Evaluation Forum Conference, CLEF 2013, Sep 2013, Valencia, Spain (2013)
|
|
BASE
|
|
Show details
|
|
17 |
Semantisches Wörterbuch der deutschen Sprache für maschinelle Sprachverarbeitungssysteme [Online resource]
|
|
|
|
In: Aussiger Beiträge : germanistische Schriftenreihe aus Forschung und Lehre / Katedra germanistiky FF UJEP 7 (2013), 103-117
|
|
Linguistik-Repository
|
|
Show details
|
|
|
|