1 |
Adapting NMT to caption translation in Wikimedia Commons for low-resource languages ; Adaptando NMT a la traducción de pies de imagen en Wikimedia Commons para idiomas con pocos recursos
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Lexical semantics, Basque and Spanish in QTLeap: Quality Translation by Deep Language Engineering Approaches ; QTLeap - Traducción de calidad mediante tratamientos profundos de ingeniería lingüística
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Matxin-Informatika: versión del traductor Matxin adaptada al dominio de la informática ; Matxin-Informatika, version of Matxin translation system adapted to the computer science domain
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Uso de información morfológica en el alineamiento español-euskera
|
|
|
|
BASE
|
|
Show details
|
|
5 |
An open-source shallow-transfer machine translation engine for the Romance languages of Spain
|
|
Corbí Bellot, Antonio Miguel; Forcada, Mikel L.; Ortiz Rojas, Sergio; Pérez-Ortiz, Juan Antonio; Ramírez Sánchez, Gema; Sánchez-Martínez, Felipe; Alegría Loinaz, Iñaki; Mayor Martínez, Aingeru; Sarasola Gabiola, Kepa. - : European Association for Machine Translation, 2005
|
|
Abstract:
We present the current status of development of an open-source shallow-transfer machine translation engine for the Romance languages of Spain (the main ones being Spanish, Catalan and Galician) as part of a larger government-funded project which includes non-Romance languages such as Basque and involving both universities and linguistic technology companies. The machine translation architecture uses finite-state transducers for lexical processing, hidden Markov models for part-of-speech tagging, and finite-state based chunking for structural transfer, and is largely based upon that of systems already developed by the Transducens group at the Universitat d'Alacant, such as interNOSTRUM (Spanish—Catalan) and Traductor Universia (Spanish—Portuguese). The possible scope of the project, however, is wider, since it will be possible to use the resulting machine translation system with new pairs of languages; to that end, the project also aims at proposing standard formats to encode the linguistic data needed. This paper briefly describes the machine translation engine, the formats it uses for linguistic data, and the compilers that convert these data into an efficient format used by the engine. ; Work funded by projects FIT-340101-2004-3 (Spanish Ministry of Industry, Commerce and Tourism) and TIC2003-08681-C02-01 (Spanish Ministry of Science and Technology). Felipe Sánchez-Martínez is supported by the Spanish Ministry of Science and Education and the European Social Fund through grant BES-2004-4711.
|
|
Keyword:
Lenguajes y Sistemas Informáticos; Machine translation; Romance languages; Shallow-transfer
|
|
URL: http://hdl.handle.net/10045/27525
|
|
BASE
|
|
Hide details
|
|
6 |
Construcción de un corpus etiquetado sintácticamente para el euskera
|
|
|
|
BASE
|
|
Show details
|
|
7 |
Extracción masiva de información sobre subcategorización verbal vasca a partir de corpus
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Etiquetado semiautomático del rasgo semántico de animicidad para su uso en un sistema de traducción automática
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Subcategorización verbal vasca : propuesta inicial y herramienta de validación
|
|
|
|
BASE
|
|
Show details
|
|
|
|