61 |
Domain-Specific Term-List Expansion Using Existing Linguistic Resources
|
|
|
|
In: DTIC (2002)
|
|
BASE
|
|
Show details
|
|
62 |
Improved Word-Level Alignment: Injecting Knowledge about MT Divergences
|
|
|
|
In: DTIC (2002)
|
|
Abstract:
Word-level alignments of bilingual text (bitexts) are not an integral part of statistical machine translation models, but also useful for lexical acquisition, treebank construction. and part-of-speech tagging. The frequent occurrence of divergences, structural differences between languages, presents a great challenge to the alignment task. We resolve some of the most prevalent divergence cases by using syntactic parse information to transform the sentence structure of one language to bear a closer resemblance to that of the other language. In this paper, we show that common divergence types can be found in multiple language pairs (in particular, we focus on English-Spanish and English-Arabic) and systematically identified. We describe our techniques for modifying English parse trees to form resulting sentences that share more similarity with the sentences in the other languages; finally, we present an empirical analysis comparing the complexities of performing word-level alignments with an without divergence handling. Our results suggest that divergence-handling can improve word-level alignment. ; Additional rept. no. UMIACS-TR-2002-15.
|
|
Keyword:
*LEXICOGRAPHY; *MACHINE TRANSLATION; *STATISTICAL ANALYSIS; *WORDS(LANGUAGE); ACQUISITION; ALIGNMENT; EXPERIMENTAL DATA; LANGUAGE; Linguistics; MATHEMATICAL MODELS; Statistics and Probability; TREES
|
|
URL: http://oai.dtic.mil/oai/oai?&verb=getRecord&metadataPrefix=html&identifier=ADA458774 http://www.dtic.mil/docs/citations/ADA458774
|
|
BASE
|
|
Hide details
|
|
63 |
Handling Translation Divergences: Combining Statistical and Symbolic Techniques in Generation-Heavy Machine Translation
|
|
|
|
In: DTIC (2002)
|
|
BASE
|
|
Show details
|
|
64 |
Creating Parsing Lexicons from Semantic Lexicons Automatically and Its Applications
|
|
|
|
In: DTIC (2002)
|
|
BASE
|
|
Show details
|
|
65 |
Constraints on the Generation of Tense, Aspect, and Connecting Words from Temporal Expressions
|
|
|
|
In: DTIC (2002)
|
|
BASE
|
|
Show details
|
|
66 |
Construction of a Chinese-English Verb Lexicon for Embedded Machine Translation in Cross-Language Information Retrieval
|
|
|
|
BASE
|
|
Show details
|
|
67 |
Domain-Specific Term-List Expansion Using Existing Linguistic Resources
|
|
|
|
BASE
|
|
Show details
|
|
68 |
Constraints on the Generation of Tense, Aspect, and Connecting Words from Temporal Expressions
|
|
|
|
BASE
|
|
Show details
|
|
69 |
Handling Translation Divergences: Combining Statistical and Symbolic Techniques in Generation-Heavy Machine Translation
|
|
|
|
BASE
|
|
Show details
|
|
70 |
Handling Translation Divergences in Generation-Heavy Hybrid Machine Translation
|
|
|
|
BASE
|
|
Show details
|
|
72 |
Efficient Language Independent Generation from Lexical Conceptual Structure
|
|
|
|
In: DTIC (2001)
|
|
BASE
|
|
Show details
|
|
73 |
Building a Chinese-English Mapping Between Verb Concepts for Multilingual Applications
|
|
|
|
In: DTIC (2001)
|
|
BASE
|
|
Show details
|
|
74 |
Large Scale Language Independent Generation Using Thematic Hierarchies
|
|
|
|
In: DTIC (2001)
|
|
BASE
|
|
Show details
|
|
75 |
Mapping Lexical Entries in a Verbs Database to WordNet Senses
|
|
|
|
In: DTIC (2001)
|
|
BASE
|
|
Show details
|
|
76 |
Spanish Language Processing at University of Maryland: Building Infrastructure for Multilingual Applications
|
|
|
|
In: DTIC (2001)
|
|
BASE
|
|
Show details
|
|
77 |
Mapping WorldNet Senses to a Lexical Database of Verbs
|
|
|
|
In: DTIC (2001)
|
|
BASE
|
|
Show details
|
|
78 |
Efficient Language Independent Generation from Lexical Conceptual Structures
|
|
|
|
BASE
|
|
Show details
|
|
79 |
Lexical Resource Integration across the Syntax-Semantics Interface
|
|
|
|
BASE
|
|
Show details
|
|
80 |
Mapping Lexical Entries in a Verbs Database to WordNet Senses
|
|
|
|
BASE
|
|
Show details
|
|
|
|