21 |
Simplification Using Paraphrases and Context-Based Lexical Substitution
|
|
|
|
In: Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies ; https://hal.archives-ouvertes.fr/hal-01838519 ; Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Association for Computational Linguistics, Jun 2018, Nouvelle Orléans, United States (2018)
|
|
BASE
|
|
Show details
|
|
22 |
Automated Paraphrase Lattice Creation for HyTER Machine Translation Evaluation
|
|
|
|
In: Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies ; https://hal.archives-ouvertes.fr/hal-01838521 ; Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Association for Computational Linguistics Jun 2018, Nouvelle Orléans, United States (2018)
|
|
BASE
|
|
Show details
|
|
23 |
Comparing Constraints for Taxonomic Organization
|
|
|
|
In: Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies ; https://hal.archives-ouvertes.fr/hal-01838520 ; Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Association for Computational Linguistics Jun 2018, Nouvelle Orléans, United States (2018)
|
|
BASE
|
|
Show details
|
|
24 |
Mapping the Paraphrase Database to WordNet
|
|
|
|
In: Conference on Lexical and Computational Semantics ; https://hal.archives-ouvertes.fr/hal-01838527 ; Conference on Lexical and Computational Semantics, Aug 2017, Vancouver, Canada (2017)
|
|
BASE
|
|
Show details
|
|
25 |
Learning Antonyms with Paraphrases and a Morphology-aware Neural Network
|
|
|
|
In: Conference on Lexical and Computational Semantics ; https://hal.archives-ouvertes.fr/hal-01838526 ; Conference on Lexical and Computational Semantics, Aug 2017, Vancouver, Canada (2017)
|
|
BASE
|
|
Show details
|
|
26 |
Word Sense Filtering Improves Embedding-Based Lexical Substitution
|
|
|
|
In: Conference of the European Chapter of the Association for Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-01838524 ; Conference of the European Chapter of the Association for Computational Linguistics , Apr 2017, Valencia, Spain (2017)
|
|
BASE
|
|
Show details
|
|
27 |
Learning Translations via Matrix Completion
|
|
|
|
In: Conference on Empirical Methods in Natural Language Processing ; https://hal.archives-ouvertes.fr/hal-01838532 ; Conference on Empirical Methods in Natural Language Processing, Sep 2017, Copenhagen, Denmark (2017)
|
|
BASE
|
|
Show details
|
|
28 |
KnowYourNyms? A Game of Semantic Relationships
|
|
|
|
In: Conference on Empirical Methods in Natural Language Processing ; https://hal.archives-ouvertes.fr/hal-01838528 ; Conference on Empirical Methods in Natural Language Processing, Sep 2017, Copenhagen, Denmark (2017)
|
|
BASE
|
|
Show details
|
|
29 |
Use of Modality and Negation in Semantically-Informed Syntactic MT ...
|
|
|
|
BASE
|
|
Show details
|
|
31 |
FEATURE-DRIVEN QUESTION ANSWERING WITH NATURAL LANGUAGE ALIGNMENT
|
|
|
|
BASE
|
|
Show details
|
|
32 |
Using Comparable Corpora to Augment Statistical Machine Translation Models in Low Resource Settings
|
|
|
|
BASE
|
|
Show details
|
|
33 |
Bucking the Trend: Large-Scale Cost-Focused Active Learning for Statistical Machine Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
34 |
Fisher and CALLHOME Spanish--English Speech Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
35 |
Semantically-Informed Syntactic Machine Translation: A Tree-Grafting Approach ...
|
|
|
|
BASE
|
|
Show details
|
|
36 |
Dirt cheap web-scale parallel text from the Common Crawl ...
|
|
|
|
BASE
|
|
Show details
|
|
37 |
Dirt cheap web-scale parallel text from the Common Crawl
|
|
|
|
In: Smith, Jason R; Saint-Amand, Herve; Plamada, Magdalena; Koehn, Philipp; Callison-Burch, Chris; Lopez, Adam (2013). Dirt cheap web-scale parallel text from the Common Crawl. In: 51st Annual Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 2013. Association for Computational Linguistics, 1374-1383. (2013)
|
|
Abstract:
Parallel text is the fuel that drives modern machine translation systems. The Web is a comprehensive source of preexisting parallel text, but crawling the entire web is impossible for all but the largest companies. We bring web-scale parallel text to the masses by mining the Common Crawl, a public Web crawl hosted on Amazon 19s Elastic Cloud. Starting from nothing more than a set of common two-letter language codes, our open-source extension of the STRAND algorithm mined 32 terabytes of the crawl in just under a day, at a cost of about \$500. Our large-scale experiment uncovers large amounts of parallel text in dozens of language pairs across a variety of domains and genres, some previously unavailable in curated datasets. Even with minimal cleaning and filtering, the resulting data boosts translation performance across the board for five different language pairs in the news domain, and on open domain test sets we see improvements of up to 5 BLEU. We make our code and data available for other researchers seeking to mine this rich new data resource.
|
|
Keyword:
000 Computer science; 410 Linguistics; Institute of Computational Linguistics; knowledge & systems
|
|
URL: https://www.zora.uzh.ch/id/eprint/80038/1/ACL2013135.pdf https://doi.org/10.5167/uzh-80038 https://www.zora.uzh.ch/id/eprint/80038/ http://www.aclweb.org/anthology/P13-1135
|
|
BASE
|
|
Hide details
|
|
38 |
Use of Modality and Negation in Semantically-Informed Syntactic MT
|
|
|
|
In: DTIC (2012)
|
|
BASE
|
|
Show details
|
|
39 |
Use of Modality and Negation in Semantically-Informed Syntactic MT
|
|
|
|
BASE
|
|
Show details
|
|
40 |
Incremental Syntactic Language Models for Phrase-Based Translation
|
|
|
|
In: DTIC (2011)
|
|
BASE
|
|
Show details
|
|
|
|