Home Catalogue search

eng

Refine your search:
- Keyword:
- Creator / Publisher
- Year:
  - 2021 (2)
  - 2020 (4)
  - 2019 (2)
  - 2018 (2)
  - 2017 (1)
  - 2014 (3)
  - 2011 (1)
  - 2010 (1)
  - 2009 (1)
  - 2004 (1)
  - more
- Medium:
  - Online (24)
  - Print (12)
- Type
- BLLDB-Access:
  - free (36)
  - subject to license (0)

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2

Hits 1 – 20 of 36

1	Sequence Models for Computational Etymology of Borrowings ...
	The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing 2021; Duh, Kevin; Wu, Winston. - : Underline Science Inc., 2021
	BASE
	Show details

2	BUSINESS MEETING ...
	The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing 2021; Bender, Emily M.; Callison-Burch, Chris. - : Underline Science Inc., 2021
	BASE
	Show details

3	Measuring the Similarity of Grammatical Gender Systems by Comparing Partitions
	McCarthy, Arya D.; Cotterell, Ryan; Williams, Adina...
	In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP) (2020)
	BASE
	Show details

4	UniMorph 3.0: Universal Morphology
	Kirov, Christo; Nicolai, Garrett; Arkhangelskij, Timofey...
	In: Proceedings of the 12th Language Resources and Evaluation Conference (2020)
	BASE
	Show details

5	UniMorph 3.0: Universal Morphology ...
	McCarthy, Arya D.; Kirov, Christo; Grella, Matteo. - : ETH Zurich, 2020
	BASE
	Show details

6	Measuring the Similarity of Grammatical Gender Systems by Comparing Partitions ...
	McCarthy, Arya D.; Williams, Adina; Liu, Shijia. - : ETH Zurich, 2020
	BASE
	Show details

7	Massively Multilingual Adversarial Speech Recognition ...
	Adams, Oliver; Wiesner, Matthew; Watanabe, Shinji. - : arXiv, 2019
	BASE
	Show details

8	Modeling Color Terminology Across Thousands of Languages ...
	McCarthy, Arya D.; Wu, Winston; Mueller, Aaron. - : arXiv, 2019
	BASE
	Show details

9	Marrying Universal Dependencies and Universal Morphology ...
	McCarthy, Arya D.; Silfverberg, Miikka; Cotterell, Ryan. - : arXiv, 2018
	BASE
	Show details

10	A keyword search system using open source software ...
	Trmal, Jan; Guoguo Chen; Povey, Dan. - : Figshare, 2018
	BASE
	Show details

11	Paradigm Completion for Derivational Morphology ...
	Cotterell, Ryan; Vylomova, Ekaterina; Khayrallah, Huda. - : arXiv, 2017
	BASE
	Show details

12	FEATURE-DRIVEN QUESTION ANSWERING WITH NATURAL LANGUAGE ALIGNMENT
	Yao, Xuchen. - 2014
	BASE
	Show details

13	Using Comparable Corpora to Augment Statistical Machine Translation Models in Low Resource Settings
	Irvine, Ann. - 2014
	Abstract: Previously, statistical machine translation (SMT) models have been estimated from parallel corpora, or pairs of translated sentences. In this thesis, we directly incorporate comparable corpora into the estimation of end-to-end SMT models. In contrast to parallel corpora, comparable corpora are pairs of monolingual corpora that have some cross-lingual similarities, for example topic or publication date, but that do not necessarily contain any direct translations. Comparable corpora are more readily available in large quantities than parallel corpora, which require significant human effort to compile. We use comparable corpora to estimate machine translation model parameters and show that doing so improves performance in settings where a limited amount of parallel data is available for training. The major contributions of this thesis are the following: * We release ‘language packs’ for 151 human languages, which include bilingual dictionaries, comparable corpora of Wikipedia document pairs, comparable corpora of time-stamped news text that we harvested from the web, and, for non-roman script languages, dictionaries of name pairs, which are likely to be transliterations. * We present a novel technique for using a small number of example word translations to learn a supervised model for bilingual lexicon induction which takes advantage of a wide variety of signals of translation equivalence that can be estimated over comparable corpora. * We show that using comparable corpora to induce new translations and estimate new phrase table feature functions improves end-to-end statistical machine translation performance for low resource language pairs as well as domains. * We present a novel algorithm for composing multiword phrase translations from multiple unigram translations and then use comparable corpora to prune the large space of hypothesis translations. We show that these induced phrase translations improve machine translation performance beyond that of component unigrams. This thesis focuses on critical low resource machine translation settings, where insufficient parallel corpora exist for training statistical models. We experiment with both low resource language pairs and low resource domains of text. We present results from our novel error analysis methodology, which show that most translation errors in low resource settings are due to unseen source language words and phrases and unseen target language translations. We also find room for fixing errors due to how different translations are weighted, or scored, in the models. We target both error types; we use comparable corpora to induce new word and phrase translations and estimate novel translation feature scores. Our experiments show that augmenting baseline SMT systems with new translations and features estimated over comparable corpora improves translation performance significantly. Additionally, our techniques expand the applicability of statistical machine translation to those language pairs for which zero parallel text is available.
	Keyword: machine translation; natural language processing
	URL: http://jhir.library.jhu.edu/handle/1774.2/38018
	BASE
	Hide details

14	A keyword search system using open source software ...
	Trmal, Jan; Guoguo Chen; Povey, Dan. - : Carnegie Mellon University, 2014
	BASE
	Show details

15	Proceedings of the 5th International Joint Conference on Natural Language Processing (IJCNLP), Chiang Mai, Thailand, 8 - 13 November 2011
	Wang, Haifeng (Hrsg.); Yarowsky, David (Hrsg.). - Chiang Mai, Thailand : Asian Federation of Natural Language Processing, 2011
	IDS OBELEX meta
	Show details

16	Word sense disambiguation
	Yarowsky, David
	In: Handbook of natural language processing (Boca Raton, Fla., 2010), p. 315-338
	MPI für Psycholinguistik
	Show details

17	Structural, transitive and latent models for biographic fact extraction
	Garera, Nikesh; Yarowsky, David
	In: Association for Computational Linguistics / European Chapter. Conference of the European Chapter of the Association for Computational Linguistics. - Menlo Park, Calif. : ACL 12 (2009), 300-308
	BLLDB
	Show details

18	Corpus linguistics : readings in a widening discipline
	Kilgarriff, Adam (Mitarb.); Fries, Charles Carpenter (Mitarb.); Francis, Gill (Mitarb.). - London [u.a.] : Continuum, 2004
	BLLDB
	UB Frankfurt Linguistik
	Show details

19	Evaluating word sense disambiguation systems
	Edmonds, Philip (Hrsg.); Kilgarriff, Adam (Hrsg.); Yarowsky, David (Mitarb.)...
	In: Natural language engineering. - Cambridge : Cambridge University Press 8 (2002) 4, 279-390
	BLLDB
	Show details

20	Bootstrapping a Multilingual Part-of-speech Tagger in One Person-day
	Cucerzan, Silviu; Yarowsky, David
	In: DTIC (2002)
	BASE
	Show details

Page: 1 2

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern