Home Catalogue search

eng

Refine your search:
- Keyword
- Creator / Publisher
- Year
- Medium:
  - Online (36)
  - Print (1)
- Type:
- BLLDB-Access:
  - free (37)
  - subject to license (0)

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2

Hits 1 – 20 of 37

1	Neural machine translation with a polysynthetic low resource language [<Journal>]
	Ortega, John E. [Verfasser]; Castro Mamani, Richard [Verfasser]; Cho, Kyunghyun [Verfasser]
	DNB Subject Category Language
	Show details

2	Length-Adaptive Transformer: Train Once with Length Drop, Use Anytime with Search ...
	The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing 2021; Cho, Kyunghyun; Kim, Gyuwan. - : Underline Science Inc., 2021
	BASE
	Show details

3	Monotonic Simultaneous Translation with Chunk-wise Reordering and Refinement ...
	The 2021 Conference on Empirical Methods in Natural Language Processing 2021; ., Yoonjung; Ahn, Seokchan. - : Underline Science Inc., 2021
	BASE
	Show details

4	Length-Adaptive Transformer: Train Once with Length Drop, Use Anytime with Search ...
	The 2021 Conference on Empirical Methods in Natural Language Processing 2021; Cho, Kyunghyun; Kim, Gyuwan. - : Underline Science Inc., 2021
	BASE
	Show details

5	The Future is not One-dimensional: Complex Event Schema Induction by Graph Modeling for Event Prediction ...
	The 2021 Conference on Empirical Methods in Natural Language Processing 2021; Cho, Kyunghyun; Han, Jiawei. - : Underline Science Inc., 2021
	BASE
	Show details

6	Comparing Test Sets with Item Response Theory ...
	The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing 2021; Bowman, Samuel R.; Cho, Kyunghyun. - : Underline Science Inc., 2021
	BASE
	Show details

7	DEEP: DEnoising Entity Pre-training for Neural Machine Translation ...
	Hu, Junjie; Hayashi, Hiroaki; Cho, Kyunghyun. - : arXiv, 2021
	BASE
	Show details

8	VisualSem: A High-quality Knowledge Graph for Vision and Language ...
	Alberts, Houda; Huang, Teresa; Deshpande, Yash. - : arXiv, 2020
	BASE
	Show details

9	Learning to Learn Morphological Inflection for Resource-Poor Languages ...
	Kann, Katharina; Bowman, Samuel R.; Cho, Kyunghyun. - : arXiv, 2020
	BASE
	Show details

10	Improving Conversational Question Answering Systems after Deployment using Feedback-Weighted Learning ...
	The 28th International Conference on Computational Linguistics 2020; Agirre, Eneko; Azkune, Gorka. - : Underline Science Inc., 2020
	BASE
	Show details

11	Cold-start universal information extraction
	Huang, Lifu. - 2020
	Abstract: Who? What? When? Where? Why? are fundamental questions asked when gathering knowledge about and understanding a concept, topic, or event. The answers to these questions underpin the key information conveyed in the overwhelming majority, if not all, of language-based communication. At the core of my research in Information Extraction (IE) is the desire to endow machines with the ability to automatically extract, assess, and understand text in order to answer these fundamental questions. IE has been serving as one of the most important components for many downstream natural language processing (NLP) tasks, such as knowledge base completion, machine reading comprehension, machine translation and so on. The proliferation of the Web also intensifies the need of dealing with enormous amount of unstructured data from various sources, such as languages, genres and domains. When building an IE system, the conventional pipeline is to (1) ask expert linguists to rigorously define a target set of knowledge types we wish to extract by examining a large data set, (2) collect resources and human annotations for each type, and (3) design features and train machine learning models to extract knowledge elements. In practice, this process is very expensive as each step involves extensive human effort which is not always available, for example, to specify the knowledge types for a particular scenario, both consumers and expert linguists need to examine a lot of data from that domain and write detailed annotation guidelines for each type. Hand-crafted schemas, which define the types and complex templates of the expected knowledge elements, often provide low coverage and fail to generalize to new domains. For example, none of the traditional event extraction programs, such as ACE (Automatic Content Extraction) and TAC-KBP, include "donation'' and "evacuation'' in their schemas in spite of their potential relevance to natural disaster management users. Additionally, these approaches are highly dependent on linguistic resources and human labeled data tuned to pre-defined types, so they suffer from poor scalability and portability when moving to a new language, domain, or genre. The focus of this thesis is to develop effective theories and algorithms for IE which not only yield satisfactory quality by incorporating prior linguistic and semantic knowledge, but also greater portability and scalability by moving away from the high cost and narrow focus of large-scale manual annotation. This thesis opens up a new research direction called Cold-Start Universal Information Extraction, where the full extraction and analysis starts from scratch and requires little or no prior manual annotation or pre-defined type schema. In addition to this new research paradigm, we also contribute effective algorithms and models towards resolving the following three challenges: How can machines extract knowledge without any pre-defined types or any human annotated data? We develop an effective bottom-up and unsupervised Liberal Information Extraction framework based on the hypothesis that the meaning and underlying knowledge conveyed by linguistic expressions is usually embodied by their usages in language, which makes it possible to automatically induces a type schema based on rich contextual representations of all knowledge elements by combining their symbolic and distributional semantics using unsupervised hierarchical clustering. How can machines benefit from available resources, e.g., large-scale ontologies or existing human annotations? My research has shown that pre-defined types can also be encoded by rich contextual or structured representations, through which knowledge elements can be mapped to their appropriate types. Therefore, we design a weakly supervised Zero-shot Learning and a Semi-Supervised Vector Quantized Variational Auto-Encoder approach that frames IE as a grounding problem instead of classification, where knowledge elements are grounded into any types from an extensible and large-scale target ontology or induced from the corpora, with available annotations for a few types. How can IE approaches be extent to low-resource languages without any extra human effort? There are more than 6000 living languages in the real world while public gold-standard annotations are only available for a few dominant languages. To facilitate the adaptation of these IE frameworks to other languages, especially low resource languages, a Multilingual Common Semantic Space is further proposed to serve as a bridge for transferring existing resources and annotated data from dominant languages to more than 300 low resource languages. Moreover, a Multi-Level Adversarial Transfer framework is also designed to learn language-agnostic features across various languages.
	Keyword: Common Semantic Space; Event Extraction; Information Extraction; Zero-Shot Learning
	URL: http://hdl.handle.net/2142/109351
	BASE
	Hide details

12	AdapterHub: A Framework for Adapting Transformers
	Pfeiffer, Jonas; Ruckle, Andreas; Poth, Clifton. - : Association for Computational Linguistics, 2020. : Proceedings of the Conference on Empirical Methods in Natural Language Processing: System Demonstrations (EMNLP 2020), 2020
	BASE
	Show details

13	Neural Machine Translation with Byte-Level Subwords ...
	Wang, Changhan; Cho, Kyunghyun; Gu, Jiatao. - : arXiv, 2019
	BASE
	Show details

14	Improved Zero-shot Neural Machine Translation via Ignoring Spurious Correlations ...
	Gu, Jiatao; Wang, Yong; Cho, Kyunghyun. - : arXiv, 2019
	BASE
	Show details

15	Emergent Linguistic Phenomena in Multi-Agent Communication Games ...
	Graesser, Laura; Cho, Kyunghyun; Kiela, Douwe. - : arXiv, 2019
	BASE
	Show details

16	Countering Language Drift via Visual Grounding ...
	Lee, Jason; Cho, Kyunghyun; Kiela, Douwe. - : arXiv, 2019
	BASE
	Show details

17	Insertion-based Decoding with Automatically Inferred Generation Order
	Gu, Jiatao; Liu, Qi; Cho, Kyunghyun
	In: Transactions of the Association for Computational Linguistics, Vol 7, Pp 661-676 (2019) (2019)
	BASE
	Show details

18	Meta-Learning for Low-Resource Neural Machine Translation ...
	Gu, Jiatao; Wang, Yong; Chen, Yun. - : arXiv, 2018
	BASE
	Show details

19	Multi-lingual Common Semantic Space Construction via Cluster-consistent Word Embedding ...
	Huang, Lifu; Cho, Kyunghyun; Zhang, Boliang. - : arXiv, 2018
	BASE
	Show details

20	From Characters to Understanding Natural Language (C2NLU): Robust End-to-End Deep Learning for NLP (Dagstuhl Seminar 17042)
	Cho, Kyunghyun; Dyer, Chris; Blunsom, Phil. - : Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik, 2017. : Dagstuhl Reports. Dagstuhl Reports, Volume 7, Issue 1, 2017
	BASE
	Show details

Page: 1 2

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern