DE eng

Search in the Catalogues and Directories

Page: 1 2
Hits 1 – 20 of 37

1
Neural machine translation with a polysynthetic low resource language [<Journal>]
Ortega, John E. [Verfasser]; Castro Mamani, Richard [Verfasser]; Cho, Kyunghyun [Verfasser]
DNB Subject Category Language
Show details
2
Length-Adaptive Transformer: Train Once with Length Drop, Use Anytime with Search ...
BASE
Show details
3
Monotonic Simultaneous Translation with Chunk-wise Reordering and Refinement ...
BASE
Show details
4
Length-Adaptive Transformer: Train Once with Length Drop, Use Anytime with Search ...
BASE
Show details
5
The Future is not One-dimensional: Complex Event Schema Induction by Graph Modeling for Event Prediction ...
BASE
Show details
6
Comparing Test Sets with Item Response Theory ...
BASE
Show details
7
DEEP: DEnoising Entity Pre-training for Neural Machine Translation ...
BASE
Show details
8
VisualSem: A High-quality Knowledge Graph for Vision and Language ...
BASE
Show details
9
Learning to Learn Morphological Inflection for Resource-Poor Languages ...
BASE
Show details
10
Improving Conversational Question Answering Systems after Deployment using Feedback-Weighted Learning ...
BASE
Show details
11
Cold-start universal information extraction
Huang, Lifu. - 2020
Abstract: Who? What? When? Where? Why? are fundamental questions asked when gathering knowledge about and understanding a concept, topic, or event. The answers to these questions underpin the key information conveyed in the overwhelming majority, if not all, of language-based communication. At the core of my research in Information Extraction (IE) is the desire to endow machines with the ability to automatically extract, assess, and understand text in order to answer these fundamental questions. IE has been serving as one of the most important components for many downstream natural language processing (NLP) tasks, such as knowledge base completion, machine reading comprehension, machine translation and so on. The proliferation of the Web also intensifies the need of dealing with enormous amount of unstructured data from various sources, such as languages, genres and domains. When building an IE system, the conventional pipeline is to (1) ask expert linguists to rigorously define a target set of knowledge types we wish to extract by examining a large data set, (2) collect resources and human annotations for each type, and (3) design features and train machine learning models to extract knowledge elements. In practice, this process is very expensive as each step involves extensive human effort which is not always available, for example, to specify the knowledge types for a particular scenario, both consumers and expert linguists need to examine a lot of data from that domain and write detailed annotation guidelines for each type. Hand-crafted schemas, which define the types and complex templates of the expected knowledge elements, often provide low coverage and fail to generalize to new domains. For example, none of the traditional event extraction programs, such as ACE (Automatic Content Extraction) and TAC-KBP, include "donation'' and "evacuation'' in their schemas in spite of their potential relevance to natural disaster management users. Additionally, these approaches are highly dependent on linguistic resources and human labeled data tuned to pre-defined types, so they suffer from poor scalability and portability when moving to a new language, domain, or genre. The focus of this thesis is to develop effective theories and algorithms for IE which not only yield satisfactory quality by incorporating prior linguistic and semantic knowledge, but also greater portability and scalability by moving away from the high cost and narrow focus of large-scale manual annotation. This thesis opens up a new research direction called Cold-Start Universal Information Extraction, where the full extraction and analysis starts from scratch and requires little or no prior manual annotation or pre-defined type schema. In addition to this new research paradigm, we also contribute effective algorithms and models towards resolving the following three challenges: How can machines extract knowledge without any pre-defined types or any human annotated data? We develop an effective bottom-up and unsupervised Liberal Information Extraction framework based on the hypothesis that the meaning and underlying knowledge conveyed by linguistic expressions is usually embodied by their usages in language, which makes it possible to automatically induces a type schema based on rich contextual representations of all knowledge elements by combining their symbolic and distributional semantics using unsupervised hierarchical clustering. How can machines benefit from available resources, e.g., large-scale ontologies or existing human annotations? My research has shown that pre-defined types can also be encoded by rich contextual or structured representations, through which knowledge elements can be mapped to their appropriate types. Therefore, we design a weakly supervised Zero-shot Learning and a Semi-Supervised Vector Quantized Variational Auto-Encoder approach that frames IE as a grounding problem instead of classification, where knowledge elements are grounded into any types from an extensible and large-scale target ontology or induced from the corpora, with available annotations for a few types. How can IE approaches be extent to low-resource languages without any extra human effort? There are more than 6000 living languages in the real world while public gold-standard annotations are only available for a few dominant languages. To facilitate the adaptation of these IE frameworks to other languages, especially low resource languages, a Multilingual Common Semantic Space is further proposed to serve as a bridge for transferring existing resources and annotated data from dominant languages to more than 300 low resource languages. Moreover, a Multi-Level Adversarial Transfer framework is also designed to learn language-agnostic features across various languages.
Keyword: Common Semantic Space; Event Extraction; Information Extraction; Zero-Shot Learning
URL: http://hdl.handle.net/2142/109351
BASE
Hide details
12
AdapterHub: A Framework for Adapting Transformers
Pfeiffer, Jonas; Ruckle, Andreas; Poth, Clifton. - : Association for Computational Linguistics, 2020. : Proceedings of the Conference on Empirical Methods in Natural Language Processing: System Demonstrations (EMNLP 2020), 2020
BASE
Show details
13
Neural Machine Translation with Byte-Level Subwords ...
BASE
Show details
14
Improved Zero-shot Neural Machine Translation via Ignoring Spurious Correlations ...
Gu, Jiatao; Wang, Yong; Cho, Kyunghyun. - : arXiv, 2019
BASE
Show details
15
Emergent Linguistic Phenomena in Multi-Agent Communication Games ...
BASE
Show details
16
Countering Language Drift via Visual Grounding ...
BASE
Show details
17
Insertion-based Decoding with Automatically Inferred Generation Order
In: Transactions of the Association for Computational Linguistics, Vol 7, Pp 661-676 (2019) (2019)
BASE
Show details
18
Meta-Learning for Low-Resource Neural Machine Translation ...
Gu, Jiatao; Wang, Yong; Chen, Yun. - : arXiv, 2018
BASE
Show details
19
Multi-lingual Common Semantic Space Construction via Cluster-consistent Word Embedding ...
BASE
Show details
20
From Characters to Understanding Natural Language (C2NLU): Robust End-to-End Deep Learning for NLP (Dagstuhl Seminar 17042)
Cho, Kyunghyun; Dyer, Chris; Blunsom, Phil. - : Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik, 2017. : Dagstuhl Reports. Dagstuhl Reports, Volume 7, Issue 1, 2017
BASE
Show details

Page: 1 2

Catalogues
0
0
0
0
1
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
36
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern