DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5...37
Hits 1 – 20 of 723

1
A gentle introduction to Girard's Transcendental Syntax for the linear logician
In: https://hal.archives-ouvertes.fr/hal-02977750 ; 2022 (2022)
BASE
Show details
2
Unsupervised Morphological Segmentation and Part-of-Speech Tagging for Low-Resource Scenarios
Eskander, Ramy. - 2021
BASE
Show details
3
Salience Estimation and Faithful Generation: Modeling Methods for Text Summarization and Generation
Abstract: This thesis is focused on a particular text-to-text generation problem, automatic summarization, where the goal is to map a large input text to a much shorter summary text. The research presented aims to both understand and tame existing machine learning models, hopefully paving the way for more reliable text-to-text generation algorithms. Somewhat against the prevailing trends, we eschew end-to-end training of an abstractive summarization model, and instead break down the text summarization problem into its constituent tasks. At a high level, we divide these tasks into two categories: content selection, or “what to say” and content realization, or “how to say it” (McKeown, 1985). Within these categories we propose models and learning algorithms for the problems of salience estimation and faithful generation. Salience estimation, that is, determining the importance of a piece of text relative to some context, falls into a problem of the former category, determining what should be selected for a summary. In particular, we experiment with a variety of popular or novel deep learning models for salience estimation in a single document summarization setting, and design several ablation experiments to gain some insight into which input signals are most important for making predictions. Understanding these signals is critical for designing reliable summarization models. We then consider a more difficult problem of estimating salience in a large document stream, and propose two alternative approaches using classical machine learning techniques from both unsupervised clustering and structured prediction. These models incorporate salience estimates into larger text extraction algorithms that also consider redundancy and previous extraction decisions. Overall, we find that when simple, position based heuristics are available, as in single document news or research summarization, deep learning models of salience often exploit them to make predictions, while ignoring the arguably more important content features of the input. In more demanding environments, like stream summarization, where heuristics are unreliable, more semantically relevant features become key to identifying salience content. In part two, content realization, we assume content selection has already been performed and focus on methods for faithful generation (i.e., ensuring that output text utterances respect the semantics of the input content). Since they can generate very fluent and natural text, deep learning- based natural language generation models are a popular approach to this problem. However, they often omit, misconstrue, or otherwise generate text that is not semantically correct given the input content. In this section, we develop a data augmentation and self-training technique to mitigate this problem. Additionally, we propose a training method for making deep learning-based natural language generation models capable of following a content plan, allowing for more control over the output utterances generated by the model. Under a stress test evaluation protocol, we demonstrate some empirical limits on several neural natural language generation models’ ability to encode and properly realize a content plan. Finally, we conclude with some remarks on future directions for abstractive summarization outside of the end-to-end deep learning paradigm. Our aim here is to suggest avenues for constructing abstractive summarization systems with transparent, controllable, and reliable behavior when it comes to text understanding, compression, and generation. Our hope is that this thesis inspires more research in this direction, and, ultimately, real tools that are broadly useful outside of the natural language processing community.
Keyword: Computer science; Decision making--Computer programs; Machine learning; Machine learning--Mathematical models
URL: https://doi.org/10.7916/d8-61n8-mg23
BASE
Hide details
4
Multiplicative Linear Logic from Logic Programs and Tilings
In: https://hal.archives-ouvertes.fr/hal-02895111 ; 2021 (2021)
BASE
Show details
5
A gentle introduction to Girard's Transcendental Syntax for the linear logician
In: https://hal.archives-ouvertes.fr/hal-02977750 ; 2021 (2021)
BASE
Show details
6
Stellar Resolution: Multiplicatives - for the linear logician, through examples
In: https://hal.archives-ouvertes.fr/hal-02977750 ; 2021 (2021)
BASE
Show details
7
A gentle introduction to Girard's Transcendental Syntax for the linear logician
In: https://hal.archives-ouvertes.fr/hal-02977750 ; 2021 (2021)
BASE
Show details
8
Stellar Resolution: Multiplicatives - for the linear logician, through examples
In: https://hal.archives-ouvertes.fr/hal-02977750 ; 2021 (2021)
BASE
Show details
9
A Many Small Programs (MSP) Approach in a CS1 Course
Allen, Joe Michael. - : eScholarship, University of California, 2021
BASE
Show details
10
Unsupervised Morphological Segmentation and Part-of-Speech Tagging for Low-Resource Scenarios ...
Eskander, Ramy. - : Columbia University, 2021
BASE
Show details
11
History of Logo
In: Proceedings of the ACM on Programming Languages, vol 4, iss HOPL (2020)
BASE
Show details
12
Preventing Summer Reading Slide: Examining the Effects of Two Computer-Assisted Reading Programs
In: ETSU Faculty Works (2020)
BASE
Show details
13
A call for cautious interpretation of meta-analytic reviews
In: Education Publications (2020)
BASE
Show details
14
Mechanized metatheory revisited
In: ISSN: 0168-7433 ; EISSN: 1573-0670 ; Journal of Automated Reasoning ; https://hal.inria.fr/hal-01884210 ; Journal of Automated Reasoning, Springer Verlag, 2019, 63 (3), pp.625-665. ⟨10.1007/s10817-018-9483-3⟩ (2019)
BASE
Show details
15
Decidable XPath Fragments in the Real World
In: 38th ACM Symposium on Principles of Database Systems (PODS'19) ; https://hal.inria.fr/hal-01852475 ; 38th ACM Symposium on Principles of Database Systems (PODS'19), 2019, Amsterdam, Netherlands. ⟨10.1145/3294052.3319685⟩ (2019)
BASE
Show details
16
Primärprozess in der Katathym Imaginativen Psychotherapie unter dem Einfluss psychotroper Substanzen ...
Grotz, Annalen. - : Universität Ulm, 2019
BASE
Show details
17
The Manifesto Corpus: a new resource for research on political parties and quantitative text analysis
In: Research and Politics ; 3 ; 2 ; 1-8 (2019)
BASE
Show details
18
A sequent calculus with dependent types for classical arithmetic
In: LICS 2018 - 33th Annual ACM/IEEE Symposium on Logic in Computer Science ; https://hal.inria.fr/hal-01703526 ; LICS 2018 - 33th Annual ACM/IEEE Symposium on Logic in Computer Science, Jul 2018, Oxford, United Kingdom. pp.720-729, ⟨10.1145/3209108.3209199⟩ (2018)
BASE
Show details
19
NgramPOS: A Bigram-based Linguistic and Statistical Feature Process Model for Unstructured Text Classification
BASE
Show details
20
ПРОФЕССИОНАЛЬНО-ОРИЕНТИРОВАННОЕ ОБУЧЕНИЕ ИНОСТРАННОМУ ЯЗЫКУ МАГИСТРАНТОВ ТЕХНИЧЕСКИХ СПЕЦИАЛЬНОСТЕЙ
ГРАММА ДАРЬЯ ВИКТОРОВНА; КУЗНЕЦОВА СВЕТЛАНА ВЛАДИМИРОВНА. - : Общество с ограниченной ответственностью Центр научного сотрудничества Интерактив плюс, 2017
BASE
Show details

Page: 1 2 3 4 5...37

Catalogues
0
0
0
0
0
0
2
Bibliographies
0
0
0
0
0
0
0
0
19
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
702
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern