DE eng

Search in the Catalogues and Directories

Page: 1...366 367 368 369 370 371 372 373
Hits 7.381 – 7.400 of 7.453

7381
Deep Learning with Constraints for Answer-Agnostic Question Generation in Legal Text Understanding
Lamba, Deepti. - August
Abstract: Doctor of Philosophy ; Department of Computer Science ; William H. Hsu ; The aim of this dissertation is to develop constraint-based methods that extend and improve on current deep learning neural networks such as transformers and sequence-to-sequence (seq2seq) models, for the problem of question generation based on the analysis of the text of legal agreements, particularly privacy policies. A privacy policy is a legally binding agreement between a customer and service provider. This dissertation focuses on analyzing a privacy policy document to generate questions that capture entities and the relationships between them. Another area of focus is the generation of constraints based on domain knowledge and their application to the deep learning network during the question generation process. A possible use case of this research is development of test corpus for question answering systems in the privacy domain because the shortage of sufficiently large corpora poses a key challenge in the development of question answering and question generation systems. Question generation is the task of generating an interrogative sentence based on some text. Current approaches to question generation use sequence-to-sequence models with additional information like answers, positions of the answers, part-of-speech details, named entity tags among others. The idea behind such approaches is that these models can benefit from additional information about the text (i.e., sentence or paragraph). Recently, transformer-based approaches that offer the benefit of attention mechanism have also been used for generating questions. Transformers have achieved state-of-the-art results in many natural language processing tasks including text classification, machine translation, language understanding, co-reference resolution, and summarization. However, the contribution of transformers towards a task like question generation has not been as significant. This research tries to find ways of improving existing approaches by injecting domain knowledge, modeled as a combination of logical and linguistic constraints, into these deep learning models during the training and validation phases. This work also explores design and implementation of different kind of constraints that can better direct the deep learning model towards the expected output, which in this case refers to syntactically and semantically correct and relevant questions. Another contribution of this research is the creation of custom labels for named entities in the privacy policy domain. Results show that adding some form of domain specific constraints improves the performance of the aforementioned models as compared to the performance of state-of-the-art models on the test bed used in this work. For the given test bed, constrained seq-to-seq approaches perform better than the constrained transformer-based approach.
Keyword: Deep Learning; Legal Text; Natural Language Processing; Privacy Policies
URL: https://hdl.handle.net/2097/41629
BASE
Hide details
7382
Text and Network Mining for Literature-Based Scientific Discovery in Biomedicine.
BASE
Show details
7383
Detecting gross alignment errors in the Spoken British National Corpus
BASE
Show details
7384
An information extraction tool for microbial characters
BASE
Show details
7385
Measuring Semantic Distance using Distributional Profiles of Concepts
Mohammad, Saif. - NO_RESTRICTION
BASE
Show details
7386
Exploiting Linguistic Knowledge to Infer Properties of Neologisms
Cook, C. Paul. - NO_RESTRICTION
BASE
Show details
7387
Exploring neural paraphrasing to improve fluency of rule-based generation
BASE
Show details
7388
Distributed prediction of relations for entities: the Easy, the Difficult, and the impossible
Boleda, Gemma; Gupta, Abhijeet; Padó, Sebastian. - : ACL (Association for Computational Linguistics)
BASE
Show details
7389
Twitter as a lifeline: human-annotated Twitter corpora for NLP of crisis-related messages
BASE
Show details
7390
Task-based Evaluation of the PANACEA Production Chain
BASE
Show details
7391
Documentation of Naive Bayes classifier Web Service
BASE
Show details
7392
First version (v1) of the integrated platform/nand documentation
BASE
Show details
7393
Criteria for evaluation of resources, technology and integration
BASE
Show details
7394
Documentation of P clue/ lexical class from Weka computer Web Service
BASE
Show details
7395
PANACEA: The Platform
BASE
Show details
7396
Third evaluation report. Evaluation of PANACEA v3 and produced resources
BASE
Show details
7397
Final Report on the Corpus Acquisition & Annotation subsystem and its components
BASE
Show details
7398
Architecture and design of the platform
BASE
Show details
7399
Analysis of Industrial User Requirements
BASE
Show details
7400
Integrated Final Version of the Components for Lexical Acquisition
BASE
Show details

Page: 1...366 367 368 369 370 371 372 373

Catalogues
37
0
0
0
0
2
8
Bibliographies
33
0
0
0
0
0
0
5
30
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
7.372
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern