Home
Catalogue search
Refine your search:
Keyword:
Computation and Language cs.CL (1)
Computational Linguistics (1)
FOS Computer and information sciences (1)
Machine Learning (1)
Machine Learning and Data Mining (1)
Natural Language Processing (1)
Creator / Publisher:
Callison-Burch, Chris (2)
Kim, Joongwon (2)
Kriz, Reno (2)
Maddela, Mounica (2)
Xu, Wei (2)
The 2021 Conference on Empirical Methods in Natural Language Processing 2021 (1)
Year:
2021 (2)
Medium
Type
BLLDB-Access
Search in the Catalogues and Directories
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
Sort by
creator [A → Z]
'
creator [Z → A]
'
publishing year ↑ (asc)
'
publishing year ↓ (desc)
'
title [A → Z]
'
title [Z → A]
'
Simple Search
Hits 1 – 2 of 2
1
BiSECT: Learning to Split and Rephrase Sentences with Bitexts ...
Kim, Joongwon
;
Maddela, Mounica
;
Kriz, Reno
. - : arXiv, 2021
BASE
Show details
2
BiSECT: Learning to Split and Rephrase Sentences with Bitexts ...
The 2021 Conference on Empirical Methods in Natural Language Processing 2021
;
Callison-Burch, Chris
;
Kim, Joongwon
;
Kriz, Reno
;
Maddela, Mounica
;
Xu, Wei
. - : Underline Science Inc., 2021
Abstract:
Anthology paper link: https://aclanthology.org/2021.emnlp-main.500/ Abstract: An important task in NLP applications such as sentence simplification is the ability to take a long, complex sentence and split it into shorter sentences, rephrasing as necessary. We introduce a novel dataset and a new model for this `split and rephrase' task. Our BiSECT training data consists of 1 million long English sentences paired with shorter, meaning-equivalent English sentences. We obtain these by extracting 1-2 sentence alignments in bilingual parallel corpora and then using machine translation to convert both sides of the corpus into the same language. BiSECT contains higher quality training examples than previous Split and Rephrase corpora, with sentence splits that require more significant modifications. We categorize examples in our corpus, and use these categories in a novel model that allows us to target specific regions of the input sentence to be split and edited. Moreover, we show that models trained on BiSECT can ...
Keyword:
Computational Linguistics
;
Machine Learning
;
Machine Learning and Data Mining
;
Natural Language Processing
URL:
https://underline.io/lecture/37931-bisect-learning-to-split-and-rephrase-sentences-with-bitexts
https://dx.doi.org/10.48448/98n6-k286
BASE
Hide details
Mobile view
All
Catalogues
UB Frankfurt Linguistik
0
IDS Mannheim
0
OLC Linguistik
0
UB Frankfurt Retrokatalog
0
DNB Subject Category Language
0
Institut für Empirische Sprachwissenschaft
0
Leibniz-Centre General Linguistics (ZAS)
0
Bibliographies
BLLDB
0
BDSL
0
IDS Bibliografie zur deutschen Grammatik
0
IDS Bibliografie zur Gesprächsforschung
0
IDS Konnektoren im Deutschen
0
IDS Präpositionen im Deutschen
0
IDS OBELEX meta
0
MPI-SHH Linguistics Collection
0
MPI for Psycholinguistics
0
Linked Open Data catalogues
Annohub
0
Online resources
Link directory
0
Journal directory
0
Database directory
0
Dictionary directory
0
Open access documents
BASE
2
Linguistik-Repository
0
IDS Publikationsserver
0
Online dissertations
0
Language Description Heritage
0
© 2013 - 2024 Lin|gu|is|tik
|
Imprint
|
Privacy Policy
|
Datenschutzeinstellungen ändern