DE eng

Search in the Catalogues and Directories

Hits 1 – 3 of 3

1
ClidSum: A Benchmark Dataset for Cross-Lingual Dialogue Summarization ...
Wang, Jiaan; Meng, Fandong; Lu, Ziyao. - : arXiv, 2022
BASE
Show details
2
A Survey on Cross-Lingual Summarization ...
Wang, Jiaan; Meng, Fandong; Zheng, Duo. - : arXiv, 2022
BASE
Show details
3
SportsSum2.0: Generating High-Quality Sports News from Live Text Commentary ...
Abstract: Sports game summarization aims to generate news articles from live text commentaries. A recent state-of-the-art work, SportsSum, not only constructs a large benchmark dataset, but also proposes a two-step framework. Despite its great contributions, the work has three main drawbacks: 1) the noise existed in SportsSum dataset degrades the summarization performance; 2) the neglect of lexical overlap between news and commentaries results in low-quality pseudo-labeling algorithm; 3) the usage of directly concatenating rewritten sentences to form news limits its practicability. In this paper, we publish a new benchmark dataset SportsSum2.0, together with a modified summarization framework. In particular, to obtain a clean dataset, we employ crowd workers to manually clean the original dataset. Moreover, the degree of lexical overlap is incorporated into the generation of pseudo labels. Further, we introduce a reranker-enhanced summarizer to take into account the fluency and expressiveness of the summarized news. ... : Accepted as a short paper in CIKM 2021 ...
Keyword: Computation and Language cs.CL; FOS Computer and information sciences
URL: https://arxiv.org/abs/2110.05750
https://dx.doi.org/10.48550/arxiv.2110.05750
BASE
Hide details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
3
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern