1 |
Sememe Prediction for BabelNet Synsets using Multilingual and Multimodal Information ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
YACLC: A Chinese Learner Corpus with Multidimensional Annotation ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Alternated Training with Synthetic and Authentic Data for Neural Machine Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
CPM-2: Large-scale Cost-effective Pre-trained Language Models ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Automatic Construction of Sememe Knowledge Bases via Dictionaries ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Sub-Character Tokenization for Chinese Pretrained Language Models ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
CCPM: A Chinese Classical Poetry Matching Dataset ...
|
|
|
|
Abstract:
Poetry is one of the most important art forms of human languages. Recently many studies have focused on incorporating some linguistic features of poetry, such as style and sentiment, into its understanding or generation system. However, there is no focus on understanding or evaluating the semantics of poetry. Therefore, we propose a novel task to assess a model's semantic understanding of poetry by poem matching. Specifically, this task requires the model to select one line of Chinese classical poetry among four candidates according to the modern Chinese translation of a line of poetry. To construct this dataset, we first obtain a set of parallel data of Chinese classical poetry and modern Chinese translation. Then we retrieve similar lines of poetry with the lines in a poetry corpus as negative choices. We name the dataset Chinese Classical Poetry Matching Dataset (CCPM) and release it at https://github.com/THUNLP-AIPoet/CCPM. We hope this dataset can further enhance the study on incorporating deep ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://arxiv.org/abs/2106.01979 https://dx.doi.org/10.48550/arxiv.2106.01979
|
|
BASE
|
|
Hide details
|
|
8 |
MoEfication: Transformer Feed-forward Layers are Mixtures of Experts ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Transfer Learning for Sequence Generation: from Single-source to Multi-source ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Segment, Mask, and Predict: Augmenting Chinese Word Segmentation with Self-Supervision ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
OpenAttack: An Open-source Textual Adversarial Attack Toolkit ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Try to Substitute: An Unsupervised Chinese Word Sense Disambiguation Method Based on HowNet ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
Lexical Sememe Prediction using Dictionary Definitions by Capturing Local Semantic Correspondence ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Improving Back-Translation with Uncertainty-based Confidence Estimation ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Towards Building a Multilingual Sememe Knowledge Base: Predicting Sememes for BabelNet Synsets ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Modeling Semantic Compositionality with Sememe Knowledge ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
OpenHowNet: An Open Sememe-based Lexical Knowledge Base ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Neural Machine Translation with Explicit Phrase Alignment ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|