1 |
Generating Authentic Adversarial Examples beyond Meaning-preserving with Doubly Round-trip Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Conditional Bilingual Mutual Information Based Adaptive Training for Neural Machine Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
ConSLT: A Token-level Contrastive Framework for Sign Language Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
A Simple Multi-Modality Transfer Learning Baseline for Sign Language Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Focus on the Target's Vocabulary: Masked Label Smoothing for Machine Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
USTC-NELSLIP at SemEval-2022 Task 11: Gazetteer-Adapted Integration Network for Multilingual Complex Named Entity Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Towards the Next 1000 Languages in Multilingual Machine Translation: Exploring the Synergy Between Supervised and Self-Supervised Learning ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
GL-CLeF: A Global-Local Contrastive Learning Framework for Cross-lingual Spoken Language Understanding ...
|
|
|
|
BASE
|
|
Show details
|
|
10 |
CINO: A Chinese Minority Pre-trained Language Model ...
|
|
|
|
Abstract:
Multilingual pre-trained language models have shown impressive performance on cross-lingual tasks. It greatly facilitates the applications of natural language processing on low-resource languages. However, there are still some languages that the existing multilingual models do not perform well on. In this paper, we propose CINO (Chinese Minority Pre-trained Language Model), a multilingual pre-trained language model for Chinese minority languages. It covers Standard Chinese, Cantonese, and six other Chinese minority languages. To evaluate the cross-lingual ability of the multilingual models on the minority languages, we collect documents from Wikipedia and build a text classification dataset WCM (Wiki-Chinese-Minority). We test CINO on WCM and two other text classification tasks. Experiments show that CINO outperforms the baselines notably. The CINO model and the WCM dataset are available at http://cino.hfl-rc.com. ... : 4 pages ...
|
|
Keyword:
Computation and Language cs.CL; FOS Computer and information sciences
|
|
URL: https://dx.doi.org/10.48550/arxiv.2202.13558 https://arxiv.org/abs/2202.13558
|
|
BASE
|
|
Hide details
|
|
11 |
Delving Deeper into Cross-lingual Visual Question Answering ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Multi-Level Contrastive Learning for Cross-Lingual Alignment ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Cross-Lingual Text Classification with Multilingual Distillation and Zero-Shot-Aware Training ...
|
|
|
|
BASE
|
|
Show details
|
|
15 |
HFL at SemEval-2022 Task 8: A Linguistics-inspired Regression Model with Data Augmentation for Multilingual News Similarity ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Controllable Natural Language Generation with Contrastive Prefixes ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
SGL: Symbolic Goal Learning in a Hybrid, Modular Framework for Human Instruction Following ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
Local-Global Context Aware Transformer for Language-Guided Video Segmentation ...
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Automatic Speech Recognition Datasets in Cantonese: A Survey and New Dataset ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|