Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4 5 6 7 8 9...282

Hits 81 – 100 of 5.621

81	Team ÚFAL at CMCL 2022 Shared Task: Figuring out the correct recipe for predicting Eye-Tracking features using Pretrained Language Models ...
	Bhattacharya, Sunit; Kumar, Rishu; Bojar, Ondrej. - : arXiv, 2022
	BASE
	Show details

82	Does Corpus Quality Really Matter for Low-Resource Languages? ...
	Artetxe, Mikel; Aldabe, Itziar; Agerri, Rodrigo. - : arXiv, 2022
	BASE
	Show details

83	IIITDWD-ShankarB@ Dravidian-CodeMixi-HASOC2021: mBERT based model for identification of offensive content in south Indian languages ...
	Biradar, Shankar; Saumya, Sunil. - : arXiv, 2022
	BASE
	Show details

84	mSLAM: Massively multilingual joint pre-training for speech and text ...
	Bapna, Ankur; Cherry, Colin; Zhang, Yu; Jia, Ye; Johnson, Melvin; Cheng, Yong; Khanuja, Simran; Riesa, Jason; Conneau, Alexis. - : arXiv, 2022
	Abstract: We present mSLAM, a multilingual Speech and LAnguage Model that learns cross-lingual cross-modal representations of speech and text by pre-training jointly on large amounts of unlabeled speech and text in multiple languages. mSLAM combines w2v-BERT pre-training on speech with SpanBERT pre-training on character-level text, along with Connectionist Temporal Classification (CTC) losses on paired speech and transcript data, to learn a single model capable of learning from and representing both speech and text signals in a shared representation space. We evaluate mSLAM on several downstream speech understanding tasks and find that joint pre-training with text improves quality on speech translation, speech intent classification and speech language-ID while being competitive on multilingual ASR, when compared against speech-only pre-training. Our speech translation model demonstrates zero-shot text translation without seeing any text translation data, providing evidence for cross-modal alignment of representations. ...
	Keyword: Computation and Language cs.CL; FOS Computer and information sciences; Machine Learning cs.LG
	URL: https://arxiv.org/abs/2202.01374 https://dx.doi.org/10.48550/arxiv.2202.01374
	BASE
	Hide details

85	On the Representation Collapse of Sparse Mixture of Experts ...
	Chi, Zewen; Dong, Li; Huang, Shaohan. - : arXiv, 2022
	BASE
	Show details

86	Politics and Virality in the Time of Twitter: A Large-Scale Cross-Party Sentiment Analysis in Greece, Spain and United Kingdom ...
	Antypas, Dimosthenis; Preece, Alun; Collados, Jose Camacho. - : arXiv, 2022
	BASE
	Show details

87	L3Cube-MahaHate: A Tweet-based Marathi Hate Speech Detection Dataset and BERT models ...
	Velankar, Abhishek; Patil, Hrushikesh; Gore, Amol. - : arXiv, 2022
	BASE
	Show details

88	Few-Shot Cross-lingual Transfer for Coarse-grained De-identification of Code-Mixed Clinical Texts ...
	Amin, Saadullah; Goldstein, Noon Pokaratsiri; Wixted, Morgan Kelly. - : arXiv, 2022
	BASE
	Show details

89	A Unified Strategy for Multilingual Grammatical Error Correction with Pre-trained Cross-Lingual Language Model ...
	Sun, Xin; Ge, Tao; Ma, Shuming. - : arXiv, 2022
	BASE
	Show details

90	A New Generation of Perspective API: Efficient Multilingual Character-level Transformers ...
	Lees, Alyssa; Tran, Vinh Q.; Tay, Yi. - : arXiv, 2022
	BASE
	Show details

91	Factual Consistency of Multilingual Pretrained Language Models ...
	Fierro, Constanza; Søgaard, Anders. - : arXiv, 2022
	BASE
	Show details

92	Examining Scaling and Transfer of Language Model Architectures for Machine Translation ...
	Zhang, Biao; Ghorbani, Behrooz; Bapna, Ankur. - : arXiv, 2022
	BASE
	Show details

93	MuMiN: A Large-Scale Multilingual Multimodal Fact-Checked Misinformation Social Network Dataset ...
	Nielsen, Dan Saattrup; McConville, Ryan. - : arXiv, 2022
	BASE
	Show details

94	Mono vs Multilingual BERT for Hate Speech Detection and Text Classification: A Case Study in Marathi ...
	Velankar, Abhishek; Patil, Hrushikesh; Joshi, Raviraj. - : arXiv, 2022
	BASE
	Show details

95	Curlie Dataset - Language-agnostic Website Embedding and Classification ...
	Lugeon, Sylvain; Piccardi, Tiziano. - : figshare, 2022
	BASE
	Show details

96	Curlie Dataset - Language-agnostic Website Embedding and Classification ...
	Lugeon, Sylvain; Piccardi, Tiziano. - : figshare, 2022
	BASE
	Show details

97	Curlie Dataset - Language-agnostic Website Embedding and Classification ...
	Lugeon, Sylvain; Piccardi, Tiziano. - : figshare, 2022
	BASE
	Show details

98	Curlie Dataset - Language-agnostic Website Embedding and Classification ...
	Lugeon, Sylvain; Piccardi, Tiziano. - : figshare, 2022
	BASE
	Show details

99	Characterizing News Portrayal of Civil Unrest in Hong Kong, 1998–2020 ...
	The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing 2021; ., Giovanna Maria Dora; Bhattacharya, Rohit. - : Underline Science Inc., 2022
	BASE
	Show details

100	Natural Language Descriptions of Deep Visual Features ...
	Hernandez, Evan; Schwettmann, Sarah; Bau, David. - : arXiv, 2022
	BASE
	Show details

Page: 1 2 3 4 5 6 7 8 9...282

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern