2 |
Methods, Models and Tools for Improving the Quality of Textual Annotations
|
|
|
|
In: Modelling; Volume 3; Issue 2; Pages: 224-242 (2022)
|
|
BASE
|
|
Show details
|
|
3 |
Models of diachronic semantic change using word embeddings ; Modèles diachroniques à base de plongements de mot pour l'analyse du changement sémantique
|
|
|
|
In: https://tel.archives-ouvertes.fr/tel-03199801 ; Document and Text Processing. Université Paris-Saclay, 2021. English. ⟨NNT : 2021UPASG006⟩ (2021)
|
|
BASE
|
|
Show details
|
|
4 |
Teachers of Color's Perception on Identity and Academic Success: A Reflective Narrative
|
|
|
|
In: All Antioch University Dissertations & Theses (2021)
|
|
BASE
|
|
Show details
|
|
5 |
A Survey on Multilingual Hate Speech Detection and Classification by Machine Learning Techniques ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
A Survey on Multilingual Hate Speech Detection and Classification by Machine Learning Techniques ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
APiCS-Ligt: Towards Semantic Enrichment of Interlinear Glossed Text ...
|
|
Ionov, Maxim. - : Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2021
|
|
BASE
|
|
Show details
|
|
8 |
Towards Learning Terminological Concept Systems from Multilingual Natural Language Text ...
|
|
|
|
BASE
|
|
Show details
|
|
9 |
Improving Multilingual Models for the Swedish Language : Exploring CrossLingual Transferability and Stereotypical Biases
|
|
|
|
BASE
|
|
Show details
|
|
10 |
NASca and NASes: Two Monolingual Pre-Trained Models for Abstractive Summarization in Catalan and Spanish
|
|
|
|
In: Applied Sciences ; Volume 11 ; Issue 21 (2021)
|
|
BASE
|
|
Show details
|
|
11 |
Inductive Bias and Modular Design for Sample-Efficient Neural Language Learning ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Analyzing Non-Textual Content Elements to Detect Academic Plagiarism
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Learning to scale multilingual representations for vision-language tasks
|
|
|
|
BASE
|
|
Show details
|
|
16 |
Inductive Bias and Modular Design for Sample-Efficient Neural Language Learning
|
|
|
|
BASE
|
|
Show details
|
|
17 |
SberQuAD – Russian Reading Comprehension Dataset: Description and Analysis
|
|
|
|
In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2020)
|
|
BASE
|
|
Show details
|
|
18 |
Assessing English Writing in Multilingual Writers in Higher Education: A Longitudinal Study
|
|
|
|
In: Applied Linguistics and English as a Second Language Dissertations (2019)
|
|
BASE
|
|
Show details
|
|
19 |
Character language models for generalization of multilingual named entity recognition
|
|
|
|
Abstract:
State-of-the-art Named Entity Recognition (NER) models usually achieve high performance on entities that they have seen in training data, but a significantly lower performance on unseen entities. This is one of the key reasons in performance degradation observed when NER models are evaluated on new domains. Motivated by this observation, quantified for the first time in this thesis, we study an improved, multi-domain and multi-lingual, capability for identifying \what is a name". Character-level patterns have been widely used as features in English Named Entity Recognition (NER) systems. However, to date there has been no direct investigation of the inherent differences between name and non-name tokens in text, nor whether this property holds across multiple languages. The key contribution of this thesis is to develop a Character-level Language Model (CLM) that, as we show, allow us to better learn \what is a name". We analyze the capabilities of corpus-agnostic Character-level Language Models (CLMs) in the binary task of distinguishing name tokens from non-name tokens and demonstrate that CLMs provide a simple yet powerful model for capturing these differences. Specifically, we show that it can identify named entity tokens in a diverse set of languages at close to the performance of full NER systems. Moreover, by adding very simple CLM-based features we can significantly improve the performance of an o -the-shelf NER system for multiple languages.
|
|
Keyword:
Character Language Models; Generalization; Multilingual; Multilingual Named Entity Recognition; Named Entity Recognition; NER
|
|
URL: http://hdl.handle.net/2142/104934
|
|
BASE
|
|
Hide details
|
|
20 |
Multilingual Information Access (MLIA) Tools on Google and WorldCat: Bi/Multilingual University Students’ Experience and Perceptions
|
|
|
|
In: FIMS Publications (2019)
|
|
BASE
|
|
Show details
|
|
|
|