1 |
Assessing the impact of OCR noise on multilingual event detection over digitised documents
|
|
|
|
In: ISSN: 1432-5012 ; EISSN: 1432-1300 ; International Journal on Digital Libraries ; https://hal.archives-ouvertes.fr/hal-03635985 ; International Journal on Digital Libraries, Springer Verlag, 2022, ⟨10.1007/s00799-022-00325-2⟩ (2022)
|
|
BASE
|
|
Show details
|
|
2 |
PROTECT: A Pipeline for Propaganda Detection and Classification
|
|
|
|
In: CLiC-it 2021- Italian Conference on Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-03417019 ; CLiC-it 2021- Italian Conference on Computational Linguistics, Jan 2022, Milan, Italy (2022)
|
|
BASE
|
|
Show details
|
|
3 |
Potential of automatic speech processing technologies for early detection of oral language disorders: a meta-analytic review ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Data for: Speech naturalness detection and language representation in the dog brain ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Data for: Speech naturalness detection and language representation in the dog brain ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
MIss RoBERTa WiLDe: Metaphor Identification Using Masked Language Model with Wiktionary Lexical Definitions
|
|
|
|
In: Applied Sciences; Volume 12; Issue 4; Pages: 2081 (2022)
|
|
Abstract:
Recent years have brought an unprecedented and rapid development in the field of Natural Language Processing. To a large degree this is due to the emergence of modern language models like GPT-3 (Generative Pre-trained Transformer 3), XLNet, and BERT (Bidirectional Encoder Representations from Transformers), which are pre-trained on a large amount of unlabeled data. These powerful models can be further used in the tasks that have traditionally been suffering from a lack of material that could be used for training. Metaphor identification task, which is aimed at automatic recognition of figurative language, is one of such tasks. The metaphorical use of words can be detected by comparing their contextual and basic meanings. In this work, we deliver the evidence that fully automatically collected dictionary definitions can be used as the optimal medium for retrieving the non-figurative word senses, which consequently may help improve the performance of the algorithms used in metaphor detection task. As the source of the lexical information, we use the openly available Wiktionary. Our method can be applied without changes to any other dataset designed for token-level metaphor detection given it is binary labeled. In the set of experiments, our proposed method (MIss RoBERTa WiLDe) outperforms or performs similarly well as the competing models on several datasets commonly chosen in the research on metaphor processing.
|
|
Keyword:
figurative language; language models; lexical definitions; metaphor detection; RoBERTa; Sentence-BERT; Wiktionary
|
|
URL: https://doi.org/10.3390/app12042081
|
|
BASE
|
|
Hide details
|
|
7 |
Detection of Chinese Deceptive Reviews Based on Pre-Trained Language Model
|
|
|
|
In: Applied Sciences; Volume 12; Issue 7; Pages: 3338 (2022)
|
|
BASE
|
|
Show details
|
|
8 |
Measuring Gender Bias in Contextualized Embeddings
|
|
|
|
In: Computer Sciences & Mathematics Forum; Volume 3; Issue 1; Pages: 3 (2022)
|
|
BASE
|
|
Show details
|
|
9 |
Automatic Detection of Plagiarism in Writing
|
|
|
|
In: Studies in Applied Linguistics & TESOL, Vol 21, Iss 2 (2022) (2022)
|
|
BASE
|
|
Show details
|
|
10 |
Automatic Detection of Plagiarism in Writing
|
|
|
|
In: Studies in Applied Linguistics & TESOL, Vol 21, Iss 2 (2022) (2022)
|
|
BASE
|
|
Show details
|
|
11 |
Deceptive Opinions Detection Using New Proposed Arabic Semantic Features
|
|
|
|
In: ISSN: 1877-0509 ; EISSN: 1877-0509 ; Procedia Computer Science ; https://hal.archives-ouvertes.fr/hal-03299022 ; Procedia Computer Science, Elsevier, 2021, 189, pp.29 - 36. ⟨10.1016/j.procs.2021.05.067⟩ (2021)
|
|
BASE
|
|
Show details
|
|
12 |
End-to-end speaker segmentation for overlap-aware resegmentation
|
|
|
|
In: Interspeech 2021 ; https://hal-univ-lemans.archives-ouvertes.fr/hal-03257524 ; Interspeech 2021, Aug 2021, Brno, Czech Republic ; https://www.interspeech2021.org/ (2021)
|
|
BASE
|
|
Show details
|
|
13 |
Tackling Morphological Analogies Using Deep Learning -- Extended Version
|
|
|
|
In: https://hal.inria.fr/hal-03425776 ; 2021 (2021)
|
|
BASE
|
|
Show details
|
|
14 |
Enjeux liés à la détection de l’ironie
|
|
|
|
In: Actes de la 28e Conférence sur le Traitement Automatique des Langues Naturelles. Volume 2 : 23e REncontres jeunes Chercheurs en Informatique pour le TAL (RECITAL) ; Traitement Automatique des Langues Naturelles ; https://hal.archives-ouvertes.fr/hal-03265905 ; Traitement Automatique des Langues Naturelles, 2021, Lille, France. pp.55-66 (2021)
|
|
BASE
|
|
Show details
|
|
15 |
Study of non-projective dependencies in French ; Étude des dépendances syntaxiques non projectives en français
|
|
|
|
In: ISSN: 1248-9433 ; EISSN: 1965-0906 ; Revue TAL ; https://hal.inria.fr/hal-03389157 ; Revue TAL, ATALA (Association pour le Traitement Automatique des Langues), 2021, 62 (1) (2021)
|
|
BASE
|
|
Show details
|
|
16 |
COSMO-Onset: A Neurally-Inspired Computational Model of Spoken Word Recognition, Combining Top-Down Prediction and Bottom-Up Detection of Syllabic Onsets
|
|
|
|
In: ISSN: 1662-5137 ; Frontiers in Systems Neuroscience ; https://hal.archives-ouvertes.fr/hal-03318691 ; Frontiers in Systems Neuroscience, Frontiers, 2021, 15, pp.653975. ⟨10.3389/fnsys.2021.653975⟩ (2021)
|
|
BASE
|
|
Show details
|
|
17 |
Hate speech and offensive language detection using transfer learning approaches ; Détection du discours de haine et du langage offensant utilisant des approches de Transfer Learning
|
|
|
|
In: https://tel.archives-ouvertes.fr/tel-03276023 ; Document and Text Processing. Institut Polytechnique de Paris, 2021. English. ⟨NNT : 2021IPPAS007⟩ (2021)
|
|
BASE
|
|
Show details
|
|
18 |
Leveraging lyrics from audio for MIR ; Exploiter les paroles de chansons à partir de l'audio pour le MIR
|
|
|
|
In: https://tel.archives-ouvertes.fr/tel-03558515 ; Signal and Image processing. Institut Polytechnique de Paris, 2021. English. ⟨NNT : 2021IPPAT027⟩ (2021)
|
|
BASE
|
|
Show details
|
|
19 |
A Multilingual Dataset for Named Entity Recognition, Entity Linking and Stance Detection in Historical Newspapers
|
|
|
|
In: SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval ; https://hal.archives-ouvertes.fr/hal-03418387 ; SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, Jul 2021, Virtual Event, Canada. pp.2328-2334, ⟨10.1145/3404835.3463255⟩ (2021)
|
|
BASE
|
|
Show details
|
|
20 |
Breaking Down the Invisible Wall of Informal Fallacies in Online Discussions
|
|
|
|
In: ACL-IJCNLP 2021 - Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing ; https://hal.inria.fr/hal-03351649 ; ACL-IJCNLP 2021 - Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Aug 2021, Online, France ; https://2021.aclweb.org/ (2021)
|
|
BASE
|
|
Show details
|
|
|
|