1 |
Le modèle Transformer: un « couteau suisse » pour le traitement automatique des langues
|
|
|
|
In: Techniques de l'Ingenieur ; https://hal.archives-ouvertes.fr/hal-03619077 ; Techniques de l'Ingenieur, Techniques de l'ingénieur, 2022, ⟨10.51257/a-v1-in195⟩ ; https://www.techniques-ingenieur.fr/base-documentaire/innovation-th10/innovations-en-electronique-et-tic-42257210/transformer-des-reseaux-de-neurones-pour-le-traitement-automatique-des-langues-in195/ (2022)
|
|
BASE
|
|
Show details
|
|
6 |
On Multi-domain Sentence Level Sentiment Analysis for Roman Urdu ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
Multilingual Email Zoning - Segmenting Multilingual Email Text Into Zones
|
|
|
|
Abstract:
Dissertation presented as the partial requirement for obtaining a Master's degree in Information Management, specialization in Knowledge Management and Business Intelligence ; The segmentation of emails into functional zones (also dubbed email zoning) is a relevant preprocessing step for most NLP tasks that deal with emails. In this research, we analyze in depth the email zoning literature and develop a business case around CLEVERLY AI, a company from the Customer Service sector. We design a new email zoning classification schema and collect a multilingual corpus of emails from CLEVERLY AI clients. We develop five neural network-based email zoning systems, among those systems, we introduce OKAPI, the first multilingual email zoning model based on a language agnostic sentence encoder. Besides outperforming our other systems when tested on CLEVERLY’s emails, OKAPI shows competitive performances with current English public benchmarks and reached new state-of-the-art results for English domain adaptation tasks. Moreover, we release a new multilingual benchmark, composed of 625 emails in Portuguese, Spanish and French, and demonstrate OKAPI can effectively generalize its learnings for unseen languages.
|
|
Keyword:
Customer Service; Email Zoning; Machine Learning; Multilingual; Natural Language Processing; Text Segmentation
|
|
URL: http://hdl.handle.net/10362/119831
|
|
BASE
|
|
Hide details
|
|
10 |
ISL-CSLTR: Indian Sign Language Dataset for Continuous Sign Language Translation and Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Content4All Open Research Sign Language Translation Datasets ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
ISL-CSLTR: Indian Sign Language Dataset for Continuous Sign Language Translation and Recognition ...
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Identity-Based Patterns in Deep Convolutional Networks: Generative Adversarial Phonology and Reduplication ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
A Codicological and Linguistic Typology of Common Torah Codices from the Cairo Genizah ...
|
|
|
|
BASE
|
|
Show details
|
|
17 |
Rule-based Morphological Inflection Improves Neural Terminology Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
18 |
Translating Headers of Tabular Data: A Pilot Study of Schema Translation ...
|
|
|
|
BASE
|
|
Show details
|
|
19 |
A Prototype Free/Open-Source Morphological Analyser and Generator for Sakha ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|