Home
Catalogue search
Refine your search:
Keyword
Creator / Publisher:
Gauvain, Jean-Luc (2)
Huang, Guangpu (2)
Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur (LIMSI) (2)
Lamel, Lori (2)
Laurent, Antoine (2)
ROSARIO, RYAN ROBERT (2)
Sorbonne Université (SU)-Sorbonne Université (SU)-Université Paris-Saclay-Université Paris-Sud - Paris 11 (UP11) (2)
Université Paris Saclay (COmUE)-Centre National de la Recherche Scientifique (CNRS)-Sorbonne Université - UFR d'Ingénierie (UFR 919) (2)
A Min Tjoa (1)
Abascal, Julio (1)
more
Year
Medium
Type
BLLDB-Access
Search in the Catalogues and Directories
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
AND
OR
AND NOT
All fields
Title
Creator / Publisher
Keyword
Year
Sort by
creator [A → Z]
'
creator [Z → A]
'
publishing year ↑ (asc)
'
publishing year ↓ (desc)
'
title [A → Z]
'
title [Z → A]
'
Simple Search
Hits 1 – 13 of 13
1
Effekten av textaugmenteringsstrategier på träffsäkerhet, F1-värde och viktat F1-värde ; The effect of text data augmentation strategies on Accuracy, F1-score, and weighted F1-score
Shmas, George
;
Svedberg, Jonatan
. - : KTH, Hälsoinformatik och logistik, 2021
Abstract:
Att utveckla en sofistikerad chatbotlösning kräver stora mängder textdata för att kunna anpassalösningen till en specifik domän. Att manuellt skapa en komplett uppsättning textdata, specialanpassat för den givna domänen och innehållandes ett stort antal varierande meningar som en människa kan tänkas yttra, är ett enormt tidskrävande arbete. För att kringgå detta tillämpas dataaugmentering för att generera mer data utifrån en mindre uppsättning redan existerande textdata. Softronic AB vill undersöka alternativa strategier för dataaugmentering med målet att eventuellt ersätta den nuvarande lösningen med en mer vetenskapligt underbyggd sådan. I detta examensarbete har prototypmodeller utvecklats för att jämföra och utvärdera effekten av olika textaugmenteringsstrategier. Resultatet av genomförda experiment med prototypmodellerna visar att augmentering genom synonymutbyten med en domänanpassad synonymordlista, presenterade märkbart förbättrade effekter på förmågan hos en NLU-modell att korrekt klassificera data, gentemot övriga utvärderade strategier. Vidare indikerar resultatet att ett samband föreligger mellan den strukturella variationsgraden av det augmenterade datat och de tillämpade språkparens semantiska likhetsgrad under tillbakaöversättningar. ; Developing a sophisticated chatbot solution requires large amounts of text data to be able to adapt the solution to a specific domain. Manually creating a complete set of text data, specially adapted for the given domain, and containing a large number of varying sentences that a human conceivably can express, is an exceptionally time-consuming task. To circumvent this, data augmentation is applied to generate more data based on a smaller set of already existing text data. Softronic AB wants to investigate alternative strategies for data augmentation with the aim of possibly replacing the current solution with a more scientifically substantiated one. In this thesis, prototype models have been developed to compare and evaluate the effect of different text augmentation strategies. The results of conducted experiments with the prototype models show that augmentation through synonym swaps with a domain-adapted thesaurus, presented noticeably improved effects on the ability of an NLU-model to correctly classify data, compared to other evaluated strategies. Furthermore, the result indicates that there is a relationship between the structural degree of variation of the augmented data and the applied language pair's semantic degree of similarity during back-translations.
Keyword:
back translation
;
brusinjektion
;
F1-score
;
F1-värde
;
Language Technology (Computational Linguistics)
;
noise injection
;
RASA NLU
;
Språkteknologi (språkvetenskaplig databehandling)
;
synonym swap
;
synonymutbyte
;
Text data augmentation
;
Textdataaugmentering
;
tillbakaöversättning
URL:
http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-296550
BASE
Hide details
2
Improving Short Text Classification Through Global Augmentation Methods
Marivate, Vukosi
;
Sefara, Tshephisho
In: Lecture Notes in Computer Science ; 4th International Cross-Domain Conference for Machine Learning and Knowledge Extraction (CD-MAKE) ; https://hal.inria.fr/hal-03414750 ; 4th International Cross-Domain Conference for Machine Learning and Knowledge Extraction (CD-MAKE), Aug 2020, Dublin, Ireland. pp.385-399, ⟨10.1007/978-3-030-57321-8_21⟩ (2020)
BASE
Show details
3
Characterization and classification of semantic image-text relations ...
Otto, Christian
;
Springstein, Matthias
;
Anand, Avishek
. - : London : Springer, 2020
BASE
Show details
4
Characterization and classification of semantic image-text relations ...
Otto, C.
;
Springstein, M.
;
Anand, A.
. - : Berlin : Springer Nature, 2020
BASE
Show details
5
Effective keyword search for low-resourced conversational speech
Lileikyte, Rasa
;
Fraga-Silva, Thiago
;
Lamel, Lori
...
In: icassp 2017 ; https://hal.archives-ouvertes.fr/hal-01744176 ; icassp 2017, IEEE, Mar 2017, La Nouvelle Orléans, United States (2017)
BASE
Show details
6
A Data Augmentation Approach to Short Text Classification
ROSARIO, RYAN ROBERT
. - : eScholarship, University of California, 2017
In: ROSARIO, RYAN ROBERT. (2017). A Data Augmentation Approach to Short Text Classification. UCLA: Statistics 0891. Retrieved from: http://www.escholarship.org/uc/item/9cn7k2xq (2017)
BASE
Show details
7
A Data Augmentation Approach to Short Text Classification
ROSARIO, RYAN ROBERT
. - : eScholarship, University of California, 2017
BASE
Show details
8
How Short is a Piece of String?: the Impact of Text Length and Text Augmentation on Short-text Classification Accuracy
McCartney, Austin
;
Hensman, Svetlana
;
Longo, Luca
In: Conference papers (2017)
BASE
Show details
9
Language Model Data Augmentation for Keyword Spotting
Gorin, Arseniy
;
Lileikyté, Rasa
;
Huang, Guangpu
...
In: Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-01837186 ; Annual Conference of the International Speech Communication Association , Jan 2016, San Francisco, United States (2016)
BASE
Show details
10
InLéctor: enhanced bilingual e-books for language learning
Oliver González, Antoni
;
Coll-Florit, Marta
;
Iribarren i Donadeu, Teresa
. - : International Journal on Advances in Education Research, 2014
BASE
Show details
11
Modelling text prediction systems in low- and high-inflected languages
Garay-Vitoria, Nestor
;
Abascal, Julio
In:
Computer speech and language. - Amsterdam [u.a.] : Elsevier
24 (2010) 2, 117-135
BLLDB
OLC Linguistik
Show details
12
Text Augmentation: Inserting markup into natural language text with PPM Models
Yeates, Stuart Andrew
. - : The University of Waikato, 2006
BASE
Show details
13
A Development System for Augmented Transition Network Grammars and a Large Grammar for Technical Prose.
Mayer,John
;
Kieras,David E
In: DTIC AND NTIS (1987)
BASE
Show details
Mobile view
All
Catalogues
UB Frankfurt Linguistik
0
IDS Mannheim
0
OLC Linguistik
1
UB Frankfurt Retrokatalog
0
DNB Subject Category Language
0
Institut für Empirische Sprachwissenschaft
0
Leibniz-Centre General Linguistics (ZAS)
0
Bibliographies
BLLDB
1
BDSL
0
IDS Bibliografie zur deutschen Grammatik
0
IDS Bibliografie zur Gesprächsforschung
0
IDS Konnektoren im Deutschen
0
IDS Präpositionen im Deutschen
0
IDS OBELEX meta
0
MPI-SHH Linguistics Collection
0
MPI for Psycholinguistics
0
Linked Open Data catalogues
Annohub
0
Online resources
Link directory
0
Journal directory
0
Database directory
0
Dictionary directory
0
Open access documents
BASE
12
Linguistik-Repository
0
IDS Publikationsserver
0
Online dissertations
0
Language Description Heritage
0
© 2013 - 2024 Lin|gu|is|tik
|
Imprint
|
Privacy Policy
|
Datenschutzeinstellungen ändern