DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5...15
Hits 1 – 20 of 288

1
Statistics in corpus linguistics research: a new approach
Wallis, Sean. - London usw. : Routledge, 2021
IDS Bibliografie zur deutschen Grammatik
Show details
2
Modeling contextual information in neural machine translation
Stojanovski, Dario. - : Ludwig-Maximilians-Universität München, 2021
BASE
Show details
3
Embedding mobile learning into everyday life settings
Schneegass, Christina. - : Ludwig-Maximilians-Universität München, 2021
BASE
Show details
4
Distributed representations for multilingual language processing
Dufter, Philipp. - : Ludwig-Maximilians-Universität München, 2021
Abstract: Distributed representations are a central element in natural language processing. Units of text such as words, ngrams, or characters are mapped to real-valued vectors so that they can be processed by computational models. Representations trained on large amounts of text, called static word embeddings, have been found to work well across a variety of tasks such as sentiment analysis or named entity recognition. More recently, pretrained language models are used as contextualized representations that have been found to yield even better task performances. Multilingual representations that are invariant with respect to languages are useful for multiple reasons. Models using those representations would only require training data in one language and still generalize across multiple languages. This is especially useful for languages that exhibit data sparsity. Further, machine translation models can benefit from source and target representations in the same space. Last, knowledge extraction models could not only access English data, but data in any natural language and thus exploit a richer source of knowledge. Given that several thousand languages exist in the world, the need for multilingual language processing seems evident. However, it is not immediately clear, which properties multilingual embeddings should exhibit, how current multilingual representations work and how they could be improved. This thesis investigates some of these questions. In the first publication, we explore the boundaries of multilingual representation learning by creating an embedding space across more than one thousand languages. We analyze existing methods and propose concept based embedding learning methods. The second paper investigates differences between creating representations for one thousand languages with little data versus considering few languages with abundant data. In the third publication, we refine a method to obtain interpretable subspaces of embeddings. This method can be used to investigate the workings of multilingual representations. The fourth publication finds that multilingual pretrained language models exhibit a high degree of multilinguality in the sense that high quality word alignments can be easily extracted. The fifth paper investigates reasons why multilingual pretrained language models are multilingual despite lacking any kind of crosslingual supervision during training. Based on our findings we propose a training scheme that leads to improved multilinguality. Last, the sixth paper investigates the use of multilingual pretrained language models as multilingual knowledge bases.
Keyword: ddc:000; ddc:004; Fakultät für Mathematik; Informatik und Statistik
URL: http://nbn-resolving.de/urn:nbn:de:bvb:19-280144
https://edoc.ub.uni-muenchen.de/28014/1/Dufter_Philipp.pdf
BASE
Hide details
5
Warum wir so wenig über die Sprachen in Deutschland wissen: Spracheinstellungen als Erkenntnisbarriere
In: Diskurs Kindheits- und Jugendforschung / Discourse. Journal of Childhood and Adolescence Research ; 16 ; 4 ; 403-419 ; Perspektiven von Kindern und Jugendlichen auf sprachliche Diversität und Sprachbildungsprozesse (2021)
BASE
Show details
6
Konzepte und Guidelines für Applikationen in Cinematic Virtual Reality
Rothe, Sylvia. - : Ludwig-Maximilians-Universität München, 2020
BASE
Show details
7
Evaluating Unsupervised Representation Learning for Detecting Stances of Fake News
Guderlei, Maike; Aßenmacher, Matthias. - : Ludwig-Maximilians-Universität München, 2020
BASE
Show details
8
Multi-dimensional analysis : research methods and current issues
Sardinha, Tony Berber (Herausgeber); Pinto, Marcia Veirano (Herausgeber). - Sydney : Bloomsbury Academic, 2019
BLLDB
UB Frankfurt Linguistik
Show details
9
KoGra-R: Standardisierte statistische Auswertung von Korpusrecherchen
In: Grammatik im Korpus (2019), 299-357
IDS Bibliografie zur deutschen Grammatik
Show details
10
Bildungsforschung mit Daten der amtlichen Statistik ; Educational research with data of official statistics
In: Fickermann, Detlef [Hrsg.]; Weishaupt, Horst [Hrsg.]: Bildungsforschung mit Daten der amtlichen Statistik. Münster ; New York : Waxmann 2019, S. 11-18. - (Die Deutsche Schule, Beiheft; 14) (2019)
BASE
Show details
11
Bildungsforschung mit Daten der amtlichen Statistik
Fickermann, Detlef Hrsg.; Weishaupt, Horst Hrsg.. - : Waxmann, 2019. : Münster, 2019. : New York, 2019. : pedocs-Dokumentenserver/DIPF, 2019
In: Münster ; New York : Waxmann 2019, 267 S. - (Die Deutsche Schule, Beiheft; 14) (2019)
BASE
Show details
12
Multilabel text classification of public procurements using deep learning intent detection ; Textklassificering av offentliga upphandlingar med djupa artificiella neuronnät och avsåtsdetektering
Suta, Adin. - : KTH, Matematisk statistik, 2019
BASE
Show details
13
LSTM vs Random Forest for Binary Classification of Insurance Related Text ; LSTM vs Random Forest för binär klassificering av försäkringsrelaterad text
Kindbom, Hannes. - : KTH, Matematisk statistik, 2019
BASE
Show details
14
Public Sentiment on Twitter and Stock Performance : A Study in Natural Language Processing ; Allmänna sentimentet på Twitter och aktiemarknaden : En studie i språkteknologi
Henriksson, Jimmy; Hultberg, Carl. - : KTH, Skolan för elektroteknik och datavetenskap (EECS), 2019
BASE
Show details
15
Integration of Migrant Populations into Health Monitoring in Germany: Results from a Feasibility Study
In: Survey Methods: Insights from the Field ; 1-11 ; Probability and Nonprobability Sampling: Sampling of hard-to-reach survey populations (2019)
BASE
Show details
16
Quantitative methods for second language research : a problem-solving approach
Phakiti, Aek; Röver, Carsten. - London : Routledge, 2018
BLLDB
UB Frankfurt Linguistik
Show details
17
FREDDIE Shiny - an online statistics interface ; FREDDIE Shiny - ein Online-Werkzeug für Statistik
BASE
Show details
18
Ein statistisches Mittel zur Messbarkeit von Semantik : = A statistical mean measuring semantics
Schäfer, Philipp. - Aachen : Shaker Verlag, 2017
BLLDB
UB Frankfurt Linguistik
Show details
19
Supervised and unsupervised methods for learning representations of linguistic units
Rothe, Sascha. - : Ludwig-Maximilians-Universität München, 2017
BASE
Show details
20
Functional linear mixed models for complex correlation structures and general sampling grids
Cederbaum, Jona. - : Ludwig-Maximilians-Universität München, 2017
BASE
Show details

Page: 1 2 3 4 5...15

Catalogues
33
54
0
0
1
7
35
Bibliographies
36
0
91
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
1
0
0
1
Open access documents
58
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern