Home Catalogue search

eng

Refine your search:
- Keyword:
- Creator / Publisher
- Year:
  - 2021 (5)
  - 2020 (2)
  - 2019 (8)
  - 2018 (2)
  - 2017 (3)
  - 2016 (3)
  - 2015 (9)
  - 2014 (6)
  - 2013 (10)
  - 2012 (9)
  - more
- Medium
- Type
- BLLDB-Access:
  - free (288)
  - subject to license (34)

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4 5...15

Hits 1 – 20 of 288

1	Statistics in corpus linguistics research: a new approach
	Wallis, Sean. - London usw. : Routledge, 2021
	IDS Bibliografie zur deutschen Grammatik
	Show details

2	Modeling contextual information in neural machine translation
	Stojanovski, Dario. - : Ludwig-Maximilians-Universität München, 2021
	BASE
	Show details

3	Embedding mobile learning into everyday life settings
	Schneegass, Christina. - : Ludwig-Maximilians-Universität München, 2021
	BASE
	Show details

4	Distributed representations for multilingual language processing
	Dufter, Philipp. - : Ludwig-Maximilians-Universität München, 2021
	Abstract: Distributed representations are a central element in natural language processing. Units of text such as words, ngrams, or characters are mapped to real-valued vectors so that they can be processed by computational models. Representations trained on large amounts of text, called static word embeddings, have been found to work well across a variety of tasks such as sentiment analysis or named entity recognition. More recently, pretrained language models are used as contextualized representations that have been found to yield even better task performances. Multilingual representations that are invariant with respect to languages are useful for multiple reasons. Models using those representations would only require training data in one language and still generalize across multiple languages. This is especially useful for languages that exhibit data sparsity. Further, machine translation models can benefit from source and target representations in the same space. Last, knowledge extraction models could not only access English data, but data in any natural language and thus exploit a richer source of knowledge. Given that several thousand languages exist in the world, the need for multilingual language processing seems evident. However, it is not immediately clear, which properties multilingual embeddings should exhibit, how current multilingual representations work and how they could be improved. This thesis investigates some of these questions. In the first publication, we explore the boundaries of multilingual representation learning by creating an embedding space across more than one thousand languages. We analyze existing methods and propose concept based embedding learning methods. The second paper investigates differences between creating representations for one thousand languages with little data versus considering few languages with abundant data. In the third publication, we refine a method to obtain interpretable subspaces of embeddings. This method can be used to investigate the workings of multilingual representations. The fourth publication finds that multilingual pretrained language models exhibit a high degree of multilinguality in the sense that high quality word alignments can be easily extracted. The fifth paper investigates reasons why multilingual pretrained language models are multilingual despite lacking any kind of crosslingual supervision during training. Based on our findings we propose a training scheme that leads to improved multilinguality. Last, the sixth paper investigates the use of multilingual pretrained language models as multilingual knowledge bases.
	Keyword: ddc:000; ddc:004; Fakultät für Mathematik; Informatik und Statistik
	URL: http://nbn-resolving.de/urn:nbn:de:bvb:19-280144 https://edoc.ub.uni-muenchen.de/28014/1/Dufter_Philipp.pdf
	BASE
	Hide details

5	Warum wir so wenig über die Sprachen in Deutschland wissen: Spracheinstellungen als Erkenntnisbarriere
	Adler, Astrid; Ribeiro Silveira, Maria
	In: Diskurs Kindheits- und Jugendforschung / Discourse. Journal of Childhood and Adolescence Research ; 16 ; 4 ; 403-419 ; Perspektiven von Kindern und Jugendlichen auf sprachliche Diversität und Sprachbildungsprozesse (2021)
	BASE
	Show details

6	Konzepte und Guidelines für Applikationen in Cinematic Virtual Reality
	Rothe, Sylvia. - : Ludwig-Maximilians-Universität München, 2020
	BASE
	Show details

7	Evaluating Unsupervised Representation Learning for Detecting Stances of Fake News
	Guderlei, Maike; Aßenmacher, Matthias. - : Ludwig-Maximilians-Universität München, 2020
	BASE
	Show details

8	Multi-dimensional analysis : research methods and current issues
	Sardinha, Tony Berber (Herausgeber); Pinto, Marcia Veirano (Herausgeber). - Sydney : Bloomsbury Academic, 2019
	BLLDB
	UB Frankfurt Linguistik
	Show details

9	KoGra-R: Standardisierte statistische Auswertung von Korpusrecherchen
	Hansen-Morath, Sandra; Schmitz, Hans-Christian; Schneider, Roman...
	In: Grammatik im Korpus (2019), 299-357
	IDS Bibliografie zur deutschen Grammatik
	Show details

10	Bildungsforschung mit Daten der amtlichen Statistik ; Educational research with data of official statistics
	Fickermann, Detlef; Weishaupt, Horst
	In: Fickermann, Detlef [Hrsg.]; Weishaupt, Horst [Hrsg.]: Bildungsforschung mit Daten der amtlichen Statistik. Münster ; New York : Waxmann 2019, S. 11-18. - (Die Deutsche Schule, Beiheft; 14) (2019)
	BASE
	Show details

11	Bildungsforschung mit Daten der amtlichen Statistik
	Fickermann, Detlef Hrsg.; Weishaupt, Horst Hrsg.. - : Waxmann, 2019. : Münster, 2019. : New York, 2019. : pedocs-Dokumentenserver/DIPF, 2019
	In: Münster ; New York : Waxmann 2019, 267 S. - (Die Deutsche Schule, Beiheft; 14) (2019)
	BASE
	Show details

12	Multilabel text classification of public procurements using deep learning intent detection ; Textklassificering av offentliga upphandlingar med djupa artificiella neuronnät och avsåtsdetektering
	Suta, Adin. - : KTH, Matematisk statistik, 2019
	BASE
	Show details

13	LSTM vs Random Forest for Binary Classification of Insurance Related Text ; LSTM vs Random Forest för binär klassificering av försäkringsrelaterad text
	Kindbom, Hannes. - : KTH, Matematisk statistik, 2019
	BASE
	Show details

14	Public Sentiment on Twitter and Stock Performance : A Study in Natural Language Processing ; Allmänna sentimentet på Twitter och aktiemarknaden : En studie i språkteknologi
	Henriksson, Jimmy; Hultberg, Carl. - : KTH, Skolan för elektroteknik och datavetenskap (EECS), 2019
	BASE
	Show details

15	Integration of Migrant Populations into Health Monitoring in Germany: Results from a Feasibility Study
	Zeisler, Marie-Luise; Lemcke, Johannes
	In: Survey Methods: Insights from the Field ; 1-11 ; Probability and Nonprobability Sampling: Sampling of hard-to-reach survey populations (2019)
	BASE
	Show details

16	Quantitative methods for second language research : a problem-solving approach
	Phakiti, Aek; Röver, Carsten. - London : Routledge, 2018
	BLLDB
	UB Frankfurt Linguistik
	Show details

17	FREDDIE Shiny - an online statistics interface ; FREDDIE Shiny - ein Online-Werkzeug für Statistik
	Zamecnik, Jiri; Juskan, Marten. - 2018
	BASE
	Show details

18	Ein statistisches Mittel zur Messbarkeit von Semantik : = A statistical mean measuring semantics
	Schäfer, Philipp. - Aachen : Shaker Verlag, 2017
	BLLDB
	UB Frankfurt Linguistik
	Show details

19	Supervised and unsupervised methods for learning representations of linguistic units
	Rothe, Sascha. - : Ludwig-Maximilians-Universität München, 2017
	BASE
	Show details

20	Functional linear mixed models for complex correlation structures and general sampling grids
	Cederbaum, Jona. - : Ludwig-Maximilians-Universität München, 2017
	BASE
	Show details

Page: 1 2 3 4 5...15

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern