DE eng

Search in the Catalogues and Directories

Hits 1 – 1 of 1

1
A Swiss German Dictionary: Variation in Speech and Writing ...
Abstract: We introduce a dictionary containing forms of common words in various Swiss German dialects normalized into High German. As Swiss German is, for now, a predominantly spoken language, there is a significant variation in the written forms, even between speakers of the same dialect. To alleviate the uncertainty associated with this diversity, we complement the pairs of Swiss German - High German words with the Swiss German phonetic transcriptions (SAMPA). This dictionary becomes thus the first resource to combine large-scale spontaneous translation with phonetic transcriptions. Moreover, we control for the regional distribution and insure the equal representation of the major Swiss dialects. The coupling of the phonetic and written Swiss German forms is powerful. We show that they are sufficient to train a Transformer-based phoneme to grapheme model that generates credible novel Swiss German writings. In addition, we show that the inverse mapping - from graphemes to phonemes - can be modeled with a transformer ... : 6 pages, 1 figure, 2 tables. To be published in: Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC 2020). Marseille, France. For project reports and to obtain the dictionary see http://tiny.uzh.ch/11X ...
Keyword: 68T50, 68T10; A.2; I.2.7; J.5; Computation and Language cs.CL; FOS Computer and information sciences
URL: https://arxiv.org/abs/2004.00139
https://dx.doi.org/10.48550/arxiv.2004.00139
BASE
Hide details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
1
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern