DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5...262
Hits 1 – 20 of 5.231

1
Wissensrohstoff Text : Eine Einführung in das Text Mining
Biemann, Chris (VerfasserIn); Heyer, Gerhard (VerfasserIn). - 2., wesentl. überarb. Auflage 2022. - Wiesbaden : Springer Fachmedien Wiesbaden GmbH, 2022
IDS Mannheim
Show details
2
Uncertainty, quantity, and relevance inferences from modified numerals
In: Measurements, Numerals and Scales : Essays in Honour of Stephanie Solt (2022), S. 59-74
Leibniz-Zentrum Allgemeine Sprachwissenschaft
Show details
3
Documenting Geographically and Contextually Diverse Data Sources: The BigScience Catalogue of Language Data and Resources
In: https://hal.inria.fr/hal-03550289 ; 2022 (2022)
BASE
Show details
4
Proportionate translation of study materials and measures in a multinational global health trial: methodology development and implementation ...
Charles, Ashleigh; Korde, Palak; Newby, Chris. - : Universität Ulm, 2022
BASE
Show details
5
Community Mapping 2.0: Using Technology to Raise Community Awareness
In: Networks: An Online Journal for Teacher Research (2022)
BASE
Show details
6
MMTAfrica: Multilingual Machine Translation for African Languages ...
BASE
Show details
7
A Feasibility Study of Answer-Agnostic Question Generation for Education ...
BASE
Show details
8
Longitudinal Brain Correlates of Multisensory Lexical Processing in Children ...
BASE
Show details
9
Language Models Explain Word Reading Times Better Than Empirical Predictability ...
BASE
Show details
10
Cyberbullying Classifiers are Sensitive to Model-Agnostic Perturbations ...
BASE
Show details
11
SCoT: Sense Clustering over Time: a tool for the analysis of lexical change ...
BASE
Show details
12
Automatic Fake News Detection: Are current models "fact-checking" or "gut-checking"? ...
BASE
Show details
13
Cerebral Polymorphisms for Lateralisation: Modelling the Genetic and Phenotypic Architectures of Multiple Functional Modules
In: Symmetry; Volume 14; Issue 4; Pages: 814 (2022)
BASE
Show details
14
How We Failed in Context: A Text-Mining Approach to Understanding Hotel Service Failures
In: Sustainability; Volume 14; Issue 5; Pages: 2675 (2022)
BASE
Show details
15
BAS Edition of German Distant Speech Data Corpus 2014/2015
Abstract: General information: The corpus contains read German speech of 179 different speakers (50 female, 129 male). Each speaker has read randomly selected sentences from four text collections: Wikipedia, the Europarl Corpus,a list of German Command/Control sentences, a corpus of web-crawled sentences that represent direct speech. The recording took place at the Language Technology and Telecooperation labs, TU-Darmstadt, Germany in 2014-2015. The task for the speaker was to read fluently and precise (no dialectal variation). Up to 5 microphones were recorded in parallel: Kinect 1 Beamformed Audio signal through Kinect SDK, Kinect 1 Direct Access as normal microphone, Internal Realtek Mic of Asus PC - near noisy fan, Samson C01U, Yamaha PSG-01S. Distance to mouth for all microphones was approx. 100cm. Room: 'dry' acoustics ('quiet office'), no noise. Sampling rate: 16kHz, resolution: 16 Bit. The speech data was collected in a controlled environment (same room, same microphone distances, etc.). Each recording has a xml transcription file that also includes speaker meta data. The data is curated (manually checked and corrected), to reduce errors and artefacts. The speech data is divided into three independent data sets: Training / Test / Dev, Test and Dev contains new sentences and new speakers that are not part of training set, in order to assess model quality in a speaker-independent open-vocabulary setting. Information about the data collection procedure: (1) Train set (recordings in 2014): Sentences were randomly chosen from German Wikipedia and Europarl Corpus, to be read by the speakers. The Europarl corpus (Release v7) is a collection of the proceedings of the European Parliament between 1996 and 2011, generated by Philipp Koehn (Europarl: A Parallel Corpus for Statistical Machine Translation, Philipp Koehn, MT Summit 2005, http://www.statmt.org/europarl/). As third data set, German command and control sentences, were manually specified and would be typical for a command and control setting in living rooms. (2) Test/dev set (recordings in 2015): Additional sentences from the German Wikipedia and from the Europarl Corpus have selected for the recordings. Additionally, we collected German sentences from the web by crawling the German top-level-domain and applying language filtering and deduplification. Exclusively sentences starting with quotation marks were selected and randomly sampled. The three text sources are represented with approximately equal amounts of recordings in the test/dev set.
Keyword: phonetics
URL: http://hdl.handle.net/11022/1009-0000-0007-F5CB-0
BASE
Hide details
16
leomccormack/Spatial_Audio_Framework: v1.3.0 ...
BASE
Show details
17
ruby-rdf/rdf: Release 3.2.3 ...
BASE
Show details
18
leomccormack/Spatial_Audio_Framework: v1.3.0 ...
BASE
Show details
19
Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale ...
BASE
Show details
20
Functional Connectivity and Speech Entrainment Speech Entrainment Improves Connectivity Between Anterior and Posterior Cortical Speech Areas in Non-fluent Aphasia ...
BASE
Show details

Page: 1 2 3 4 5...262

Catalogues
154
37
637
0
135
1
38
Bibliographies
737
1
16
72
0
0
4
10
282
Linked Open Data catalogues
0
Online resources
17
1
2
3
Open access documents
3.327
2
4
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern