DE eng

Search in the Catalogues and Directories

Hits 1 – 12 of 12

1
Automatic Semantic Text Tagging On Historical Lexica By Combining Ocr And Typography Classification ...
BASE
Show details
2
Automatic Semantic Text Tagging On Historical Lexica By Combining Ocr And Typography Classification ...
BASE
Show details
3
For a fistful of blogs: Discovery and comparative benchmarking of republishable German content
BASE
Show details
4
Morphologische und phonologische Repräsentationen in childLex
BASE
Show details
5
Spektrum Patholinguistik = Schwerpunktthema: Besonders behandeln? : Sprachtherapie im Rahmen primärer Störungsbilder
BASE
Show details
6
For a fistful of blogs: Discovery and comparative benchmarking of republishable German content
In: KONVENS 2014, NLP4CMC workshop ; https://hal.archives-ouvertes.fr/hal-01083750 ; KONVENS 2014, NLP4CMC workshop, Oct 2014, Hildesheim, Germany. pp.2-10 ; http://www.uni-hildesheim.de/konvens2014/ (2014)
BASE
Show details
7
Altersgruppeneffekte in childLex
BASE
Show details
8
For a fistful of blogs: Discovery and comparative benchmarking of republishable German content
Abstract: We introduce two corpora gathered on the web and related to computer-mediated communication: blog posts and blog comments. In order to build such corpora, we addressed following issues: website discovery and crawling, content extraction constraints, and text quality assessment. The blogs were manually classified as to their license and content type. Our results show that it is possible to find blogs in German under Creative Commons license, and that it is possible to perform text extraction and linguistic annotation efficiently enough to allow for a comparison with more traditional text types such as newspaper corpora and subtitles. The comparison gives insights on distributional properties of the processed web texts on token and type level. For example, quantitative analysis reveals that blog posts are close to written language, while comments are slightly closer to spoken language.
Keyword: Computerunterstützte Kommunikation; ddc:400; Korpus
URL: https://hildok.bsz-bw.de/frontdoor/index/index/docId/273
https://hildok.bsz-bw.de/files/273/01_01.pdf
https://nbn-resolving.org/urn:nbn:de:gbv:hil2-opus-2902
BASE
Hide details
9
Analysemethode und Datengrundlage können die Ergebnisse beeinflussen
BASE
Show details
10
Spektrum Patholinguistik = Schwerpunktthema: Von der Programmierung zur Artikulation : Sprechapraxie bei Kindern und Erwachsenen
Aichert, Ingrid (Dr.); Staiger, Anja (Dr.); Schulte-Mäter, Anne (Dr.). - 2010
BASE
Show details
11
Finite-state methods and natural language processing : 6th International Workshop, FSMNLP 2007 Potsdam, Germany, september 14 - 16 ; revised papers
BASE
Show details
12dlexDB - lexikalische Datenbank für die psychologische und linguistische Forschung
http://www.dlexdb.de/
Topic: Computational linguistics; Corpus linguistics; Lexicology / Etymology; ...
Language: German, Standard
Forschungstyp: Research projects
Access: additional functions after registration

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
1
0
1
0
Open access documents
11
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern