41 |
Data-driven identification of German phrasal compounds
|
|
|
|
In: Enthalten in: TSD (20. : 2017 : Prag): Text, speech, and dialogue (2017)
|
|
IDS Mannheim
|
|
Show details
|
|
43 |
Data-Driven Identification of German Phrasal Compounds
|
|
|
|
In: Text, Speech, and Dialogue ; https://hal.archives-ouvertes.fr/hal-01575651 ; Kamil Ekštein; Václav Matoušek. Text, Speech, and Dialogue, 10415, Springer International Publishing, pp.192-200, 2017, Lecture Notes in Computer Science, 978-3-319-64205-5. ⟨10.1007/978-3-319-64206-2_22⟩ ; https://link.springer.com/bookseries/558 (2017)
|
|
BASE
|
|
Show details
|
|
44 |
Die Korpusplattform des „Digitalen Wörterbuchs der deutschen Sprache“ (DWDS)
|
|
|
|
In: ISSN: 0301-3294 ; EISSN: 1613-0626 ; Zeitschrift für Germanistische Linguistik ; https://hal.archives-ouvertes.fr/hal-01575661 ; Zeitschrift für Germanistische Linguistik, De Gruyter, 2017, Zeitschrift für Germanistische Linguistik, 45 (2), pp.327-344. ⟨10.1515/zgl-2017-0017⟩ ; https://www.degruyter.com/view/j/zfgl.2017.45.issue-2/zgl-2017-0017/zgl-2017-0017.xml (2017)
|
|
BASE
|
|
Show details
|
|
45 |
Putting Der Brenner on the map
|
|
|
|
In: Corpus Linguistics and Literature Workshop, 43rd Austrian Linguistics Conference ; https://hal.archives-ouvertes.fr/hal-01951848 ; Corpus Linguistics and Literature Workshop, 43rd Austrian Linguistics Conference, Dec 2017, Klagenfurt, Austria. ⟨10.1553/Brenner_map⟩ ; https://epub.oeaw.ac.at/?arp=0x003a1086 (2017)
|
|
BASE
|
|
Show details
|
|
46 |
Toponyms as Entry Points into a Digital Edition: Mapping Die Fackel (1899-1936)
|
|
|
|
In: Digital Humanities ; https://hal.archives-ouvertes.fr/hal-01591628 ; Digital Humanities, Aug 2017, Montréal, Canada. pp.159-161 ; https://dh2017.adho.org/ (2017)
|
|
BASE
|
|
Show details
|
|
47 |
Discriminating between Similar Languages using Weighted Subword Features
|
|
|
|
In: Fourth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2017) ; https://hal.archives-ouvertes.fr/hal-01575656 ; Fourth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2017), Association for Computational Linguistics (ACL), Apr 2017, Valence, Spain. pp.184-189, ⟨10.18653/v1/W17-1223⟩ ; http://ttg.uni-saarland.de/vardial2017/ (2017)
|
|
BASE
|
|
Show details
|
|
48 |
Towards a toolbox to map historical text collections
|
|
|
|
In: 11th Workshop on Geographic Information Retrieval (GIR'17) ; https://hal.archives-ouvertes.fr/hal-01654526 ; 11th Workshop on Geographic Information Retrieval (GIR'17), Nov 2017, Heidelberg, Germany. ⟨10.1145/3155902.3155905⟩ (2017)
|
|
BASE
|
|
Show details
|
|
53 |
Bootstrapped OCR error detection for a less-resourced language variant
|
|
|
|
In: Proceedings of the 13th Conference on Natural Language Processing (KONVENS 2016) ; 13th Conference on Natural Language Processing (KONVENS 2016) ; https://hal.archives-ouvertes.fr/hal-01371689 ; 13th Conference on Natural Language Processing (KONVENS 2016), Sep 2016, Bochum, Germany. pp.21-26 ; https://www.linguistics.ruhr-uni-bochum.de/konvens16/ (2016)
|
|
BASE
|
|
Show details
|
|
54 |
An Unsupervised Morphological Criterion for Discriminating Similar Languages
|
|
|
|
In: 3rd Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2016) ; https://hal.archives-ouvertes.fr/hal-01575653 ; 3rd Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2016), Dec 2016, Osaka, Japan. pp.212-220 ; http://ttg.uni-saarland.de/vardial2016/ (2016)
|
|
BASE
|
|
Show details
|
|
55 |
Visualisierung von Ortsnamen im Deutschen Textarchiv
|
|
|
|
In: DHd 2016 ; https://halshs.archives-ouvertes.fr/halshs-01287931 ; DHd 2016, Mar 2016, Leipzig, Germany. pp.264-267 ; http://dhd2016.de/ (2016)
|
|
BASE
|
|
Show details
|
|
56 |
APIs in Digital Humanities: The Infrastructural Turn
|
|
|
|
In: Digital Humanities 2016 ; https://hal.archives-ouvertes.fr/hal-01348706 ; Digital Humanities 2016, Jul 2016, Cracovie, Poland. pp.93-96 ; http://dh2016.adho.org/ (2016)
|
|
BASE
|
|
Show details
|
|
57 |
Collection and Indexing of Tweets with a Geographical Focus
|
|
|
|
In: Tenth International Conference on Language Resources and Evaluation (LREC 2016) ; https://hal.archives-ouvertes.fr/hal-01323274 ; Tenth International Conference on Language Resources and Evaluation (LREC 2016), May 2016, Portorož, Slovenia. pp.24-27 (2016)
|
|
Abstract:
International audience ; This paper introduces a Twitter corpus currently focused geographically in order to (1) test selection and collection processes for a given region and (2) find a suitable database to query, filter, and visualize the tweets. Due to access restrictions, it is not possible to retrieve all available tweets, which is why corpus construction implies a series of decisions described below. The corpus focuses on Austrian users, as data collection grounds on a two-tier detection process addressing corpus construction and user location issues. The emphasis lies on short messages whose sender mentions a place in Austria as his/her hometown or tweets from places located in Austria. The resulting user base is then queried and enlarged using focused crawling and random sampling, so that the corpus is refined and completed in the way of a monitor corpus. Its current volume is 21.7 million tweets from approximately 125,000 users. The tweets are indexed using Elasticsearch and queried via the Kibana frontend, which allows for queries on metadata as well as for the visualization of geolocalized tweets (currently about 3.3% of the collection).
|
|
Keyword:
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO.INFO-WB]Computer Science [cs]/Web; [SHS.LANGUE]Humanities and Social Sciences/Linguistics; Computer-Mediated Communication; Database Solutions; Visualization; Web Corpus Construction
|
|
URL: https://hal.archives-ouvertes.fr/hal-01323274v3/document https://hal.archives-ouvertes.fr/hal-01323274 https://hal.archives-ouvertes.fr/hal-01323274v3/file/Barbaresi_CMLC2016_Twitter_archive.pdf
|
|
BASE
|
|
Hide details
|
|
58 |
Extraction and Visualization of Toponyms in Diachronic Text Corpora
|
|
|
|
In: Digital Humanities 2016 ; https://hal.archives-ouvertes.fr/hal-01348696 ; Digital Humanities 2016, Jul 2016, Cracovie, Poland. pp.732-734 ; http://dh2016.adho.org/ (2016)
|
|
BASE
|
|
Show details
|
|
59 |
Efficient construction of metadata-enhanced web corpora
|
|
|
|
In: Proceedings of the 10th Web as Corpus Workshop ; 10th Web as Corpus Workshop ; https://hal.archives-ouvertes.fr/hal-01371704 ; 10th Web as Corpus Workshop, Association for Computational Linguistics (ACL SIGWAC), Aug 2016, Berlin, Germany. pp.7-16, ⟨10.18653/v1/W16-2602⟩ (2016)
|
|
BASE
|
|
Show details
|
|
60 |
Efficient Exploration of Translation Variants in Large Multiparallel Corpora Using a Relational Database
|
|
|
|
In: Graën, Johannes; Clematide, Simon; Volk, Martin (2016). Efficient Exploration of Translation Variants in Large Multiparallel Corpora Using a Relational Database. In: 4th Workshop on the Challenges in the Management of Large Corpora, Portorož, 28 May 2016 - 28 May 2016, 20-23. (2016)
|
|
BASE
|
|
Show details
|
|
|
|