DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5 6
Hits 81 – 100 of 109

81
Von der Form zur Bedeutung: Texte automatisch verarbeiten : proceedings of the biennial GSCL conference 2009 = From form to meaning: processing texts automatically
Chiarcos, Christian [Herausgeber]. - Tübingen : Narr, 2009
DNB Subject Category Language
Show details
82
ANNIS: a search tool for multi-layer annotated corpora
Zeldes, Amir [Verfasser]; Lüdeling, Anke [Verfasser]; Ritz, Julia [Verfasser]. - Berlin : Humboldt-Universität zu Berlin, 2009
DNB Subject Category Language
Show details
83
Von der Form zur Bedeutung: Texte automatisch verarbeiten : proceedings of the Biennial GSCL Conference 2009 = From form to meaning: processing texts automatically
Chiarcos, Christian (Hrsg.). - Tübingen : Narr, 2009
UB Frankfurt Linguistik
Show details
84
ANNIS: a search tool for multi-layer annotated corpora ...
Zeldes, Amir; Lüdeling, Anke; Ritz, Julia. - : Humboldt-Universität zu Berlin, Philosophische Fakultät II, 2009
BASE
Show details
85
Information Structure in African Languages ...
Chiarcos, Christian; Fiedler, Ines; Grubic, Mira. - : Humboldt-Universität zu Berlin, Philosophische Fakultät II, 2009
BASE
Show details
86
Information structure in African languages: corpora and tools
Grubic, Mira; Chiarcos, Christian; Haida, Andreas. - : Association for Computational Linguistics, 2009
BASE
Show details
87
ANNIS: a search tool for multi-layer annotated corpora
Zeldes, Amir; Lüdeling, Anke; Ritz, Julia; Chiarcos, Christian. - : Humboldt-Universität zu Berlin, Philosophische Fakultät II, 2009
Abstract: ANNIS (see Dipper & Götze 2005; Chiarcos et al. 2008) is a flexible web-based corpus architecture for search and visualization of multi-layer linguistic corpora. By multi-layer we mean that the same primary datum may be annotated independently with (i) annotations of different types (spans, DAGs with labelled edges and arbitrary pointing relations between terminals or non-terminals), and (ii) annotation structures that possibly overlap and/or conflict hierarchically. In this paper we present the different features of the architecture as well as actual use cases for corpus linguistic research on such diverse areas as information structure, learner language and discourse level phenomena. The supported search functionalities of ANNIS2 include exact and regular expression matching on word forms and annotations, as well as complex relations between individual elements, such as all forms of overlapping, contained or adjacent annotation spans, hierarchical dominance (children, ancestors, left- or rightmost child etc.) and more. Alternatively to the query language, data can be accessed using a graphical query builder. Query matches are visualized depending on annotation types: annotations referring to tokens (e.g. lemma, POS, morphology) are shown immediately in the match list. Spans (covering one or more tokens) are displayed in a grid view, trees/graphs in a tree/graph view, and pointing relations (such as anaphoric links) in a discourse view, with same-colour highlighting for coreferent elements. Full Unicode support is provided and a media player is embedded for rendering audio files linked to the data, allowing for a large variety of corpora. Corpus data is annotated with automatic tools (taggers, parsers etc.) or taskspecific expert tools for manual annotation, and then mapped onto the interchange format PAULA (Dipper 2005), where stand-off annotations refer to the same primary data. Importers exist for many formats, including EXMARaLDA (Schmidt 2004), TigerXML (Brants & Plaehn 2000), MMAX2 (Müller & Strube 2006), RSTTool (O’Donnell 2000), PALinkA (Orasan 2003) and Toolbox (Stuart et al. 2007). Data is compiled into a relational DB for optimal performance. Query matches and their features can also be exported in the ARFF format and processed with the data mining tool WEKA (Witten & Frank 2005), which offers implementations of clustering and classification algorithms. ANNIS2 compares favourably with search functionalities in the above tools as well as other corpus search engines (EXAKT, http://www.exmaralda.org/exakt.html, TIGERSearch, Lezius,2002, CWB, Christ 1994) and other frameworks/architectures (NITE, Carletta et al. 2003, GATE, Cunningham, 2002). ; Peer Reviewed
Keyword: 400 Sprachwissenschaft; ddc:400; Linguistik
URN: urn:nbn:de:kobv:11-100174292
URL: http://edoc.hu-berlin.de/18452/14089
https://doi.org/10.18452/13437
http://www.linguistik.hu-berlin.de/institut/professuren/korpuslinguistik/mitarbeiter-innen/amir/pdf/CL2009_ANNIS_pre.pdf
BASE
Hide details
88
Information Structure in African Languages
Zeldes, Amir; Fiedler, Ines; Hartmann, Katharina. - : Humboldt-Universität zu Berlin, Philosophische Fakultät II, 2009
BASE
Show details
89
Search And Visualization Of Richly Annotated Corpora With Annis2 ...
BASE
Show details
90
Stand off-Annotation für Textdokumente: Vom Konzept zur Implementierung (zur Standardisierung?)
BASE
Show details
91
Rhetorical distance revisited : a parameterized approach
In: Constraints in discourse (Amsterdam, 2008), p. 97-116
MPI für Psycholinguistik
Show details
92
Rhetorical distance revisited - a parameterized approach
In: Constraints in discourse ; [1]. / ed. by Anton Benz .... - Amsterdam [u.a.] : Benjamins (2008), 97-115
BLLDB
Show details
93
An ontology of linguistic annotations
In: LDV-Forum. - Regensburg : GLDV 23 (2008) 1, 1-17
BLLDB
Show details
94
Semimanuelle Generierung und Auswertung von Alternativentexten
In: Text - Verstehen (2006)
IDS Mannheim
95
Avoiding data graveyards: From heterogeneous data collected in multiple research projects to sustainable linguistic resources
In: E-MELD Workshop Workshop on Digital Language Documentation: Tools and Standards – The State of the Art. East Lansing, Michigan. (2006)
IDS Bibliografie zur Gesprächsforschung
Show details
96
Avoiding Data Graveyards: From Heterogenious Data Collected in Multiple Research Projects to Sustainable Linguistic Resources
Schmidt, Thomas; Chiarcos, Christian; Lehmberg, Timm. - Hamburg : Uni-Hamburg, 2006
IDS Bibliografie zur Gesprächsforschung
Show details
97
E-MELD Workshop Workshop on Digital Language Documentation: Tools and Standards - The State of the Art. East Lansing, Michigan
Schmidt, Thomas; Chiarcos, Christian; Lehmberg, Timm. - : Universität Hamburg, 2006
IDS Bibliografie zur Gesprächsforschung
Show details
98
Rhetorical distance in cross-language evaluation: a parameterized approach of referential accessibility
In: Studies in contrastive linguistics. - Santiago de Compostela : Univ. de Santiago de Compostela (2006), 167-180
BLLDB
Show details
99
Semimanuelle Generierung und Auswertung von Alternativtexten
In: Text - Verstehen. - Berlin [u.a.] : de Gruyter (2006), 406-411
BLLDB
Show details
100
Sprachtechnologie für die multilinguale Kommunikation : Textproduktion, Recherche, Übersetzung, Lokalisierung
Schmidt, Thomas (Mitarb.); Rösner, Dietmar (Mitarb.); Göbel, Tobias (Mitarb.). - Sankt Augustin : Gardez!-Verl., 2003
BLLDB
UB Frankfurt Linguistik
Show details

Page: 1 2 3 4 5 6

Catalogues
5
2
0
0
12
0
0
Bibliographies
6
0
0
4
1
0
0
2
1
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
69
0
8
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern