DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5 6 7...9
Hits 41 – 60 of 168

41
A scalable method for preserving oral literature from small languages
BIRD, STEVEN. - : Springer, 2010
BASE
Show details
42
The Big Australian Speech Corpus (the Big ASC)
Wagner, Michael; Tran, Dat; Togneri, Roberto. - : Canberra, A.C.T., ASSTA, 2010
BASE
Show details
43
Natural language processing with Python : [analyzing text with the natural language toolkit]
Bird, Steven; Klein, Ewan; Loper, Edward. - Beijing : O'Reilly, 2009
MPI für Psycholinguistik
Show details
44
Querying linguistic trees
In: Journal of logic, language and information. - Dordrecht [u.a.] : Kluwer 19 (2009) 1, 53-73
BLLDB
OLC Linguistik
Show details
45
Natural language processing with Python
Loper, Edward; Klein, Ewan; Bird, Steven. - Beijing [u.a.] : O'Reilly, 2009
BLLDB
UB Frankfurt Linguistik
Show details
46
Phonology
In: The Oxford handbook of computational linguistics (New York, 2009), p. 3-24
MPI für Psycholinguistik
Show details
47
ACL Anthology Reference Corpus
Kan, Min-Yen; Bird, Steven. - : Linguistic Data Consortium, 2009. : https://www.ldc.upenn.edu, 2009
BASE
Show details
48
ACL Anthology Reference Corpus ...
Kan, Min-Yen; Bird, Steven. - : Linguistic Data Consortium, 2009
BASE
Show details
49
A Blueprint for a comprehensive Australian English auditory-visual speech corpus
Burnham, Denis; Ambikairajah, Eliathamby; Cutler, Anne. - : Somerville, MA : Cascadilla Proceedings Project, 2009
BASE
Show details
50
A blueprint for a comprehensive Australian English auditory-visual speech corpus
Ishihara, Shunichi; Fletcher, Janet Mary; Kemp, Nenagh. - : Cascadilla Proceedings Project, 2009
BASE
Show details
51
Curating lexical databases for minority languages
BASE
Show details
52
OLAC: Accessing the world's language resources
BASE
Show details
53
OLAC: accessing the world's language resources
BASE
Show details
54
A blueprint for a comprehensive Australian English auditory-visual speech corpus
Tran, D.; Fletcher, J.M.; Goecke, R.. - : Cascadilla Press, 2009
BASE
Show details
55
OLAC: Accessing the world's language resources
BASE
Show details
56
Curating lexical databases for minority languages
Abstract: One of the biggest challenges in compiling a dictionary of a minority language is managing the large quantity of lexical data. Decisions about the format and content of the dictionary or the orthography typically evolve over the years that such projects usually take. This results in inconsistencies between older and newer entries. Revising the data for publication as a dictionary introduces further inconsistencies as does having multiple contributors and/or editors. Proofreading a lexical database takes a great deal of time and the richer its structure the more this is the case. The tools described in this presentation significantly reduce this effort. Tools developed for checking the consistency of the lexical database in the Iu Mien—Chinese—English dictionary project have proven extremely helpful. Two basic approaches are used: 1) use of a program written to check for likely errors that scans the lexical database and produces an error report that is used by a lexicographer to make appropriate corrections. 2) outputting the lexical data in alternate forms that make it easier for the lexicographer to spot problem areas. These alternative forms include the reverse indexes and views structured according to semantic domains. The Iu Mien—Chinese—English dictionary project, like many minority language dictionary projects, uses SIL's Toolbox software. It is very flexible software but its capabilities to enforce consistency are quite limited. Some parts of the approach described here are specific to MDF (Multi-Dictionary Formatter) lexical databases in Toolbox but will be equally useful for other MDF databases. Other parts are specific to each of the three languages involved but will be useful for non-Toolbox lexical databases. Every dictionary is unique and this applies not only to content of the entries but also the decisions about how entries should be arranged to suit the languages involved. Other decisions about the structure are likely to be made differently even in other dictionaries of the same languages. It is the way that each dictionary combines themes that are found in many dictionaries that makes them unique, e.g. to be root based or not, to have include subentries. Therefore our approach is to use a toolkit based approach to curating lexical databases. This allows checking techniques to be mixed and matched to suit the unique aspects of a lexical project. The checking software is written in Python and relies on the toolbox module in NLTK (The Natural Language Toolkit http://nltk.sourceforge.net).
URL: http://hdl.handle.net/10125/5094
BASE
Hide details
57
A blueprint for a comprehensive Australian English auditory-visual speech corpus
Burnham, Denis K. (R7357); Ambikairajah, Eliathamby; Arciuli, Joanne. - : U.S.A, Cascadilla Proceedings Project, 2009
BASE
Show details
58
The ACL Anthology Reference Corpus: A Reference Dataset for Bibliographic Research in Computational Linguistics
Bird, Steven; Dale, Robert; Dorr, Bonnie J. - : Paris : European Language Resources Association (ELRA), 2008
BASE
Show details
59
Toward a global infrastructure for the sustainability of language resources
Simons, Gary F.; Bird, Steven. - : De La Salle University, 2008
BASE
Show details
60
Defining a core body of knowledge for the introductory computational linguistics curriculum
BIRD, STEVEN. - 2008
BASE
Show details

Page: 1 2 3 4 5 6 7...9

Catalogues
3
7
12
0
1
2
0
Bibliographies
27
0
1
1
0
0
0
0
3
Linked Open Data catalogues
0
Online resources
4
0
1
0
Open access documents
114
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern