1 |
Supporting accessibility and reproducibility in language research in the Alveo virtual laboratory
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Case study : the AusTalk corpus
|
|
|
|
Abstract:
This chapter presents detail of the Annotation Task of the Big Australian Speech Corpus (Big ASC) project, in which AusTalk, a large audio-visual corpus of Australian English, was collected. We describe the scope of the task and its implementation and give an overview of the results so far. When complete, AusTalk will consist of 3 h of audio-visual recording from each of 1000 speakers of Australian English, across a wide range of tasks including scripted (read) speech, spontaneous speech and dialogue. The read speech of 100 participants has now been manually annotated but a challenge of the project was to produce transcriptions for the unscripted (spontaneous) speech data. We report on several avenues that have been explored for the automation of this task. We describe the annotation challenges, the processes that were adopted and the limitations of automated transcription.
|
|
Keyword:
corpora (linguistics); English language; linguistic analysis (linguistics); XXXXXX - Unknown
|
|
URL: http://handle.westernsydney.edu.au:8081/1959.7/uws:44707 https://doi.org/10.1007/978-94-024-0881-2_49
|
|
BASE
|
|
Hide details
|
|
3 |
The Alveo Virtual Lab : working with an API for linguistic data (Invited Tutorial)
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Two platforms for research in human communication science : the AusTalk Corpus and the Alveo Virtual Laboratory
|
|
|
|
BASE
|
|
Show details
|
|
6 |
The Human Communication Science Virtual Lab : a repository microclimate in a rapidly evolving research-ecosystem
|
|
|
|
BASE
|
|
Show details
|
|
8 |
The Australian National Corpus : national infrastructure for language resources
|
|
|
|
BASE
|
|
Show details
|
|
9 |
The Australian National Corpus: National Infrastructure for Language Resources
|
|
|
|
BASE
|
|
Show details
|
|
10 |
The Australian National Corpus: National Infrastructure for Language Resources
|
|
|
|
BASE
|
|
Show details
|
|
11 |
Building an audio-visual corpus of Australian English : large corpus collection with an economical portable and replicable Black Box
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Updating the ICE annotation system : tagging, parsing and validation
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Building an audio-visual corpus of Australian English : large corpus collection with an economical portable and replicable Black Box
|
|
|
|
BASE
|
|
Show details
|
|
15 |
Ingesting the Auslan corpus into the DADA annotation store
|
|
Cassidy, Steve; Johnston, Trevor. - : Stroudsburg, PA : The Association for Computational Linguistics and The Asian Federation of Natural Language Processing, 2009
|
|
BASE
|
|
Show details
|
|
16 |
A Blueprint for a comprehensive Australian English auditory-visual speech corpus
|
|
|
|
BASE
|
|
Show details
|
|
17 |
A blueprint for a comprehensive Australian English auditory-visual speech corpus
|
|
|
|
BASE
|
|
Show details
|
|
19 |
AnswerFinder at QAst 2007 : named entity recognition for QA on speech transcripts
|
|
|
|
BASE
|
|
Show details
|
|
20 |
Named entity recognition in question answering of speech data
|
|
|
|
BASE
|
|
Show details
|
|
|
|