41 |
Curation Technologies for a Cultural Heritage Archive: "Project Tongilbu" ...
|
|
|
|
BASE
|
|
Show details
|
|
42 |
Curation Technologies for a Cultural Heritage Archive: "Project Tongilbu" ...
|
|
|
|
Abstract:
We are developing a platform for generic curation technologies, using various NLP procedures, that is specifically targeted at, but not limited to, document collections that are too large for humans to (manually) read and go through. The aim then is to provide prototypical NLP tools like NER, Entity Linking, clustering and summarization in order to support rapid exploration of a data set. In this particular submission, the data set in question is the result of "Project Tongilbu”, a report funded by the Korean Ministry of Re-unification, on the unification of East- and West-Germany in the 1990’s. The majority of the content in this data set is in German, with small parts in Korean. With the collection being a set of PDF files, we first apply OCR to extract machine-readable text. Focusing on German, we then apply an NER model trained on Wikipedia data, retrieve URIs of recognized entities in the GND (Gemeinsame Normdatei, a German database of entities with additional information), perform temporal analysis and ...
|
|
Keyword:
Korean; NLP
|
|
URL: https://zenodo.org/record/3404255 https://dx.doi.org/10.5281/zenodo.3404255
|
|
BASE
|
|
Hide details
|
|
43 |
Correspondence between the Korean and Mandarin Chinese pronunciations of Chinese characters: A comparison at the sub-syllabic level
|
|
|
|
BASE
|
|
Show details
|
|
44 |
Liquids and Loanwords: The Variant Behavior of the Korean /l/
|
|
|
|
BASE
|
|
Show details
|
|
45 |
Interpretation and processing of overt pronouns in Korean, English and L2-acquisition
|
|
|
|
BASE
|
|
Show details
|
|
46 |
COMPARATIVE STUDY ON THE HISTORICAL DEVELOPMENT OF THE GENITIVE CASE MARKERS IN KOREAN AND JAPANESE
|
|
|
|
BASE
|
|
Show details
|
|
47 |
A CROSS-LINGUISTIC AND CROSS-CULTURAL STUDY OF STANCE MARKERS IN RESEARCH ARTICLES IN ENGLISH AND KOREAN
|
|
|
|
BASE
|
|
Show details
|
|
48 |
Effects of Web-based Auditory Training on the Perception of Korean Sounds by Mandarin Learners of Korean
|
|
|
|
BASE
|
|
Show details
|
|
50 |
Parental attitudes towards heritage language resources in the Australian Korean community
|
|
|
|
BASE
|
|
Show details
|
|
51 |
Research on heritage language development and maintenance of Korean
|
|
|
|
BASE
|
|
Show details
|
|
52 |
Characteristic features of Korean heritage learner grammatical errors
|
|
|
|
BASE
|
|
Show details
|
|
53 |
Korean heritage children's language use and maintenance
|
|
Shin, S-C. - : Sotong Publishing, 2019. : Seoul, Korea, 2019
|
|
BASE
|
|
Show details
|
|
54 |
Characteristic features of Korean heritage learner lexical errors
|
|
|
|
BASE
|
|
Show details
|
|
55 |
Characteristic features of Korean heritage learner orthographic errors
|
|
|
|
BASE
|
|
Show details
|
|
56 |
Agency at Play: Impoliteness and Korean Language in Online Interactions
|
|
|
|
BASE
|
|
Show details
|
|
57 |
Definite article bridging relations in L2: A learner corpus study
|
|
|
|
BASE
|
|
Show details
|
|
58 |
The morphosyntax of clause typing: single, double, periphrastic, and multifunctional complementizers in Korean
|
|
|
|
BASE
|
|
Show details
|
|
59 |
Korean Orthography of Loanwords and spelling problems with proper nouns from Slovenia
|
|
|
|
In: Acta Linguistica Asiatica, Vol 9, Iss 2 (2019) (2019)
|
|
BASE
|
|
Show details
|
|
60 |
Allocutive agreement in Korean under cyclic Agree
|
|
|
|
In: Proceedings of the Linguistic Society of America; Vol 4 (2019): Proceedings of the Linguistic Society of America; 56:1–15 ; 2473-8689 (2019)
|
|
BASE
|
|
Show details
|
|
|
|