DE eng

Search in the Catalogues and Directories

Page: 1 2
Hits 1 – 20 of 34

1
A fine-grained recognition of Named Entities in ELTeC collection using cascades
In: Final Action Event of COST Action Distant Reading for European Literary History ; https://hal.archives-ouvertes.fr/hal-03615219 ; Final Action Event of COST Action Distant Reading for European Literary History, Christof Schöch, Apr 2022, Krakow, Poland ; https://www.distant-reading.net/events/conference-programme/ (2022)
Abstract: International audience ; In the scope of the COST action “Distant Reading for European Literary History” (Schöch et al. 2021; Patras et al. 2021) the working group 2 (WG2) responsible for methods and tools suggested a set of seven named entity (NE) categories to be used for annotating novels (the so-called “level-2” text version). Tags to be used for this set are: PERS, LOC, ORG, WORK, EVENT, ROLE, DEMO (Frontini et. al 2020; Šandrih Todorović et al. 2021). The level-2 version of Serbian novels was produced using this set of categories and tags (Krstev et al. 2019).For Serbian and French the fine-grained named entity recognition systems were developed based on exhaustive lexicons of corresponding languages and rules implemented in the form of cascades of finite-state automata (Maurel and Friburger 2014; Krstev et al. 2014). These systems were developed using the open-source corpus processing suite Unitex/GramLab and its module CasSys. Both systems recognize and tag a rich set of NE categories and subcategories and allow entity embedding; moreover, the French system recognizes NEs that correspond to TEI guidelines, chapter 13 (TEI P5). An example that illustrates this in Frenchis (Marquis de la Lande factories): usines de laLande Similarly, in Serbian (Queen Elizabeth of Hungary): kraljice Ugarske Elizabete Moreover, both systems recognize beside broad categories suggested by WG2 the other categories such as temporal or measurement expressions.In both Serbian and French systems, the recognition module is separated from the annotation module, which enables production of output as needed. In this paper we will illustrate this on a few Serbian and French novels from ELTeC corpus chosen to match in respect to corpus balance criteria, namely author’s gender, novel’s size, year of first publication. The novels will be annotated with the simplified tags needed for level-2 text format, and with more elaborate TEI compliant tags that reflect all nuances of recognized NEs.Two output formats for Serbian and French novels will be uploaded into TXM corpus processing systems which will enable both quantitative and qualitative analysis (Krstev et al., 2019). Besides statistical analysis of annotated NER, we will perform contrastive analysis of Serbian and French NEs and for both languages between fine-grained and simplified versions of annotation. The qualitative analysis will reveal interesting examples of annotation, open issues and hard cases. Textometrie analysis in TXM will be illustrated for both fine-grained and simplified versions of annotated samples.Finally, we will go back to the research questions that were posed by Action’s working group 3 (literary theory and history) when the Action started. Namely the first idea and wish of the WG3 was to produce fine grained annotations that will allow, for instance, distinction between cities and villages, different person’s roles (professions, family relations, etc.), person’s gender, types of locations (continent, country, region, city, village, mountain, waterbody, astronym), etc. After the analysis of availability of NER tools, the fine-grained approach was substituted with a much simpler schema. With this research we would like to reopen these questions and establish whether it is possible to meet the need for more detailed literary analysis based on Named Entities.
Keyword: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; Digital humanities; Distant Reading for European Literary History; Named entities recognition; Unitex
URL: https://hal.archives-ouvertes.fr/hal-03615219
BASE
Hide details
2
The Impact of Game Elements on Learner Motivation: Influence of Initial Motivation and Player Profile
In: EISSN: 1939-1382 ; IEEE Transactions on Learning Technologies ; https://hal.univ-lyon2.fr/hal-03579428 ; IEEE Transactions on Learning Technologies, Institute of Electrical and Electronics Engineers, In press, ⟨10.1109/TLT.2022.3153239⟩ (2022)
BASE
Show details
3
Bislama: An Introduction to the National Language of Vanuatu
Tryon, Darrell T.. - : The Australian National University, 2022
BASE
Show details
4
Förderung des Bildungsspracherwerbs bei heterogenen sprachlichen Voraussetzungen im Unterricht mit digitalen Medien ...
Pöschl, Sonja-Hella. - : Verlag Julius Klinkhardt, 2022
BASE
Show details
5
兒童華語教材中的飲食文化分析:以《學華語向前走》為例 ; Analysis of Food Culture of Chinese Teaching Material for Children — With Reference to “Let’s Learn Chinese”
BASE
Show details
6
The Croatian web dictionary Mrežnik (A-F) 1.0
Hudeček, Lana; Mihaljević, Milica; Blagus Bartolec, Goranka. - : Institute for Croatian Language and Linguistics, 2022
BASE
Show details
7
Förderung des Bildungsspracherwerbs bei heterogenen sprachlichen Voraussetzungen im Unterricht mit digitalen Medien
In: Haider, Michael [Hrsg.]; Schmeinck, Daniela [Hrsg.]: Digitalisierung in der Grundschule. Grundlagen, Gelingensbedingungen und didaktische Konzeptionen am Beispiel des Fachs Sachunterricht. Bad Heilbrunn : Verlag Julius Klinkhardt 2022, S. 124-139 (2022)
BASE
Show details
8
学習を認知・情意・精神運動の領域で捉える提案 : より見通しのきく日本語教育Can-do記述に向けて
鈴木 美加; Mika SUZUKI. - : 国立国語研究所, 2022
BASE
Show details
9
Beginning Spanish ¡Empecemos por aquí!
In: PDXOpen: Open Educational Resources (2022)
BASE
Show details
10
Sentence Level Embedding Detoxification via Toxic Component Removal ...
: University of Virginia, 2022
BASE
Show details
11
Attention-Language Interface in Multilingual Assessment Instrument for Narratives (MAIN) ...
Sekerina, Irina. - : Open Science Framework, 2022
BASE
Show details
12
Investigating secondary mathematics teachers’ analogies to function
In: Research outputs 2022 to 2026 (2022)
BASE
Show details
13
Is a Wizard-of-Oz Required for Robot-Led Conversation Practice in a Second Language?
Águas Lopes, José David; Cumbal, Ronald; Engwall, Olov. - : KTH, Tal-kommunikation, 2022. : KTH, Tal, musik och hörsel, TMH, 2022. : Springer Nature, 2022
BASE
Show details
14
The Big Five Personality Traits and Positive Orientation in Polish Adults with Multiple Sclerosis: The Role of Meaning in Life
In: International Journal of Environmental Research and Public Health; Volume 19; Issue 9; Pages: 5426 (2022)
BASE
Show details
15
Deep Learning-Based End-to-End Language Development Screening for Children Using Linguistic Knowledge
In: Applied Sciences; Volume 12; Issue 9; Pages: 4651 (2022)
BASE
Show details
16
Assessment of the Readability and Quality of Online Patient Education Material for Chronic Medical Conditions
In: Healthcare; Volume 10; Issue 2; Pages: 234 (2022)
BASE
Show details
17
It's a Two-way Street: Informing Irish Pre-sessional EAP Programs with a Needs Analysis of Irish Higher Education
Garska, Jessica Nicole. - : Trinity College Dublin. School of Linguistic Speech & Comm Sci. C.L.C.S., 2022
BASE
Show details
18
The BioVisualSpeech corpus of words with sibilants for speech therapy games development
BASE
Show details
19
Towards the new construct of academic English in the digital age
Khabbazbashi, Nahal; Chan, Sathena Hiu Chong; Clark, Tony. - : Oxford University Press, 2022
BASE
Show details
20
A Visual Decision-Support System using Fingerprint Matrices applied to Cyclical Spatio-Temporal Data from Motorsports
BASE
Show details

Page: 1 2

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
34
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern