Home Catalogue search

eng

Refine your search:
- Keyword
- Creator / Publisher
- Year:
  - 2022 (1)
  - 2021 (2)
  - 2018 (2)
  - 2017 (3)
  - 2016 (4)
  - 2015 (8)
  - 2014 (5)
  - 2013 (6)
  - 2012 (3)
  - 2011 (7)
  - more
- Medium:
  - Online (69)
  - Print (32)
- Type
- BLLDB-Access:
  - free (101)
  - subject to license (0)

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4 5 6

Hits 1 – 20 of 101

1	A fine-grained recognition of Named Entities in ELTeC collection using cascades
	Krstev, Cvetana; Maurel, Denis; Stanković, Ranka
	In: Final Action Event of COST Action Distant Reading for European Literary History ; https://hal.archives-ouvertes.fr/hal-03615219 ; Final Action Event of COST Action Distant Reading for European Literary History, Christof Schöch, Apr 2022, Krakow, Poland ; https://www.distant-reading.net/events/conference-programme/ (2022)
	Abstract: International audience ; In the scope of the COST action “Distant Reading for European Literary History” (Schöch et al. 2021; Patras et al. 2021) the working group 2 (WG2) responsible for methods and tools suggested a set of seven named entity (NE) categories to be used for annotating novels (the so-called “level-2” text version). Tags to be used for this set are: PERS, LOC, ORG, WORK, EVENT, ROLE, DEMO (Frontini et. al 2020; Šandrih Todorović et al. 2021). The level-2 version of Serbian novels was produced using this set of categories and tags (Krstev et al. 2019).For Serbian and French the fine-grained named entity recognition systems were developed based on exhaustive lexicons of corresponding languages and rules implemented in the form of cascades of finite-state automata (Maurel and Friburger 2014; Krstev et al. 2014). These systems were developed using the open-source corpus processing suite Unitex/GramLab and its module CasSys. Both systems recognize and tag a rich set of NE categories and subcategories and allow entity embedding; moreover, the French system recognizes NEs that correspond to TEI guidelines, chapter 13 (TEI P5). An example that illustrates this in Frenchis (Marquis de la Lande factories): usines de laLande Similarly, in Serbian (Queen Elizabeth of Hungary): kraljice Ugarske Elizabete Moreover, both systems recognize beside broad categories suggested by WG2 the other categories such as temporal or measurement expressions.In both Serbian and French systems, the recognition module is separated from the annotation module, which enables production of output as needed. In this paper we will illustrate this on a few Serbian and French novels from ELTeC corpus chosen to match in respect to corpus balance criteria, namely author’s gender, novel’s size, year of first publication. The novels will be annotated with the simplified tags needed for level-2 text format, and with more elaborate TEI compliant tags that reflect all nuances of recognized NEs.Two output formats for Serbian and French novels will be uploaded into TXM corpus processing systems which will enable both quantitative and qualitative analysis (Krstev et al., 2019). Besides statistical analysis of annotated NER, we will perform contrastive analysis of Serbian and French NEs and for both languages between fine-grained and simplified versions of annotation. The qualitative analysis will reveal interesting examples of annotation, open issues and hard cases. Textometrie analysis in TXM will be illustrated for both fine-grained and simplified versions of annotated samples.Finally, we will go back to the research questions that were posed by Action’s working group 3 (literary theory and history) when the Action started. Namely the first idea and wish of the WG3 was to produce fine grained annotations that will allow, for instance, distinction between cities and villages, different person’s roles (professions, family relations, etc.), person’s gender, types of locations (continent, country, region, city, village, mountain, waterbody, astronym), etc. After the analysis of availability of NER tools, the fine-grained approach was substituted with a much simpler schema. With this research we would like to reopen these questions and establish whether it is possible to meet the need for more detailed literary analysis based on Named Entities.
	Keyword: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; Digital humanities; Distant Reading for European Literary History; Named entities recognition; Unitex
	URL: https://hal.archives-ouvertes.fr/hal-03615219
	BASE
	Hide details

2	UNITEX 3.3 User Manual
	Paumier, Sébastien; Flury, Wolfgang; Guenthner, Franz...
	In: https://hal.archives-ouvertes.fr/hal-03589580 ; 2021 (2021)
	BASE
	Show details

3	UNITEX 3.3 Manuel d'utilisation
	Paumier, Sébastien; Guenthner, Franz; Laporte, Eric...
	In: https://hal.archives-ouvertes.fr/hal-03589598 ; 2021 (2021)
	BASE
	Show details

4	Mise en ligne de la cascade CasEN de recherche d’entités nommées en français (1.1), version TEI
	Maurel, Denis; Friburger, Nathalie
	In: https://hal.archives-ouvertes.fr/hal-01764756 ; 2018, http://tln.li.univ-tours.fr/Tln_CasEN.html (2018)
	BASE
	Show details

5	Serbian Language Integration in Prolexbase Multilingual Dictionary
	Krstev, Cvetana; Maurel, Denis; Vitas, Dusko
	In: ISSN: 1450-9687 ; EISSN: 2217-9461 ; INFOtheca : Journal of Information and Library Science ; https://hal.archives-ouvertes.fr/hal-02050875 ; INFOtheca : Journal of Information and Library Science, Serbian Academic Library Association, 2018, 18 (2), pp.29-52 ; https://infoteka.bg.ac.rs/ojs/index.php/Infoteka/article/view/2018.18.2.2_en/143 (2018)
	BASE
	Show details

6	ASRextractor: A Tool extracting Semantic Relations between Arabic Named Entities
	Ben Mesmia, Fatma; Zid, Fatma; Haddar, Kais...
	In: 3rd International Conference on Arabic Computational Linguistics (ACLing 2017) ; https://hal-univ-tours.archives-ouvertes.fr/hal-01632858 ; 3rd International Conference on Arabic Computational Linguistics (ACLing 2017), Nov 2017, Dubai, United Arab Emirates ; http://acling2017.org/ (2017)
	BASE
	Show details

7	CasANER: Arabic Named Entity Recognition Tool
	Ben Mesmia, Fatma; Haddar, Kais; Friburger, Nathalie...
	In: Intelligent Natural Language Processing: Trends and Applications ; https://hal.archives-ouvertes.fr/hal-01643271 ; Shaalan K., Hassanien A., Tolba F. Intelligent Natural Language Processing: Trends and Applications, Springer, Cham, pp.173-198, 2017, Studies in Computational Intelligence, 978-3-319-67055-3 (2017)
	BASE
	Show details

8	Extraction of Semantic Relation between Arabic Named Entities Using Different Kinds of Transducer Cascades
	Ben Mesmia, Fatma; Kaouther, Bouabidi; Friburger, Nathalie...
	In: 18th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing 2017) ; https://hal.archives-ouvertes.fr/hal-01491290 ; 18th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing 2017), Apr 2017, Budapest, Hungary ; http://www.cicling.org/2017/ (2017)
	BASE
	Show details

9	A Semi-automatic and Low Cost Approach to Build Scalable Lemma-based Lexical Resources for Arabic Verbs
	Doumi, Noureddine; Lehireche, Ahmed; Maurel, Denis...
	In: International Journal of Information Technology and Computer Science(IJITCS) ; https://hal.archives-ouvertes.fr/hal-01270974 ; International Journal of Information Technology and Computer Science(IJITCS), IJIT, 2016, 8 (2), pp.1-13 ; http://www.mecs-press.org/ijitcs/ijitcs-v8-n2/IJITCS-V8-N2-1.pdf (2016)
	BASE
	Show details

10	Covering various Needs in Temporal Annotation: a Proposal of Extension of ISO TimeML that Preserves Upward Compatibility
	Lefeuvre-Halftermeyer, Anaïs; Antoine, Jean-Yves; Couillault, Alain...
	In: LREC 2016 proceedings ; LREC 2016 ; https://halshs.archives-ouvertes.fr/halshs-01318792 ; LREC 2016, May 2016, Portorož Slovenia (2016)
	BASE
	Show details

11	Estimate the notoriety of a Proper name using Wikipedia ; Estimer la notoriété d’un nom propre via Wikipedia
	Elashter, Mouna; Maurel, Denis
	In: TALN 2016 ; https://hal.archives-ouvertes.fr/hal-01455571 ; TALN 2016, Jul 2016, Paris, France ; https://jep-taln2016.limsi.fr/actes/Actes%20JTR-2016/V02-TALN.pdf (2016)
	BASE
	Show details

12	Recognition and TEI annotation of Arabic Events Using Transducers
	Ben Mesmia, Fatma; Friburger, Nathalie; Haddar, Kais...
	In: 17th International Conference on Intelligent Text Processing and Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-01291336 ; 17th International Conference on Intelligent Text Processing and Computational Linguistics, Apr 2016, Konya, Turkey (2016)
	BASE
	Show details

13	Using finite-state transducers to build lexical resources for Unitex Arabic package
	Doumi, Noureddine; Lehireche, Ahmed; Maurel, Denis...
	In: CEC-TAL ; https://hal.archives-ouvertes.fr/hal-01177017 ; CEC-TAL, Mar 2015, Sousse, Tunisia. pp.90-100 (2015)
	BASE
	Show details

14	Mise en ligne de ProLMF, version 2.2, au format XML LMF (norme ISO-24613:2008) de la base de données Prolexbase sur le site du CNRTL
	Maurel, Denis; Bouchou Markhoff, Béatrice; Savary, Agata
	In: https://hal.archives-ouvertes.fr/hal-01229951 ; 2015, http://www.cnrtl.fr/lexiques/prolex/ (2015)
	BASE
	Show details

15	A Multilingual Corpus for the Study of Toponyms in Translation
	Lecuit, Émeline; Maurel, Denis; Vitas, Dusko
	In: Challenges in Synchronic Toponymy: Structure, Context and Use ; https://hal.archives-ouvertes.fr/hal-01230007 ; Challenges in Synchronic Toponymy: Structure, Context and Use, Francke A. Verlag, 2015, 978-3-7720-8479-9 (2015)
	BASE
	Show details

16	Transducers cascades for an automatic recognition of Arabic named entities in order to establish links to free resources
	Ben Mesmia, Fatma; Friburger, Nathalie; Haddar, Kais...
	In: 16th International Conference on Intelligent Text Processing and Computational Linguistics ; https://hal.archives-ouvertes.fr/hal-01230026 ; 16th International Conference on Intelligent Text Processing and Computational Linguistics, Apr 2015, Cairo, Egypt. pp.61-67 ; http://www.cicling.org/2015/ (2015)
	BASE
	Show details

17	Construction d’une cascade de transducteurs pour la reconnaissance des dates à partir d’un corpus Wikipédia
	Ben Mesmia, Fatma; Friburger, Nathalie; Haddar, Kais...
	In: CEC-TAL ; https://hal.archives-ouvertes.fr/hal-01177022 ; CEC-TAL, Mar 2015, Sousse, Tunisie. pp.8-11 (2015)
	BASE
	Show details

18	Automatiser le processus d’anonymisation des corpus oraux : le cas d’ESLO
	Eshkol-Taravella, Iris; Baude, Olivier; Maurel, Denis...
	In: TALN2015 ; https://hal.archives-ouvertes.fr/hal-01174647 ; TALN2015, Jun 2015, Caen, France ; https://taln2015.greyc.fr/articlesenlignetaln/ (2015)
	BASE
	Show details

19	Arabic Named Entity Recognition Process using Transducer Cascade and Arabic Wikipedia
	Ben Mesmia, Fatma; Haddar, Kais; Maurel, Denis...
	In: RANLP 2015 ; https://hal.archives-ouvertes.fr/hal-01376732 ; RANLP 2015, Sep 2015, Hissar, Bulgaria ; http://lml.bas.bg/ranlp2015/start1.php (2015)
	BASE
	Show details

20	Using Finite-State Transducers to Build Lexical Resources for Unitex Arabic Package ...
	Noureddine Doumi; Lehireche, Ahmed; Maurel, Denis. - : Unpublished, 2015
	BASE
	Show details

Page: 1 2 3 4 5 6

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern