Catalogue search • Linguistik portal • Fachinformationsdienst (FID)

1	CasANER: Arabic Named Entity Recognition Tool
	Ben Mesmia, Fatma; Haddar, Kais; Friburger, Nathalie...
	In: Intelligent Natural Language Processing: Trends and Applications ; https://hal.archives-ouvertes.fr/hal-01643271 ; Shaalan K., Hassanien A., Tolba F. Intelligent Natural Language Processing: Trends and Applications, Springer, Cham, pp.173-198, 2017, Studies in Computational Intelligence, 978-3-319-67055-3 (2017)
	BASE
	Show details

2	Construction d’une cascade de transducteurs pour la reconnaissance des dates à partir d’un corpus Wikipédia
	Ben Mesmia, Fatma; Friburger, Nathalie; Haddar, Kais...
	In: CEC-TAL ; https://hal.archives-ouvertes.fr/hal-01177022 ; CEC-TAL, Mar 2015, Sousse, Tunisie. pp.8-11 (2015)
	BASE
	Show details

3	Annotation tools for syntax and named entities in the National Corpus of Polish.
	Waszczuk, Jakub; Glowinska, Katarzyna; Savary, Agata; Przepiórkowski, Adam; Lenart, Michel
	In: ISSN: 1759-1163 ; EISSN: 1759-1171 ; International Journal of Data Mining, Modelling and Management ; https://hal.archives-ouvertes.fr/hal-01021348 ; International Journal of Data Mining, Modelling and Management, Inderscience, 2013, 5 (2), pp.103-122 (2013)
	Abstract: International audience ; The ongoing National Corpus of Polish project assumes several levels of linguistic annotation. We present the technical environment and methodological background developed for the three upper annotation levels: the levels of syntactic words, syntactic groups and named entities. We show how knowledge-based platforms Spejd and Sprout are used for the automatic pre-annotation of the corpus and discuss some particular problems faced during the preparation of the parser grammar, which contains over 1,000 rules and is one of the largest chunking grammars for Polish. We also show how the tree editor TrEd has been customised for manual post-editing of annotations and for further revision of discrepancies. Our XML format converters and customised archiving repository ensure an automatic data flow and efficient corpus file management. We discuss the inter-annotator agreement in the manually annotated data, and present the first results of a CRF classifier trained on these data.
	Keyword: [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]; [INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB]; automatic data flow; chunking grammars; corpus annotation; customised archiving repository; file management; linguistic annotation; named entities; named entity recognition; National Corpus of Polish; NER; parser grammar; shallow parsing; syntactic groups; syntactic words; syntax; XML converters
	URL: https://hal.archives-ouvertes.fr/hal-01021348
	BASE
	Hide details

Search in the Catalogues and Directories