Page: 1 2 3 4 5 6 7... 257
41 |
Amharic Adhoc Information Retrieval System Based on Morphological Features
|
|
|
|
In: Applied Sciences; Volume 12; Issue 3; Pages: 1294 (2022)
|
|
Abstract:
Information retrieval (IR) is one of the most important research and development areas due to the explosion of digital data and the need of accessing relevant information from huge corpora. Although IR systems function well for technologically advanced languages such as English, this is not the case for morphologically complex, under-resourced and less-studied languages such as Amharic. Amharic is a Semitic language characterized by a complex morphology where thousands of words are generated from a single root form through inflection and derivation. This has made the development of Amharic natural language processing (NLP) tools a challenging task. Amharic adhoc retrieval also faces challenges due to scarcity of linguistic resources, tools and standard evaluation corpora. In this research work, we investigate the impact of morphological features on the representation of Amharic documents and queries for adhoc retrieval. We also analyze the effects of stem-based and root-based text representation, and proposed new Amharic IR system architecture. Moreover, we present the resources and corpora we constructed for evaluation of Amharic IR systems and other NLP tools. We conduct various experiments with a TREC-like approach for Amharic IR test collection using a standard evaluation framework and measures. Our findings show that root-based text representation outperforms the conventional stem-based representation on Amharic IR.
|
|
Keyword:
adhoc retrieval; Amharic; complex morphology; corpus; information retrieval; resources
|
|
URL: https://doi.org/10.3390/app12031294
|
|
BASE
|
|
Hide details
|
|
43 |
Community Development of the SWEET Semantic System for Earth and Environmental Data - A Call for Interest ...
|
|
|
|
BASE
|
|
Show details
|
|
44 |
The SWEET (Semantic Web for Earth and Environmental Terminology) Bibliography ...
|
|
|
|
BASE
|
|
Show details
|
|
45 |
Community Development of the SWEET Semantic System for Earth and Environmental Data - A Call for Interest ...
|
|
|
|
BASE
|
|
Show details
|
|
46 |
The SWEET (Semantic Web for Earth and Environmental Terminology) Bibliography ...
|
|
|
|
BASE
|
|
Show details
|
|
47 |
Deep Embeddings for Robust User-Based Amateur Vocal Percussion Classification ...
|
|
|
|
BASE
|
|
Show details
|
|
53 |
ISSUES AND CHALLENGES IN INDIAN MULTI-LINGUAL AND MULTI SCRIPTS BIBLIOGRAPHIC RETRIEVAL SYSTEMS
|
|
|
|
In: Library Philosophy and Practice (e-journal) (2022)
|
|
BASE
|
|
Show details
|
|
55 |
ISSumSet: a tweet summarization dataset hidden in a TREC track
|
|
|
|
In: SAC '21: Proceedings of the 36th Annual ACM Symposium on Applied Computing ; ISBN: 978-1-4503-8104-8 ; 36th ACM/SIGAPP Symposium on Applied Computing (SAC 2021) ; https://hal-univ-tlse3.archives-ouvertes.fr/hal-03244354 ; 36th ACM/SIGAPP Symposium on Applied Computing (SAC 2021), Association for Computing Machinery - Special Interest Group on Applied Computing (SIGAPP), Mar 2021, Republic of Korea (virtual event), South Korea. pp.665-671, ⟨10.1145/3412841.3441946⟩ ; https://dl.acm.org/doi/10.1145/3412841.3441946 (2021)
|
|
BASE
|
|
Show details
|
|
56 |
High-resolution speaker counting in reverberant rooms using CRNN with Ambisonics features
|
|
|
|
In: EUSIPCO 2020 - 28th European Signal Processing Conference (EUSIPCO) ; https://hal.archives-ouvertes.fr/hal-03537323 ; EUSIPCO 2020 - 28th European Signal Processing Conference (EUSIPCO), Jan 2021, Amsterdam, Netherlands. pp.71-75, ⟨10.23919/Eusipco47968.2020.9287637⟩ (2021)
|
|
BASE
|
|
Show details
|
|
57 |
Why Don't You Act Your Age?: Recognizing the Stereotypical 8-12 Year Old Searcher by Their Search Behavior
|
|
|
|
In: Boise State University Theses and Dissertations (2021)
|
|
BASE
|
|
Show details
|
|
58 |
Supporting an effective review of telecollaboration for second language learning by visualising the participation and engagement at Dublin City University
|
|
|
|
In: Lee, Hyowon orcid:0000-0003-4395-7702 , Scriney, Michael orcid:0000-0001-6813-2630 , Dey-Plissonneau, Aparajita and Smeaton, Alan orcid:0000-0003-1028-8389 (2021) Supporting an effective review of telecollaboration for second language learning by visualising the participation and engagement at Dublin City University. In: Virtual Exchange in Higher Education: Charting the Irish Experience, 17 Sept 2021, Online vs MS Teams. (2021)
|
|
BASE
|
|
Show details
|
|
59 |
English machine reading comprehension: new approaches to answering multiple-choice questions
|
|
Dzendzik, Daria. - : Dublin City University. School of Computing, 2021. : Dublin City University. ADAPT, 2021
|
|
In: Dzendzik, Daria (2021) English machine reading comprehension: new approaches to answering multiple-choice questions. PhD thesis, Dublin City University. (2021)
|
|
BASE
|
|
Show details
|
|
60 |
Dataset diversity: measuring and mitigating geographical bias in image search and retrieval
|
|
|
|
In: Mandal, Abhishek, Leavy, Susan and Little, Suzanne orcid:0000-0003-3281-3471 (2021) Dataset diversity: measuring and mitigating geographical bias in image search and retrieval. In: 1st International Workshop on Trustworthy AI for Multimedia Computing, 24 Oct 2021, Chengdu, China. ISBN 978-1-4503-8674-6 (2021)
|
|
BASE
|
|
Show details
|
|
Page: 1 2 3 4 5 6 7... 257
|
|