3 |
GikiP at GeoCLEF 2008: Joining GIR and QA forces for querying Wikipedia
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Porting a summarizer to the French language
|
|
|
|
In: Bois, Remi, Leveling, Johannes orcid:0000-0003-0603-4191 , Goeuriot, Lorraine orcid:0000-0001-7491-1980 , Jones, Gareth J.F. orcid:0000-0002-4033-9135 and Kelly, Liadh orcid:0000-0003-1131-5238 (2014) Porting a summarizer to the French language. In: Traitement Automatique du Langage Naturel (TALN), 1-4 Jul 2014, Marseille, France. (2014)
|
|
BASE
|
|
Show details
|
|
5 |
Automatic prediction of text aesthetics and interestingness
|
|
|
|
In: Ganguly, Debasis orcid:0000-0003-0050-7138 , Leveling, Johannes orcid:0000-0003-0603-4191 and Jones, Gareth J.F. orcid:0000-0002-4033-9135 (2014) Automatic prediction of text aesthetics and interestingness. In: 25th International Conference on Computational Linguistics (COLING 2014), 23-29 Aug 2014, Dublin, Ireland. (2014)
|
|
Abstract:
This paper investigates the problem of automated text aesthetics prediction. The availability of user generated content and ratings, e.g. Flickr, has induced research in aesthetics prediction for non-text domains, particularly for photographic images. This problem, however, has yet not been explored for the text domain. Due to the very subjective nature of text aesthetics, it is dicult to compile human annotated data by methods such as crowd sourcing with a fair degree of inter-annotator agreement. The availability of the Kindle \popular highlights" data has motivated us to compile a dataset comprised of human annotated aesthetically pleasing and interesting text passages. We then undertake a supervised classication approach to predict text aesthetics by constructing real-valued feature vectors from each text passage. In particular, the features that we use for this classification task are word length, repetitions, polarity, part-of-speech, semantic distances; and topic generality and diversity. A traditional binary classication approach is not effective in this case because non-highlighted passages surrounding the highlighted ones do not necessarily represent the other extreme of unpleasant quality text. Due to the absence of real negative class samples, we employ the MC algorithm, in which training can be initiated with instances only from the positive class. On each successive iteration the algorithm selects new strong negative samples from the unlabeled class and retrains itself. The results show that the mapping convergence (MC) algorithm with a Gaussian and a linear kernel used for the mapping and convergence phases, respectively, yields the best results, achieving satisfactory accuracy, precision and recall values of about 74%, 42% and 54% respectively.
|
|
Keyword:
Computational linguistics; Information retrieval
|
|
URL: http://doras.dcu.ie/20379/
|
|
BASE
|
|
Hide details
|
|
6 |
Adaptation of machine translation for multilingual information retrieval in the medical domain
|
|
|
|
In: Pecina, Pavel, Dušek, Ondřej, Goeuriot, Lorraine orcid:0000-0001-7491-1980 , Hajič, Jan, Hlaváčová, Jaroslava, Jones, Gareth J.F. orcid:0000-0002-4033-9135 , Kelly, Liadh orcid:0000-0003-1131-5238 , Leveling, Johannes orcid:0000-0003-0603-4191 , Mareček, David, Novák, Michal, Popel, Martin, Rosa, Rudolf, Tamchyna, Aleš and Urešová, Zdeňka (2014) Adaptation of machine translation for multilingual information retrieval in the medical domain. Artificial Intelligence in Medicine, 61 (3). pp. 165-185. ISSN 1873-2860 (2014)
|
|
BASE
|
|
Show details
|
|
7 |
Adaptation of machine translation for multilingual information retrieval in the medical domain
|
|
|
|
In: ISSN: 0933-3657 ; Artificial Intelligence in Medicine ; https://hal.archives-ouvertes.fr/hal-01921881 ; Artificial Intelligence in Medicine, Elsevier, 2014, 61 (3), pp.165 - 185. ⟨10.1016/j.artmed.2014.01.004⟩ (2014)
|
|
BASE
|
|
Show details
|
|
8 |
A case study in decompounding for Bengali information retrieval
|
|
|
|
In: Ganguly, Debasis orcid:0000-0003-0050-7138 , Leveling, Johannes orcid:0000-0003-0603-4191 and Jones, Gareth J.F. orcid:0000-0003-2923-8365 (2013) A case study in decompounding for Bengali information retrieval. In: CLEF 2013 - Conference and Labs, 23-26 Sept 2013, Valencia, Spain. (2013)
|
|
BASE
|
|
Show details
|
|
9 |
Overview of the ShARe/CLEF eHealth Evaluation Lab 2013
|
|
|
|
In: Suominen, Hanna, Salanterä, Sanna, Velupillai, Sumithra, Chapman, Wendy, Savova, Guergana, Elhadad, Noemie, Pradhan, Sameer, South, Brett R., Mowery, Danielle L., Jones, Gareth J.F. orcid:0000-0003-2923-8365 , Leveling, Johannes orcid:0000-0003-0603-4191 , Kelly, Liadh orcid:0000-0003-1131-5238 , Goeuriot, Lorraine orcid:0000-0001-7491-1980 , Martinez, David and Zuccon, Guido (2013) Overview of the ShARe/CLEF eHealth Evaluation Lab 2013. In: 4th International Conference of the CLEF Initiative (CLEF 2013), 23-26 Sept 2013, Valencia, Spain. ISBN 978-3-642-40802-1 (2013)
|
|
BASE
|
|
Show details
|
|
10 |
Overview of the ShARe/CLEF eHealth evaluation lab 2013
|
|
|
|
In: Suominen, Hanna, Salanterä, Sanna, Velupillai, Sumithra, Chapman, Wendy, Savova, Guergana, Elhadad, Noemie, Pradhan, Sameer, South, Brett R., Mowery, Danielle L., Jones, Gareth J.F. orcid:0000-0003-2923-8365 , Leveling, Johannes orcid:0000-0003-0603-4191 , Kelly, Liadh orcid:0000-0003-1131-5238 , Martinez, David and Zuccon, Guido (2013) Overview of the ShARe/CLEF eHealth evaluation lab 2013. Information Access Evaluation. Multilinguality, Multimodality, and Visualization, 8138 ( . pp. 212-231. ISSN 0302-9743 (2013)
|
|
BASE
|
|
Show details
|
|
11 |
DCU@FIRE-2012: rule-based stemmers for Bengali and Hindi
|
|
|
|
In: Ganguly, Debasis orcid:0000-0003-0050-7138 , Leveling, Johannes orcid:0000-0003-0603-4191 and Jones, Gareth J.F. orcid:0000-0003-2923-8365 (2012) DCU@FIRE-2012: rule-based stemmers for Bengali and Hindi. In: FIRE 2012 Workshop, 17-19 Dec 2012, Kolkata, India. (2012)
|
|
BASE
|
|
Show details
|
|
12 |
Cross-lingual topical relevance models
|
|
|
|
In: Ganguly, Debasis orcid:0000-0003-0603-4191 , Leveling, Johannes orcid:0000-0003-0603-4191 and Jones, Gareth J.F. orcid:0000-0003-2923-8365 (2012) Cross-lingual topical relevance models. In: 24th International Conference on Computational Linguistics (COLING 2012), 8-15 Dec 2012, Mumbai, India. (2012)
|
|
BASE
|
|
Show details
|
|
13 |
Making results fit into 40 characters: a study in document rewriting
|
|
|
|
In: Leveling, Johannes orcid:0000-0003-0603-4191 and Jones, Gareth J.F. orcid:0000-0003-2923-8365 (2012) Making results fit into 40 characters: a study in document rewriting. In: The 35th Annual ACM SIGIR 2012 Conference, 12-16 Aug 2012, Portland, Oregon. (2012)
|
|
BASE
|
|
Show details
|
|
14 |
Approximate sentence retrieval for scalable and efficient example-based machine translation
|
|
|
|
In: Ganguly, Debasis orcid:0000-0003-0050-7138 , Leveling, Johannes orcid:0000-0003-0603-4191 , Dandapat, Sandipan and Jones, Gareth J.F. orcid:0000-0003-2923-8365 (2012) Approximate sentence retrieval for scalable and efficient example-based machine translation. In: 24th International Conference on Computational Linguistics (COLING 2012), 8-15 Dec 2012, Mumbai, India. (2012)
|
|
BASE
|
|
Show details
|
|
15 |
LogCLEF: Enabling research on multilingual log files
|
|
|
|
In: Leveling, Johannes orcid:0000-0003-0603-4191 , Di Nunzio, Giorgio Maria and Mandl, Thomas (2011) LogCLEF: Enabling research on multilingual log files. In: The 1st Workshop on Personalised Multilingual Hypertext Retrieval (PMHR 2011), 6 June 2011, Eindhoven, the Netherlands. (2011)
|
|
BASE
|
|
Show details
|
|
16 |
Personalised multilingual hypertext retrieval: An overview
|
|
|
|
In: Agosti, Maristella, De Luca, Ernesto William, Lawless, Séamus and Leveling, Johannes orcid:0000-0003-0603-4191 (2011) Personalised multilingual hypertext retrieval: An overview. In: The First Workshop on Personalised Multilingual Hypertext Retrieval (PMHR 2011), 6 June 2011, Eindhoven, The Netherlands. (2011)
|
|
BASE
|
|
Show details
|
|
17 |
Towards evaluation of personalized and collaborative information retrieval
|
|
|
|
In: Ganguly, Debasis orcid:0000-0003-0050-7138 , Leveling, Johannes orcid:0000-0003-0603-4191 , Li, Wei B. orcid:0000-0001-7347-3501 and Jones, Gareth J.F. orcid:0000-0003-2923-8365 (2011) Towards evaluation of personalized and collaborative information retrieval. In: The First Workshop on Personalised Multilingual Hypertext Retrieval (PMHR 2011), 6th June 2011, Eindhoven, The Netherlands. (2011)
|
|
BASE
|
|
Show details
|
|
18 |
Simulation of within-session query variations using a text segmentation approach
|
|
|
|
In: Ganguly, Debasis orcid:0000-0003-0050-7138 , Leveling, Johannes orcid:0000-0003-0603-4191 and Jones, Gareth J.F. orcid:0000-0003-2923-8365 (2011) Simulation of within-session query variations using a text segmentation approach. In: CLEF 2011 Conference on Multilingual and Multimodal Information Access Evaluation, 19-22 Sept 2011, Amsterdam, The Netherlands. (2011)
|
|
BASE
|
|
Show details
|
|
19 |
Multilingual log analysis: LogCLEF
|
|
|
|
In: Nunzio, Giorgio Maria Di, Leveling, Johannes orcid:0000-0003-0603-4191 and Mandl, Thomas (2011) Multilingual log analysis: LogCLEF. In: 33rd European Conference on Information Retrieval (ECIR 2011), 18th-21st April 2011, Dublin, Ireland. (2011)
|
|
BASE
|
|
Show details
|
|
20 |
Multilingual adaptive search for digital libraries
|
|
|
|
In: Ghorab, M. Rami, Leveling, Johannes orcid:0000-0003-0603-4191 , Lawless, Séamus, O'Connor, Alexander, Zhou, Dong, Jones, Gareth J.F. orcid:0000-0003-2923-8365 and Wade, Vincent (2011) Multilingual adaptive search for digital libraries. In: 1st International Conference on Theory and Practice of Digital Libraries (TPDL 2011), 25-29 Sept 2011, Berlin, Germany. ISBN 9783642244681 (2011)
|
|
BASE
|
|
Show details
|
|
|
|