DE eng

Search in the Catalogues and Directories

Hits 1 – 13 of 13

1
Tempo-lexical context driven word embedding for cross-session search task extraction
In: Sen, Procheta, Ganguly, Debasis orcid:0000-0003-0050-7138 and Jones, Gareth J.F. orcid:0000-0003-2923-8365 (2018) Tempo-lexical context driven word embedding for cross-session search task extraction. In: 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 1-6 June 2018, New Orleans, LA, USA. (2018)
BASE
Show details
2
Retrievability of code mixed microblogs
In: Ganguly, Debasis orcid:0000-0003-0050-7138 , Bandyopadhyay, Ayan, Mitra, Mandar and Jones, Gareth J.F. orcid:0000-0003-2923-8365 (2016) Retrievability of code mixed microblogs. In: 39th International ACM SIGIR conference on Research and Development in Information Retrieval, 17-21 July 2016, Pisa, Italy. ISBN 978-1-4503-4069-4 (2016)
BASE
Show details
3
Joint estimation of topics and hashtag relevance in cross-lingual tweets
In: Sen, Procheta, Ganguly, Debasis orcid:0000-0003-0050-7138 and Jones, Gareth J.F. orcid:0000-0003-2923-8365 (2016) Joint estimation of topics and hashtag relevance in cross-lingual tweets. In: ACM on International Conference on the Theory of Information Retrieval, ICTIR 2016, 12- 6 Sept 2016., Newark, DE, USA. ISBN 978-1-4503-4497-5 (2016)
BASE
Show details
4
FaDA: fast document aligner using word embedding
In: Lohar, Pintu, Ganguly, Debasis orcid:0000-0003-0050-7138 , Afli, Haithem orcid:0000-0002-7449-4707 , Way, Andy orcid:0000-0001-5736-5930 and Jones, Gareth J.F. orcid:0000-0003-2923-8365 (2016) FaDA: fast document aligner using word embedding. Prague Bulletin of Mathematical Linguistics (106). pp. 169-179. ISSN 1804-0462 (2016)
BASE
Show details
5
DCU@FIRE-2014: fuzzy queries with rule-based normalization for mixed script information retrieval
In: Ganguly, Debasis orcid:0000-0003-0050-7138 , Pal, Santanu and Jones, Gareth J.F. orcid:0000-0002-4033-9135 (2014) DCU@FIRE-2014: fuzzy queries with rule-based normalization for mixed script information retrieval. In: Forum for Information Retrieval Evaluation (FIRE 2014) workshop, 5-7 Dec 2014, Bangalore, India. (2014)
BASE
Show details
6
Automatic prediction of text aesthetics and interestingness
In: Ganguly, Debasis orcid:0000-0003-0050-7138 , Leveling, Johannes orcid:0000-0003-0603-4191 and Jones, Gareth J.F. orcid:0000-0002-4033-9135 (2014) Automatic prediction of text aesthetics and interestingness. In: 25th International Conference on Computational Linguistics (COLING 2014), 23-29 Aug 2014, Dublin, Ireland. (2014)
BASE
Show details
7
A case study in decompounding for Bengali information retrieval
In: Ganguly, Debasis orcid:0000-0003-0050-7138 , Leveling, Johannes orcid:0000-0003-0603-4191 and Jones, Gareth J.F. orcid:0000-0003-2923-8365 (2013) A case study in decompounding for Bengali information retrieval. In: CLEF 2013 - Conference and Labs, 23-26 Sept 2013, Valencia, Spain. (2013)
BASE
Show details
8
DCU@FIRE-2012: rule-based stemmers for Bengali and Hindi
In: Ganguly, Debasis orcid:0000-0003-0050-7138 , Leveling, Johannes orcid:0000-0003-0603-4191 and Jones, Gareth J.F. orcid:0000-0003-2923-8365 (2012) DCU@FIRE-2012: rule-based stemmers for Bengali and Hindi. In: FIRE 2012 Workshop, 17-19 Dec 2012, Kolkata, India. (2012)
BASE
Show details
9
Cross-lingual topical relevance models
In: Ganguly, Debasis orcid:0000-0003-0603-4191 , Leveling, Johannes orcid:0000-0003-0603-4191 and Jones, Gareth J.F. orcid:0000-0003-2923-8365 (2012) Cross-lingual topical relevance models. In: 24th International Conference on Computational Linguistics (COLING 2012), 8-15 Dec 2012, Mumbai, India. (2012)
BASE
Show details
10
Approximate sentence retrieval for scalable and efficient example-based machine translation
In: Ganguly, Debasis orcid:0000-0003-0050-7138 , Leveling, Johannes orcid:0000-0003-0603-4191 , Dandapat, Sandipan and Jones, Gareth J.F. orcid:0000-0003-2923-8365 (2012) Approximate sentence retrieval for scalable and efficient example-based machine translation. In: 24th International Conference on Computational Linguistics (COLING 2012), 8-15 Dec 2012, Mumbai, India. (2012)
Abstract: Approximate sentence matching (ASM) is an important technique for tasks in machine translation (MT) such as example-based MT (EBMT) which influences the translation time and the quality of translation output. We investigate different approaches to find similar sentences in an example base and evaluate their efficiency (runtime), effectiveness, and the resulting quality of translation output. A comparison of approaches demonstrates that i) a sequential computation of the edit distance between an input sentence and all sentences in the example base is not feasible, even when efficient algorithms to compute the edit distance are employed; ii) in-memory data structures such as tries and ternary search trees are more efficient in terms of runtime, but are not scalable for large example bases; iii) standard IR models which only cover material similarity (e.g. term overlap), do not perform well in finding the approximate matches, due to their lack of handling word order and word positions. We propose a new retrieval model derived from language modelling (LM), named LM-ASM, to include positional and ordinal similarities in the matching process, in addition to material similarity. Our IR based retrieval experiments involve reranking the top-ranked documents based on their true edit distance score. Experimental results show that i) IR based approaches result in about 100 times faster translation; ii) LM-ASM approximates edit distance better than standard LM by about 10%; and iii) surprisingly, LM-ASM even improves MT quality by 1:52% in comparison to sequential edit distance computation.
Keyword: Approximate Sentence Matching; Edit Distance; Example-Based Machine Translation; Information retrieval; Machine translating
URL: http://doras.dcu.ie/20362/
BASE
Hide details
11
Towards evaluation of personalized and collaborative information retrieval
In: Ganguly, Debasis orcid:0000-0003-0050-7138 , Leveling, Johannes orcid:0000-0003-0603-4191 , Li, Wei B. orcid:0000-0001-7347-3501 and Jones, Gareth J.F. orcid:0000-0003-2923-8365 (2011) Towards evaluation of personalized and collaborative information retrieval. In: The First Workshop on Personalised Multilingual Hypertext Retrieval (PMHR 2011), 6th June 2011, Eindhoven, The Netherlands. (2011)
BASE
Show details
12
Simulation of within-session query variations using a text segmentation approach
In: Ganguly, Debasis orcid:0000-0003-0050-7138 , Leveling, Johannes orcid:0000-0003-0603-4191 and Jones, Gareth J.F. orcid:0000-0003-2923-8365 (2011) Simulation of within-session query variations using a text segmentation approach. In: CLEF 2011 Conference on Multilingual and Multimodal Information Access Evaluation, 19-22 Sept 2011, Amsterdam, The Netherlands. (2011)
BASE
Show details
13
DCU@FIRE2010: term conflation, blind relevance feedback, and cross-language IR with manual and automatic query translation
In: Leveling, Johannes orcid:0000-0003-0603-4191 , Ganguly, Debasis orcid:0000-0003-0050-7138 and Jones, Gareth J.F. orcid:0000-0003-2923-8365 (2010) DCU@FIRE2010: term conflation, blind relevance feedback, and cross-language IR with manual and automatic query translation. In: FIRE 2010 - Forum for Information Retrieval Evaluation, 19-21 February 2010, Gandhinagar, India. (2010)
BASE
Show details

Catalogues
0
0
0
0
0
0
0
Bibliographies
0
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
13
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern