1 |
Decreasing lexical data sparsity in statistical syntactic parsing - experiments with named entities
|
|
|
|
In: Hogan, Deirdre, Foster, Jennifer orcid:0000-0002-7789-4853 and van Genabith, Josef orcid:0000-0003-1322-7944 (2011) Decreasing lexical data sparsity in statistical syntactic parsing - experiments with named entities. In: Multiword Expressions: from Parsing and Generation to the Real World (MWE). Workshop at ACL 2011, 19-24 June 2011, Portland, Oregon. (2011)
|
|
Abstract:
In this paper we present preliminary experiments that aim to reduce lexical data sparsity in statistical parsing by exploiting information about named entities. Words in the WSJ corpus are mapped to named entity clusters and a latent variable constituency parser is trained and tested on the transformed corpus. We explore two different methods for mapping words to entities, and look at the effect of mapping various subsets of named entity types. Thus far, results show no improvement in parsing accuracy over the best baseline score; we identify possible problems and outline suggestions for future directions.
|
|
Keyword:
language corpus; Lexicalisation; Machine translating
|
|
URL: http://doras.dcu.ie/16465/
|
|
BASE
|
|
Hide details
|
|
2 |
From news to comment: Resources and benchmarks for parsing the language of web 2.0
|
|
|
|
In: Foster, Jennifer orcid:0000-0002-7789-4853 , Cetinoglu, Ozlem, Wagner, Joachim orcid:0000-0002-8290-3849 , Le Roux, Joseph, Nivre, Joakim, Hogan, Deirdre and van Genabith, Josef orcid:0000-0003-1322-7944 (2011) From news to comment: Resources and benchmarks for parsing the language of web 2.0. In: The 5th International Joint Conference on Natural Language Processing (IJCNLP), 08-13 Nov 2011, Chiang Mai, Thailand. ISBN 978-974-466-564-5 (2011)
|
|
BASE
|
|
Show details
|
|
3 |
#hardtoparse: POS tagging and parsing the twitterverse
|
|
|
|
In: Foster, Jennifer orcid:0000-0002-7789-4853 , Cetinoglu, Ozlem, Wagner, Joachim orcid:0000-0002-8290-3849 , Le Roux, Joseph, Hogan, Stephen, Nivre, Joakim, Hogan, Deirdre and van Genabith, Josef orcid:0000-0003-1322-7944 (2011) #hardtoparse: POS tagging and parsing the twitterverse. In: The AAAI-11 Workshop on Analyzing Microtext, 8 Aug 2011, San Francisco, CA. (2011)
|
|
BASE
|
|
Show details
|
|
4 |
LFG without C-structures
|
|
|
|
In: Cetinoglu, Ozlem, Foster, Jennifer orcid:0000-0002-7789-4853 , Nivre, Joakim, Hogan, Deirdre, Cahill, Aoife orcid:0000-0002-3519-7726 and van Genabith, Josef orcid:0000-0003-1322-7944 (2010) LFG without C-structures. In: the 9th International Workshop on Treebanks and Linguistic Theories, 3 - 4 Dec. 2010, Tartu Estonia. (2010)
|
|
BASE
|
|
Show details
|
|
5 |
Handling unknown words in statistical latent-variable parsing models for Arabic, English and French
|
|
|
|
In: Attia, Mohammed, Foster, Jennifer orcid:0000-0002-7789-4853 , Hogan, Deirdre, Le Roux, Joseph, Tounsi, Lamia and van Genabith, Josef orcid:0000-0003-1322-7944 (2010) Handling unknown words in statistical latent-variable parsing models for Arabic, English and French. In: SPMRL 2010 - 1st Workshop on Statistical Parsing of Morphologically-Rich Languages at NAACL HLT 2010, 5 June 2010, Los Angeles, CA, USA. (2010)
|
|
BASE
|
|
Show details
|
|
6 |
Finding common ground: towards a surface realisation shared task
|
|
|
|
In: Belz, Anya, White, Mike, van Genabith, Josef orcid:0000-0003-1322-7944 , Hogan, Deirdre and Stent, Amanda (2010) Finding common ground: towards a surface realisation shared task. In: INLG 2010 - 6th International Natural Language Generation Conference, 7-9 July 2010, Trim, Co. Meath, Ireland. (2010)
|
|
BASE
|
|
Show details
|
|
7 |
Handling Unknown Words in Statistical Latent-Variable Parsing Models for Arabic, English and French
|
|
|
|
In: Proceedings of the First Workshop on Statistical Parsing of Morphologically Rich Languages (SPMRL 2010) ; First Workshop on Statistical Parsing of Morphologically Rich Languages (SPMRL 2010) ; https://hal.archives-ouvertes.fr/hal-00702414 ; First Workshop on Statistical Parsing of Morphologically Rich Languages (SPMRL 2010), 2010, United States. pp.67-75 (2010)
|
|
BASE
|
|
Show details
|
|
8 |
Finding common ground: towards a surface realisation shared task
|
|
|
|
BASE
|
|
Show details
|
|
9 |
DCU at the TREC 2008 Blog Track
|
|
|
|
In: Bermingham, Adam, Smeaton, Alan F. orcid:0000-0003-1028-8389 , Foster, Jennifer orcid:0000-0002-7789-4853 and Hogan, Deirdre (2008) DCU at the TREC 2008 Blog Track. In: TREC 2008 - Text REtrieval Conference, Gaithersburg, MD. (2008)
|
|
BASE
|
|
Show details
|
|
10 |
Parser-based retraining for domain adaptation of probabilistic generators
|
|
|
|
In: Hogan, Deirdre, Foster, Jennifer orcid:0000-0002-7789-4853 , Wagner, Joachim orcid:0000-0002-8290-3849 and van Genabith, Josef (2008) Parser-based retraining for domain adaptation of probabilistic generators. In: INLG 08 - 5th International Natural Language Generation Conference, 12-14 June 2008, Salt Fork, Ohio, USA. (2008)
|
|
BASE
|
|
Show details
|
|
12 |
Empirical measurements of lexical similarity in noun phrase conjuncts
|
|
|
|
In: Hogan, Deirdre (2007) Empirical measurements of lexical similarity in noun phrase conjuncts. In: ACL 2007 - 45th Annual Meeting of the Association for Computational Linguistics, 25-27 June 2007, Prague, Czech Republic. (2007)
|
|
BASE
|
|
Show details
|
|
13 |
Exploiting multi-word units in history-based probabilistic generation
|
|
|
|
In: Hogan, Deirdre, Cafferkey, Conor, Cahill, Aoife orcid:0000-0002-3519-7726 and van Genabith, Josef (2007) Exploiting multi-word units in history-based probabilistic generation. In: EMNLP-CoNLL 2007 - Joint Meeting of the Conference on Empirical Methods in Natural Language Processing and the Conference on Computational Natural Language Learning, 28-30 June 2007, Prague, Czech Republic. (2007)
|
|
BASE
|
|
Show details
|
|
14 |
Coordinate noun phrase disambiguation in a generative parsing model
|
|
|
|
In: Hogan, Deirdre (2007) Coordinate noun phrase disambiguation in a generative parsing model. In: ACL 2007 - 45th Annual Meeting of the Association for Computational Linguistics, 25-27 June 2007, Prague, Czech Republic. (2007)
|
|
BASE
|
|
Show details
|
|
15 |
On coordination disambiguation in a generative parsing model, with memory-based techniques for parameter estimation.
|
|
Hogan, Deirdre. - : Trinity College (Dublin, Ireland). School of Computer Science & Statistics, 2007
|
|
BASE
|
|
Show details
|
|
|
|