DE eng

Search in the Catalogues and Directories

Page: 1 2 3 4 5...7
Hits 1 – 20 of 137

1
Penn Discourse Treebank Version 3.0
Prasad, Rashmi; Webber, Bonnie; Lee, Alan; Joshi, Aravind. - : Linguistic Data Consortium, 2019. : https://www.ldc.upenn.edu, 2019
Abstract: *Introduction* Penn Discourse Treebank (PDTB) Version 3.0 is the third release in the Penn Discourse Treebank project, the goal of which is to annotate the Wall Street Journal (WSJ) section of Treebank-2 (LDC95T7) with discourse relations. Penn Discourse Treebank Version 2 (LDC2008T05) contains over 40,600 tokens of annotated relations. In Version 3, an additional 13,000 tokens were annotated, certain pairwise annotations were standardized, new senses were included and the corpus was subject to a series of consistency checks. Details concerning the development of PDTB Version 3.0 can be found in the documentation accompanying this release. Largely because the PDTB project was based on the idea that discourse relations are grounded in an identifiable set of explicit words or phrases (discourse connectives) or simply in the adjacency of two sentences, the PTDB has been used by many researchers in the natural language processing community and more recently, by researchers in psycholinguistics. It has also stimulated the development of similar resources in other languages and domains. *Data* Annotations are provided in the form of separate text files (standoff annotation) that are byte-indexed into the raw WSJ text files in Treebank-2. The raw WSJ files are also included in this release. All text files are plain text, encoded in UTF-8. This corpus contains two tools: (1) The Annotator, used for annotation and adjudication, and which can also be used for viewing the corpus; and (2) The Conversion Tool for converting Version 2 annotation files into the Version 3 format. The documentation directory contains a manual describing what is new in Version 3 and how Version 3 differs from Version 2; the methods and guidelines used in annotating PDTB Version 3; and a range of statistics on the tokens, including the frequency of each connective, its sense labels and its modifiers. More information about the corpus and research carried out by the developers and others using the corpus can be found on the PDTB website. *Samples* One can see samples of the annotation of different types of discourse relations, along with their visualization in the Annotator tool at: * Explicit relations * Implicit relations * Altlex and AltLexC relations * Entity relations * Hypophora relations * NoRel (annotated only between adjacent sentences within a paragraph that are not linked to each other by a discourse relation) *Updates* Experiments carried out in Fall 2019 on the intra-sentential discourse relations in the PDTB-3 revealed two problems with the corpus: (1) the final versions of two gold files of "to clause" annotation had not been loaded, and (2) several tokens were inadvertently omitted on the assumption that they were duplicates, when they were not. Repairing these errors, and correcting a mis-labelled token in file wsj_1026, has added another 45 implicit intra-sentential relations to the corpus. Counts in the Annotation Manual have been adjusted to take these additional tokens into account. Specific changes/additions are recorded in the file "pdtb3-revision-jan-2020.txt". Downloads after February 3, 2020 contain the updated corpus. *Acknowledgment* This work has been funded by the National Science Foundation, under grant NSF IIS 1422186 to the University of Pennsylvania and grant NSF IIS 1421067 to the University of Wisconsin, Milwaukee. The content of this publication does not necessarily reflect the position or policy of the Government, and no official endorsement should be inferred.
URL: https://catalog.ldc.upenn.edu/LDC2019T05
BASE
Hide details
2
Penn Discourse Treebank Version 3.0 ...
Prasad, Rashmi; Webber, Bonnie; Lee, Alan. - : Linguistic Data Consortium, 2019
BASE
Show details
3
The main sessionProceedings of the forty-seventh (47.) annual meeting of the Chicago Linguistic Society 1.
In: The main session (2014), S. 31-46
Leibniz-Zentrum Allgemeine Sprachwissenschaft
Show details
4
A dependency perspective on the adequacy of tree local multi-component tree adjoining grammar
Chen-Main, Joan; Joshi, Aravind K.. - : Oxford University Press, 2012
BASE
Show details
5
LTAG-spinal and the Treebank: a new resource for incremental, dependency and semantic parsing
Shen, Libin; Champollion, Lucas; Joshi, Aravind K.. - : Universität Tübingen, 2012
BASE
Show details
6
Formal grammars in linguistics and psycholinguistics, vol. 1: An introduction to the theory of formal languages and automata. By Willem J. M. Levelt. Amsterdam: John Benjamins, 2008. Pp. XI, 139 [Rezension]
In: Language. - Washington, DC : Linguistic Society of America 87 (2011) 2, 414-416
BLLDB
OLC Linguistik
Show details
7
Discourse Indicators for Content Selection in Summaization
In: Departmental Papers (CIS) (2010)
BASE
Show details
8
Using Entity Features to Classify Implicit Discourse Relations
In: Departmental Papers (CIS) (2010)
BASE
Show details
9
Tree-adjoining grammars
In: The Oxford handbook of computational linguistics (New York, 2009), p. 483-500
MPI für Psycholinguistik
Show details
10
LTAG-spinal and the Treebank : a new resource for incremental, dependency and semantic parsing
Shen, Libin Verfasser]. - Tübingen : Universitätsbibliothek Tübingen, 2008
DNB Subject Category Language
Show details
11
LTAG-spinal and the treebank : a new resource for incremental, depedency and semantic parsing
In: Language resources and evaluation. - Dordrecht [u.a.] : Springer 42 (2008) 1, 1-19
BLLDB
Show details
12
Penn Discourse Treebank Version 2.0
Prasad, Rashmi; Lee, Alan; Dinesh, Nikhil. - : Linguistic Data Consortium, 2008. : https://www.ldc.upenn.edu, 2008
BASE
Show details
13
Penn Discourse Treebank Version 2.0 ...
Prasad, Rashmi; Lee, Alan; Dinesh, Nikhil. - : Linguistic Data Consortium, 2008
BASE
Show details
14
Sense Annotation in the Penn Discourse Treebank
In: Departmental Papers (CIS) (2008)
BASE
Show details
15
Computational linguistics: A new tool for exploring biopolymer structures and statistical mechanics
In: Dill, Ken A; Lucas, Adam; Hockenmaier, Julia; Huang, Liang; Chiang, David; & Joshi, Aravind K.(2007). Computational linguistics: A new tool for exploring biopolymer structures and statistical mechanics. Polymer, 48, 4289 - 4300. doi:10.1016/j.polymer.2007.05.018. UC San Francisco: Retrieved from: http://www.escholarship.org/uc/item/235458px (2007)
BASE
Show details
16
Detecting Compositionality of Verb-Object Combinations using Selectional Preferences
Venkatapathy, Sriram; McCarthy, Diana; Joshi, Aravind K. - : The Association for Computational Linguistics, 2007
BASE
Show details
17
The Penn Discourse Treebank 2.0 Annotation Manual
In: IRCS Technical Reports Series (2007)
BASE
Show details
18
Computing discourse semantics : the predicate-argument semantics of discourse connectives in D-LTAG
In: Journal of semantics. - Oxford : Univ. Press 23 (2006) 1, 55-106
BLLDB
OLC Linguistik
Show details
19
Attribution and its annotation in the Penn Discourse TreeBank
In: Traitement automatique des langues. - Paris : ATALA 47 (2006) 2, 43-64
BLLDB
Show details
20
A short introduction to the Penn Discourse TreeBank
In: Treebanking for discourse and speech. - Frederiksberg : Samfundslitteratur (2006), 9-28
BLLDB
Show details

Page: 1 2 3 4 5...7

Catalogues
11
1
10
0
4
0
2
Bibliographies
50
0
0
1
1
0
0
0
10
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
52
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern