1 |
From Stance to Concern: Adaptation of Propositional Analysis to New Tasks and Domains ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Incremental Phrase Structure Generation and a Universal Theory of V2
|
|
|
|
In: North East Linguistics Society (2020)
|
|
BASE
|
|
Show details
|
|
4 |
Author Commitment and Social Power: Automatic Belief Tagging to Infer the Social Context of Interactions ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
TAG Parsing with Neural Networks and Vector Representations of Supertags
|
|
|
|
In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, ; Conference on Empirical Methods in Natural Language Processing ; https://hal.archives-ouvertes.fr/hal-01771494 ; Conference on Empirical Methods in Natural Language Processing, Sep 2017, Copenhague, Denmark. pp.1712 - 1722 (2017)
|
|
BASE
|
|
Show details
|
|
8 |
Morphologically Annotated Corpora and Morphological Analyzers for Moroccan and Sanaani Yemeni Arabic
|
|
|
|
In: 10th Language Resources and Evaluation Conference (LREC 2016) ; https://hal.archives-ouvertes.fr/hal-01349201 ; 10th Language Resources and Evaluation Conference (LREC 2016), May 2016, Portoroz, Slovenia (2016)
|
|
BASE
|
|
Show details
|
|
11 |
Statistical modality tagging from rule-based annotations and crowdsourcing ...
|
|
|
|
BASE
|
|
Show details
|
|
12 |
Conventional Orthography for Dialectal Arabic (CODA): Principles and Guidelines -- Egyptian Arabic - Version 0.7 - March 2012
|
|
|
|
BASE
|
|
Show details
|
|
13 |
Conventional Orthography for Dialectal Arabic (CODA): Principles and Guidelines -- Egyptian Arabic - Version 0.7 - March 2012 ...
|
|
|
|
BASE
|
|
Show details
|
|
14 |
LDC Arabic Treebanks and Associated Corpora: Data Divisions Manual
|
|
|
|
BASE
|
|
Show details
|
|
15 |
LDC Arabic Treebanks and Associated Corpora: Data Divisions Manual ...
|
|
|
|
BASE
|
|
Show details
|
|
16 |
LDC Arabic Treebanks and Associated Corpora: Data Divisions Manual ...
|
|
|
|
Abstract:
The Linguistic Data Consortium (LDC) has developed hundreds of data corpora for natural language processing (NLP) research. Among these are a number of annotated treebank corpora for Arabic. Typically, these corpora consist of a single collection of annotated documents. NLP research, however, usually requires multiple data sets for the purposes of training models, developing techniques, and final evaluation. Therefore it becomes necessary to divide the corpora used into the required data sets (divisions). Unfortunately, there is no universally accepted convention or standard for dividing bulk corpora. This caused different research groups to either define their own divisions (which makes comparison to similar research results difficult) or adopt existing published divisions (which do not adapt as new corpora versions are released). When a new treebank is released, a new division needs to be developed, which may or may not be consistent with the other treebank divisions. This document details a set of rules ...
|
|
Keyword:
Computer science; Information science
|
|
URL: https://dx.doi.org/10.7916/d8pk0qkd https://academiccommons.columbia.edu/doi/10.7916/D8PK0QKD
|
|
BASE
|
|
Hide details
|
|
17 |
Frame-Based Representation of Lexical, Graphical, and Factual Knowledge for Text-to-Scene Generation
|
|
|
|
BASE
|
|
Show details
|
|
|
|