1 |
Speaker Information Can Guide Models to Better Inductive Biases: A Case Study On Predicting Code-Switching ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
Machine Translation into Low-resource Language Varieties ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Representation and parsing of multiword expressions: Current trends
|
|
|
|
In: Language Science Press; (2019)
|
|
BASE
|
|
Show details
|
|
4 |
Representation and parsing of multiword expressions: Current trends
|
|
|
|
In: Language Science Press; (2019)
|
|
BASE
|
|
Show details
|
|
5 |
Representation and parsing of multiword expressions: Current trends
|
|
|
|
In: Language Science Press; (2019)
|
|
BASE
|
|
Show details
|
|
6 |
Representation and parsing of multiword expressions: Current trends
|
|
|
|
In: Language Science Press; (2019)
|
|
BASE
|
|
Show details
|
|
7 |
Representation and parsing of multiword expressions: Current trends
|
|
|
|
In: Language Science Press; (2019)
|
|
BASE
|
|
Show details
|
|
8 |
Representation and parsing of multiword expressions: Current trends
|
|
|
|
In: Language Science Press; (2019)
|
|
BASE
|
|
Show details
|
|
9 |
Representation and parsing of multiword expressions: Current trends
|
|
|
|
In: Language Science Press; (2019)
|
|
BASE
|
|
Show details
|
|
10 |
Representation and parsing of multiword expressions: Current trends
|
|
|
|
In: Language Science Press; (2019)
|
|
BASE
|
|
Show details
|
|
11 |
Representation and parsing of multiword expressions: Current trends
|
|
|
|
In: Language Science Press; (2019)
|
|
BASE
|
|
Show details
|
|
12 |
Representation and parsing of multiword expressions: Current trends
|
|
|
|
In: Language Science Press; (2019)
|
|
BASE
|
|
Show details
|
|
13 |
Representation and parsing of multiword expressions: Current trends
|
|
|
|
In: Language Science Press; (2019)
|
|
BASE
|
|
Show details
|
|
14 |
Representation and parsing of multiword expressions: Current trends
|
|
|
|
In: Language Science Press; (2019)
|
|
BASE
|
|
Show details
|
|
15 |
Representation and parsing of multiword expressions: Current trends
|
|
|
|
In: Language Science Press; (2019)
|
|
BASE
|
|
Show details
|
|
16 |
Representation and parsing of multiword expressions: Current trends
|
|
|
|
In: Language Science Press; (2019)
|
|
BASE
|
|
Show details
|
|
17 |
Representation and parsing of multiword expressions: Current trends
|
|
|
|
In: Language Science Press; (2019)
|
|
BASE
|
|
Show details
|
|
18 |
Representation and parsing of multiword expressions: Current trends
|
|
|
|
In: Language Science Press; (2019)
|
|
BASE
|
|
Show details
|
|
19 |
Representation and parsing of multiword expressions: Current trends
|
|
|
|
In: Language Science Press; (2019)
|
|
BASE
|
|
Show details
|
|
20 |
The Knesset Meetings Corpus 2004-2005 ...
|
|
|
|
Abstract:
The Knesset Meetings Corpus 2004-2005 is made up of two components: Raw texts - 282 files made up of 867,725 lines together. These can be downloaded in two formats: As doc files, encoded using windows-1255 encoding: kneset16.zip - Contains 164 text files made up of 543,228 lines together. [MILA host] [Github Mirror] kneset17.zip - Contains 118 text files made up of 324,497 lines together. [MILA host] [Github Mirror] As txt files, encoded using utf8 encoding: kneset.tar.gz - An archive of all the raw text files, divided into two folders: [Github mirror] 16 - Contains 164 text files made up of 543,228 lines together. 17 - Contains 118 text files made up of 324,497 lines together. knesset_txt_16.tar.gz - Contains 164 text files made up of 543,228 lines together. [MILA host] [Github Mirror] knesset_txt_17.zip - Contains 118 text files made up of 324,497 lines together. [MILA host] [Github Mirror] Tokenized and morphologically tagged texts - Tagged versions exist only for the files in the 16 folder. The text are ... : The Open Natural Language Processing in Hebrew (NLPH) initiative is a joint effort by members of DataHack and The Public Knowledge Workshop to promote open tools and resources for Natural Language Processing in Hebrew. This community collects resources for NLP in Hebrew, as part of the NLPH project, which you can read more about here. These include corpora, lexicons, dictionaries, treebanks, embeddings, code, services, applications, papers, course materials and presentations, among others. A full list of these resources is maintained here: https://github.com/NLPH/NLPH_Resources If you have a resource you can contribute, to be released under some open license, please submit a pull request, or contact us at contact@nlph.org.il. ...
|
|
Keyword:
Hebrew; Knesset; morphologically tagged text; NLP; NLPH; Tokenization; Transcripts
|
|
URL: https://dx.doi.org/10.5281/zenodo.2707356 https://zenodo.org/record/2707356
|
|
BASE
|
|
Hide details
|
|
|
|