1 |
segmented Corpus of Buddhist Sanskrit (proof of concept) ...
|
|
|
|
Abstract:
This is a proof-of-concept Sanskrit corpus developed for the study of Buddhist Sanskrit lexicology. It comprises: 311 lemmatized and metadata-enriched Buddhist Sanskrit texts for a total of ~ 7 million words. a tokenised reference corpus of general Sanskrit including 267 texts for a total of ~ 13 million words a metadata table with information about each text in the Buddhist and Reference corpora The corpora is in romanised Sanskrit (UTF-8 encoding) Limitations As a proof of concept, this corpus suffers from several limitations. It is small by contemporary standards and it has not been proof-read yet. We are grateful to have received an Ashoka grant from the Khyentse Foundation to proofread the Buddhist Sanskrit Corpus. Improved versions will be released in due course. Acknowledgments The corpus had been first realised as part of the project 'Lexis and Tradition: variation in the vocabulary of Sanskrit Mahāyāna literature'. This project was funded by the British Academy through a Newton International ...
|
|
Keyword:
Buddhist Sanskrit; corpus; Sanskrit
|
|
URL: https://dx.doi.org/10.5281/zenodo.5847100 https://zenodo.org/record/5847100
|
|
BASE
|
|
Hide details
|
|
2 |
segmented Corpus of Buddhist Sanskrit (proof of concept) ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
Visual Dictionary and Thesaurus of Buddhist Sanskrit (Data) ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
Visual Dictionary and Thesaurus of Buddhist Sanskrit (Data) ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
Visual Dictionary and Thesaurus of Buddhist Sanskrit (Data) ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
Visual Dictionary and Thesaurus of Buddhist Sanskrit (Data) ...
|
|
|
|
BASE
|
|
Show details
|
|
|
|