3 |
Delimitation of dialect regions, subgroups, areas and types in the Czech Republic
|
|
Goláňová, Hana. - : Charles University, Faculty of Arts, Institute of the Czech National Corpus, 2022
|
|
BASE
|
|
Show details
|
|
7 |
Agreement attraction in English and Czech: A direct experimental comparison ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
The language and functions of Czech counter slogans: 1948 to 1989
|
|
|
|
In: 20 ; 1 ; 28 (2022)
|
|
BASE
|
|
Show details
|
|
12 |
PDT-Vallex: Czech Valency lexicon linked to treebanks 4.0 (PDT-Vallex 4.0)
|
|
|
|
BASE
|
|
Show details
|
|
19 |
FERNET-C5
|
|
|
|
Abstract:
The FERNET-C5 is a monolingual BERT language representation model trained from scratch on the Czech Colossal Clean Crawled Corpus (C5) data - a Czech mutation of the English C4 dataset. The training data contained almost 13 billion words (93 GB of text data). The model has the same architecture as the original BERT model, i.e. 12 transformation blocks, 12 attention heads and the hidden size of 768 neurons. In contrast to Google’s BERT models, we used SentencePiece tokenization instead of the Google’s internal WordPiece tokenization. More details can be found in README.txt. Yet more detailed description is available in https://arxiv.org/abs/2107.10042 The same models are also released at https://huggingface.co/fav-kky/FERNET-C5
|
|
Keyword:
BERT; Czech; Czech language
|
|
URL: http://hdl.handle.net/11234/1-3776
|
|
BASE
|
|
Hide details
|
|
20 |
WALS Online Resources for Czech
|
|
: Max Planck Institute for Evolutionary Anthropology, 2021
|
|
BASE
|
|
Show details
|
|
|
|