1 |
explosion/spaCy: v3.3.0: Improved speed, new trainable lemmatizer, and pipelines for Finnish, Korean and Swedish ...
|
|
|
|
BASE
|
|
Show details
|
|
2 |
explosion/spaCy: v3.3.0: Improved speed, new trainable lemmatizer, and pipelines for Finnish, Korean and Swedish ...
|
|
|
|
BASE
|
|
Show details
|
|
3 |
explosion/spaCy: v3.2.0: Registered scoring functions, Doc input, floret vectors and more ...
|
|
|
|
BASE
|
|
Show details
|
|
4 |
explosion/spaCy: v2.3.6: Bug fixes and base support for Amharic ...
|
|
|
|
BASE
|
|
Show details
|
|
5 |
explosion/spaCy: v3.0.2: CLI overrides and env variables in projects, base support for Setswana, PhraseMatcher for spans and bug fixes ...
|
|
|
|
BASE
|
|
Show details
|
|
6 |
explosion/spaCy: 3.1.1: Support for Ancient Greek and various bug fixes ...
|
|
|
|
BASE
|
|
Show details
|
|
7 |
explosion/spaCy: v2.2.2: Multiprocessing, future APIs, Luxembourgish base support & simpler GPU install ...
|
|
|
|
BASE
|
|
Show details
|
|
8 |
explosion/spaCy: v2.2.3: Tokenizer.explain, Korean base support, dependency scores per label and bug fixes ...
|
|
Montani, Ines; Honnibal, Matthew; Honnibal, Matthew; Landeghem, Sofie Van; Peters, Henning; Samsonov, Maxim; Adrianeboyd; Geovedi, Jim; Regan, Jim; Orosz, György; McCann, Paul O'Leary; Kristiansen, Søren Lind; Altinok, Duygu; , Roman; Howard, Grégory; Wannaphong Phatthiyaphaibun; Bozek, Sam; Explosion Bot; Böing, Björn; Amery, Mark; Vogelsang, Leif Uwe; Tippa, Pradeep Kumar; Jeannefukumaru; GregDubbin; Mazaev, Vadim; Balakrishnan, Ramanan; Møllerhøj, Jens Dahl; Wbwseeker; Burton, Magnus; Avadh Patel. - : Zenodo, 2019
|
|
Abstract:
✨ New features and improvements NEW: Tokenizer.explain method to see which rule or pattern was matched. tok_exp = nlp.tokenizer.explain("(don't)") assert [t[0] for t in tok_exp] == ["PREFIX", "SPECIAL-1", "SPECIAL-2", "SUFFIX"] assert [t[1] for t in tok_exp] == ["(", "do", "n't", ")"] NEW: Official Python 3.8 wheels for spaCy and its dependencies. Base language support for Korean. Add Scorer.las_per_type (labelled depdencency scores per label). Rework Chinese language initialization and tokenization Improve language data for Luxembourgish. 🔴 Bug fixes Fix issue #4573, #4645: Improve tokenizer usage docs. Fix issue #4575: Add error in debug-data if no dev docs are available. Fix issue #4582: Make as_tuples=True in Language.pipe work with multiprocessing. Fix issue #4590: Correctly call on_match in DependencyMatcher . Fix issue #4593: Build wheels for Python 3.8. Fix issue #4604: Fix realloc in Retokenizer.split . Fix issue #4656: Fix conllu2json converter when -n > 1. Fix ...
|
|
URL: https://dx.doi.org/10.5281/zenodo.3550036 https://zenodo.org/record/3550036
|
|
BASE
|
|
Hide details
|
|
|
|