Commit History

Cache label data along with tokenized text data
af84d9b

Tymec commited on

Fix broken tokenization
447f97e

Tymec commited on

Add more vectorizers, classifiers and CLI options
b0ade1a

Tymec commited on

Chunked serialization
afaacd1

Tymec commited on

Update options, force GC, tweak parameters and add flags
18cc46a

Tymec commited on

Ability to change number of parallel jobs for search
8471e78

Tymec commited on

Create model in train_model
3854a1f

Tymec commited on

Tokenization rework
2c1f9dd

Tymec commited on

Change HF entry point and add examples
b42b884

Tymec commited on

Add evaluate command
cdf1241

Tymec commited on

Add cross validation
5a2db0a

Tymec commited on

Use stopwords from NLTK and download NLTK data
204391c

Tymec commited on

Fix merge
0993d5e

Tymec commited on

Completely change the structure of the project
85ac990

Tymec commited on