1 5 24

Thomas Renkert

trenkert

AI & ML interests

None yet

Recent Activity

liked a model 19 days ago

BAAI/bge-reranker-v2-m3

updated a model about 2 months ago

trenkert/hilt_llama3_notteik

updated a model about 2 months ago

trenkert/hilt_llama3

View all activity

Organizations

trenkert's activity

liked a model 19 days ago

BAAI/bge-reranker-v2-m3

Text Classification • Updated Jun 24 • 677k • 393

updated 2 models about 2 months ago

trenkert/hilt_llama3_notteik

Updated Sep 27 • 4

trenkert/hilt_llama3

Updated Sep 26

updated 2 models 2 months ago

trenkert/hilt_phi3_merge

Updated Sep 25 • 1

trenkert/hilt_phi3

Updated Sep 25

liked a Space 3 months ago

Running

🧭

Base Model Explorer

liked a model 3 months ago

numind/NuExtract

Text Generation • Updated Oct 17 • 2.05k • 207

updated a dataset 4 months ago

trenkert/testchatml2

Updated Jul 31 • 4

liked a dataset 5 months ago

stefan-it/HisGermaNER

Preview • Updated Mar 28 • 364 • 2

liked a model 6 months ago

mistralai/Mixtral-8x22B-Instruct-v0.1

Text Generation • Updated Oct 3 • 140k • 690

Reacted to tomaarsen's post with 👍 6 months ago

Post

1940

‼️Sentence Transformers v3.0 is out! You can now train and finetune embedding models with multi-GPU training, bf16 support, loss logging, callbacks & much more. I also release 50+ datasets to train on.

1️⃣ Training Refactor
Embedding models can now be trained using an extensive trainer with a lot of powerful features:
- MultiGPU Training (Data Parallelism (DP) and Distributed Data Parallelism (DDP))
- bf16 training support; loss logging
- Evaluation datasets + evaluation loss
- Improved callback support + an excellent Weights & Biases integration
- Gradient checkpointing, gradient accumulation
- Improved model card generation
- Resuming from a training checkpoint without performance loss
- Hyperparameter Optimization
and much more!
Read my detailed blogpost to learn about the components that make up this new training approach: https://huggingface.co/blog/train-sentence-transformers

2️⃣ Similarity Score
Not sure how to compare embeddings? Don't worry, you can now use model.similarity(embeddings1, embeddings2) and you'll get your similarity scores immediately. Model authors can specify their desired similarity score, so you don't have to worry about it anymore!

3️⃣ Additional Kwargs
Sentence Transformers relies on various Transformers instances (AutoModel, AutoTokenizer, AutoConfig), but it was hard to provide valuable keyword arguments to these (like 'torch_dtype=torch.bfloat16' to load a model a lower precision for 2x inference speedup). This is now easy!

4️⃣ Hyperparameter Optimization
Sentence Transformers now ships with HPO, allowing you to effectively choose your hyperparameters for your data and task.

5️⃣ Dataset Release
To help you out with finetuning models, I've released 50+ ready-to-go datasets that can be used with training or finetuning embedding models: sentence-transformers/embedding-model-datasets-6644d7a3673a511914aa7552

Full release notes: https://github.com/UKPLab/sentence-transformers/releases/tag/v3.0.0