Milan Kryl

mikr

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago
StephanST/WALDO30
upvoted an article about 2 months ago
View all activity

Organizations

mikr's activity

upvoted an article about 2 months ago
view article
Article

🇨🇿 BenCzechMark - Can your LLM Understand Czech?

18
New activity in BUT-FIT/csmpt7b 9 months ago
replied to merve's post 10 months ago
view reply

That's nice! I've requested to join the group, modified my spaces to have proper decorators. So I hope I'll be confirmed :)

Reacted to merve's post with 👍 10 months ago
view post
Post
Migrated all my GPU consuming Spaces to ZERO, it was super easy to do so (add three lines of code and voila!) and the start-up time decreased dramatically as well 💜
·
Reacted to philschmid's post with 👍❤️ 10 months ago
view post
Post
What's the best way to fine-tune open LLMs in 2024? Look no further! 👀 I am excited to share “How to Fine-Tune LLMs in 2024 with Hugging Face” using the latest research techniques, including Flash Attention, Q-LoRA, OpenAI dataset formats (messages), ChatML, Packing, all built with Hugging Face TRL. 🚀

It is created for consumer-size GPUs (24GB) covering the full end-to-end lifecycle with:
💡Define and understand use cases for fine-tuning
🧑🏻‍💻 Setup of the development environment
🧮 Create and prepare dataset (OpenAI format)
🏋️‍♀️ Fine-tune LLM using TRL and the SFTTrainer
🥇 Test and evaluate the LLM
🚀 Deploy for production with TGI

👉  https://www.philschmid.de/fine-tune-llms-in-2024-with-trl

Coming soon: Advanced Guides for multi-GPU/multi-Node full fine-tuning and alignment using DPO & KTO. 🔜
·
New activity in mikr/whisper-small-ro-cv11 11 months ago