Alysson Guimarães

k3ybladewielder

AI & ML interests

NLP, Commonsense Reasoning

Recent Activity

liked a model about 5 hours ago
google/gemma-3-1b-it
liked a model about 5 hours ago
unsloth/gemma-3-1b-pt-bnb-4bit
liked a model about 5 hours ago
google/gemma-3-1b-pt
View all activity

Organizations

None yet

k3ybladewielder's activity

New activity in google/gemma-7b about 8 hours ago

Error

13
#31 opened about 1 year ago by
trungnd7112004
reacted to clem's post with 🔥 4 days ago
view post
Post
7014
I was chatting with @peakji , one of the cofounders of Manu AI, who told me he was on Hugging Face (very cool!).

He shared an interesting insight which is that agentic capabilities might be more of an alignment problem rather than a foundational capability issue. Similar to the difference between GPT-3 and InstructGPT, some open-source foundation models are simply trained to 'answer everything in one response regardless of the complexity of the question' - after all, that's the user preference in chatbot use cases. Just a bit of post-training on agentic trajectories can make an immediate and dramatic difference.

As a thank you to the community, he shared 100 invite code first-come first serve, just use “HUGGINGFACE” to get access!
·
upvoted an article 5 days ago
view article
Article

Vision Language Models Explained

287
upvoted 2 articles about 1 month ago
view article
Article

Open-R1: a fully open reproduction of DeepSeek-R1

808
view article
Article

Open-source DeepResearch – Freeing our search agents

1.16k