Alberto Cetoli PRO

fractalego

AI & ML interests

Entity/relation extraction, Q&A, Summarisation

Recent Activity

Articles

Organizations

Blog-explorers's profile picture Hugging Face Discord Community's profile picture open/ acc's profile picture

fractalego's activity

upvoted an article 3 days ago
view article
Article

Visualize and understand GPU memory in PyTorch

162
reacted to mitkox's post with 🤯🔥 10 days ago
view post
Post
2394
Can it run DeepSeek V3 671B is the new 'can it run Doom'.

How minimalistic can I go with on device AI with behemoth models - here I'm running DeepSeek V3 MoE on a single A6000 GPU.

Not great, not terrible, for this minimalistic setup. I love the Mixture of Experts architectures. Typically I'm running my core LLM distributed over the 4 GPUs.

Make sure you own your AI. AI in the cloud is not aligned with you; it's aligned with the company that owns it.
·
reacted to julien-c's post with 🔥 about 2 months ago
view post
Post
2623
wow 😮

INTELLECT-1 is the first collaboratively trained 10 billion parameter language model trained from scratch on 1 trillion tokens of English text and code.

PrimeIntellect/INTELLECT-1-Instruct
reacted to merve's post with ❤️ about 2 months ago
view post
Post
3165
your hugging face profile now has your recent activities 🤗