Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
17
15
92
Alessandro Ercolani
PRO
giux78
Follow
AndreaSeveso's profile picture
LordPBA's profile picture
Fishtiks's profile picture
67 followers
·
37 following
https://alessandroercolani.webflow.io/
giux78
giux78
AI & ML interests
NLP, Reinforcement Learning, Semantics, Computational Neuroscience
Recent Activity
liked
a dataset
about 5 hours ago
togethercomputer/CoderForge-Preview
reacted
to
their
post
with 🔥
2 days ago
Together with @mferraretto and @efederici we released #Nesso-4B, a new model specialized for agentic workflows. https://huggingface.co/mii-llm/nesso-4B #Nesso-4B is a fine-tuned version of Qwen-4B, trained on a highly curated and balanced dataset designed specifically for multilingual agentic workflows and conversational use cases. As shown in the video below we simulate, the new “cowork” from #Antrophic, without any data sharing all running on a consumer device. The model can be used to build agentic behavior in #privateAI environments. Not every problem requires super intelligence: in many cases, intelligence at the edge is more than enough. #Nesso4B #AgenticAI #PrivateAI #EdgeAI #OnDeviceAI
reacted
to
robtacconelli
's
post
with 🚀
2 days ago
🏆 Nacrith: a 135M model that out-compresses everything on natural language What if a tiny LM could compress english text better than _every_ compressor out there — classical or neural, small or large? Nacrith pairs SmolLM2-135M with an ensemble of online predictors and high-precision arithmetic coding. What's inside The standard LLM+arithmetic coding approach wastes ~75% of CDF precision on large vocabularies. Our CDF-24 fix alone recovers 0.5 bpb. On top: a token N-gram that skips the GPU on predictable tokens, an adaptive bias head, llama.cpp backend (7× faster than PyTorch), multi-GPU parallel compression, and a binary file format (NC06) — the first LLM-based binary compressor we know of. Runs on a GTX 1050 Ti. ~500 MB weights, ~1.2 GB VRAM per worker. 💻 Code: https://github.com/robtacconelli/Nacrith-GPU ⭐ Space: https://huggingface.co/spaces/robtacconelli/Nacrith-GPU 📄 Paper: https://huggingface.co/papers/2602.19626 Try it, break it, share your results — all feedback welcome. ⭐ on the repo appreciated! Results across all systems we tested: - alice29.txt → 0.918 bpb (−44% vs CMIX, −20% vs ts_zip) — below the 2nd-order Shannon entropy bound - enwik8 (100 MB) → 0.9389 bpb (−8% vs FineZip/LLMZip's 8B model, −15% vs ts_zip) - Unseen text → 0.723 bpb on a doc published after training cutoff — no memorization, 26% better than FineZip/LLMZip on the same model SmolLM2-135M by https://huggingface.co/HuggingFaceTB
View all activity
Organizations
giux78
's models
52
Sort: Recently updated
giux78/nesso-350M-sft-v0.4-Q8_0-GGUF
0.6B
•
Updated
Jan 10
•
12
giux78/open-zagreus-350M-sft-Q8_0-GGUF
0.4B
•
Updated
Nov 28, 2025
•
3
giux78/zagreus-350M-sft-all-union-fixed
Text Generation
•
0.4B
•
Updated
Oct 23, 2025
•
2
giux78/open-zagreus-350M-sft
Text Generation
•
0.4B
•
Updated
Oct 16, 2025
•
1
giux78/zagreus-test-202000-sft-15
Text Generation
•
0.4B
•
Updated
Oct 14, 2025
•
1
giux78/zagreus-test-202000-sft-14
Text Generation
•
0.4B
•
Updated
Oct 14, 2025
•
1
giux78/zagreus-test-202000-sft-13
Text Generation
•
0.4B
•
Updated
Oct 11, 2025
•
1
giux78/zagreus-test-202000-sft-12
Text Generation
•
0.4B
•
Updated
Oct 10, 2025
•
4
giux78/zagreus-test-202000-sft-11
Text Generation
•
0.4B
•
Updated
Oct 9, 2025
•
1
giux78/zagreus-test-202000-sft-10
Text Generation
•
0.4B
•
Updated
Oct 8, 2025
•
2
giux78/zagreus-test-202000-sft-8
Text Generation
•
0.4B
•
Updated
Oct 8, 2025
•
2
giux78/zagreus-test-202000-sft-7
Text Generation
•
0.4B
•
Updated
Oct 6, 2025
•
2
giux78/zagreus-test-202000-sft-6
Text Generation
•
0.4B
•
Updated
Oct 4, 2025
•
2
giux78/zagreus-test-202000-sft-5
Text Generation
•
0.4B
•
Updated
Oct 2, 2025
•
2
giux78/zagreus-test-202000-sft-4
Text Generation
•
0.4B
•
Updated
Oct 2, 2025
•
2
giux78/zagreus-test-202000-sft-3
Text Generation
•
0.4B
•
Updated
Oct 2, 2025
•
1
giux78/zagreus-test-202000-sft-2
Text Generation
•
0.4B
•
Updated
Oct 1, 2025
•
2
giux78/zagreus-test-202000-sft
Text Generation
•
0.4B
•
Updated
Sep 29, 2025
•
1
giux78/zagreus-test-302000
0.4B
•
Updated
Sep 27, 2025
giux78/zagreus-test-220000
0.4B
•
Updated
Sep 26, 2025
giux78/zagreus-test-202000
0.4B
•
Updated
Sep 26, 2025
giux78/zagreus-test-184000
0.4B
•
Updated
Sep 26, 2025
giux78/zagreus-test-162000
0.4B
•
Updated
Sep 26, 2025
giux78/zagreus-test-186000
0.4B
•
Updated
Sep 24, 2025
giux78/zagreus-test-224000
0.4B
•
Updated
Sep 22, 2025
giux78/zagreus-test-140000
0.4B
•
Updated
Sep 21, 2025
giux78/zagreus-test-70000
0.4B
•
Updated
Sep 20, 2025
giux78/pre-bgpt-v.0.1
Text Generation
•
0.3B
•
Updated
Aug 31, 2025
•
2
giux78/test_236000
2B
•
Updated
Aug 20, 2025
giux78/test_216000
2B
•
Updated
Aug 17, 2025
Previous
1
2
Next