Joseph

Joseph717171

AI & ML interests

None yet

Recent Activity

Organizations

Joseph717171's activity

Reacted to fdaudens's post with πŸ‘πŸ‘€ 2 days ago
view post
Post
1242
πŸ€– 93% of Gen Z workers use AI tools weekly, but nearly half of all workers aren't comfortable admitting it. The tech adoption gap isn't about usageβ€”it's about openness. Why are we still treating AI tools like a workplace secret? πŸ€”

See this article: https://www.axios.com/2024/11/25/gen-z-ai-work-survey
  • 1 reply
Β·
Reacted to loubnabnl's post with πŸ€—πŸ”₯ 2 days ago
view post
Post
1236
Making SmolLM2 reproducible: open-sourcing our training & evaluation toolkit πŸ› οΈ https://github.com/huggingface/smollm/

- Pre-training code with nanotron
- Evaluation suite with lighteval
- Synthetic data generation using distilabel (powers our new SFT dataset HuggingFaceTB/smoltalk)
- Post-training scripts with TRL & the alignment handbook
- On-device tools with llama.cpp for summarization, rewriting & agents

Apache 2.0 licensed. V2 pre-training data mix coming soon!

Which other tools should we add next?
Reacted to TuringsSolutions's post with πŸ‘€ 5 days ago
view post
Post
800
I created something called 'Hyperbolic Embeddings'. I literally just embed the tokens into Hyperbolic Space instead of Euclidean space. At first, this did not get me the gains I was expecting. I was a sad panda. Then I thought about it, a Hyperbolic Embedding needs a Hyperbolic Optimizer. So, instead of Adam, I used Riemannian Adam (RAdam). "Ladies and Gentlemen, We Got 'Em!"
  • 27 replies
Β·
Reacted to davidberenstein1957's post with πŸ€—β€οΈπŸš€πŸ”₯πŸ‘€ 5 days ago
view post
Post
1014
πŸ€—πŸ”­ Introducing Observers: A Lightweight SDK for AI Observability πŸ”­πŸ€—

Observers is an open-source Python SDK that provides comprehensive observability for AI applications. Our library makes it easy to:

- Track and record interactions with AI models
- Store observations in multiple backends
- Query and analyse your AI interactions with ease

https://huggingface.co/blog/davidberenstein1957/observers-a-lightweight-sdk-for-ai-observability
Reacted to cfahlgren1's post with πŸ”₯πŸ‘€ 7 days ago
view post
Post
838
You can create charts, leaderboards, and filters on top of any Hugging Face dataset in less than a minute

β€’ ASCII Bar Charts πŸ“Š
β€’ Powered by DuckDB WASM ⚑
β€’ Download results to Parquet πŸ’½
β€’ Embed and Share results with friends πŸ“¬

Do you have any interesting queries?
Reacted to jsulz's post with 🧠 7 days ago
view post
Post
2854
When the XetHub crew joined Hugging Face this fall, @erinys and I started brainstorming how to share our work to replace Git LFS on the Hub. Uploading and downloading large models and datasets takes precious time. That’s where our chunk-based approach comes in.

Instead of versioning files (like Git and Git LFS), we version variable-sized chunks of data. For the Hugging Face community, this means:

⏩ Only upload the chunks that changed.
πŸš€ Download just the updates, not the whole file.
🧠 We store your file as deduplicated chunks

In our benchmarks, we found that using CDC to store iterative model and dataset version led to transfer speedups of ~2x, but this isn’t just a performance boost. It’s a rethinking of how we manage models and datasets on the Hub.

We're planning on our new storage backend to the Hub in early 2025 - check out our blog to dive deeper, and let us know: how could this improve your workflows?

https://huggingface.co/blog/from-files-to-chunks