Joseph Pollack

Tonic

AI & ML interests

๐Ÿค–Making robots to help people learn things quicker ๐Ÿ‘ฉ๐Ÿปโ€๐Ÿš€๐Ÿš€

Articles

Organizations

Tonic's activity

replied to m-ric's post about 8 hours ago
view reply

the Math one is absolutely incredible , the demo is great :-)

replied to nebazi12's post 1 day ago
replied to Wauplin's post 1 day ago
view reply

thanks for the large_folder_upload it was not necessary in general , but i, as well as the public, kinda needed to be spoon fed , and for this i thank you ๐Ÿค—

posted an update 2 days ago
replied to their post 2 days ago
view reply

... and BIG THANKS for the cool PR on friday night ;-)

replied to their post 2 days ago
view reply

examples welcome if you have a cool one to show off for folks ;-)

replied to jeffboudier's post 2 days ago
view reply

this is AWESOME ! congrats on a cool release and an amazing collaboration ๐Ÿš€

posted an update 3 days ago
posted an update 8 days ago
view post
Post
1083
๐Ÿ™‹๐Ÿปโ€โ™‚๏ธ hey there folks ,

made an image similarity demo to test out the mistral-community/pixtral-12b-240910 model .

If anyone knows how to generate captions with it , please do let me know x ๐Ÿš€

here's the demo : Tonic/Pixtral

hope you like it ๐Ÿค—
posted an update 9 days ago
view post
Post
2636
So awesome , now i can deploy a jupyterlab on huggingface and deploy gradio from the jupyterlab
posted an update 14 days ago
posted an update 17 days ago
view post
Post
2513
๐Ÿ™‹๐Ÿปโ€โ™‚๏ธhey there folks ,

โœ’๏ธInkubaLM has been trained from scratch using 1.9 billion tokens of data for five African languages, along with English and French data, totaling 2.4 billion tokens of data. It is capable of understanding and generating content in five African languages: Swahili, Yoruba, Hausa, isiZulu, and isiXhosa, as well as English and French.

model lelapa/InkubaLM-0.4B
demo Tonic/Inkuba-0.4B
posted an update 19 days ago
replied to takeraparterer's post about 2 months ago
view reply

lightning.ai
huggingface.co

many others have jupyterlab available and can scale ... for a price.

hope this helps !

replied to Ameeeee's post about 2 months ago
view reply

May i ask : has the huggingface image been updated accordingly ?

replied to merve's post about 2 months ago
view reply

your posts and demos are always sooooo cool

replied to vilarin's post about 2 months ago
view reply

I really enjoy it , but it's just been released and we've not really seen the full extent of how cool it is yet i think .

With their text to video model announced, i'm really excited about the future

image (30).webp
image (29).webp
image-_28_.png
image-_27_.png
image-_26_.png
image-_25_.png
image-_24_.png

posted an update about 2 months ago
view post
Post
731
๐Ÿ™‹๐Ÿปโ€โ™‚๏ธHey there folks ,

I found this cool (new?) thing by Docker called Testcontainers , and there's an @ollama object that you can use to programmatically serve ephemeral containers and LLMs.

I made a post about it here : https://huggingface.co/blog/Tonic/localai-testcontainers

It's really useful, powerful and fun !

Demo coming soon ๐Ÿค—
posted an update about 2 months ago
view post
Post
1713
๐Ÿ™‹๐Ÿปโ€โ™‚๏ธ Hey there folks

made a demo for Nvidia Minitron on an A100.

Minitron is a family of small language models (SLMs) obtained by pruning NVIDIA's Nemotron-4 15B model. We prune model embedding size, attention heads, and MLP intermediate dimension, following which, we perform continued training with distillation to arrive at the final models.

Deriving the Minitron 8B and 4B models from the base 15B model using our approach requires up to 40x fewer training tokens per model compared to training from scratch; this results in compute cost savings of 1.8x for training the full model family (15B, 8B, and 4B). Minitron models exhibit up to a 16% improvement in MMLU scores compared to training from scratch, perform comparably to other community models such as Mistral 7B, Gemma 7B and Llama-3 8B, and outperform state-of-the-art compression techniques from the literature. Please refer to our arXiv paper for more details.

Minitron models are for research and development only.

source : nvidia/Minitron-8B-Base
demo : Tonic/Minitron
  • 1 reply
ยท
posted an update 2 months ago
replied to dvilasuero's post 3 months ago
view reply

i'm very happy for the huggingface team , it really makes sense to get closer together and not only for data ;-)

posted an update 3 months ago
view post
Post
2455
appreciation post for @osanseviero + huggingface staff ( @reach-vb , @merve , many others many many others) , that fight hard for many weeks / months to fix the releases in many organisations to make it easier for us to test out so many things ... ๐Ÿค—๐Ÿค—๐Ÿค— thanks for that folks !
  • 1 reply
ยท
posted an update 4 months ago
view post
Post
1938
my ๐Ÿค—huggingface activity for 2024 so far ...

dont tell my boss...

check yours too now, it's fun ๐Ÿค—
ยท
replied to lunarflu's post 4 months ago
view reply

love the link github/git idea (not sure if it's actually possible)

replied to MonsterMMORPG's post 4 months ago
view reply

it would be so much nicer if we kept posts about open source, machine learning and releases ... not like promotions and stuff... just my perspective on what kind of technical content i prefer to see on hugginface.

replied to qq8933's post 4 months ago
view reply

i was visting your v1 webapp this week , i cant wait for v2 now !

posted an update 4 months ago
view post
Post
854
all these GPU bourgeois tryna act cool like the GPU poor kids...
- what's your number for real ?
+ and did it work at parties for you ?
replied to lunarflu's post 4 months ago
view reply

it's not a bad idea, would be nice to have a bridge with git based on verified email, but i guess you know that already.

would be nice to track datasets and models more than demos . something like a design that's not an exact copy of micro$$oftgithub would be nice... but i dont have a solution for you...

my main ask about the hub is better control over notifications , that would be tremendously useful...

replied to abidlabs's post 4 months ago
view reply

interesting ! i'm quite curious ... i was already struggling to keep the pace with gradio v4+ and now i'm looking forward to v.1 once again , meanwhile i really really think there depth and breadth of gradio deserves a comprehensive tutorial/course , and no the docs (they are fine, they are good) are not enough about it ... not a criticism , just a request from a big fan :-)

posted an update 4 months ago
view post
Post
1020
๐Ÿ™‹๐Ÿปโ€โ™‚๏ธ Hey there folks ,

@tiiuae released Falcon 11B Vision Model !

๐Ÿฆ…๐Ÿฆ…๐Ÿ‘€๐Ÿ‘€

it's quite good , and you can try it here : Tonic/Falcon-Vision
replied to pangjh3's post 4 months ago
view reply

this is actually amazing + very cool/interesting i'm very happy i found the paper and the models.

congratulations on the tencent collaboration , i'm looking forward to the future.

posted an update 4 months ago
view post
Post
1242
๐Ÿ™‹๐Ÿปโ€โ™‚๏ธhey there folks !

i got an email my space was down so now my space is back up ! StarCoder2 (Raw) on A100 , for your enjoyment and apache2 research purposes ๐Ÿ‘๐Ÿป๐Ÿ‘๐Ÿป

Tonic/starcoder2

check my profile for more cool GPUzero demos, i'll cycle them with some new overlooked models soon ๐Ÿ’–๐Ÿค—
replied to xianbao's post 5 months ago
replied to Locutusque's post 5 months ago
replied to Sentdex's post 5 months ago
view reply

i've had "best" results mushing everything into a single context window with a single "final"/"next" answer , i think i remember @teknium saying they often do that and they may have published that research , but i cant speak for them, i just remember them saying that and feeling validated :-)

posted an update 6 months ago
view post
Post
there were only 5 Major Releases last week !

๐Ÿ˜ฑ it's so over
  • 1 reply
ยท
posted an update 6 months ago
view post
Post
LAST CHANCE TO TRY ๐ŸŒŸSTARCODER2

After today it's gone !

actually not - just joking ! it's <3 open source !

just trying to get folks' attention to my featured "Spaces of the Week" :

Tonic/starcoder2

drop a like for your boy and join us next week for making fine tunes !
posted an update 6 months ago
view post
Post
Last day on Spaces of the Week ,
and we made it to last place on trending.
i really thought it couldnt get any better, but i'm crying ! ๐Ÿ˜ญ

The thing i like the most about ZeroGPU , import spaces , is that i dont have to always check to see if someone decided to test if i have hard character limits , and it reloads the application flawlessly .

drop a like on my spaces here :
Spaces of the Week : https://huggingface.co/spaces/tonic/starcoder2
9 other ZeroGPU demos : https://huggingface.co/tonic
posted an update 7 months ago
view post
Post
hey there folks new Yi model just came out , and i had a gradio interface ready since their last releases.

it's just a base model but you can check it out here : Tonic/Yi-9B

cant wait to fine tune this one ๐Ÿค—๐Ÿš€
posted an update 7 months ago
view post
Post
๐Ÿ™‹๐Ÿปโ€โ™‚๏ธ hey there folks ,

Star coder came out and it's really fascinating in more ways than one !

first off it codes well already. but secondly it's reported to "know" 101 programming languages !

that actually means it's ripe for fine tunes, so if you're like me you've been bookmarking cool datasets and cant wait to get started !

that said , here's a cool demo where you can try it out now : Tonic/starcoder2

turns out it can program a T5 demo using gradio !
posted an update 7 months ago
view post
Post
๐Ÿ™‹๐Ÿปโ€โ™‚๏ธhey there folks ,

๐Ÿค—Aya has been released ! It's an absolutely massive undertaking to create a huge multilingual dataset and multilingual model of very high quality.

Papers :
https://cohere.com/research/papers/aya-dataset-paper-2024-02-13
https://cohere.com/research/papers/aya-model-paper-2024-02-13

Model : CohereForAI/aya-101
Dataset : CohereForAI/aya_dataset


I am proud to be one of 3,000 humans who built Aya - a new massively multilingual, generative LLM that outperforms existing open-source models and covers 101 different languages. Together, we are accelerating multilingual AI. ๐Ÿค—
  • 1 reply
ยท
posted an update 8 months ago
replied to their post 8 months ago
replied to their post 8 months ago
view reply

hey thanks for pointing that out, there's so much to organise and build for this, help is really welcome if you like the subject :-)

posted an update 8 months ago
view post
Post
๐Ÿ‘‹hi there folks !

check out this Chest X-Ray model from AIMIStanford : Tonic/CheXRay

thanks to @lunarflu for kicking me a bit to get the examples in there !

would be great to get even more examples and even more downstream functions , so contributions are very welcome, or if you have a dataset source, please do share it in the discussions !

posted an update 8 months ago
view post
Post
hey there folks , work in progress, but basically celebrating the release of whisperspeech

just :

pip install whisperspeech

to get started and check out my demo to do multilingual text to speech including making voice prints using whisperspeechreverse engineering of whisper here : Tonic/whisperspeech
and the model card here : https://huggingface.co/collabora/whisperspeech

i met collabora on LAION check out LAION here :
https://huggingface.co/laion
  • 1 reply
ยท
posted an update 8 months ago
view post
Post
๐Ÿ‘‹ Hi there folks,

I launched my first competition !

Goal : Use AI to beat the Math Olympics within the set time

Basically we're looking for adventurous teams and individuals to make a common submission to the AI Math Olympics by the MLCommons.

Althought the ultimately there can only be one winner and there must always be a winner, the ultimate goal is to get together for a common solution.

check it out here :
Tonic1/mathathon
ยท
replied to KnutJaegersberg's post 8 months ago
replied to gsarti's post 8 months ago
replied to dhuynh95's post 8 months ago
view reply

i'm a fan of this community project : to train sector-specific 32K-context BERT embedding models ๐Ÿค—

replied to abhishek's post 8 months ago
replied to abhishek's post 8 months ago
view reply

cant wait to participate in yours and host some of mine :-)

posted an update 8 months ago
view post
Post
๐Ÿ™‹๐Ÿปโ€โ™‚๏ธhey there folks,

Everyone's๐Ÿ—ฃ๏ธtalking about microsoft's new e5mistral embeddings model

๐Ÿค”๐Ÿค” but did you actually try it yet ?

Well , now you can, just check it out. it's a new way to serve and create embeddings.

try it hosted on GPUZero : Tonic/e5
or served on an A10G : Tonic/e5

you get best results actually building with it though, so use it in your app !

Our demo is coming soon too so let's work together if you want :-)
  • 1 reply
ยท
replied to their post 8 months ago
view reply

our target is to pursue LowRes animated waifus and husbandos and be the leading frontrunners of anime related content ( โ€ขฬ€ ฯ‰ โ€ขฬ )y

So right now they're gathering cool datasets, soon we'll make and serve some LORAs, then we'll build with these for a little bit more interesting and simple anime applications , actually we already started at least that part :-)

you should check it out, it's a wild and massive community of i think of a quarter million folks on facebook - if you add up all the parts.

actually i should tag @not-lain because i basically take directions from him, perhaps he can say more :-)

posted an update 8 months ago
view post
Post
๐Ÿ™‹๐Ÿปโ€โ™‚๏ธHey there folks ,

i wanted to share with you a really cool new organisation called

https://huggingface.co/lowres

In just one week it has gathered almost 150 members !

Check them out if you love anime , SDLX, LORAs and cool datasets.

can we make this one reach 200 members? ๐Ÿš€
ยท
replied to victor's post 8 months ago
posted an update 8 months ago
view post
Post
๐Ÿคฆ๐Ÿปโ€โ™‚๏ธwell, day before yesterday i was so happy about **gpuzero** that i made a bunch of demos : https://huggingface.co/posts/Tonic/802671427380916

- one for YI-200K , but it actually doesnt quite fit on a GPUZero... U_U
- one for SDXL style align, but omg i didnt even realize at the time it wasnt my demo of it (lol)
- one for texify (which works great btw, keep an eye on texify, it's about to blow up... in a couple of months!)

so yeah, i ran back and tried to get my demos working at least for sdxl which i love , but i simply couldnt get the CPU stuff working, or the refactored code working. no wonder i was thinking "wow this is so easy" on @osanseviero 's demo : yeah , it's not my code that's why it works ๐Ÿ˜…๐Ÿ™๐Ÿป

anyway spent the day unsuccessfully experimenting, but starting tomorrow i'll try to serve some cool and overlooked models so ๐Ÿค—huggingface appreciators can try them out ๐Ÿš€
  • 1 reply
ยท
posted an update 8 months ago
view post
Post
๐Ÿ™‹๐Ÿปโ€โ™‚๏ธhey there folks , ๐ŸŒŸTonic here
- just a ๐Ÿ› ๏ธbuilder from ๐Ÿ—ผParis !
Everyone is making something special for their first post , so since i got access to **GPUZero** , well, my first post is about **GPUZero**

### GPUZero is here !

This one's great for builders like me that are often making and serving models to their community.
- demos get popular then fade away
- they retain interest over the next three months as folks have questions

**GPUZero** lets you serve demos to your community over time while optimizing for costs .
Believe it or not it's actually impossible to pay for everything over a whole month if you have even one GPU running at a time.
I'm so excited for this because it lets me serve a complete stack of specialized models and to build with them too.
- all optimized for efficiency in dollar cost.

check out some demos that are available on GPUZero :
- Tonic/marker-texify : this one is the first one i made it's for an image to latex formula model.
- https://huggingface.co/spaces/Tonic/YI-6B-200k : this one probably actually works better on GPUZero than on a standard A10, but dont take my word for it , try it out ๐Ÿค—
- https://huggingface.co/spaces/Tonic/style-aligned_sdxl : this one was my greatest technical achievement, check the dates and times on it too, there's a backstory to this one so i'll maybe tell it in another post
  • 1 reply
ยท
replied to dvilasuero's post 9 months ago
view reply

i love the "posting" from arguilla , what a fantastic way to share ๐Ÿค—