2 1

maxwell andrews

madmaxbr5

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

BAAI/Emu3-VisionTokenizer

replied to victor's post 2 months ago

🙋 Calling all Hugging Face users! We want to hear from YOU! What feature or improvement would make the biggest impact on Hugging Face? Whether it's the Hub, better documentation, new integrations, or something completely different – we're all ears! Your feedback shapes the future of Hugging Face. Drop your ideas in the comments below! 👇

View all activity

Organizations

None yet

madmaxbr5's activity

liked a model about 1 month ago

BAAI/Emu3-VisionTokenizer

Feature Extraction • Updated Oct 8 • 20.7k • 52

replied to victor's post 2 months ago

Hi Victor, the playground is working, but appears to be only for LLM models. I need to compare embedding models. What I want to be able to do is select a task, for example clustering, provide my own dataset, and then find & run a bunch of models against the task automatically so I can compare the results. Ideally I could find the top 50 models, see how they perform, and then narrow it down to a few to study in more detail.

Here's the problem. This is the current task list on HF:

And then within task classification, for example, there are 68,000+ models:

So finding the right set of these to start with is a huge barrier to actually utilizing these models practically. The best we have is the mteb leaderboard which while it can give some idea of relative performance, may not match at all the specific task or domain one is trying to accomplish. Right now, one must go to the leaderboard, pick one you think might work, try it, then pick another one and try it, etc. It's a very manual process and you don't know if there is yet another that might be better.

replied to victor's post 2 months ago

can you try https://hf.co/playground and tell me if this helps for your use case?

I get a 403 error saying I need to be a "hugging face member." However I'm logged in an have an active billing method, so not sure what the issue is.

replied to victor's post 2 months ago

I want to create a pipeline signature and provide a few examples, then have the hub go and test a bunch of models against that scenario and pick the best ones for me. For example, let's say I want to build a username matching function. I should be able to provide some example positive and negative matches, expand those examples into a small eval dataset using one-click synthetic data expansion, and then have the hub go and try hundreds of models against that eval dataset and find the top performers in several parameter size classes.

New activity in Mihaiii/Metis-0.5 10 months ago

Examples?

#1 opened 10 months ago by

madmaxbr5

New activity in open-llm-leaderboard/open_llm_leaderboard 12 months ago

Feature Request for Leaderboard: date added to hub

#425 opened 12 months ago by

madmaxbr5

Feature Request for Leaderboard: date added to hub

#425 opened 12 months ago by

madmaxbr5