Spaces:

huggingface
/

HuggingDiscussions

Running

App Files Files Community

[FEEDBACK] Inference Providers

#49

by julien-c HF staff - opened 13 days ago

Discussion

julien-c

Hugging Face org 13 days ago

Any inference provider you love, and that you'd like to be able to access directly from the Hub?

reach-vb

Hugging Face org 2 days ago

•

edited 2 days ago

Love that I can call DeepSeek R1 directly from the Hub 🔥

from huggingface_hub import InferenceClient

client = InferenceClient(
    provider="together",
    api_key="xxxxxxxxxxxxxxxxxxxxxxxx"
)

messages = [
    {
        "role": "user",
        "content": "What is the capital of France?"
    }
]

completion = client.chat.completions.create(
    model="deepseek-ai/DeepSeek-R1", 
    messages=messages, 
    max_tokens=500
)

print(completion.choices[0].message)

benhaotang

2 days ago

•

edited 2 days ago

Is it possible to set a monthly payment budget or rate limits for all the external providers? I don't see such options in billings tab. In case a key is or session token is stolen, it can be quite dangerous to my thin wallet:(

julien-c

Hugging Face org 2 days ago

@benhaotang you already get spending notifications when crossing important thresholds ($10, $100, $1,000) but we'll add spending limits in the future

benhaotang

2 days ago

•

edited 2 days ago

@benhaotang you already get spending notifications when crossing important thresholds ($10, $100, $1,000) but we'll add spending limits in the future

Thanks for your quick reply, good to know!

sylanaustin

2 days ago

Would be great if you could add Nebius AI Studio to the list :) New inference provider on the market, with the absolute cheapest prices and the highest rate limits...

Hazzzardous

2 days ago

Could be good to add featherless.ai

teentitan

2 days ago

TitanML !!

alexatallah

2 days ago

OpenRouter!

Dhruv

2 days ago

Hi everyone and first of all I really thank huggingface team for releasing this feature.
One another suggestion that I need is :
to give more granular parameters for the user to share the deployment configuration file (as an terraform hcl configuration) in order to define more optimum infrastructure for inference and let them define the inference model.

victor1203

2 days ago

requesty.ai
groq

llamameta

2 days ago

•

edited 1 day ago

TypeError: InferenceClient.__init__() got an unexpected keyword argument 'provider'

lukep

1 day ago

Add RunPod as an inference provider!

yuchenj

1 day ago

Hyperbolic!

nerdylive

1 day ago

Runpod!

pcuenq

Hugging Face org 1 day ago

@llamameta please, make sure you use the latest version of huggingface_hub

pip install --upgrade huggingface_hub

temirulan

1 day ago

Hello. Congratulations on great feature. How we can proceed on adding deepinfra.com to the providers list?

bitsnaps

1 day ago

That's a great news, and if you provide more measurable details on what's free inference with a small quota is about for signed-in free users, you'll build more trust around the community, hence more PRO subscribers!

Thibault-Requesty

1 day ago

requesty.ai !

Xrunner

1 day ago

Runpod is excellent.
Let's add it to inference provider list

ozone-ai

about 23 hours ago

runpod!

Jeydd

about 20 hours ago

Runpod would be very welcome!

mitchken

about 20 hours ago

Yes, Runpod.io would be great indeed

Presario

about 20 hours ago

This comment has been hidden

Presario

about 17 hours ago

@julien-c

is there a way to add https://nineteen.ai/ as a provider ? it allows free access to top models like Deepseek R1 & the llama family
there is also : https://chutes.ai/ which can allow users to deploy any model on demand