Hugo Larcher's picture
3 6

Hugo Larcher

hlarcher

AI & ML interests

None yet

Recent Activity

Articles

Organizations

Hugging Face's profile picture HuggingFaceBR4's profile picture Hugging Face Smol Cluster's profile picture Optimum Nvidia's profile picture smol-explorers's profile picture

Posts 1

view post
Post
912
We are introducing multi-backend support in Hugging Face Text Generation Inference!
With new TGI architecture we are now able to plug new modeling backends to get best performances according to selected model and available hardware. This first step will very soon be followed by the integration of new backends (TRT-LLM, llama.cpp, vLLM, Neuron and TPU).

We are polishing the TensorRT-LLM backend which achieves impressive performances on NVIDIA GPUs, stay tuned 🤗 !

Check out the details: https://huggingface.co/blog/tgi-multi-backend