How can we access this model via a HuggingFace Inference Endpoint, rather than by downloading it locally? I am on a Macbook, so I cannot run the model locally due to the flash_attn dependency
Your need to confirm your account before you can post a new comment.