V3 with longer context
1
#9 opened 4 months ago
by
Hypersniper
Improve language tag
#8 opened 7 months ago
by
lbourdois
Draft Model of Speculative Decoding
3
#6 opened 9 months ago
by
nagug
Can I use any Inference Engine(like vllm、ollama) applicable to qwen2.5 to infer Athene-V2-Chat?
1
#5 opened 11 months ago
by
wangdafa
32 B coding model please
👍
9
3
#4 opened 12 months ago
by
gopi87
inference api not working
2
#3 opened 12 months ago
by
llamameta
Smaller versions incoming?
👍
3
#2 opened 12 months ago
by
phly95