-
-
-
-
-
-
Inference Providers
Active filters:
trl
maisamabbas/results
Updated
lewtun/gemma-7b-dpo-full-mix1-beta-0.1
Text Generation
•
Updated
•
12
lewtun/gemma-7b-dpo-full-mix1-beta-0.1-epoch-3
Text Generation
•
Updated
•
13
sampraxi/v1
lewtun/gemma-7b-dpo-full-mix1-beta-0.2
Text Generation
•
Updated
•
8
lewtun/gemma-7b-dpo-full-mix2-beta-0.1
Text Generation
•
Updated
•
8
lewtun/gemma-7b-dpo-full-mix1-beta-0.4
Text Generation
•
Updated
•
7
lewtun/gemma-7b-dpo-full-mix1-beta-0.6
Text Generation
•
Updated
•
8
lewtun/gemma-7b-dpo-full-ultrafeedback-beta-0.01
Text Generation
•
Updated
•
23
lewtun/gemma-7b-dpo-full-mix1-beta-0.05
Text Generation
•
Updated
•
12
lewtun/gemma-7b-dpo-full-mix1-beta-0.01
Text Generation
•
Updated
•
8
lewtun/gemma-7b-dpo-full-mix1-beta-0.4-epoch-3
Text Generation
•
Updated
•
13
Nayan4HF/code-mistral-7b-text-to-python
shaposhnikov/qlora
Updated
lewtun/gemma-7b-sft-full-openhermes-v0
Text Generation
•
Updated
•
9
Yaxin1992/zephyr-llama-merge-7b-dpo-multi
Weni/ZeroShot-3.3.17-Mistral-7b-Multilanguage-3.2.0
TheRadDani/tinystarcoder-rlhf-model
Text Generation
•
Updated
•
56
ChenWu98/skills_metaphor_chat-then-skills_red_herring_chat-lora
EddyGiusepe/zephyr-7b-sft-lora
Updated
cvzion/lora-tinyllama-dqg-v4
Updated
ashikshaffi08/outputs
ChenWu98/skills_red_herring_chat-then-skills_metaphor_chat-lora
cvzion/lora-msistral7b-dqg-v5
Updated
llm-finetune/results
Updated
•
169
Andyrasika/code-llama-7b-text-to-sql
sandy37/mistral-finetuned-samsum
lewtun/gemma-7b-dpo-full-mix1-beta-0.05-epoch-2
Text Generation
•
Updated
•
13
lewtun/gemma-7b-dpo-full-mix1-beta-0.05-epoch-3
Text Generation
•
Updated
•
16
•
1
lewtun/gemma-7b-dpo-full-openhermes-mix1-beta-0.1
Text Generation
•
Updated
•
9