Edit Models filters

Inference Providers

Nebius AI Studio

HF Inference API

Misc

Inference Endpoints

text-generation-inference

AutoTrain Compatible

4-bit precision

8-bit precision

Mixture of Experts

text-embeddings-inference

Carbon Emissions

Models

58,586

Full-text search

Active filters: trl

maisamabbas/results

Updated Feb 29, 2024

lewtun/gemma-7b-dpo-full-mix1-beta-0.1

Text Generation • Updated Feb 29, 2024 • 12

lewtun/gemma-7b-dpo-full-mix1-beta-0.1-epoch-3

Text Generation • Updated Feb 29, 2024 • 13

sampraxi/v1

Updated Feb 29, 2024 • 1

lewtun/gemma-7b-dpo-full-mix1-beta-0.2

Text Generation • Updated Feb 29, 2024 • 8

lewtun/gemma-7b-dpo-full-mix2-beta-0.1

Text Generation • Updated Feb 29, 2024 • 8

lewtun/gemma-7b-dpo-full-mix1-beta-0.4

Text Generation • Updated Feb 29, 2024 • 7

lewtun/gemma-7b-dpo-full-mix1-beta-0.6

Text Generation • Updated Feb 29, 2024 • 8

lewtun/gemma-7b-dpo-full-ultrafeedback-beta-0.01

Text Generation • Updated Feb 29, 2024 • 23

lewtun/gemma-7b-dpo-full-mix1-beta-0.05

Text Generation • Updated Feb 29, 2024 • 12

lewtun/gemma-7b-dpo-full-mix1-beta-0.01

Text Generation • Updated Feb 29, 2024 • 8

lewtun/gemma-7b-dpo-full-mix1-beta-0.4-epoch-3

Text Generation • Updated Feb 29, 2024 • 13

Nayan4HF/code-mistral-7b-text-to-python

Updated Apr 18, 2024 • 2

shaposhnikov/qlora

Updated Mar 1, 2024

lewtun/gemma-7b-sft-full-openhermes-v0

Text Generation • Updated Mar 1, 2024 • 9

Yaxin1992/zephyr-llama-merge-7b-dpo-multi

Updated Mar 1, 2024 • 1

Weni/ZeroShot-3.3.17-Mistral-7b-Multilanguage-3.2.0

Updated Mar 1, 2024 • 6

TheRadDani/tinystarcoder-rlhf-model

Text Generation • Updated Mar 1, 2024 • 56

ChenWu98/skills_metaphor_chat-then-skills_red_herring_chat-lora

Updated Mar 1, 2024 • 4

EddyGiusepe/zephyr-7b-sft-lora

Updated Mar 1, 2024

cvzion/lora-tinyllama-dqg-v4

Updated Mar 1, 2024

ashikshaffi08/outputs

Updated Mar 1, 2024 • 1

ChenWu98/skills_red_herring_chat-then-skills_metaphor_chat-lora

Updated Mar 1, 2024 • 18

cvzion/lora-msistral7b-dqg-v5

Updated Mar 1, 2024

llm-finetune/results

Updated Mar 15, 2024 • 169

Andyrasika/code-llama-7b-text-to-sql

Updated Mar 1, 2024 • 8

sandy37/mistral-finetuned-samsum

Updated Mar 1, 2024 • 1

lewtun/gemma-7b-dpo-full-mix1-beta-0.05-epoch-2

Text Generation • Updated Mar 1, 2024 • 13

lewtun/gemma-7b-dpo-full-mix1-beta-0.05-epoch-3

Text Generation • Updated Mar 1, 2024 • 16 • 1

lewtun/gemma-7b-dpo-full-openhermes-mix1-beta-0.1

Text Generation • Updated Mar 1, 2024 • 9