mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.05 Text Generation • Updated about 21 hours ago
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.10 Text Generation • Updated about 21 hours ago
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.15 Text Generation • Updated about 21 hours ago
mlfoundations-dev/hp_ablations_gemma_scheduler_inverse_sqrt Text Generation • Updated about 21 hours ago
mlfoundations-dev/hp_ablations_gemma_scheduler_linear_warmup0.05 Text Generation • Updated about 21 hours ago
mlfoundations-dev/hp_ablations_gemma_scheduler_linear_warmup0.10 Text Generation • Updated about 21 hours ago
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.05_minlr1e-6 Text Generation • Updated about 18 hours ago
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.10_minlr5e-7 Text Generation • Updated about 18 hours ago
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.05_minlr1e-7 Text Generation • Updated about 18 hours ago
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.10_minlr1e-7 Text Generation • Updated about 18 hours ago • 1
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.05_minlr5e-7 Text Generation • Updated about 19 hours ago
mlfoundations-dev/hp_ablations_gemma_scheduler_cosine_warmup0.10_minlr1e-6 Text Generation • Updated about 18 hours ago