Text Generation
scaling
umup-research-3b-bf16 / model_state_layer_9_TransformerLayer.pt

Commit History