Model Details
Model Description
Finetune of LLaMa 3.2 1B model to include flashnormalization (https://arxiv.org/abs/2407.09577)
- Developed by: OpenMachine Labs
- License: MIT
- Finetuned from model Meta LLaMa 3.2 1B
Model Sources [optional]
- Repository: https://github.com/meta-llama/llama-models/tree/main/models/llama3_2
- Paper https://ai.meta.com/blog/llama-3-2-connect-2024-vision-edge-mobile-devices/
Uses
How to Get Started with the Model
Use the code below to get started with the model.
Speeds, Sizes, Times
[More Information Needed]
Evaluation
[More Information Needed]
Metrics
[More Information Needed]
Results
[More Information Needed]
Summary
Model Examination [optional]
Model Card Authors
Nils Graef (nils@openmachine.ai)
Drew Wasielewski (drewwas@berkeley.edu)
- Downloads last month
- 31
Model tree for drewwas/OpenMachine_FlashNorm
Base model
meta-llama/Llama-3.2-1B