Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
JuncaiL
/
llama-265m
like
1
Text Generation
Transformers
PyTorch
wikipedia
allenai/c4
English
llama_moe
custom_code
arxiv:
2305.09781
Model card
Files
Files and versions
Community
1
Train
Use this model
3240d88
llama-265m
Commit History
fix state_dict loading in MoE model
3240d88
verified
JuncaiL
commited on
Mar 25
update config.json
0b1dfd4
verified
JuncaiL
commited on
Mar 25
upload llama-265m model checkpoint
e567dee
verified
JuncaiL
commited on
Mar 24
initial commit
6dda61f
verified
JuncaiL
commited on
Mar 24