Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
akswelh
/
NEOX
like
0
arxiv:
29 papers
Model card
Files
Files and versions
Community
d90b3a8
NEOX
/
megatron
1 contributor
History:
1 commit
akswelh
Upload 251 files
d90b3a8
verified
26 days ago
data
Upload 251 files
26 days ago
fused_kernels
Upload 251 files
26 days ago
gradient_noise_scale
Upload 251 files
26 days ago
model
Upload 251 files
26 days ago
mpu
Upload 251 files
26 days ago
neox_arguments
Upload 251 files
26 days ago
tokenizer
Upload 251 files
26 days ago
__init__.py
929 Bytes
Upload 251 files
26 days ago
checkpointing.py
17.6 kB
Upload 251 files
26 days ago
devutil.py
1.28 kB
Upload 251 files
26 days ago
initialize.py
8.58 kB
Upload 251 files
26 days ago
learning_rates.py
5.22 kB
Upload 251 files
26 days ago
logging.py
16.4 kB
Upload 251 files
26 days ago
mup_substitute.py
7.8 kB
Upload 251 files
26 days ago
optimizers.py
18.1 kB
Upload 251 files
26 days ago
text_generation_utils.py
42.3 kB
Upload 251 files
26 days ago
training.py
64.4 kB
Upload 251 files
26 days ago
utils.py
17.6 kB
Upload 251 files
26 days ago