Furu Wei

thegenerality

AI & ML interests

None yet

Recent Activity

Organizations

None yet

thegenerality's activity

Reacted to shumingma's post with 🚀 8 months ago
view post
Post
2604
The Era of 1-bit LLMs: Training Tips, Code and FAQ

https://github.com/microsoft/unilm/blob/master/bitnet/The-Era-of-1-bit-LLMs__Training_Tips_Code_FAQ.pdf

We present details and tips for training 1-bit LLMs. We also provide additional experiments and results that were not reported and responses to questions regarding the "The-Era-of-1-bit-LLM" paper. Finally, we include the official PyTorch implementation of BitNet (b1.58 and b1) for future research and development of 1-bit LLMs.
  • 2 replies
·