Om

asach
·

AI & ML interests

None yet

Recent Activity

liked a model 6 days ago
stepfun-ai/GOT-OCR2_0
liked a model 6 days ago
microsoft/Phi-3.5-vision-instruct
liked a Space 6 days ago
opencompass/open_vlm_leaderboard
View all activity

Organizations

asach's profile picture ashwinsart's profile picture

asach's activity

reacted to grimjim's post with ❤️ 6 days ago
view post
Post
2589
I'm (finally) releasing a Python script that trims excess weights in Gemma2 full-weight models that bloated by ~1B parameters due to an early mergekit bug.
https://github.com/jim-plus/Gemma2-mergekit-remediation

I'd noticed something was off when merges of Gemma2 9B models ended up having ~10B parameters. The current mergekit package is fine, but there are still bloated models on HF that could stand to be fixed.

The script assumes that it will be run from the same directory as the model weights, and will trim the unnecessary lm_head.weight tensor and corresponding index entry.
  • 2 replies
·
liked a Space 11 months ago