AI & ML interests
Energy-aware Computing, Low Power Design, EDA, Dark Silicon, Efficient Deep Learning
Recent Activity
View all activity
Papers
UniQL: Unified Quantization and Low-rank Compression for Adaptive Edge LLMs
Quamba2: A Robust and Scalable Post-training Quantization Framework for Selective State Space Models
models
37
ut-enyac/quamba2-8b-converted-w4aX
Text Generation
•
Updated
•
14
ut-enyac/quamba-chat-w4a8
Text Generation
•
Updated
•
14
ut-enyac/quamba2-2.7b-w4a8
Text Generation
•
Updated
•
18
ut-enyac/quamba2-8b-converted-w4a8
Text Generation
•
Updated
•
12
•
1
ut-enyac/quamba-chat-w8a8
Updated
•
6
ut-enyac/quamba-chat-w4a16
Updated
•
5
ut-enyac/quamba2-8b-converted-w4a16
Updated
•
2
ut-enyac/quamba-790m-w8a8
Updated
•
2
ut-enyac/quamba-790m-w4a8
Updated
•
1
ut-enyac/quamba-790m-w4a16
Updated
•
2
datasets
0
None public yet