Jialiang Cheng
Julius-L
ยท
AI & ML interests
None yet
Recent Activity
upvoted a paper 10 days ago
SERE: Similarity-based Expert Re-routing for Efficient Batch Decoding in MoE Models upvoted a paper 10 days ago
EDiT: A Local-SGD-Based Efficient Distributed Training Method for Large Language Models authored
a paper
25 days ago
SERE: Similarity-based Expert Re-routing for Efficient Batch Decoding in MoE Models