Using Apollo-MoE in Transformers

#2
by glancioni - opened

Have you any example on how to use a model of the Apollo-MoE family in Transformers?
Something short like placing a query and getting an answer would suffice. Unfortunately, no example is available on your model cards and the github page has instruction to train and evaluate, not on using the model for inference.
Thank you!

FreedomAI org

Thanks for your feedback. We have updated the Model Card, and you can find the Model Download and Inference sections in it. When you run the inference example, remember to put configuration_upcycling_qwen2_moe.py and modeling_upcycling_qwen2_moe.py in the same directory with the inference file.

Thank you so much!

Sign up or log in to comment