Inquiries regarding implementation details

by LemonNoel - opened Mar 14, 2023

Mar 14, 2023

Thanks for your excellent work! I attempted to execute the fine-tuning code in GLM, but I noticed a slight difference in the implementation of the gelu function. Specifically, GLM uses the "approximate" strategy, whereas HuggingFace uses the default mode. I'm unsure which one I should use when fine-tuning the model.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment