Added GLUMLP, changed config accordingly, added code to convert state_dict 0211324 Markus28 commited on Mar 22, 2024
feat: choose flash attention heuristically if not set explicitly 2e2b8d0 Markus28 commited on Mar 6, 2024