better handling and logging of empty sharegpt turns (#603) a363604 unverified winglian commited on Sep 22, 2023
improve handling for empty text on the tokenization step (#502) 1eebbd0 unverified winglian commited on Sep 19, 2023
support custom field for completion from yml (#580) f7a2263 unverified winglian commited on Sep 15, 2023
better handling of empty input ids when tokenizing (#395) 85cf4f8 unverified winglian commited on Aug 15, 2023
better handling since xgen tokenizer breaks with convert_tokens_to_ids 2a428e8 winglian commited on Jul 21, 2023
WIP large refactor to make finetune script a little more manageable (#3) 6045345 unverified winglian commited on Apr 18, 2023
suppport for alpaca-like instruction datasets without inputs e107643 winglian commited on Apr 18, 2023
config chooser, update readme instructions, device config, llama flash attention, debug out the labels, fix config key checks, other bugfixes f2a2029 winglian commited on Apr 14, 2023