adamo1139
/

Yi-1.5-34B-32K-rebased-1406

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Still active?

#1

by DazzlingXeno - opened Oct 22, 2024

Oct 22, 2024

Are you still working on this?

Owner Oct 22, 2024

I have some plans to finetune Yi 1.5 32B 32K on Magpie Ultra or something similar, with this or just 01's base model used as a base for the finetune.

Oct 22, 2024

Nice, I look forward to seeing it.

Owner Nov 18, 2024

FYI I published this model here adamo1139/Yi-1.5-34B-32K-Magpie-Ultra-0611

Nov 19, 2024

Will have to give this a go tonight! Has it been tuned with your AEZAKMI dataset as well?

Owner Nov 19, 2024

No, it's just Magpie Ultra. AEZAKMI dataset is in a bit of an existential crisis, I like v2 more then newer versions. I'll try to make a new version of the AEZAKMI that I'll like and then train Yi 34B 200K/Yi 1.5 34B / Yi 1.5 9B on it.

Owner Nov 19, 2024

I'm open to ideas when it comes to the direction of the AEZAKMI dataset, I'm thinking about adding some distilled multi-turn conversations from Hermes 3 70B to it and some non-synthetic reddit&4chan data

Nov 19, 2024

•

edited Nov 19, 2024

I think that would be cool and maybe some Gutenberg and LimaRP stuff?

https://huggingface.co/datasets/Dampfinchen/Creative_Writing_Multiturn

Owner Nov 19, 2024

I'll look into it. I know LimaRP has some ERP samples with small kids so I am hesitant to use it anywhere, unless it would be filtered out properly.

Nov 19, 2024

•

edited Nov 19, 2024

I had no idea! Jesus!

I remember using a model for creative writing once and it suggested I include some paedophilia in it. Never deleted a model so fast in my life!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment