Ostrich-70B / README.md
some1nostr's picture
Update README.md
73f3a59 verified
|
raw
history blame
1.4 kB
metadata
license: apache-2.0

Model Card for Ostrich

Contentious, judgemental, uncensored, can't agree with itself 32% of the time! Trained a bit about nostr Trained a bit about bitcoin Trained a bit in the health domain

I am having success with chat template: [INST] ... It may also work with ChatML format, though I see more repetitions when I use that.

Model Details

Based on https://huggingface.co/crestf411/daybreak-miqu-1-70b-v1.0-hf because it is one of the most uncensored according to https://huggingface.co/spaces/DontPlanToEnd/UGI-Leaderboard.

Uses

Ask any question, compared to other models this may know more about Nostr and Bitcoin. You can use llama.cpp to chat with it. You can also use llama-cpp-python package to use it in a Python script.

Warning

Users (both direct and downstream) should be aware of the risks, biases and limitations of the model. The trainer, developer or uploader of this model does not assume any liability. Use it at your own risk.

Training Details

Training Data

Nostr notes, kind=1, longer notes are taken from reputable accounts. Number of notes: 300k

Training Procedure

LLaMa-Factory is used to train on 2x3090! fsdp_qlora is the technique. It took ~170 hours for a dataset of 120MB.