LB

lbathen

AI & ML interests

None yet

Organizations

None yet

lbathen's activity

New activity in mgoin/Nemotron-4-340B-Instruct-hf 2 months ago

Reward model also possible?

1
#1 opened 4 months ago by noamgat
New activity in nvidia/Nemotron-4-340B-Base 3 months ago

Hf safetensors version

9
#3 opened 5 months ago by ehartford
New activity in nvidia/Nemotron-4-340B-Reward 3 months ago

Convertion to HF

3
#7 opened 3 months ago by lbathen
New activity in mistralai/Mistral-Nemo-Instruct-2407 3 months ago

NAN when training

1
#29 opened 4 months ago by nthehai01
New activity in nvidia/Mistral-NeMo-12B-Instruct 4 months ago
New activity in nvidia/Nemotron-4-340B-Reward 4 months ago
New activity in mistralai/Mistral-Nemo-Instruct-2407 4 months ago

NeMo Format

2
#9 opened 4 months ago by lbathen
New activity in mistralai/Mixtral-8x22B-v0.1 4 months ago

Use V1 tokenizer instead

7
#10 opened 4 months ago by Rocketknight1

vocab size mismatch

4
#9 opened 4 months ago by mradermacher