LB's picture

9

LB

lbathen

AI & ML interests

None yet

Organizations

None yet

lbathen's activity

New activity in mgoin/Nemotron-4-340B-Instruct-hf 2 months ago

Reward model also possible?

#1 opened 4 months ago by

New activity in nvidia/Nemotron-4-340B-Base 3 months ago

Hf safetensors version

#3 opened 5 months ago by

New activity in nvidia/Nemotron-4-340B-Reward 3 months ago

Convertion to HF

#7 opened 3 months ago by

New activity in mistralai/Mistral-Nemo-Instruct-2407 3 months ago

NAN when training

#29 opened 4 months ago by

New activity in nvidia/Mistral-NeMo-12B-Instruct 4 months ago

Can't load model with nemo framework

#7 opened 4 months ago by

New activity in nvidia/Nemotron-4-340B-Reward 4 months ago

Running inference outside of triton

#6 opened 4 months ago by

New activity in mistralai/Mistral-Nemo-Instruct-2407 4 months ago

NeMo Format

#9 opened 4 months ago by

New activity in mistralai/Mixtral-8x22B-v0.1 4 months ago

Use V1 tokenizer instead

#10 opened 4 months ago by

vocab size mismatch

#9 opened 4 months ago by