Yosef Worku Alemneh
rasyosef
AI & ML interests
Pretraining, Supervised Fine Tuning, Direct Preference Optimization, Retrieval Augmented Generation (RAG), Function Calling
Organizations
None yet
rasyosef's activity
[Query-ISSUE] tokenizer.vocab_size is 128000, however len(tokenizer) is 128256, which prevents me from using those other tokens.
1
#34 opened 11 days ago
by
HV-Khurdula
What are the start and stop tokens of this model?
1
#40 opened 8 days ago
by
aryaash
Is the BOS token id of 128000 hardcoded into the llama 3.2 tokenizer?
2
#17 opened about 1 month ago
by
rasyosef
Phi-2-Instruct-APO: aligned with Anchored Preference Optimization
9
#3 opened about 2 months ago
by
rasyosef
Mistral-NeMo-Minitron-8B-Chat
5
#5 opened 3 months ago
by
rasyosef
what is the context window size of this model , i means what is the input token and output tokens of this model
4
#1 opened about 2 months ago
by
naveen237
APO Trainer in TRL?
1
#2 opened 2 months ago
by
rasyosef
ChatML template does not work properly
10
#2 opened 3 months ago
by
WasamiKirua
Collaboration
1
#1 opened 3 months ago
by
deleted
Error when trying to run
1
#1 opened 2 months ago
by
ctranslate2-4you
What changed for people using this model in english?
3
#3 opened 3 months ago
by
migueltalka
Phi 2 Instruct: an instruction following Phi 2 SLM that has undergone SFT and DPO
#132 opened 3 months ago
by
rasyosef
Phi 1.5 Instruct: an instruction following Phi 1.5 model that has undergone SFT and DPO
#89 opened 3 months ago
by
rasyosef
Update README.md
1
#2 opened 4 months ago
by
seyyaw
Duplicate?
1
#2 opened 6 months ago
by
israel
Model card is about Mixtral-8x7B instead of Mixtral-8x22B
1
#3 opened 7 months ago
by
rasyosef