Yosef Worku Alemneh

rasyosef

AI & ML interests

Pretraining, Supervised Fine Tuning, Direct Preference Optimization, Retrieval Augmented Generation (RAG), Function Calling

Recent Activity

updated a model 4 days ago

rasyosef/embedding-amharic-base

authored a paper 5 days ago

The Multilingual Curse at the Retrieval Layer: Evidence from Amharic

updated a model 5 days ago

rasyosef/reranker-amharic-medium

View all activity

Organizations

New activity in rasyosef/colbert-amharic-base 6 days ago

Improve model card: update pipeline tag, add paper and code links

🤗 1

#1 opened 6 days ago by

nielsr

New activity in rasyosef/reranker-amharic-medium 6 days ago

Improve model card: add paper, code links and citation

🤗 1

#2 opened 6 days ago by

nielsr

New activity in rasyosef/embedding-amharic-medium 6 days ago

Add paper and code links, update pipeline tag

🤗 1

#1 opened 6 days ago by

nielsr

New activity in rasyosef/splade-amharic-base 6 days ago

Improve model card: add paper link, code link, and update pipeline tag

🤗 1

#1 opened 6 days ago by

nielsr

New activity in rasyosef/reranker-amharic-base 6 days ago

Add paper and code links to model card

🤗 1

#1 opened 6 days ago by

nielsr

New activity in rasyosef/colbert-amharic-medium 6 days ago

Improve model card: update pipeline tag and add paper/code links

🤗 1

#1 opened 6 days ago by

nielsr

New activity in rasyosef/splade-amharic-medium 6 days ago

Improve model card: update pipeline tag and add paper/code links

🤗 1

#1 opened 6 days ago by

nielsr

New activity in rasyosef/Amharic-Passage-Retrieval-Dataset-V2 6 days ago

Add paper link, GitHub link, and task category

🤗 1

#2 opened 6 days ago by

nielsr

New activity in naver/splade-v3 10 months ago

SPLADE-Index python package: An ultra-fast search index for SPLADE sparse retrieval models

🚀 1

#8 opened 10 months ago by

rasyosef

New activity in cross-encoder/ms-marco-MiniLM-L6-v2 11 months ago

MS Marco Dev number of queries and documents

#15 opened 11 months ago by

rasyosef

New activity in rasyosef/amharic-passage-retrieval-dataset 12 months ago

Update task category and add tags

#2 opened 12 months ago by

nielsr

New activity in colbert-ir/colbertv2.0 12 months ago

Multilanguage

#6 opened over 2 years ago by

DamianS89

New activity in rasyosef/flores_english_amharic_mt about 1 year ago

Update README.md to include ISO code for English

#2 opened over 1 year ago by

weezygeezer

New activity in rasyosef/reranker-amharic-medium about 1 year ago

Update model metadata to set pipeline tag to the new `text-ranking` and tags to `sentence-transformers`

#1 opened about 1 year ago by

tomaarsen

New activity in tomaarsen/natural-questions-hard-negatives about 1 year ago

Using hard negatives VS query, pos pair to train embedding models

#2 opened over 1 year ago by

rasyosef

New activity in rasyosef/phi-2-instruct-apo over 1 year ago

Adding Evaluation Results

#1 opened over 1 year ago by

leaderboard-pr-bot

New activity in rasyosef/Mistral-NeMo-Minitron-8B-Chat over 1 year ago

Adding Evaluation Results

#3 opened over 1 year ago by

leaderboard-pr-bot

New activity in ContextualAI/ultrafeedback_clair_32k over 1 year ago

Phi-2-Instruct-APO: aligned with Anchored Preference Optimization

#3 opened over 1 year ago by

rasyosef

New activity in meta-llama/Llama-3.2-1B over 1 year ago

[Query-ISSUE] tokenizer.vocab_size is 128000, however len(tokenizer) is 128256, which prevents me from using those other tokens.

#34 opened over 1 year ago by

HV-Khurdula

What are the start and stop tokens of this model?

#40 opened over 1 year ago by

aryaash

Yosef Worku Alemneh

AI & ML interests

Recent Activity

Organizations

rasyosef's activity

Improve model card: update pipeline tag, add paper and code links

Improve model card: add paper, code links and citation

Add paper and code links, update pipeline tag

Improve model card: add paper link, code link, and update pipeline tag

Add paper and code links to model card

Improve model card: update pipeline tag and add paper/code links

Improve model card: update pipeline tag and add paper/code links

Add paper link, GitHub link, and task category

SPLADE-Index python package: An ultra-fast search index for SPLADE sparse retrieval models

MS Marco Dev number of queries and documents

Update task category and add tags

Multilanguage

Update README.md to include ISO code for English

Update model metadata to set pipeline tag to the new `text-ranking` and tags to `sentence-transformers`

Using hard negatives VS query, pos pair to train embedding models

Adding Evaluation Results

Adding Evaluation Results

Phi-2-Instruct-APO: aligned with Anchored Preference Optimization

[Query-ISSUE] tokenizer.vocab_size is 128000, however len(tokenizer) is 128256, which prevents me from using those other tokens.

What are the start and stop tokens of this model?