Alina Lozovskaya
alozowski
AI & ML interests
NLP in all aspects
Recent Activity
New activity
2 days ago
open-llm-leaderboard/open_llm_leaderboard:Reproducibility error
New activity
2 days ago
open-llm-leaderboard/open_llm_leaderboard:Model benchmarks degraded after re-evaluation
New activity
2 days ago
open-llm-leaderboard/open_llm_leaderboard:Suggestion: Table refresh timer
Organizations
alozowski's activity
Reproducibility error
1
#1020 opened 3 days ago
by
cluebbers
Model benchmarks degraded after re-evaluation
1
#1018 opened 9 days ago
by
Etherll
Suggestion: Table refresh timer
1
#1015 opened 12 days ago
by
zelk12
Is the score computed by lm-eval-harness normalized?
1
#1011 opened 16 days ago
by
chenxiaobooo
Re-run failed eval fblgit/cybertron-v4-qw7B-UNAMGS
1
#1021 opened 3 days ago
by
fblgit
Failed model for open llm bechmark
1
#1019 opened 5 days ago
by
legolasyiu
Interpretation of result details?
2
#1 opened 4 months ago
by
nicobuko
add-co2-column
5
#1017 opened 9 days ago
by
alozowski
OOM when use vllm to accelerate compute
1
#1012 opened 16 days ago
by
chenxiaobooo
Update LLM
1
#1013 opened 15 days ago
by
legolasyiu
add-co2-column
5
#1014 opened 15 days ago
by
alozowski
Finished but no result
2
#1010 opened 16 days ago
by
ymcki
Failed evaluation again for `fblgit/TheBeagle-v2beta-32B-MGS`
6
#994 opened 29 days ago
by
fblgit
Still empty main benchmark scores
1
#1009 opened 17 days ago
by
ymcki
Failed test and missing model
2
#1008 opened 17 days ago
by
zelk12
Marked deleted/incomplete
2
#1006 opened 20 days ago
by
CultriX
Failed Model
1
#1005 opened 21 days ago
by
legolasyiu
update model
3
#1002 opened 23 days ago
by
legolasyiu
Empty main benchmark scores
1
#1007 opened 19 days ago
by
ymcki
bump-up-transformers
5
#1003 opened 22 days ago
by
alozowski