SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators Paper • 2502.06394 • Published 7 days ago • 84
denis-gordeev/reranker_dialog_items_biencoder_rubert-tiny-turbo-7 Sentence Similarity • Updated Dec 23, 2024 • 4
denis-gordeev/reranker_dialog_items_biencoder_rubert-tiny-turbo-6 Sentence Similarity • Updated Dec 19, 2024 • 148 • 1
denis-gordeev/reranker_dialog_items_biencoder_rubert-tiny-turbo-5 Sentence Similarity • Updated Dec 19, 2024 • 9
denis-gordeev/reranker_dialog_items_biencoder_rubert-tiny-turbo-4 Sentence Similarity • Updated Dec 19, 2024 • 6
denis-gordeev/reranker_dialog_items_biencoder_rubert-tiny-turbo-3 Sentence Similarity • Updated Dec 19, 2024 • 5
denis-gordeev/reranker_dialog_items_crossencoder_rubert-tiny-turbo Text Classification • Updated Dec 17, 2024 • 110
Reasoning benchmarks Collection Various benchmarks for reasoning capabilities of LLMs • 1 item • Updated Oct 4, 2024
Vinoground: Scrutinizing LMMs over Dense Temporal Reasoning with Short Videos Paper • 2410.02763 • Published Oct 3, 2024 • 7
PingPong: A Benchmark for Role-Playing Language Models with User Emulation and Multi-Model Evaluation Paper • 2409.06820 • Published Sep 10, 2024 • 64