17 39 72

Alireza Mohammadshahi

alirezamsh

AI & ML interests

AI/NLP (NMT,LLMs)

Organizations

upvoted a paper 3 months ago

KAPSO: A Knowledge-grounded framework for Autonomous Program Synthesis and Optimization

Paper • 2601.21526 • Published Jan 29 • 2

upvoted an article 12 months ago

Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

Feb 4, 2025

•

128

upvoted 3 papers over 1 year ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 379

Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking

Paper • 2403.09629 • Published Mar 14, 2024 • 79

PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers

Paper • 2406.12430 • Published Jun 18, 2024 • 7

upvoted a collection over 1 year ago

Probably function calling datasets

Collection

Created using the https://huggingface.co/spaces/librarian-bots/dataset-column-search-api Space. • 39 items • Updated Jul 17, 2024 • 39

upvoted a paper almost 2 years ago

OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI

Paper • 2406.12753 • Published Jun 18, 2024 • 17

upvoted 6 papers about 2 years ago

A Careful Examination of Large Language Model Performance on Grade School Arithmetic

Paper • 2405.00332 • Published May 1, 2024 • 33

Octopus v4: Graph of language models

Paper • 2404.19296 • Published Apr 30, 2024 • 118

Better & Faster Large Language Models via Multi-token Prediction

Paper • 2404.19737 • Published Apr 30, 2024 • 81

Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Paper • 2404.18796 • Published Apr 29, 2024 • 71

AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs

Paper • 2404.16873 • Published Apr 21, 2024 • 29

FlowMind: Automatic Workflow Generation with LLMs

Paper • 2404.13050 • Published Mar 17, 2024 • 34

upvoted a collection about 2 years ago

OpenMath

Collection

A collection of models and datasets introduced in "OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset" • 15 items • Updated about 8 hours ago • 46

upvoted a paper about 2 years ago

Textbooks Are All You Need II: phi-1.5 technical report

Paper • 2309.05463 • Published Sep 11, 2023 • 92

upvoted 2 articles about 2 years ago

Article

Synthetic data: save money, time and carbon with open source

Feb 16, 2024

•

Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

Mar 20, 2024

•

113

upvoted 3 papers about 2 years ago

Insights into Alignment: Evaluating DPO and its Variants Across Multiple Tasks

Paper • 2404.14723 • Published Apr 23, 2024 • 10

LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding

Paper • 2404.16710 • Published Apr 25, 2024 • 81

OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

Paper • 2404.14619 • Published Apr 22, 2024 • 126

Alireza Mohammadshahi

AI & ML interests

Organizations

alirezamsh's activity

DABStep: Data Agent Benchmark for Multi-step Reasoning

Synthetic data: save money, time and carbon with open source

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models