2 3 2

Prakhar Dixit

pdx97

AI & ML interests

None yet

Recent Activity

updated a dataset about 2 months ago

pdx97/Schema_Based_Instruction_Dataset

View all activity

Organizations

pdx97's activity

updated a dataset about 2 months ago

pdx97/Schema_Based_Instruction_Dataset

Viewer • Updated Nov 18, 2024 • 360 • 3

authored a paper 3 months ago

SBI-RAG: Enhancing Math Word Problem Solving for Students through Schema-Based Instruction and Retrieval-Augmented Generation

Paper • 2410.13293 • Published Oct 17, 2024 • 2

commented a paper 3 months ago

SBI-RAG: Enhancing Math Word Problem Solving for Students through Schema-Based Instruction and Retrieval-Augmented Generation

Paper • 2410.13293 • Published Oct 17, 2024 • 2 •

upvoted a paper 8 months ago

OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework

Paper • 2405.11143 • Published May 20, 2024 • 34

authored a paper 8 months ago

ReProHRL: Towards Multi-Goal Navigation in the Real World using Hierarchical Agents

Paper • 2308.08737 • Published Aug 17, 2023

updated 3 models 8 months ago

upvoted a paper 8 months ago

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 50

updated a model 8 months ago

pdx97/poca-SoccerTwos

Reinforcement Learning • Updated May 5, 2024 • 14

liked a Space 8 months ago

Running

260

🐶

Huggy

updated 2 models 8 months ago

pdx97/a2c-PandaReachDense-v3_New

Reinforcement Learning • Updated May 4, 2024

pdx97/a2c-PandaReachDense-v3

Reinforcement Learning • Updated May 4, 2024

upvoted an article 9 months ago

Article

Preference Tuning LLMs with Direct Preference Optimization Methods

Jan 18, 2024

• 40

updated 2 models 9 months ago

pdx97/Cartpole_V1_Updated

Reinforcement Learning • Updated Apr 17, 2024

pdx97/ppo-LunarLander-v2-Latest_Upadte

Reinforcement Learning • Updated Apr 17, 2024

New activity in pdx97/ppo-LunarLander-v2-Latest_Upadte 9 months ago

Add `agent.pt` and tag the environment

#1 opened 9 months ago by

qgallouedec

updated 3 models 9 months ago

pdx97/Cartpole_New

Reinforcement Learning • Updated Apr 15, 2024

pdx97/ppo-Pyramid

Reinforcement Learning • Updated Apr 15, 2024 • 1

pdx97/ppo-SnowballTarget

Reinforcement Learning • Updated Apr 15, 2024 • 1