arxiv:2501.03271
Basab Ghosh
basab1142
AI & ML interests
Computer Vision NLP, and RL
Recent Activity
authored
a paper
about 23 hours ago
DPO Kernels: A Semantically-Aware, Kernel-Enhanced, and Divergence-Rich
Paradigm for Direct Preference Optimization
updated
a model
4 days ago
basab1142/FPO_Gemma_7b_it
Organizations
None yet
Papers
1
models
6
basab1142/FPO_Gemma_7b_it
Updated
•
2
basab1142/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
basab1142/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
basab1142/Taxi-v3-CQ
Reinforcement Learning
•
Updated
basab1142/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
basab1142/ppo-LunarLander-v2
Reinforcement Learning
•
Updated