In a Training Loop 🔄

6 52 60

Lê Võ Quyết Thắng

thangvip

https://vualidon.icu

AI & ML interests

Adapting LLM to specific domain

Recent Activity

updated a model 10 minutes ago

thangvip/qwen3-1.7b-dspo-no-sft-exp

published a model about 2 hours ago

thangvip/qwen3-1.7b-dspo-no-sft-exp

updated a model about 8 hours ago

thangvip/qwen3-1.7b-dspo-exp

View all activity

Organizations

updated a model 10 minutes ago

thangvip/qwen3-1.7b-dspo-no-sft-exp

Text Generation • 2B • Updated 10 minutes ago

published a model about 2 hours ago

thangvip/qwen3-1.7b-dspo-no-sft-exp

Text Generation • 2B • Updated 10 minutes ago

updated a model about 8 hours ago

thangvip/qwen3-1.7b-dspo-exp

Text Generation • 2B • Updated about 8 hours ago

updated a model about 13 hours ago

thangvip/qwen3-1.7b-grpo-exp

Text Generation • 2B • Updated about 13 hours ago

published 2 models 1 day ago

thangvip/qwen3-1.7b-grpo-exp

Text Generation • 2B • Updated about 13 hours ago

thangvip/qwen3-1.7b-dspo-exp

Text Generation • 2B • Updated about 8 hours ago

published a model 2 days ago

thangvip/grpo

Updated 2 days ago

published 2 models 6 days ago

thangvip/dspo_output_run_3

Updated 6 days ago

thangvip/dspo_output_run_2

Updated 6 days ago

published a model 7 days ago

thangvip/dspo_output_run_1

Updated 7 days ago

updated a model 7 days ago

thangvip/Qwen3-1.7B-SFT-math-1500

Text Generation • 2B • Updated 7 days ago • 1.02k

published a model 7 days ago

thangvip/Qwen3-1.7B-SFT-math-1500

Text Generation • 2B • Updated 7 days ago • 1.02k

liked a dataset 2 months ago

PleIAs/SYNTH

Viewer • Updated Nov 11, 2025 • 68M • 35.2k • 229

liked a dataset 3 months ago

BatsResearch/planetarium

Viewer • Updated Feb 25, 2025 • 584k • 182 • 15

updated 2 models 3 months ago

thangvip/qwen3-4b-legal-sft-grpo-phase-2

Text Generation • 4B • Updated Oct 31, 2025

thangvip/qwen3-1.7b-legal-sft-grpo-phase-2

Text Generation • 2B • Updated Oct 30, 2025

published 2 models 3 months ago

thangvip/qwen3-4b-legal-sft-grpo-phase-2

Text Generation • 4B • Updated Oct 31, 2025

thangvip/qwen3-1.7b-legal-sft-grpo-phase-2

Text Generation • 2B • Updated Oct 30, 2025

updated 2 models 3 months ago

thangvip/qwen3-4b-legal-sft-grpo

Text Generation • 4B • Updated Oct 29, 2025 • 5

thangvip/qwen3-1.7b-legal-sft-grpo

Text Generation • 2B • Updated Oct 29, 2025 • 1

Lê Võ Quyết Thắng

AI & ML interests

Recent Activity

Organizations

thangvip's activity