Song Dingjie's picture

Song Dingjie

songdj

·

bbsngg

AI & ML interests

None yet

Recent Activity

upvoted a paper 16 days ago

Digital Twin AI: Opportunities and Challenges from Large Language Models to World Models

upvoted a paper about 1 month ago

DentalGPT: Incentivizing Multimodal Complex Reasoning in Dentistry

authored a paper 7 months ago

Agentic Robot: A Brain-Inspired Framework for Vision-Language-Action Models in Embodied Agents

View all activity

Organizations

None yet

authored 2 papers 7 months ago

Agentic Robot: A Brain-Inspired Framework for Vision-Language-Action Models in Embodied Agents

Paper • 2505.23450 • Published May 29, 2025 • 9

SAMed-2: Selective Memory Enhanced Medical Segment Anything Model

Paper • 2507.03698 • Published Jul 4, 2025 • 11

authored a paper 8 months ago

A Survey on Post-training of Large Language Models

Paper • 2503.06072 • Published Mar 8, 2025 • 11

authored a paper 10 months ago

Aligning Multimodal LLM with Human Preference: A Survey

Paper • 2503.14504 • Published Mar 18, 2025 • 26

authored 3 papers about 1 year ago

BlenderLLM: Training Large Language Models for Computer-Aided Design with Self-improvement

Paper • 2412.14203 • Published Dec 16, 2024 • 1

On the Compositional Generalization of Multimodal LLMs for Medical Imaging

Paper • 2412.20070 • Published Dec 28, 2024 • 42

Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination

Paper • 2411.03823 • Published Nov 6, 2024 • 49

authored 7 papers over 1 year ago

Less is More: A Simple yet Effective Token Reduction Method for Efficient Multi-modal LLMs

Paper • 2409.10994 • Published Sep 17, 2024 • 1

AceGPT, Localizing Large Language Models in Arabic

Paper • 2309.12053 • Published Sep 21, 2023

HuatuoGPT-II, One-stage Training for Medical Adaption of LLMs

Paper • 2311.09774 • Published Nov 16, 2023 • 1

CMB: A Comprehensive Medical Benchmark in Chinese

Paper • 2308.08833 • Published Aug 17, 2023 • 1

MileBench: Benchmarking MLLMs in Long Context

Paper • 2404.18532 • Published Apr 29, 2024 • 1

TCBERT: A Technical Report for Chinese Topic Classification BERT

Paper • 2211.11304 • Published Nov 21, 2022

LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture

Paper • 2409.02889 • Published Sep 4, 2024 • 54