- 
	
	
	FinTral: A Family of GPT-4 Level Multimodal Financial Large Language ModelsPaper • 2402.10986 • Published • 80
- 
	
	
	Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative PretrainingPaper • 2408.02657 • Published • 35
- 
	
	
	NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at ScalePaper • 2508.10711 • Published • 142
Charles Cai
charlescai2016
		AI & ML interests
None yet
		Recent Activity
						upvoted 
								a
								paper
							
						about 18 hours ago
						
					
						
						
						Video-Thinker: Sparking "Thinking with Videos" via Reinforcement
  Learning
						
						liked
								a model
							
						8 days ago
						
					
						
						
						
						deepseek-ai/DeepSeek-OCR
						
						liked
								a dataset
							
						8 days ago
						
					
						
						
						
						criteo/CriteoClickLogs
						
 
								 
								 
								


