DisPose: Disentangling Pose Guidance for Controllable Human Image Animation Paper • 2412.09349 • Published Dec 12, 2024 • 8
If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents Paper • 2401.00812 • Published Jan 1, 2024 • 4
What is the Visual Cognition Gap between Humans and Multimodal LLMs? Paper • 2406.10424 • Published Jun 14, 2024
Mitigating Transformer Overconfidence via Lipschitz Regularization Paper • 2306.06849 • Published Jun 12, 2023
MACP: Efficient Model Adaptation for Cooperative Perception Paper • 2310.16870 • Published Oct 25, 2023
LaMPilot: An Open Benchmark Dataset for Autonomous Driving with Language Model Programs Paper • 2312.04372 • Published Dec 7, 2023
Agentless: Demystifying LLM-based Software Engineering Agents Paper • 2407.01489 • Published Jul 1, 2024 • 57
Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression Paper • 2403.15447 • Published Mar 18, 2024 • 16
Good at captioning, bad at counting: Benchmarking GPT-4V on Earth observation data Paper • 2401.17600 • Published Jan 31, 2024
DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer Paper • 2312.03724 • Published Nov 27, 2023 • 1
Transformer-Based Models Are Not Yet Perfect At Learning to Emulate Structural Recursion Paper • 2401.12947 • Published Jan 23, 2024 • 2
Copiloting the Copilots: Fusing Large Language Models with Completion Engines for Automated Program Repair Paper • 2309.00608 • Published Sep 1, 2023 • 2
NeuRI: Diversifying DNN Generation via Inductive Rule Inference Paper • 2302.02261 • Published Feb 4, 2023 • 3
DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models Paper • 2306.11698 • Published Jun 20, 2023 • 12
Video Pre-trained Transformer: A Multimodal Mixture of Pre-trained Experts Paper • 2304.10505 • Published Mar 24, 2023 • 1