bartowski/cognitivecomputations_Dolphin3.0-R1-Mistral-24B-GGUF Text Generation • Updated 10 days ago • 37.1k • 51
Running on Zero 376 376 Chat with DeepSeek-VL2-small 🌍 Generate responses using images and text input
Low-Rank Adapters Meet Neural Architecture Search for LLM Compression Paper • 2501.16372 • Published 26 days ago • 9
Optimizing Large Language Model Training Using FP4 Quantization Paper • 2501.17116 • Published 20 days ago • 34
Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation Paper • 2501.17433 • Published 19 days ago • 9
Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate Paper • 2501.17703 • Published 19 days ago • 54
GuardReasoner: Towards Reasoning-based LLM Safeguards Paper • 2501.18492 • Published 18 days ago • 81
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs Paper • 2501.18585 • Published 18 days ago • 53
Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models Paper • 2501.18119 • Published 18 days ago • 24