Post
881
Gemma 3 seems to be really good at human preference. Just waiting for ppl to see it.
vision , multimedia , gradio, accessibility & cool demos
@image
: Generates an image using Stable Diffusion xL. @3d
: Generates a 3D mesh. @web
: Web search agents.@rAgent
: Initiates a reasoning chain using Llama mode for coding explanations. @tts1-β
, @tts2-β
: Voice generation (Female and Male voices). @yolo
: Object Detectiongraph TD
A[User Interface] --> B[Chat Logic]
B --> C{Command Type}
C -->|Text| D[FastThink-0.5B]
C -->|Image| E[Qwen2-VL-OCR-2B]
C -->|@image| F[Stable Diffusion XL]
C -->|@tts| G[Edge TTS]
D --> H[Response]
E --> H
F --> H
G --> H