zhang
AI & ML interests
Recent Activity
Organizations
- Running7
Browser only - Screen Capture & OCR
π7One-minute creation by AI Coding Autonomous Agent MOUSE-I
- RunningAgents688
First Agent Template
β‘688Generate code and get AI answers with tool support
- Runtime errorAgentsFeatured128
OctoTools
π128An Agentic Framework with Tools for Complex Reasoning
- Runtime errorFeatured142
smolagents LLM leaderboard
π142A leaderboard for LLMs powering smolagents
-
LiuZichen/MagicQuill-models
Image-to-Image β’ Updated β’ 63 - Running on ZeroAgents616
Leffa
π616Generate realistic person images with new clothes or poses
- PausedAgents61
Dokdo
β‘61Image to Video Generation
- RunningAgents194
Llama-4-Maverick-17B-search
π194Generate detailed answers using web search and AI
- Running on ZeroAgentsFeatured1.75k
Joy Caption Alpha Two
π1.75kGenerate detailed captions or prompts for any image
- Runtime errorAgents40
Florence Llama
π¬40Generate text responses from images and text input
-
trollek/ImagePromptHelper-danube3-500M
Text Generation β’ 0.5B β’ Updated β’ 9 β’ 4 -
trollek/ImagePromptHelper-danube3-500M-GGUF
0.5B β’ Updated β’ 182 β’ 3
-
laion/laion-audio-preview
Viewer β’ Updated β’ 4.15M β’ 7.34k β’ 11 - Running on ZeroAgentsFeatured2.87k
F5-TTS
π£2.87kF5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
- PausedFeatured2.21k
FacePoke
π2.21kImport a portrait, click to move the head!
- Runtime errorAgentsFeatured697
Fish Audio S1
π697Convert text to natural-sounding speech audio
-
google/shieldgemma-2-4b-it
Image-Text-to-Text β’ 4B β’ Updated β’ 4.72k β’ 164 - RunningAgents39
Joycaption Watermark Detection
π₯39Watermark detection
-
Camais03/camie-tagger-v2
Image Classification β’ Updated β’ 188 β’ 71 -
strangerguardhf/NSFW-MultiDomain-Classification-v2.0
Updated β’ 26 β’ 4
-
allenai/olmOCR-7B-0225-preview
Image-Text-to-Text β’ 8B β’ Updated β’ 25.3k β’ 708 - Running on ZeroAgentsFeatured81
Nanonets OCR
π81Demo for Nanonets-OCR
- Running on ZeroMCP408
Multimodal OCR
π408Nanonets / olmOCR / RolmOCR / Aya-Vision / Qwen2-VL-OCR
- Running on ZeroMCPFeatured143
Multimodal OCR2
π»143FireRed / Nanonets / Monkey / Thyme / Typhoon / SmolDocling
-
chflame163/ComfyUI_LayerStyle
Updated β’ 134 β’ 115 -
allenai/Molmo-7B-D-0924
Image-Text-to-Text β’ 8B β’ Updated β’ 42.5k β’ 567 - Running on ZeroAgents248
Chroma
π₯248Generate detailed fantasy and realistic images from text descriptions
- RunningMCP46
Doc Mcp
π46RAG on documentations for your agent
- Running on ZeroAgents1.68k
Flux.1-dev Upscaler
π1.68kUpscale lowβresolution images to highβresolution with AI
- Running on ZeroAgents460
InvSR
π460Image Super-resolution via Diffusion Inversion
- PausedAgents241
FLUX Upsacle Image
π₯241Upscale images with control and customization
- Runtime errorAgentsFeatured283
Thera Arbitrary-Scale Super-Resolution
π₯283Upscale photos to any size with neural superβresolution
-
Djrango/Qwen2vl-Flux
Text-to-Image β’ Updated β’ 509 - Running on ZeroAgentsFeatured941
OminiControl
π941Generate new images from a subject photo and text prompt
- Build errorAgents563
FLUXllama gpt-oss
π563mcp_server & FLUX 4-bit Quantization + Enhanced
- Running on ZeroAgentsFeatured2.25k
MagicQuill
πͺΆ2.25kEdit images with scribbleβbased color and edge control
-
google/shieldgemma-2-4b-it
Image-Text-to-Text β’ 4B β’ Updated β’ 4.72k β’ 164 - RunningAgents39
Joycaption Watermark Detection
π₯39Watermark detection
-
Camais03/camie-tagger-v2
Image Classification β’ Updated β’ 188 β’ 71 -
strangerguardhf/NSFW-MultiDomain-Classification-v2.0
Updated β’ 26 β’ 4
- Running7
Browser only - Screen Capture & OCR
π7One-minute creation by AI Coding Autonomous Agent MOUSE-I
- RunningAgents688
First Agent Template
β‘688Generate code and get AI answers with tool support
- Runtime errorAgentsFeatured128
OctoTools
π128An Agentic Framework with Tools for Complex Reasoning
- Runtime errorFeatured142
smolagents LLM leaderboard
π142A leaderboard for LLMs powering smolagents
-
allenai/olmOCR-7B-0225-preview
Image-Text-to-Text β’ 8B β’ Updated β’ 25.3k β’ 708 - Running on ZeroAgentsFeatured81
Nanonets OCR
π81Demo for Nanonets-OCR
- Running on ZeroMCP408
Multimodal OCR
π408Nanonets / olmOCR / RolmOCR / Aya-Vision / Qwen2-VL-OCR
- Running on ZeroMCPFeatured143
Multimodal OCR2
π»143FireRed / Nanonets / Monkey / Thyme / Typhoon / SmolDocling
-
LiuZichen/MagicQuill-models
Image-to-Image β’ Updated β’ 63 - Running on ZeroAgents616
Leffa
π616Generate realistic person images with new clothes or poses
- PausedAgents61
Dokdo
β‘61Image to Video Generation
- RunningAgents194
Llama-4-Maverick-17B-search
π194Generate detailed answers using web search and AI
-
chflame163/ComfyUI_LayerStyle
Updated β’ 134 β’ 115 -
allenai/Molmo-7B-D-0924
Image-Text-to-Text β’ 8B β’ Updated β’ 42.5k β’ 567 - Running on ZeroAgents248
Chroma
π₯248Generate detailed fantasy and realistic images from text descriptions
- RunningMCP46
Doc Mcp
π46RAG on documentations for your agent
- Running on ZeroAgentsFeatured1.75k
Joy Caption Alpha Two
π1.75kGenerate detailed captions or prompts for any image
- Runtime errorAgents40
Florence Llama
π¬40Generate text responses from images and text input
-
trollek/ImagePromptHelper-danube3-500M
Text Generation β’ 0.5B β’ Updated β’ 9 β’ 4 -
trollek/ImagePromptHelper-danube3-500M-GGUF
0.5B β’ Updated β’ 182 β’ 3
- Running on ZeroAgents1.68k
Flux.1-dev Upscaler
π1.68kUpscale lowβresolution images to highβresolution with AI
- Running on ZeroAgents460
InvSR
π460Image Super-resolution via Diffusion Inversion
- PausedAgents241
FLUX Upsacle Image
π₯241Upscale images with control and customization
- Runtime errorAgentsFeatured283
Thera Arbitrary-Scale Super-Resolution
π₯283Upscale photos to any size with neural superβresolution
-
Djrango/Qwen2vl-Flux
Text-to-Image β’ Updated β’ 509 - Running on ZeroAgentsFeatured941
OminiControl
π941Generate new images from a subject photo and text prompt
- Build errorAgents563
FLUXllama gpt-oss
π563mcp_server & FLUX 4-bit Quantization + Enhanced
- Running on ZeroAgentsFeatured2.25k
MagicQuill
πͺΆ2.25kEdit images with scribbleβbased color and edge control
-
laion/laion-audio-preview
Viewer β’ Updated β’ 4.15M β’ 7.34k β’ 11 - Running on ZeroAgentsFeatured2.87k
F5-TTS
π£2.87kF5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
- PausedFeatured2.21k
FacePoke
π2.21kImport a portrait, click to move the head!
- Runtime errorAgentsFeatured697
Fish Audio S1
π697Convert text to natural-sounding speech audio