fp8 scaled version of video2world and text2world

screenshot

setup (once)

  • drag Cosmos-1_0-Diffusion-7B-Video2World_fp8_e4m3fn.safetensors [7.24GB] or/and Cosmos-1_0-Diffusion-7B-Text2World_fp8_e4m3fn.safetensors [7.24GB] to > ./ComfyUI/models/diffusion_models
  • drag oldt5_xxl_fp8_e4m3fn.safetensors [4.9GB] to > ./ComfyUI/models/text_encoders
  • drag cosmos_cv8x8x8_1.0_vae_bf16.safetensors [211MB] to > ./ComfyUI/models/vae

run it straight (no installation needed way)

  • run the .bat file in the main directory (assuming you are using the gguf-node pack below)
  • drag the workflow json file (below) to > your browser

workflow

reference

Prompt
anime style anime girl with massive fennec ears and one big fluffy tail, she has blonde long hair blue eyes wearing a maid outfit with a long black gold leaf pattern dress, walking slowly to the front with sweetie smile, holding a fancy black forest cake with candles on top in the kitchen of an old dark Victorian mansion lit by candlelight with a bright window to the foggy forest
Negative Prompt
The video captures a series of frames showing ugly scenes, static with no motion, motion blur, over-saturation, shaky footage, low resolution, grainy texture, pixelated images, poorly lit areas, underexposed and overexposed scenes, poor color balance, washed out colors, choppy sequences, jerky movements, low frame rate, artifacting, color banding, unnatural transitions, outdated special effects, fake elements, unconvincing visuals, poorly edited content, jump cuts, visual noise, and flickering. Overall, the video is of poor quality.
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference API
Unable to determine this model's library. Check the docs .

Model tree for calcuis/cosmos

Finetuned
(1)
this model