File size: 2,309 Bytes
58fff2e
56d7d4f
 
 
8c2f700
56d7d4f
 
 
 
 
 
 
 
58fff2e
405a06c
56d7d4f
b85995e
56d7d4f
ebbe3d0
 
 
 
56d7d4f
 
 
ebbe3d0
b5c383e
ebbe3d0
 
 
 
 
 
 
56d7d4f
ebbe3d0
56d7d4f
 
 
 
 
 
e7e41d4
79b7ee4
81970b8
e7e41d4
56d7d4f
 
 
 
 
 
 
149f054
56d7d4f
149f054
56d7d4f
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
---
license: apache-2.0
language:
- en
pipeline_tag: text-to-image
tags:
- mamba
- generative model
- stable diffusion
- stochastic interpolant
- zigma
- zigzag

---
# ZigMa: A DiT-style Zigzag Mamba Diffusion Model (ECCV 2024)

This model represents the official checkpoint of the paper titled ["ZigMa: A DiT-style Zigzag Mamba Diffusion Model"](https://huggingface.co/papers/2403.13802).


[![Website](doc/badges/badge-website.svg)](https://taohu.me/project_zigma)
[![Paper](https://img.shields.io/badge/arXiv-PDF-b31b1b)](https://arxiv.orgg)
[![Hugging Face Model](https://img.shields.io/badge/🤗%20Hugging%20Face-Model-green)](https://huggingface.co/Bingxin/Marigold)
[![License](https://img.shields.io/badge/License-Apache--2.0-929292)](https://www.apache.org/licenses/LICENSE-2.0)


[Vincent Tao Hu](http://taohu.me),
[Stefan Andreas Baumann](https://stefan-baumann.eu/),
[Ming Gui](https://www.linkedin.com/in/ming-gui-87b76a16b/?originalSubdomain=de),
[Olga Grebenkova](https://www.linkedin.com/in/grebenkovao/),
[Pingchuan Ma](https://www.linkedin.com/in/pingchuan-ma-492543156/),
[Johannes Fischer](https://www.linkedin.com/in/js-fischer/ )
[Bjorn Ommer](https://ommer-lab.com/people/ommer/ )

We present ZigMa, a scanning scheme that follows a zigzag pattern, considering both spatial continuity and parameter efficiency. We further adapt this scheme to video, separating the reasoning between spatial and temporal dimensions, thus achieving efficient parameter utilization. Our design allows for greater incorporation of inductive bias for non-1D data and improves parameter efficiency in diffusion models.

![teaser](doc/teaser_3col.png)



## 🎓 Citation

```bibtex
@InProceedings{hu2024zigma,
      title={ZigMa: A DiT-style Zigzag Mamba Diffusion Model},
      author={Vincent Tao Hu and Stefan Andreas Baumann and Ming Gui and Olga Grebenkova and Pingchuan Ma and Johannes Fischer and Bjorn Ommer},
      booktitle = {Arxiv},
      year={2024}
}
```

## 🎫 License

This work is licensed under the Apache License, Version 2.0 (as defined in the [LICENSE](LICENSE.txt)).

By downloading and using the code and model you agree to the terms in the  [LICENSE](LICENSE.txt).

[![License](https://img.shields.io/badge/License-Apache--2.0-929292)](https://www.apache.org/licenses/LICENSE-2.0)