ํ‹ฐ์Šคํ† ๋ฆฌ ๋ทฐ

728x90
๋ฐ˜์‘ํ˜•

https://comfyanonymous.github.io/ComfyUI_examples/wan/

 

Wan 2.1 Models

Examples of ComfyUI workflows

comfyanonymous.github.io

๐Ÿ” WAN ๋ชจ๋ธ์ด๋ž€?

WAN ๋ชจ๋ธ์€ ์ด๋ฏธ์ง€ → ๋น„๋””์˜ค ๋ณ€ํ™˜ (I2V, Image-to-Video) ๋ชจ๋ธ๋กœ, ์ •์ ์ธ ์ด๋ฏธ์ง€๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ์›€์ง์ด๋Š” ๋น„๋””์˜ค๋ฅผ ์ƒ์„ฑํ•˜๋Š” AI ๋ชจ๋ธ์ด๋‹ค.
ComfyUI์—์„œ ๋น„๋””์˜ค ์ƒ์„ฑ์šฉ Diffusion ๋ชจ๋ธ์„ ํ…Œ์ŠคํŠธํ•˜๊ณ  ์‹ถ๋‹ค๋ฉด WAN ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ๋‹ค.

 

ComfyUI์—์„œ Wan 2.1 ๋ชจ๋ธ ์‚ฌ์šฉํ•˜๊ธฐ

1. Download 

- Text encoder and VAE:

umt5_xxl_fp8_e4m3fn_scaled.safetensors goes in: ComfyUI/models/text_encoders/

wan_2.1_vae.safetensors goes in: ComfyUI/models/vae/

 

- Video Models

The diffusion models can be found here

These files go in: ComfyUI/models/diffusion_models/

(๋‚˜๋Š” i2v 720 14b bf16) image to video , 720 ๋ฝ‘ํžˆ๋Š” ๋ฒ„์ „์œผ๋กœ ๋ฐ›์•˜๋‹ค.

Image to Video

This workflow requires the wan2.1_i2v_480p_14B_bf16.safetensors file (put it in: ComfyUI/models/diffusion_models/) and clip_vision_h.safetensors which goes in: ComfyUI/models/clip_vision/

์–˜๋Š” 480 ์‚ฌ์ด์ฆˆ 

Note this example only generates 33 frames at 512x512 because I wanted it to be accessible, the model can do more than that. The 720p model is pretty good if you have the hardware/patience to run it.

 

 


1๏ธโƒฃ WAN ๋ชจ๋ธ์˜ ํŠน์ง•

โœ… WAN ๋ชจ๋ธ (WAN 2.1)์˜ ํ•ต์‹ฌ ๊ธฐ๋Šฅ

  • ์ •์ง€๋œ ์ด๋ฏธ์ง€(์‚ฌ์ง„)๋กœ๋ถ€ํ„ฐ ์ž์—ฐ์Šค๋Ÿฌ์šด ์›€์ง์ž„์„ ๊ฐ€์ง„ ๋น„๋””์˜ค ์ƒ์„ฑ
  • ์งง์€ ๊ธธ์ด (33 ํ”„๋ ˆ์ž„)์—์„œ ๋” ๊ธด ๋น„๋””์˜ค ์ƒ์„ฑ๊นŒ์ง€ ํ™•์žฅ ๊ฐ€๋Šฅ
  • ๊ธฐ์กด Diffusion ๊ธฐ๋ฐ˜ ๋น„๋””์˜ค ๋ชจ๋ธ๋ณด๋‹ค ๋” ์ž์—ฐ์Šค๋Ÿฌ์šด ์›€์ง์ž„๊ณผ ํ”„๋ ˆ์ž„ ์ผ๊ด€์„ฑ ์œ ์ง€

โžก๏ธ WAN ๋ชจ๋ธ์€ **Static Image(๊ณ ์ • ์ด๋ฏธ์ง€)**๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ์›€์ง์ด๋Š” ์š”์†Œ๋ฅผ ์ถ”๊ฐ€ํ•˜์—ฌ ๋น„๋””์˜ค๋ฅผ ์ƒ์„ฑํ•˜๋Š” ๋ฐ ํŠนํ™”๋จ.


2๏ธโƒฃ WAN ๋ชจ๋ธ์„ ComfyUI์—์„œ ์‚ฌ์šฉํ•  ๋•Œ ํ•„์š”ํ•œ ํŒŒ์ผ

WAN ๋ชจ๋ธ์„ ํ…Œ์ŠคํŠธํ•˜๋ ค๋ฉด ๋‹ค์Œ ํŒŒ์ผ์„ ๋‹ค์šด๋กœ๋“œํ•˜๊ณ  ์˜ฌ๋ฐ”๋ฅธ ํด๋”์— ๋ฐฐ์น˜ํ•ด์•ผ ํ•œ๋‹ค.

โœ… ํ•„์ˆ˜ ํŒŒ์ผ

ํŒŒ์ผ ์ด๋ฆ„์—ญํ• ์ €์žฅ ๊ฒฝ๋กœ

wan2.1_i2v_480p_14B_bf16.safetensors WAN ๋ชจ๋ธ ์ž์ฒด (Diffusion ๋ชจ๋ธ) ComfyUI/models/diffusion_models/
clip_vision_h.safetensors ๋น„๋””์˜ค ์ƒ์„ฑ ์‹œ ์ด๋ฏธ์ง€ ๋ถ„์„ ComfyUI/models/clip_vision/
umt5_xxl_fp8_e4m3fn_scaled.safetensors ํ…์ŠคํŠธ ์ธ์ฝ”๋” ComfyUI/models/text_encoders/
wan_2.1_vae.safetensors VAE (์ƒ‰๊ฐ & ๋””ํ…Œ์ผ ๊ฐœ์„ ) ComfyUI/models/vae/

 

https://www.youtube.com/watch?v=SG7ffQZslIw

 


๐Ÿ“ŒWAN ๋ชจ๋ธ๊ณผ ๋‹ค๋ฅธ Diffusion ๋ชจ๋ธ ๋น„๊ต

๋ชจ๋ธ์—ญํ• ํŠน์ง•

Stable Diffusion 1.5 / SDXL ์ด๋ฏธ์ง€ ์ƒ์„ฑ ์ •์ ์ธ ์ด๋ฏธ์ง€ ์ƒ์„ฑ (๋น„๋””์˜ค ๋ถˆ๊ฐ€)
Flux ๊ฒฝ๋Ÿ‰ ์ด๋ฏธ์ง€ ์ƒ์„ฑ VRAM ์ตœ์ ํ™”๋œ Diffusion ๋ชจ๋ธ (๋น„๋””์˜ค ๋ถˆ๊ฐ€)
WAN 2.1 ์ด๋ฏธ์ง€ → ๋น„๋””์˜ค ๋ณ€ํ™˜ ์ •์ง€ ์ด๋ฏธ์ง€๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ๋น„๋””์˜ค ์ƒ์„ฑ ๊ฐ€๋Šฅ

 


๐Ÿ“Œ SDXL (Stable Diffusion XL)๋ž€?

SDXL (Stable Diffusion XL)์€ Stability AI์—์„œ ๊ฐœ๋ฐœํ•œ Stable Diffusion ๋ชจ๋ธ์˜ ์ตœ์‹  ํ™•์žฅ ๋ฒ„์ „

1024x1024 ํ•ด์ƒ๋„

LoRA ๋ฐ ControlNet ์ง€์›

  • SDXL์€ LoRA(์†Œํ˜• ํ•™์Šต ๋ชจ๋ธ) ๋ฐ ControlNet(ํฌ์ฆˆ ์ปจํŠธ๋กค, ์Šค์ผ€์น˜ ์ ์šฉ ๋“ฑ)๊ณผ ํ•จ๊ป˜ ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ์–ด ์ปค์Šคํ„ฐ๋งˆ์ด์ง• ๊ฐ•๋ ฅ

 

๐Ÿ“Œ SDXL์„ ์‚ฌ์šฉํ•  ๋•Œ ํ•„์š”ํ•œ ๊ฒƒ

  • SDXL Base ๋ชจ๋ธ (sd_xl_base_1.0.safetensors)
    → ๊ธฐ๋ณธ์ ์ธ ์ด๋ฏธ์ง€๋ฅผ ์ƒ์„ฑํ•˜๋Š” ๋ชจ๋ธ.
  • SDXL Refiner ๋ชจ๋ธ (sd_xl_refiner_1.0.safetensors) (์„ ํƒ ์‚ฌํ•ญ)
    → ์ด๋ฏธ์ง€๋ฅผ ๋”์šฑ ์„ธ๋ฐ€ํ•˜๊ฒŒ ๋‹ค๋“ฌ๊ณ  ๊ณ ํ’ˆ์งˆ๋กœ ๋งŒ๋“ค์–ด์ฃผ๋Š” ํ›„์ฒ˜๋ฆฌ ๋ชจ๋ธ.
  • VAE (์„ ํƒ ์‚ฌํ•ญ)
    → SDXL์— ์ตœ์ ํ™”๋œ VAE๋ฅผ ์ถ”๊ฐ€๋กœ ์ ์šฉํ•˜๋ฉด ์ƒ‰๊ฐ๊ณผ ๋””ํ…Œ์ผ์ด ๋” ์ข‹์•„์งˆ ์ˆ˜ ์žˆ์Œ.

๐Ÿš€ SDXL ๋ชจ๋ธ ๋‹ค์šด๋กœ๋“œ ๋ฐฉ๋ฒ•


 

 

๐Ÿ“Œ Diffusion ๋ชจ๋ธ : SDXL๊ณผ Flux ๋ชจ๋ธ

:๋‘˜์ด ์„œ๋กœ ๋‹ค๋ฅธ Diffusion ๋ชจ๋ธ์ด๋ฏ€๋กœ ๋‘˜์ค‘ ํ•˜๋‚˜ ์„ ํƒํ•ด์„œ ์‚ฌ์šฉ.

(Diffusion ๋ชจ๋ธ์€ ์ด๋ฏธ์ง€ ์ž์ฒด๋ฅผ ์ƒ์„ฑํ•˜๋Š” AI ๋ชจ๋ธ)

โœ… SDXL (Stable Diffusion XL)

  • Stability AI์—์„œ ๊ฐœ๋ฐœํ•œ ๋Œ€ํ˜• Diffusion ๋ชจ๋ธ.
  • 1024x1024 ํ•ด์ƒ๋„ ๊ธฐ๋ณธ ์ง€์›์œผ๋กœ ๊ธฐ์กด ๋ชจ๋ธ๋ณด๋‹ค ๋›ฐ์–ด๋‚œ ํ’ˆ์งˆ์„ ์ œ๊ณต.
  • Base ๋ฐ Refiner ๋ชจ๋ธ๋กœ ๊ตฌ์„ฑ๋จ.

โœ… Flux (by Black Forest Labs)

  • Flux๋Š” Black Forest Labs์—์„œ ๊ฐœ๋ฐœํ•œ Diffusion ๋ชจ๋ธ ํŒจ๋ฐ€๋ฆฌ.
  • ComfyUI์— ์ตœ์ ํ™”๋œ ๊ฒฝ๋Ÿ‰ ๋ชจ๋ธ๋“ค์ด ์ œ๊ณต๋จ.
  • FP8 ๋ฒ„์ „์ด ์žˆ์–ด VRAM ์‚ฌ์šฉ๋Ÿ‰์„ ์ค„์ด๋ฉด์„œ๋„ ๋†’์€ ํ’ˆ์งˆ ์œ ์ง€ ๊ฐ€๋Šฅ.
  • Flux Dev๋Š” ๊ฐœ๋ฐœ ๋ฒ„์ „์œผ๋กœ, ์ƒˆ๋กœ์šด ๊ธฐ๋Šฅ์ด ์ถ”๊ฐ€๋  ์ˆ˜ ์žˆ์Œ.

https://github.com/black-forest-labs/flux

 

GitHub - black-forest-labs/flux: Official inference repo for FLUX.1 models

Official inference repo for FLUX.1 models. Contribute to black-forest-labs/flux development by creating an account on GitHub.

github.com

 

1๏ธโƒฃ ๋ชจ๋ธ๊ธฐ๋ณธ Text Encoder

SDXL t5xxl_fp16.safetensors
Flux umt5_xxl_fp8_e4m3fn_scaled.safetensors

2๏ธโƒฃ VAE (Variational Autoencoder)๋ชจ๋ธ

(VAE๋Š” ์ด๋ฏธ์ง€์˜ ๋””ํ…Œ์ผ๊ณผ ์ƒ‰๊ฐ์„ ์กฐ์ •ํ•˜๋Š” ์—ญํ•  : ์ƒ‰๊ฐ, ๋ช…์•”, ํ•ด์ƒ๋„ ํ–ฅ์ƒ)

SDXL sdxl_vae.safetensors (์ถ”์ฒœ)
Flux wan_2.1_vae.safetensors

 


๐Ÿ” clip_vision_h.safetensors๊ฐ€ ๋ฌด์—‡์ธ๊ฐ€?

clip_vision_h.safetensors๋Š” CLIP (Contrastive Language-Image Pretraining) ๋ชจ๋ธ์˜ Vision Encoder ๋ถ€๋ถ„์ด๋‹ค.

CLIP (Contrastive Language-Image Pretraining)์€ OpenAI์—์„œ ๊ฐœ๋ฐœํ•œ ๋ชจ๋ธ๋กœ, ์ด๋ฏธ์ง€์™€ ํ…์ŠคํŠธ๋ฅผ ์—ฐ๊ฒฐํ•˜๋Š” ์—ญํ• ์„ ํ•œ๋‹ค.

 


๐Ÿ” LoRA(๋กœ๋ผ) ๋ชจ๋ธ์ด๋ž€?

โœ… LoRA (Low-Rank Adaptation)๋Š” ํŠน์ • ์Šคํƒ€์ผ์ด๋‚˜ ์บ๋ฆญํ„ฐ๋ฅผ ์ถ”๊ฐ€ ํ•™์Šตํ•˜๋Š” ์ž‘์€ ๋ชจ๋ธ
โœ… ๊ธฐ์กด Diffusion ๋ชจ๋ธ(WAN, SDXL ๋“ฑ)์— ์ถ”๊ฐ€์ ์œผ๋กœ ์ ์šฉํ•˜๋Š” ๋ฐฉ์‹
โœ… ๋ฉ”๋ชจ๋ฆฌ ์‚ฌ์šฉ๋Ÿ‰์ด ์ ๊ณ  ๋น ๋ฅด๊ฒŒ ๋กœ๋“œ ๊ฐ€๋Šฅ

โžก๏ธ ์ฆ‰, LoRA๋Š” "์ถ”๊ฐ€์ ์ธ ํ•™์Šต ๋ฐ์ดํ„ฐ"๋ฅผ ๊ธฐ์กด ๋ชจ๋ธ์— ๊ฒฐํ•ฉํ•˜๋Š” ์—ญํ• ์„ ํ•œ๋‹ค.
โžก๏ธ WAN ๋ชจ๋ธ + LoRA๋ฅผ ๊ฒฐํ•ฉํ•˜๋ฉด ํŠน์ • ์Šคํƒ€์ผ์˜ ๋น„๋””์˜ค๋ฅผ ์ƒ์„ฑํ•  ์ˆ˜ ์žˆ๋‹ค!

๋ชจ๋ธ ์œ ํ˜• ์—ญํ•  ์ ์šฉ ๋ฐฉ์‹
Diffusion ๋ชจ๋ธ (WAN, SDXL ๋“ฑ) ์ด๋ฏธ์ง€๋ฅผ ์ƒ์„ฑ (WAN์€ ๋น„๋””์˜ค ์ƒ์„ฑ) ํ•„์ˆ˜
VAE ์ƒ‰๊ฐ, ๋””ํ…Œ์ผ ๊ฐœ์„  ์„ ํƒ ์‚ฌํ•ญ (ํ’ˆ์งˆ ํ–ฅ์ƒ ๊ฐ€๋Šฅ)
LoRA ์Šคํƒ€์ผ ๋ฐ ์บ๋ฆญํ„ฐ ์ถ”๊ฐ€ ์„ ํƒ ์‚ฌํ•ญ (๊ฐœ์„ฑ ์žˆ๋Š” ๊ฒฐ๊ณผ๋ฌผ ์ƒ์„ฑ ๊ฐ€๋Šฅ)

โžก๏ธ Diffusion ๋ชจ๋ธ์€ ๊ธฐ๋ณธ์ ์ธ ๊ตฌ์กฐ, VAE๋Š” ํ’ˆ์งˆ ํ–ฅ์ƒ, LoRA๋Š” ์Šคํƒ€์ผ์„ ์ถ”๊ฐ€ํ•˜๋Š” ์—ญํ• !
โžก๏ธ WAN ๋ชจ๋ธ์— LoRA๋ฅผ ์ถ”๊ฐ€ํ•˜๋ฉด ๋น„๋””์˜ค์˜ ์Šคํƒ€์ผ์„ ๋”์šฑ ๊ฐ•ํ•˜๊ฒŒ ์ปค์Šคํ„ฐ๋งˆ์ด์ง• ๊ฐ€๋Šฅ!

 

728x90
๋ฐ˜์‘ํ˜•
250x250
๊ณต์ง€์‚ฌํ•ญ
์ตœ๊ทผ์— ์˜ฌ๋ผ์˜จ ๊ธ€
์ตœ๊ทผ์— ๋‹ฌ๋ฆฐ ๋Œ“๊ธ€
Total
Today
Yesterday
๋งํฌ
ยซ   2025/03   ยป
์ผ ์›” ํ™” ์ˆ˜ ๋ชฉ ๊ธˆ ํ† 
1
2 3 4 5 6 7 8
9 10 11 12 13 14 15
16 17 18 19 20 21 22
23 24 25 26 27 28 29
30 31
๊ธ€ ๋ณด๊ด€ํ•จ
๋ฐ˜์‘ํ˜•