So i tried following this guide
https://comfyui-wiki.com/en/tutorial/advanced/video/wan2.1/wan2-1-video-model?utm_source=chatgpt.com
specifically: "2. Wan2.1 Image-to-Video Workflow"
I use
wan2.1_i2v_480p_14B_bf16.safetensors
umt5_xxl_fp8_e4m3fn_scaled.safetensors
and chose those in the workflow.
I basically just followed the guide, downloaded the files, added them where it said (the "text:encoders/" folder is the "clip" folder)
i didnt change any other parameters and did all of this in the comfy workflow tab.
I hit run and get:
**"WanImageToVideo
input must be 4-dimensional "**
with these logs:
https://paste.denizenscript.com/View/137410
my server backends: "--directml --lowvram"
i tried removing those and just adding "--cpu" instead, which resulted in the error not appearing, but also the server simply disconnecting and restarting (i have 32gb of ram) from what i saw the cpu was only having a 50% workload and the ram only used liked 10gb
Graphics card
Amd Radeon RX 5700 XT 8GB
Sapphire Nitro+ special edition
Drive
Samsung 990 PRO NVMe M.2 SSD
Processor
Amd Ryzen 9 3900x 12 core prozessor
Ram
Corsair Dimm 32GB DDr4-3600 kit
Mainboard
Asus Rog Strix B550-F Gaming
Let me know if any other information is needed
This tutorial details how to use the Wan2.1 model in ComfyUI, including installation, configuration, workflow usage, and parameter adjustments for text-to-video, image-to-video, and video-to-video generation.
Content of Swarm Debug Log Paste #137410: SwarmUI v0.9.7.2 Server Log - 2025-11-14 21:11:56... pasted 2025/11/14 12:11:56 UTC-08:00, Paste length: 31177 characters across 279 lines, Content: `2025-11-14 21:10:41.755 [Init] === SwarmUI v0.9.7.2 Starting at 2025-11-14 21:10:41 === 2025-11-14 21:10:41.799 [Init] Prepping extension: SwarmUI...