Wan 2.1 comes in four versions: T2V-1.3B, T2V-14B, I2V-14B-720P, and I2V-14B-480P. The T2V models generate videos from text, while the I2V models convert images into videos. Notably, the T2V-1.3B model can run on consumer-grade GPUs with just 8.19GB of VRAM, generating a five-second 480p video in four minutes on an Nvidia RTX 4090.