Dreamix: Video Diffusion Models are General Video Editors

Ok it’s not there yet but wow.

Have you met Dreamix? They are using video diffusion models, that can create videos based on image and text inputs. In one of the examples, they were able to take a static image of a turtle and make it move. Per Dreamix, they are able to extract the visual features and then animate them to lift weights while maintaining fidelity and temporal consistency.

By just giving a video and a text prompt, Dreamix can edit the video while maintaining fidelity to color, posture, object size, and camera pose, resulting in a temporally consistent video.

If anything it’s worth a watch.

#ai
#stablediffusion
#chatgpt – next step text to video…

—————————–
Notice: The views expressed above are my own. The views within any of my posts, or articles are not those of my employer or the employers of any contributing experts. this post? for regular insights. Click icon to be notified when I post.

Picture of Doug Shannon

Doug Shannon

Doug Shannon, a top 50 global leader in intelligent automation, shares regular insights from his 20+ years of experience in digital transformation, AI, and self-healing automation solutions for enterprise success.