At this point, it’s safe to say that AI technology is advancing at a rapid Pace. Microsoft is one of the leading companies in AI with the help of OpenAI. Well, Microsoft’s latest tool is called VASA-1, a powerful tool to generate lifelike talking faces that work in real-time.
This is evidence of AI’s growing ability to mimic human beings based on minimal input. For example, TikTok is working on a tool that will let people make an AI-generated clone of their voice with only 10 seconds of audio input. At the time of writing this article, this tool is not available to the public. However, we expect it to be coming out relatively soon.
Microsoft’s VASA-1 allows users to create lifelike talking faces in real time
We’ve seen examples of this through hundreds of advertisements of apps that let you animate a portrait to make it seem like you’re singing a Billie Eilish song. However, the technology behind VASA-1 is much more advanced and much more refined. You’re able to use a singular picture for this tool. Using this picture, the tool will be able to generate realistic movement to make it appear that the person is speaking.
This is impressive as is, but it goes further than that. VASA-1 can actually create subtle facial movements and convey a wide range of emotions. This is something that has been lacking with similar tools over the years. Its main focus is realism, and it gets really close to that.
The company showed off a few examples of this technology on its website, and it’s very impressive. Aside from that, talking faces can lip-sync to audio in real-time. That’s another great quality of this tool.
Microsoft VASA-1 can generate 512×512 videos at up to 40FPS. Also, on its online streaming mode, Microsoft boasts a latency of only 170ms.
At this point, we don’t know when Microsoft plans on releasing this feature to the masses. However, when it does, we’re pretty sure that Microsoft will monetize it. It could possibly be a feature in one of the company’s subscription services. We will have to wait for it to come out in order to be sure.
2024-04-22 15:04:28