日韩区一区二区三区四|交换国产精品视频一区|国产口爆吞精在线观视频|午夜无码大尺度福利视频|二区亚洲国产精品一区久久|精品国产一级二级三级在线|欧美日韩精品久久久免费观看|亚洲高清一区二区三区高清不卡

position: EnglishChannel  > News> Upload a Photo, Get a Video

Upload a Photo, Get a Video

Source: Science and Technology Daily | 2025-06-11 11:29:11 | Author: LI LInxu

The rapid developments in AI have unlocked new possibilities for digital representation. With the help of AI models, you can now achieve a remarkable feat: bringing characters to life with just an image and an audio clip.

Jointly developed by Tencent Hunyuan and Tencent Music, the newly released HunyuanVideo-Avatar, a multimodal diffusion transformer-based model, is capable of simultaneously generating dynamic, emotion-controllable, and multi-character dialogue videos. This capability supports head-and-shoulder, half-body, and full-body views, encompassing multiple styles, species, and even dual-character scenes.

To put it simply, you just upload a photo and a voice clip, and the model figures out the context, emotion and lip movements to create a realistic animated video.

For instance, if you upload an image of a woman sitting on a beach with a guitar, along with a piece of lyrical music,  the model understands the scene as "a woman playing the guitar and singing a lyrical song by the sea," and subsequently generates a video of the woman performing the song.

The model provides video creators with highly consistent and dynamic video generation capabilities. Its versatility can unlock a myriad of applications in fields like entertainment, media, e-commerce, advertising and education.

It has already been applied in multiple scenarios within Tencent Music, such as AI companions for music listening, long-form audio podcasts, and music videos (MVs).

For example, on the app QQ Music, when users listen to songs by "AI Leehom" (a fully AI-driven singer created by Tencent Music and Team Leehom), a lively and adorable AI Leehom image synchronizes its singing in real-time on the player.

On WeSing, a popular karaoke singing app, users can upload their images to generate personalized MVs of themselves singing.

In subject consistency and audio-video synchronization, the HunyuanVideo-Avatar shows top-tier industry performance. For video dynamics and natural body movements, it exceeds open-source solutions and rivals closed-source ones.

Currently, the model supports audio uploads of up to 14 seconds for video generation, with more capabilities to be released and open-sourced in the future.

Editor:李林旭

抱歉,您使用的瀏覽器版本過低或開啟了瀏覽器兼容模式,這會影響您正常瀏覽本網(wǎng)頁

您可以進行以下操作:

1.將瀏覽器切換回極速模式

2.點擊下面圖標升級或更換您的瀏覽器

3.暫不升級,繼續(xù)瀏覽

繼續(xù)瀏覽
库伦旗| 沿河| 霍山县| 丽江市| 郓城县| 吉木萨尔县| 阳泉市| 临高县| 股票| 莎车县| 吐鲁番市| 乌兰浩特市| 焉耆| 通许县| 阳曲县| 商洛市| 桐乡市| 枝江市| 武邑县| 晋城| 左权县| 石嘴山市| 西城区| 武强县| 泊头市| 花垣县| 镇沅| 云梦县| 灵台县| 日土县| 永平县| 郎溪县| 玛多县| 赞皇县| 上杭县| 陆川县| 固镇县| 周至县| 安西县| 襄垣县| 连南|