position: EnglishChannel  > News> Text-to-Video Generator Sora a Mixed Blessing

Text-to-Video Generator Sora a Mixed Blessing

Source: Science and Technology | 2024-02-21 15:55:30 | Author: Tang Zhexiao

OpenAI recently announced Sora artificial intelligence, which can transforms text into video of up to 1 minute. (PHOTO: VCG)

OpenAI, the creator of ChatGPT and image generator DALL-E, launched a new artificial intelligence (AI) tool that enables users to create short videos from text prompts on February 15.

Named "Sora," this AI-video tool can create videos of up to 60 seconds featuring highly detailed scenes, complex camera motion, and multiple characters with vibrant emotions, OpenAI said.

However, the San Francisco-based startup admitted that the new tool still has some limitations, such as possibly "mixing up left and right", according to AFP.

The technology that supports Sora is an adaptation of DALL-E. It generates a video by starting off with noise and "gradually transforms it by removing the noise over many steps," the company explained. It recognizes objects and concepts listed in the written prompt and pulls them out of the noise, so to speak, until a coherent series of video frames emerge.

The impact of Sora in shaping video generation and its implications for various industries has been seen through factors like enhanced text-to-video capabilities and exploration of novel applications.

According to AFP, the French video game giant Ubisoft hailed the tool as a "quantum leap forward" with the potential to let players and development teams express their imaginations.

"For professions like marketing or creative, multimodal models could be a game changer and could create significant cost savings for film and television makers, and may contribute to the proliferation of AI-generated content rather than using actors," Reece Hayden, senior analyst at a tech intelligence company ABI Research, told CBS MoneyWatch.

Besides the praise by some AI researchers, concerns about security were also raised.

"The video generation model is spurring excitement about advancing AI technology, along with growing concerns over how artificial deepfake videos worsen misinformation and disinformation during a pivotal election year worldwide," said New Scientist.

Hany Farid, professor at the University of California, Berkeley, specializing in image analysis and digital forensics, said "text-to-video will continue to rapidly improve — moving us closer and closer to a time when it will be difficult to distinguish the fake from the real."

The new video tool is not yet publicly available. OpenAI has restricted its use to "red teamers" and some visual artists, designers and filmmakers to test the product and deliver feedback before it is released more widely.

Editor:湯哲梟

Top News

  • The "Charming Guangzhou" online channel, the first comprehensive information service platform for foreign talent in Guangzhou, was launched at the 2024 Convention on Exchange of Overseas Talents and the 26th Guangzhou Convention of Overseas Chinese Scholars in Science and Technology on December 24, 2024.

AI Rescue Robot Offers 24/7 Service

Chinese scientists have unveiled an AI-powered rescue robot which can be operated without human intervention. It employs AI, big data and advanced tracking technologies and provides 24/7 all-weather monitoring, early warnings, and rapid rescue operation.

Milestone in Offshore Wind Power Test

China's first national offshore wind power research and testing base transmission chain platform in east China's Fujian province began operation on December 26, 2024.

抱歉,您使用的瀏覽器版本過(guò)低或開(kāi)啟了瀏覽器兼容模式,這會(huì)影響您正常瀏覽本網(wǎng)頁(yè)

您可以進(jìn)行以下操作:

1.將瀏覽器切換回極速模式

2.點(diǎn)擊下面圖標(biāo)升級(jí)或更換您的瀏覽器

3.暫不升級(jí),繼續(xù)瀏覽

繼續(xù)瀏覽