OpenAI announces Sora, its text to video-generative AI model

OpenAI announced its newest diffusion model Sora which harnesses the power of text to video creation. The newest AI model from the ChatGPT maker is capable of generating videos in various resolutions and aspect ratios and can also edit existing videos allowing for a quick change of scenery, lighting and shooting style all from a text prompt. Sora can also generate videos based on a still image or even extend existing videos by filling in missing frames.

This browser does not support the video element.

OpenAI shares that Sora is currently able to generate up to a minute of Full HD video content and the examples we’ve seen look promising. You can check out Sora’s landing page for more generated video samples.

Sora can generate complex scenes with multiple characters, specific types of motion, and accurate details of the subject and background. The model understands not only what the user has asked for in the prompt, but also how those things exist in the physical world.

It works by using a transformer architecture similar to ChatGPT where videos and images are presented as smaller units of data called patches. Videos generated by Sora start as static noise with the model gradually removing noise to form the final product.

Noisy input patches transformed to high quality video

OpenAI shared it is leveraging its existing safety protocols used in DALL·E 3. Sora is currently being tested by “red teamers” - experts who will carry out tests and asses the model for potential risks ahead of its official launch.

This browser does not support the video element.

OpenAI will also conduct talks with policymakers, artists, and educators to see potential concerns and use cases for Sora. There’s no official launch date provided for now.

Source

Reader comments

  • Adam

I am looking forward to be able to generate videos about dinosaurs, other galaxies and deep mysterious oceans and see how the ai generates the video

  • Anonymous

Actually sora was in the works for a year. They just didn't publish it as it requires a ton of processing power for even a 30 second video

  • tech lover

i'm optimistic. just like how social platforms gives equal opportunity for individuals to express themselves without approval of TV Cable old Stations to the masses, we can have millions of High quality movies or short films based on novel idea...