OpenAI’s flagship video and audio generation model built on a diffusion transformer architecture, featuring realistic physics simulation, synchronized audio-visual generation, and multi-shot controllability for cinematic content creation.
| Provider | OpenAI |
| Tasks | text-to-video · image-to-video |
| Starting from | 0.3400 USD / call · Pricing details |
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.