Introducing Veo 3.1 and new creative capabilities in the Gemini API

Summary

The article introduces Veo 3.1, an advanced video generation model available through the Gemini API, enhancing developers' ability to create engaging content. Key features include improved audio generation, narrative control, and the ability to use reference images for maintaining character consistency. The model supports scene extensions and smooth transitions between images, making it suitable for various creative applications. Developers can access these features via the Gemini API and are encouraged to explore the documentation for implementation details.

Key Learnings

1Veo 3.1 enhances video generation by allowing the use of reference images to maintain character consistency across scenes.
2The model introduces scene extension capabilities, enabling the creation of longer videos by generating new clips that connect seamlessly to previous content.
3Developers can control video transitions by providing a first and last frame, ensuring smooth narrative flow and synchronized audio.
4The integration with the Gemini API allows for easy access to advanced video generation features, promoting creative storytelling in applications.

Who Should Read This

Senior AI Developers implementing advanced video generation features in creative applications using the Gemini API

Test Your Knowledge

What are the trade-offs between using reference images versus relying solely on textual prompts for video generation?

How does the scene extension feature impact the overall narrative structure of a generated video?

In what scenarios might the transition generation between two images fail, and how can developers mitigate these issues?

What design decisions were made in the development of Veo 3.1 to enhance audio quality and narrative control?

Why is maintaining character consistency important in video generation, and how does Veo 3.1 achieve this?

Topics

Gemini Generative AI Machine Learning Video Generation AI Tools

Read Full Article at Google

More from Google Engineering

View Google engineering blogs →

Google

Introducing Veo 3.1 and new creative capabilities in the Gemini API

Summary

Key Learnings

Who Should Read This

Test Your Knowledge

Topics

More articles about Gemini

How we built the Google I/O 2026 Save the Date experience

Turn creative prompts into interactive XR experiences with Gemini

Making Gemini CLI extensions easier to use

Tailor Gemini CLI to your workflow with hooks

Real-World Agent Examples with Gemini 3

More from Google Engineering

Introducing Finish Changes and Outlines, now available in Gemini Code Assist extensions on IntelliJ and VS Code

Unleash Your Development Superpowers: Refining the Core Coding Experience

Introducing Wednesday Build Hour

What's new in TensorFlow 2.21

You can't stream the energy: A developer's guide to Google Cloud Next '26 in Vegas

Related topics