MediaTek NPU and LiteRT: Powering the next generation of on-device AI

Summary

The article discusses the advancements in on-device AI powered by MediaTek's Neural Processing Units (NPUs) and the introduction of the LiteRT NeuroPilot Accelerator. It highlights the challenges developers face in deploying AI models on NPUs due to the diversity of SoC variants and the lack of tailored infrastructure. The LiteRT NeuroPilot Accelerator aims to simplify this process by providing a unified API, supporting both Ahead-of-Time (AOT) and on-device compilation workflows. Key features include rich generative AI capabilities, efficient cross-platform development, and seamless integration with existing ML pipelines, enabling high-performance applications across various devices.

Key Learnings

1The LiteRT NeuroPilot Accelerator streamlines the deployment of AI models on MediaTek NPUs, addressing the complexities of hardware fragmentation.
2Developers can choose between AOT and on-device compilation strategies, optimizing for either initialization speed or flexibility in model distribution.
3The integration of a new C++ API enhances the efficiency of building ML pipelines, particularly for real-time applications involving camera and video processing.
4The collaboration with MediaTek enables the use of state-of-the-art generative AI models like the Gemma family, significantly improving on-device capabilities.

Who Should Read This

Senior Embedded Systems Engineers implementing on-device AI solutions for diverse hardware platforms

Test Your Knowledge

What are the trade-offs between using AOT and on-device compilation for deploying AI models on NPUs?

How does the LiteRT NeuroPilot Accelerator improve the developer experience compared to previous solutions?

What specific optimizations are required for running generative AI models efficiently on MediaTek NPUs?

In what scenarios might a developer prefer to use the new C++ API over the previous C API for building ML applications?

What challenges do developers face when managing the diversity of SoC variants in the context of on-device AI deployment?

Topics

Neural Networks Machine Learning Generative AI Deep Learning Embedded Systems

Read Full Article at Google

More from Google Engineering

View Google engineering blogs →

Google

MediaTek NPU and LiteRT: Powering the next generation of on-device AI

Summary

Key Learnings

Who Should Read This

Test Your Knowledge

Topics

More articles about Neural Networks

Engineering Platform Trust: Cutting Customer Case Volume 20x with Petabyte-Scale Health Signals

Unified Context-Intent Embeddings for Scalable Text-to-SQL

GenCtrl -- A Formal Controllability Toolkit for Generative Models

Multi-Frequency Fusion for Robust Video Face Forgery Detection

Unifying Ads Engagement Modeling Across Pinterest Surfaces

More from Google Engineering

Introducing Finish Changes and Outlines, now available in Gemini Code Assist extensions on IntelliJ and VS Code

Unleash Your Development Superpowers: Refining the Core Coding Experience

Introducing Wednesday Build Hour

What's new in TensorFlow 2.21

You can't stream the energy: A developer's guide to Google Cloud Next '26 in Vegas

Related topics