
No ratings yet
Be the first to review this model
Google's speed-optimized multimodal model with a 1M token context window, designed for high-volume, high-frequency tasks at scale. Processes text, images, audio, and video inputs natively with breakthrough cost efficiency. The workhorse model for applications needing fast multimodal processing.
Released
May 14, 2024
Parameters
Unknown
Context
1M
Pricing
Free/Paid
Last updated: March 15, 2026
Benchmark scores may vary based on evaluation methodology and conditions.