
No ratings yet
Be the first to review this model
Amazon's cost-efficient multimodal model processing image, video, and text inputs with lightning-fast speed. Handles up to 300K input tokens and can analyze multiple images or up to 30 minutes of video. A practical choice for multimodal applications at scale on AWS.
Released
December 3, 2024
Parameters
Unknown
Context
300K
Pricing
Paid
Last updated: March 15, 2026
Benchmark scores may vary based on evaluation methodology and conditions.