Google Unveils Multimodal Capabilities in AI Mode for Lens

The Latest in AI: Google Enhances Lens with Multimodal Capabilities

Google recently announced significant updates to its Lens feature, introducing multimodal capabilities in AI Mode for Google One AI Premium subscribers. This advancement allows users to combine visual search with AI-powered interactions, aiming to enhance the way users engage with their environment.

The upgraded Lens will empower users to capture images and explore them alongside Gemini, a system designed for in-depth understanding. According to Google, users can now ask complex questions or receive relevant links, marking a significant shift in how visual information can be accessed and interpreted.

The company noted a marked increase in the length of queries, stating that the AI Mode queries were ‘twice as long’ compared to standard searches, indicating a deeper level of inquiry from users.

Google elaborated on their new functionality: ‘With Gemini’s multimodal capabilities, AI Mode can understand the entire scene in an image, including the context of how objects relate to one another.’ This innovative approach aims to provide users with a comprehensive view of their queries.

Users from various backgrounds—whether shopping, planning vacations, or researching—can leverage this technology to obtain detailed insights about their surroundings. The company highlighted, ‘Using our query fan-out technique, AI Mode issues multiple queries regarding the overall image and its individual components.’

Though still under testing, Google encourages users on both Android and iOS platforms to sign up for Labs to experience the beta version of AI Mode and provide feedback.

Published – April 08, 2025 12:26 pm IST

Share on Facebook

Post on X

Save