Amoghavarsha: Google Integrates Native Computer Use into Gemini 3.5 Flash for Enhanced AI Agents

Thursday, June 25, 2026

Google Integrates Native Computer Use into Gemini 3.5 Flash for Enhanced AI Agents

Google has announced the integration of native computer use capabilities directly into its Gemini 3.5 Flash model, a feature previously restricted to a standalone Gemini 2.5 version.

This upgrade enables developers to build highly capable AI agents that can seamlessly interact across diverse digital platforms, including browser, desktop, and mobile environments.

According to Mateo Quiros, Product Manager at Google DeepMind, embedding this feature into the core Flash model delivers Google's strongest performance yet for agentic computer use tasks.

By transitioning this capability into the main Gemini 3.5 Flash model, Google expands upon existing features like function calling and grounding with Search and Maps.

The native integration allows custom AI agents to effectively see, reason, and execute actions across professional applications.

Google emphasizes that this upgrade significantly boosts performance for long-horizon and enterprise automation tasks, such as continuous software testing and complex knowledge work across multiple applications.

To address security risks associated with agents operating in live environments, Google has implemented targeted adversarial training specifically designed to mitigate prompt injection vulnerabilities.

Recognizing that automated actions require strict guardrails, the company is focusing heavily on defense-in-depth protocols.

These protocols ensure that as AI agents gain more autonomy over digital workspaces, the underlying models remain resilient against malicious manipulation and unauthorized system commands.

In addition to core model defenses, Google is introducing two optional enterprise safeguard systems to protect organizational workflows.

The first system requires explicit user confirmation before the AI agent can execute sensitive or irreversible actions.

The second safeguard automatically terminates ongoing tasks if an indirect prompt injection attempt is detected.

Google advises enterprises to combine these features with secure sandboxing, human-in-the-loop verification, and strict access controls.

The newly integrated computer use capability is currently available for developers and enterprises through the Gemini API and the Gemini Enterprise Agent Platform.

To facilitate early adoption and testing, Google is providing reference implementations, comprehensive documentation, and a Browserbase-hosted demo environment.

These resources are designed to help organizations safely evaluate, build, and deploy the next generation of computer-use AI agents.

Google ने बेहतर AI एजेंट्स के लिए Gemini 3.5 Flash में नेटिव कंप्यूटर इस्तेमाल को इंटीग्रेट किया

Google ने अपने Gemini 3.5 Flash मॉडल में सीधे नेटिव कंप्यूटर इस्तेमाल की क्षमताओं को इंटीग्रेट करने की घोषणा की है, यह फीचर पहले सिर्फ़ स्टैंडअलोन Gemini 2.5 वर्शन तक ही सीमित था।

यह अपग्रेड डेवलपर्स को बहुत काबिल AI एजेंट्स बनाने में मदद करता है जो ब्राउज़र, डेस्कटॉप और मोबाइल एनवायरनमेंट सहित अलग-अलग डिजिटल प्लेटफॉर्म पर आसानी से इंटरैक्ट कर सकते हैं।

Google DeepMind के प्रोडक्ट मैनेजर, माटेओ क्विरोस के अनुसार, इस फीचर को कोर Flash मॉडल में एम्बेड करने से एजेंटिक कंप्यूटर इस्तेमाल के कामों के लिए Google की अब तक की सबसे मज़बूत परफॉर्मेंस मिलती है।

इस क्षमता को मुख्य Gemini 3.5 Flash मॉडल में बदलकर, Google मौजूदा फीचर्स जैसे कि सर्च और मैप्स के साथ फंक्शन कॉलिंग और ग्राउंडिंग को बढ़ाता है।

नेटिव इंटीग्रेशन कस्टम AI एजेंट्स को प्रोफेशनल एप्लिकेशन्स में असरदार तरीके से देखने, रीज़न करने और एक्शन को एग्जीक्यूट करने की सुविधा देता है।

Google इस बात पर ज़ोर देता है कि यह अपग्रेड लॉन्ग-हॉराइज़न और एंटरप्राइज़ ऑटोमेशन कामों, जैसे कि कई एप्लिकेशन्स में लगातार सॉफ्टवेयर टेस्टिंग और कॉम्प्लेक्स नॉलेज वर्क के लिए परफॉर्मेंस को काफी बढ़ाता है।

लाइव एनवायरनमेंट में काम करने वाले एजेंट्स से जुड़े सिक्योरिटी रिस्क को दूर करने के लिए, Google ने खास तौर पर प्रॉम्प्ट इंजेक्शन वल्नरेबिलिटी को कम करने के लिए डिज़ाइन की गई टारगेटेड एडवरसैरियल ट्रेनिंग लागू की है।

यह मानते हुए कि ऑटोमेटेड एक्शन के लिए सख्त गार्डरेल की ज़रूरत होती है, कंपनी डिफेंस-इन-डेप्थ प्रोटोकॉल पर बहुत ज़्यादा फोकस कर रही है।

ये प्रोटोकॉल यह पक्का करते हैं कि जैसे-जैसे AI एजेंट्स को डिजिटल वर्कस्पेस पर ज़्यादा ऑटोनॉमी मिलती है, अंदरूनी मॉडल मैलिशियस मैनिपुलेशन और अनऑथराइज़्ड सिस्टम कमांड के खिलाफ़ मज़बूत बने रहते हैं।

कोर मॉडल डिफेंस के अलावा, Google ऑर्गेनाइज़ेशनल वर्कफ़्लो को प्रोटेक्ट करने के लिए दो ऑप्शनल एंटरप्राइज़ सेफ़गार्ड सिस्टम ला रहा है।

पहले सिस्टम में AI एजेंट के सेंसिटिव या इर्रिवर्सिबल एक्शन करने से पहले साफ़ यूज़र कन्फर्मेशन की ज़रूरत होती है।

अगर इनडायरेक्ट प्रॉम्प्ट इंजेक्शन की कोशिश का पता चलता है, तो दूसरा सेफ़गार्ड चल रहे टास्क को ऑटोमैटिकली खत्म कर देता है।

Google एंटरप्राइज़ को इन फ़ीचर्स को सिक्योर सैंडबॉक्सिंग, ह्यूमन-इन-द-लूप वेरिफिकेशन और सख्त एक्सेस कंट्रोल के साथ मिलाने की सलाह देता है।

नई इंटीग्रेटेड कंप्यूटर इस्तेमाल करने की कैपेबिलिटी अभी डेवलपर्स और एंटरप्राइज़ के लिए Gemini API और Gemini Enterprise Agent Platform के ज़रिए उपलब्ध है।

जल्दी अपनाने और टेस्टिंग को आसान बनाने के लिए, Google रेफरेंस इम्प्लीमेंटेशन, पूरा डॉक्यूमेंटेशन और एक ब्राउज़रबेस-होस्टेड डेमो एनवायरनमेंट दे रहा है।

ये रिसोर्स ऑर्गनाइज़ेशन को कंप्यूटर इस्तेमाल करने वाले AI एजेंट की अगली पीढ़ी को सुरक्षित रूप से इवैल्यूएट करने, बनाने और डिप्लॉय करने में मदद करने के लिए डिज़ाइन किए गए हैं।

మెరుగైన AI ఏజెంట్ల కోసం జెమిని 3.5 ఫ్లాష్‌లో నేటివ్ కంప్యూటర్ వినియోగాన్ని గూగుల్ అనుసంధానించింది

ఇంతకుముందు కేవలం స్టాండ్‌అలోన్ జెమిని 2.5 వెర్షన్‌కు మాత్రమే పరిమితమైన ఒక ఫీచర్‌ను, అంటే నేటివ్ కంప్యూటర్ వినియోగ సామర్థ్యాలను, నేరుగా తన జెమిని 3.5 ఫ్లాష్ మోడల్‌లో అనుసంధానిస్తున్నట్లు గూగుల్ ప్రకటించింది.

ఈ అప్‌గ్రేడ్, బ్రౌజర్, డెస్క్‌టాప్ మరియు మొబైల్ పరిసరాలతో సహా విభిన్న డిజిటల్ ప్లాట్‌ఫారమ్‌లలో సజావుగా పనిచేయగల అత్యంత సామర్థ్యం గల AI ఏజెంట్లను రూపొందించడానికి డెవలపర్‌లకు వీలు కల్పిస్తుంది.

గూగుల్ డీప్‌మైండ్‌లోని ప్రొడక్ట్ మేనేజర్ అయిన మాటియో క్విరోస్ ప్రకారం, ఈ ఫీచర్‌ను కోర్ ఫ్లాష్ మోడల్‌లో పొందుపరచడం ద్వారా, ఏజెంటిక్ కంప్యూటర్ వినియోగ పనుల కోసం గూగుల్ ఇప్పటివరకు సాధించిన అత్యంత బలమైన పనితీరును అందిస్తోంది.

ఈ సామర్థ్యాన్ని ప్రధాన జెమిని 3.5 ఫ్లాష్ మోడల్‌లోకి మార్చడం ద్వారా, సెర్చ్ మరియు మ్యాప్స్‌తో ఫంక్షన్ కాలింగ్ మరియు గ్రౌండింగ్ వంటి ఇప్పటికే ఉన్న ఫీచర్లను గూగుల్ మరింత విస్తరిస్తోంది.

ఈ నేటివ్ ఇంటిగ్రేషన్, కస్టమ్ AI ఏజెంట్లు ప్రొఫెషనల్ అప్లికేషన్‌లలో సమర్థవంతంగా చూడటానికి, తర్కించడానికి మరియు చర్యలను అమలు చేయడానికి అనుమతిస్తుంది.

నిరంతర సాఫ్ట్‌వేర్ టెస్టింగ్ మరియు బహుళ అప్లికేషన్‌లలో సంక్లిష్టమైన నాలెడ్జ్ వర్క్ వంటి దీర్ఘకాలిక మరియు ఎంటర్‌ప్రైజ్ ఆటోమేషన్ పనుల కోసం ఈ అప్‌గ్రేడ్ పనితీరును గణనీయంగా పెంచుతుందని గూగుల్ నొక్కి చెబుతోంది.

ప్రత్యక్ష వాతావరణాలలో పనిచేసే ఏజెంట్లతో ముడిపడి ఉన్న భద్రతాపరమైన ప్రమాదాలను పరిష్కరించడానికి, ప్రాంప్ట్ ఇంజెక్షన్ బలహీనతలను తగ్గించడానికి ప్రత్యేకంగా రూపొందించిన టార్గెటెడ్ అడ్వర్సేరియల్ ట్రైనింగ్‌ను గూగుల్ అమలు చేసింది.

ఆటోమేటెడ్ చర్యలకు కఠినమైన రక్షణ వ్యవస్థలు అవసరమని గుర్తించి, ఈ సంస్థ డెఫెన్స్-ఇన్-డెప్త్ ప్రోటోకాల్స్‌పై ఎక్కువగా దృష్టి సారిస్తోంది.

డిజిటల్ వర్క్‌స్పేస్‌లపై ఏఐ ఏజెంట్లు మరింత స్వయంప్రతిపత్తిని పొందుతున్న కొద్దీ, వాటి అంతర్లీన నమూనాలు దురుద్దేశపూర్వక తారుమారు మరియు అనధికార సిస్టమ్ ఆదేశాలకు వ్యతిరేకంగా పటిష్టంగా ఉండేలా ఈ ప్రోటోకాల్స్ నిర్ధారిస్తాయి.

ప్రధాన నమూనా రక్షణలతో పాటు, సంస్థాగత వర్క్‌ఫ్లోలను రక్షించడానికి గూగుల్ రెండు ఐచ్ఛిక ఎంటర్‌ప్రైజ్ సేఫ్‌గార్డ్ సిస్టమ్‌లను పరిచయం చేస్తోంది.

మొదటి సిస్టమ్‌లో, ఏఐ ఏజెంట్ సున్నితమైన లేదా మార్చలేని చర్యలను అమలు చేయడానికి ముందు వినియోగదారు నుండి స్పష్టమైన నిర్ధారణ అవసరం.

పరోక్ష ప్రాంప్ట్ ఇంజెక్షన్ ప్రయత్నం కనుగొనబడితే, రెండవ సేఫ్‌గార్డ్ కొనసాగుతున్న పనులను స్వయంచాలకంగా నిలిపివేస్తుంది.

ఈ ఫీచర్లను సురక్షిత శాండ్‌బాక్సింగ్, హ్యూమన్-ఇన్-ది-లూప్ వెరిఫికేషన్ మరియు కఠినమైన యాక్సెస్ నియంత్రణలతో కలిపి ఉపయోగించాలని గూగుల్ సంస్థలకు సలహా ఇస్తోంది.

కొత్తగా ఇంటిగ్రేట్ చేయబడిన కంప్యూటర్ వినియోగ సామర్థ్యం ప్రస్తుతం జెమిని ఏపీఐ మరియు జెమిని ఎంటర్‌ప్రైజ్ ఏజెంట్ ప్లాట్‌ఫామ్ ద్వారా డెవలపర్‌లు మరియు సంస్థలకు అందుబాటులో ఉంది.

ప్రారంభ దశలోనే స్వీకరించడాన్ని మరియు పరీక్షించడాన్ని సులభతరం చేయడానికి, గూగుల్ రిఫరెన్స్ ఇంప్లిమెంటేషన్‌లు, సమగ్ర డాక్యుమెంటేషన్, మరియు బ్రౌజర్‌బేస్-హోస్ట్ చేసిన డెమో ఎన్విరాన్‌మెంట్‌ను అందిస్తోంది.

తదుపరి తరం కంప్యూటర్-ఉపయోగ AI ఏజెంట్‌లను సంస్థలు సురక్షితంగా మూల్యాంకనం చేయడానికి, నిర్మించడానికి మరియు అమలు చేయడానికి సహాయపడేలా ఈ వనరులు రూపొందించబడ్డాయి.

Amoghavarsha

Pages

Thursday, June 25, 2026

Google Integrates Native Computer Use into Gemini 3.5 Flash for Enhanced AI Agents

No comments:

Post a Comment

Popular Posts