Hard Fork AI

Google Rolls Out AI System That Handles Threats Instantly

8 snips
Dec 9, 2025
Discover how Chrome's latest AI security system tackles online threats instantly, transforming user experience. The discussion reveals why browsers are prime hosts for AI agents and how Google's innovative critic aligns actions with user goals. Learn about the design features that minimize risks like prompt injection and the intriguing decision for agents to ignore ads. Delve into the balance of user permissions for sensitive sites and the impact of ongoing defenses in the industry. It's a fascinating look at the future of safe browsing!
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Metadata Critic Guards Agent Actions

  • Google uses a separate critique model (Gemini) to verify each agent action against the user's original goal.
  • The critic sees only metadata, not page content, to avoid prompt-injection attacks and enforce alignment.
INSIGHT

Planner‑Critic Loop Enforces Alignment

  • The critic model checks planned steps against the original goal and requests replanning if misaligned.
  • This loop reduces the chance an agent follows malicious or irrelevant instructions during execution.
INSIGHT

Origin Sets Limit Data Exposure

  • Chrome restricts which origins an agent can read from or write to using agent origin sets to limit cross-origin data leaks.
  • The browser can withhold model access to data outside the readable set and block interactions with disallowed frames.
Get the Snipd Podcast app to discover more snips from this episode
Get the app