When Your Agent Warns You About a Threat You Did Not Ask It to Watch For

AI Agents for Small Business Owners

Watch the live training this came from

This article is drawn from Shanee Moret's Day 2 live training on Codex, websites, agent-ready infrastructure, and real business-owner implementation.

Watch the replay →

How Codex's unprompted security awareness changes what it means to give an agent access to your business

There is a moment in Heather Mackay's dahlia weekend test that most people walk past because they are focused on the 490 tubers sold.

While Codex was running autonomously — posting to Facebook, emailing customers, reporting order counts every five minutes — a phishing email arrived in Heather's Gmail. It was disguised as a Shopify message. Heather did not flag it. She did not ask Codex to monitor her inbox for threats. She was not even at her computer.

Codex flagged it anyway.

In its next five-minute status report, it noted: there is a fake email from a fake Shopify address in your Gmail. Do not click on it. 98% chance it's spam. Shopify wouldn't do this, this, or this.

Heather did not configure that behavior. The agent noticed the threat because it had full context of the environment it was operating in — and because something in that environment looked wrong.

What Most Business Owners Assume About Agent Access

When business owners think about giving an agent access to their email, their Shopify account, their Facebook page, they are usually thinking about what the agent can do with that access. Post updates. Email customers. Pull order data. The framing is outbound: the agent uses access to take action on your behalf.

Heather's test revealed a different dimension of the same access. When an agent is connected to your environments and operating inside them, it does not just execute — it observes. Give an agent genuine business context, and it can identify when something in that environment is abnormal.

The phishing email was abnormal. Codex recognized it because it was actively working inside Heather's Shopify account, understood what Shopify communication looks like, and the email arriving in Gmail did not match that pattern.

No human in the loop would have caught that faster.

The Difference Between a Task Executor and an Active Observer

Most business owners who use AI are using it as a task executor. Give it a job. Receive an output. That model produces real results. But it leaves the most valuable capability unreached.

An agent operating with full business context — connected to your email, your storefront, your communications — functions as a second set of eyes that never sleeps, never gets distracted, and carries none of the cognitive load you bring into every interaction.

The phishing email is a small example. The principle scales.

Task Executor Model	Active Observer Model
Does what you ask	Does what you ask and flags what you didn't think to ask about
Operates within the scope of the current goal	Operates with awareness of the full environment
Requires you to notice problems and report them	Notices problems in parallel with executing goals
Access functions as a permission granted	Access functions as a responsibility the agent takes seriously
Useful when you are present	Valuable precisely when you are not

The active observer model emerges from the same access that makes goal execution possible. It requires no separate configuration. Full context is the prerequisite — everything else follows from it.

Why This Matters More Than It Appears

Heather runs an HR consulting firm. Her Codex has access to environments that touch sensitive client data. When she talks about deploying agents in that context, the first question any responsible business owner asks is: what is the risk of giving something access to everything?

Heather's experience redirects that question. An agent operating with partial access executes inside an environment it cannot fully evaluate. It sends emails from a compromised account. It processes orders from a spoofed storefront. It acts in a space it cannot assess because it was given only enough access to complete the task, not enough to understand the context around it.

Full access, with proper controls configured before deployment, produces an agent that can recognize threats a task-limited one will never see.

Heather's Codex is also configured to require her direct laptop input for sensitive approvals. It will not accept third-party authorization. During testing, Codex was told "Heather approved" by someone else. It responded that it needed Heather to type it on her laptop. The approval requirement and the phishing flag are the same architecture: an agent that understands the difference between authorized and unauthorized action, inside an environment it has been given genuine responsibility for.

What This Requires From You

The unprompted security flag happened because Codex had access to Heather's Gmail. Fully connected, as part of setting up the dahlia weekend test.

An agent that only has access to a Shopify storefront cannot notice a threat arriving in Gmail. An agent with read-only access on specific folders cannot flag something and instruct you not to click it. The capability scales directly with the access, and the access requires trust in the agent's design that most business owners have not yet established.

What the setup actually requires:

Connect the environments the agent will operate in — fragmented permissions produce a fragmented view
Configure approval controls for sensitive actions before expanding access, not after something goes wrong
Review the agent's status reports, not just its outputs — the flags live in the reports
Treat the agent's questions during setup as a map of what context it still needs
Expect a calibration period — the first tests are how the agent learns the baseline of your environment

The time investment is front-loaded. Heather spent roughly two weeks troubleshooting setup conflicts before the dahlia test was possible. After that, the agent's contextual awareness operated without additional configuration.

For the complete framework on how access determines everything before work begins, read the full guide.

The Practical Implication

If you are a business owner handling confidential client data — consulting, HR, professional services, financial advisory — AI deployment usually gets framed as a risk management problem. How do you contain the agent's exposure to sensitive information?

The better question, based on Heather's experience: how do you configure the agent to participate in your risk management, rather than simply be subject to it?

The 98% spam confidence, the three specific reasons the email was suspicious, the instruction not to click — those emerged from context, not from a security setting. When an agent has enough context to recognize an anomaly and enough trust built into its architecture to report it, the security awareness appears as a natural byproduct of the deployment.

Any business owner who has ever clicked a phishing link, processed a fraudulent order, or responded to a spoofed vendor email knows what inattention costs. An agent with partial context cannot protect you from those moments. An agent with full context, properly configured, can catch them mid-execution — even when you are not watching.

Watch me explain this live — the dahlia test and the phishing flag both happened in real time during the session.

The Principle

When you give an agent full access and configure it correctly, it observes the environment your goals depend on — not just the narrow task you assigned.

Every business owner who grants access is making a decision about scope. Narrow scope produces narrow protection. Full context, with proper approval controls in place, produces an agent capable of catching what you would have missed.

For context on how Heather's approval controls work alongside this — and why requiring direct authorization for sensitive actions is what makes full deployment defensible — read the next step on access control and identity verification.

For the complete framework on mental models, environments, and agent-ready infrastructure, read the full guide.

This is Part 13 of a 43-part series. Start from the beginning.

This article will be promoted by LinkedIn post.

Codex Skills for SMBs

Use the skills behind this system

The Growth Academy Skills Dashboard includes 100+ Codex skills and prompts for SMB owners, including website audits, GitHub and Cloudflare setup, permissions, business intelligence, sales, and operations workflows.

See the Skills Dashboard →