Q: What is an AI agent firewall?

An AI agent firewall is a security layer that sits between your local machine and AI API endpoints, inspecting outbound requests and inbound tool responses in real time. Unlike network firewalls that operate on IP/port rules, an AI agent firewall understands the structure of AI API payloads — scanning message content, tool inputs, tool results, and system prompts for sensitive data patterns and injection attacks before they reach the model.

Question 1

What is CoworkGuard?

Accepted Answer

CoworkGuard is a local privacy and security layer for AI agents running on macOS. It sits between your machine and AI APIs (Claude, ChatGPT, Gemini, Cursor, Copilot and others), scanning every outbound request for sensitive data — PII, credentials, API keys, internal URLs — before it leaves your machine. It also monitors MCP tool responses for indirect prompt injection attacks.

Question 2

What sensitive data does CoworkGuard detect and block?

Accepted Answer

CoworkGuard detects over 40 pattern types including: Social Security Numbers (SSN), credit card numbers, AWS and cloud provider credentials, API keys (OpenAI, Anthropic, GitHub, Stripe, Twilio and more), private keys and certificates, database connection strings, internal hostnames and IP ranges, email addresses, passport numbers, and custom patterns you define. CRITICAL severity findings (SSNs, private keys, AWS keys) are blocked by default. HIGH severity findings (JWTs, API keys) can be configured to block or flag.

Question 3

How does CoworkGuard protect against MCP prompt injection attacks?

Accepted Answer

CoworkGuard intercepts MCP (Model Context Protocol) tool responses before they reach the language model. It runs three scanners on every tool response: an injection scanner that detects instruction-override attempts, a metadata scanner that checks tool descriptions for suspicious permissions, and a unicode scanner that catches homoglyph and invisible character attacks. It also scans tool_result content blocks in Anthropic API payloads — the primary vector for indirect prompt injection.

Question 4

Does CoworkGuard send any data to external servers?

Accepted Answer

No. CoworkGuard runs entirely on your local machine. The proxy runs on localhost:8080, the dashboard on localhost:7070, and all audit logs are stored locally. No telemetry, no cloud sync, no account required. The only outbound connections are the AI API requests you make yourself — and those are scanned before they leave.

Question 5

Which AI tools and providers does CoworkGuard support?

Accepted Answer

CoworkGuard monitors requests to: Anthropic (Claude), OpenAI (ChatGPT, GPT-4), Google Gemini, Perplexity AI, Cursor, GitHub Copilot, Mistral AI, Cohere, Groq, and xAI (Grok). The Chrome extension also detects Chrome's built-in Prompt API (Gemini Nano via window.ai / LanguageModel) and suspicious extension behaviour on AI provider pages.

Question 6

What is the Confirm Before Send feature?

Accepted Answer

Confirm Before Send holds a blocked request open instead of immediately returning a 403. The request appears as PENDING in the audit dashboard with an Allow Once button. If you click Allow Once within 60 seconds, the original request is forwarded to the AI API unmodified. If the timer expires without action, the request is blocked.

This gives you a human-in-the-loop review step for sensitive requests — useful when you know a request contains something that looks like a secret but is actually safe to send.

Question 7

How does CoworkGuard detect malicious browser extensions harvesting AI conversations?

Accepted Answer

The CoworkGuard Chrome extension injects a detector into AI provider pages (Claude, ChatGPT, Gemini, etc.) that compares the page's fetch() and XMLHttpRequest implementations against iframe-isolated native references. If another extension has wrapped these APIs — the technique used by Urban VPN and similar extensions to harvest complete AI conversations — CoworkGuard flags it as CRITICAL and fires a notification. This is harder to spoof than toString() checks.

Question 8

Is CoworkGuard free and open source?

Accepted Answer

Yes. CoworkGuard is free and open source under MIT with Commons Clause. Personal and non-commercial use is unrestricted. Commercial use requires a separate license. The source code is available on GitHub at github.com/Katherine-Holland/ClaudeCoworkGuard.

Question 9

How do I install CoworkGuard on macOS?

Accepted Answer

Download the macOS app from the GitHub releases page, open the .dmg, drag CoworkGuard to Applications, and open it. The setup wizard installs the mitmproxy certificate and configures your system proxy. Alternatively, install via the shell script: curl -sSL https://raw.githubusercontent.com/Katherine-Holland/ClaudeCoworkGuard/main/install.sh | bash, then run start.sh.

Question 10

What is an AI agent firewall?

Accepted Answer

An AI agent firewall is a security layer that inspects outbound AI API requests and inbound tool responses in real time. Unlike network firewalls that operate on IP and port rules, an AI agent firewall understands the structure of AI API payloads — scanning message content, tool inputs, tool results, and system prompts for sensitive data patterns and injection attacks.

CoworkGuard is the first open-source AI agent firewall for local machines, operating as a transparent mitmproxy interceptor with no changes required to your AI tools or workflows.

Question 11

How does CoworkGuard compare to using a VPN for AI privacy?

Accepted Answer

A VPN encrypts traffic in transit but does nothing to prevent sensitive data from being sent to AI providers in the first place. CoworkGuard operates at the payload level — it inspects what is inside the request before it leaves your machine, regardless of whether a VPN is active.

The two tools address different threat models and can be used together. A VPN protects against network-level eavesdropping. CoworkGuard protects against accidental or malicious data exfiltration through AI APIs.

Question 12

Can CoworkGuard protect AI agents running in Cursor or Claude Code?

Accepted Answer

Yes. CoworkGuard intercepts all outbound requests to api.anthropic.com and api.cursor.sh, which covers Claude Code, Cursor, and any other tool using those endpoints. Because it operates as a system proxy, it works with any application on your machine — not just browsers.

Frequently Asked Questions

What is CoworkGuard?

What problem does CoworkGuard solve?

What is an AI agent firewall?

What sensitive data does CoworkGuard detect?

What happens when sensitive data is detected?

Does CoworkGuard produce false positives?

How does CoworkGuard protect against MCP prompt injection?

What is indirect prompt injection and why does it matter?

Does CoworkGuard send any data to external servers?

Does CoworkGuard store the content of my AI prompts?

What is the Confirm Before Send feature?

How does the Chrome extension work alongside the proxy?

Which AI tools and providers does CoworkGuard support?

How do I install CoworkGuard on macOS?

Is CoworkGuard free and open source?

How does CoworkGuard compare to using a VPN for AI privacy?

Can CoworkGuard protect AI agents in Cursor or Claude Code?

Does CoworkGuard work with agentic AI workflows that use many tools?

Ready to protect your AI sessions?