Now Open for Sign Ups

The manager your
AI agents are missing.

Real-time feedback while you iterate. Automatic drift detection once you're stable. Foil learns how your agents should behave — and tells you when they don't.

$ npx @getfoil/wizard1-command setup

Trusted by teams shipping AI agents in production

Add Foil with 1 command to your existing codebase.

~/my-projectfoil wizard
$npx @getfoil/foil-js wizard
 
Foil Wizard Setup
 
?API Key:········
?Agent name(default: support-agent):
?Agent description:Customer support chatbot
?Dry run?(y/N):N
 
Foil Wizard
 
Scanned project structure
Detected:OpenAI·Express.js
Pattern:Conversational HTTP server
Createdfoil.js
Modifiedindex.js— added Foil import
Modifiedroutes/chat.js— added tracing
Updated.env.example
 
Setup complete!1 agent instrumented

Foil in Action

Real teams, real agents, real results

External-facing

Customer Support Agent

A SaaS company deploys an AI support agent handling thousands of conversations daily. Foil monitors every conversation in real time.

Key Insight

After a docs update, profile learning detected hallucinated answers. Alerting caught a 3x spike in "I don't know" responses within minutes.

TracingProfilesAlerting
Internal

Document Processing Agent

A financial services firm processes documents through an AI pipeline - parsing, classification, extraction, and compliance checks.

Key Insight

Anchored invariants enforce 97% accuracy and <30s processing. Change detector flagged 12% accuracy drift after a model update.

TracingEvaluationsAnchors
External-facing

AI Onboarding Agent

An agent walks new users through product setup - answering questions, configuring settings, and escalating to humans when stuck.

Key Insight

Drift detection caught the agent routing 3x more users to human escalation after a docs update changed the setup flow.

TracingAlertingDrift Detection
Internal

Code Review Agent

An engineering team uses AI to review PRs and triage CI failures. Foil monitors invocations and developer feedback.

Key Insight

A Sunday volume spike revealed a broken dependency causing cascading CI failures - burning API credits unnoticed.

SignalsAlertingAnalytics

Agent Learning

Your agent gets better with every trace

Quantitative metrics update in real-time while AI-generated behavioral profiles deepen with every learning cycle.

From trace 1

Live Metrics, Instantly

The moment your agent sends its first trace, Foil tracks latency, error rates, tool usage, and volume - updating every 60 seconds.

Real-time latency & errorsTool usage distributionVolume & temporal patterns
From 50 traces

First Profile & Anchors

Foil generates your agent's first behavioral profile - identity, tool patterns, error analysis - alongside health anchors that are continuously validated.

Behavioral identityFalsifiable health anchorsAI-generated insights
50 → convergence

Rapid Refinement

The profile re-learns at geometric intervals (125, 313, 783+ traces). When 2 consecutive cycles find no material changes, it converges.

Geometric learning cyclesConvergence detectionDeeper analysis
Converged

Drift Detection & Self-Healing

Learning becomes change-driven. Foil monitors for behavioral drift and automatically re-learns when something meaningful changes.

Automated drift detectionChange-driven re-learningSelf-healing
Control Center

See what needs your attention

Control Center surfaces the traces that matter most. Review flagged invocations, give feedback, and improve your agent in a single workflow.

  • Real-time feed of flagged traces across all agents
  • Filter by evaluation failures, latency, errors, or user signals
  • Give feedback directly on traces to improve future behavior
  • Track resolution status across your team
Control Center
3 Critical5 Warnings
Last 7dAll agents
support-agenta8f3c21e...hallucinationquality
18.9s2m ago

"Your refund has been processed and $247 will be returned..." - fabricated refund amount

FlagNeeds ActionLooks Good
support-agentc4e91b7f...quality
3.2s8m ago

"I don't have access to that information" - repeated 4x in conversation

review-agentf7d28a3e...hallucination
6.1s12m ago

"The PR modifies the auth middleware" - file not in changeset

support-agent91b4e5c2...frustration
4.8s15m ago

User sentiment dropped after 3rd redirect to documentation

review-agente3a7d9f1...qualityslow
45.2s22m ago

"LGTM" response with no code analysis on 500-line PR

Agent Profiles

Every agent, understood

Foil builds a behavioral profile for each agent - separating always-fresh quantitative metrics from AI-generated behavioral intelligence. Live metrics update continuously. AI insights deepen as your agent processes more traces.

  • Live metrics from the first trace - latency, errors, tool usage updated in real-time
  • AI behavioral profiles bootstrap from just 50 traces
  • Health anchors: falsifiable claims continuously validated
  • Drift detection alerts when behavior shifts from baseline
support-agentSteady State
240 traces · 2/2 anchors passing
Identity

Customer support agent for vehicle inquiries, test drives, and dealership operations

Key Metrics
240
Daily Volume
0.0%
Error Rate
1.2s
Med. Latency
9
Active Hours
Tool Usage
search_inventory 46.7%gpt-4o-mini 29.8%check_slots 13.3%transfer_call 9.6%
InsightsAI-derived
High daily volume (240 traces) indicates robust engagement
Predominant search_inventory usage suggests most interactions check vehicle availability
Health Anchors
Error rate stays below 1%
Daily volume exceeds 200 traces
Smart Evaluations

Evaluations that know your agent

Every trace is evaluated against 9 built-in checks — hallucination, PII, prompt injection, and more. Unlike generic tools, Foil's evaluations use your agent's behavioral profile as context. A response that's normal for one agent might be anomalous for another.

  • 9 built-in evaluations: hallucination, PII, injection, jailbreak, quality, and more
  • Evaluations use live agent context - tool patterns, error baselines, behavioral norms
  • Create custom evaluations with few-shot examples from your own traces
  • Anomaly detection powered by agent profiles
Evaluation Pipeline
Incoming Tracetrc_9f2a3b

support-agent · "How do I track my order?"

Agent Profile Context

tool patterns, error baselines, behavioral norms

Built-in Evaluations
Hallucination
PII
Injection
Quality
NSFW
Jailbreak
Stuck/Loop
Frustration
Satisfaction
Custom EvaluationsPro
Brand Voice CompliancePass
Refund Policy AccuracyWarning

Platform Features

Everything you need. Nothing you don't.

Purpose-built for AI agents. See what no other tool can show you.

Full Thought Tracing

See every step your agent takes: each LLM call, tool invocation, memory read, and branching decision.

Hallucination Detection

Catch the moment your agent makes things up. Flag responses that contradict context in real-time.

Safety Monitoring

Detect policy violations, harmful outputs, and prompt injections before they reach users.

Cost Analytics

Track token usage and API spend down to individual decisions. Find what's burning through your budget.

User Signals

Link user feedback directly to agent decisions. Understand which reasoning paths lead to outcomes.

Failure Replay

Replay any failed execution step-by-step. Rewind to the exact moment things went wrong.

Capabilities

What Foil sees that you can't

From support bots to research agents, trace every thought and decision. Know exactly what went wrong and why.

Pinpoint the exact moment your agent went wrong

Trace every thought, every decision, every retrieval. See the complete chain of reasoning and find exactly where the failure occurred.

Thought chain replayHallucination detectionContext tracing

See what your agent actually reads and remembers

Watch your agent's memory in action. See which documents it retrieves, which passages it focuses on, and whether it's using the context you gave it.

Memory inspectionGrounding scoresSource attribution

Follow every step your agent takes

Visualize decision trees, track tool calls, and understand why it chose one path over another. Catch runaway loops before they drain your budget.

Decision tree viewTool call trackingLoop detection

Inspect every generation before it ships

Understand how your agent crafts each output. Catch hallucinations, safety issues, and quality problems with deep inspection.

Generation breakdownSafety checksFeedback collection

Integration

One command. Fully instrumented.

Our AI-powered wizard scans your codebase and adds tracing automatically. Built on OpenTelemetry - zero code changes to your LLM calls.

npx @getfoil/wizard

Scans your code, installs the SDK, adds tracing - done.

01

Run the wizard

One command analyzes your codebase, detects LLM calls, and adds Foil instrumentation automatically.

npx @getfoil/wizard
02

Review & deploy

The wizard creates a branch with all changes. Review the diff, merge, and deploy as normal.

git diff main...foil-setup
03

Traces flow automatically

Every OpenAI, Anthropic, or other LLM call is traced. Agent profiles build from real usage.

// Calls are traced automatically!
terminal
$ npx @getfoil/wizard

  Foil Wizard Setup

  ? API Key: sk_live_••••••••
  ? Agent name: support-agent
  ? Target directory: ./my-app

  # Auto-detects your LLM provider and app pattern
 Provider: OpenAI (gpt-4o-mini)
 Pattern:  One-shot script

  # Instruments your code automatically
  foil.js  — created (Foil config & shutdown handlers)
  index.js — modified
  ├─ Foil import added as first require
  ├─ LLM calls wrapped in ctx.llmCall()
  └─ main() wrapped in agent.trace()

 Branch created: foil-setup

$ npm install @getfoil/foil-js && node index.js

Pricing

Simple, transparent pricing

Try the full evaluation suite free. Pay only for what you use.

Pro Trial

Free

14 days of Pro features

  • All Pro features included
  • 10,000 spans included
  • No credit card required
Start Free Trial

Starter

$49/mo

$0.001 / interaction

  • Unlimited spans, agents & retention
  • Alerts
  • Exports
  • Deep Search
  • Email support
Get Started

Pro

14 day free trial
$149/mo

$0.005 / interaction

  • Everything in Starter, plus:
  • Model training on your data
  • Smart Search + Semantic Search
  • Prompts
  • SSO & RBAC
  • Priority support
Start Free Trial

Your agents are running right now.
Shouldn't something be watching?

Real-time metrics from trace one. Behavioral intelligence that deepens over time. Evaluations that know your agent.