flightcheck / ai quality monitor
STATUS: OPERATIONAL
Google Cloud Rapid Agent Hackathon · Arize Track

An agent that watches your other agents. And acts.

flightcheck is an autonomous monitoring agent. It inspects a live LLM application through the Arize Phoenix observability platform, detects when output quality degrades, and files a real alert — no human in the loop.

01
patient-app
A monitored LLM app emits traces of every call.
02
Arize Phoenix
Traces land in the Phoenix observability platform.
03
MCP server
A custom MCP server on Cloud Run exposes the trace data.
04
flightcheck agent
A Gemini agent reads traces, reasons, judges quality.
05
Discord alert
If quality is degraded, an alert is filed automatically.
// MULTI-STEP

Plans a real mission

On every check it sequences multiple tool calls — stats, traces, judgment, action — not a single canned reply.

// MCP-NATIVE

Powered by Arize Phoenix

A purpose-built Model Context Protocol server wraps the Arize Phoenix client and gives the agent its observability superpowers.

// BEYOND CHAT

Takes action, not notes

When it finds degraded quality, flightcheck files a severity-tagged alert to the team — autonomously.

flightcheck@cx-agent-studio — interactive session
> Try: "Check the app and take action if anything looks wrong."
> The agent will inspect Phoenix, judge Lineup's quality, and file an alert if needed.