Latency + LLM observability

Know what broke —
and what it costs.

Drop in two lines. SentriNode tracks latency, errors, and LLM spend, predicts incidents before they hit, and explains the root cause in plain English. No collector to babysit. No query language to learn.

Core capabilities

Everything you need to ship with confidence.

From your AI bill to your p95 — see it, predict it, and fix it. AI-native insight, zero ops.

LLM cost & usage

Track every model call — cost, tokens, latency, per-model spend. Catch the expensive calls that should've used a cheaper model.

Predict incidents before they break

Forecasts a latency breach from the trend — with the culprit service and an ETA — and warns you before the anomaly even fires.

One-click Fix It

AI writes the root cause, a ready-to-file Jira ticket with full context, and the exact commands to run — straight from the anomaly.

Ghost mode incident replay

Replay any incident like a movie — watch latency spike and the failure cascade frame by frame, scrub the timeline, and see exactly how it broke.

Adaptive Telemetry

Recommends what to sample or drop to cut your bill — with savings estimates. Ingest only the telemetry that actually matters.

Drop-in, any stack

Two lines for Python or Node, or raw OTLP for anything else. No collector cluster to run, no PromQL to memorize. Paste, run, see data.

Real-time engine

Your whole stack, computed live.

Every span feeds a real-time model — latency percentiles, error rates, per-model LLM cost, and two-window anomaly detection — recomputed continuously across every service, the moment it arrives.

Two lines, no account

See your LLM costs in under a minute.

Install, add two lines, and every Anthropic or OpenAI call is tracked — model, tokens, latency, and cost. Run it fully local with no signup, or send it to the cloud for team dashboards, budgets, and alerts.

Start free →
# install pip install sentrinode-llm import sentrinode_llm sentrinode_llm.instrument(local=True) # no account # use openai / anthropic as normal — # every call tracked: cost, tokens, latency # then see your costs locally: $ sentrinode dashboard # localhost:7000
Private by design

What the big platforms can't promise.

Datadog and friends are heavyweight SaaS built for someone else's threat model. SentriNode is built for teams shipping AI — private, portable, and predictable.

We never see your prompts

The SDK records tokens, model, latency, and cost — never prompt or response content. Your sensitive data never leaves your app. Nothing to leak, redact, or explain to legal.

Self-host in your own cloud

One docker compose up and SentriNode runs entirely inside your VPC. Perfect for regulated, defense, or air-gapped teams who can't send telemetry to a third party.

Spend caps that actually stop

Set a monthly LLM budget and the SDK refuses calls when you're over — a real hard cap, not just an alert after the bill. No passive observer can do this.

Run it yourself

Self-host in minutes.

Prefer to keep everything inside your own infrastructure? The full stack runs on one box with Docker. Bring a free Supabase project for auth and you are live — no telemetry ever leaves your servers.

# clone and configure git clone https://github.com/rg309/sentrinode cd sentrinode/sentrinode cp .env.example .env # add your Supabase keys # run the whole stack locally docker compose -f docker-compose.hetzner.yml up -d --build # then open http://localhost:8080

Full self-host guide →

Ready when you are

Start watching your stack in two minutes.

Free to start. No credit card. See your latency and AI spend today.

1 Create your workspace 2 Grab your API key 3 pip install & see data