Operational
Endpoints are reachable and returning expected auth responses during health checks.
Live provider status, response latency, and rolling uptime trends — updated every 5 minutes from independent health checks. Built for teams that need a fast public signal before routing workload.
Latest observed status and latency from our independent monitoring pipeline.
Independent status monitoring for every major model API platform.
Endpoints are reachable and returning expected auth responses during health checks.
Service is reachable but shows throttling or unusual latency that can affect production throughput.
Requests fail due to transport errors, repeated server failures, or endpoint unavailability.
No check data yet — usually right after a fresh deployment or before the first scheduled run.
Written for production incident response. Start here if you are seeing API failures right now.
Diagnose rate-limit failures and implement safe retry/backoff strategy without retry storms.
Troubleshoot slow responses, tune timeout budgets, and harden fallback paths for latency spikes.
Production strategy with routing pseudocode, readiness checklists, and failure-mode handling.
Root causes, severity levels, and concrete remediation steps for common AI API errors.
Independent uptime and latency monitoring for every major AI API provider.
Independent OpenAI uptime, latency, incidents, endpoint status, and region-level health signals.
Independent Anthropic uptime and latency snapshot with actionable reliability interpretation.
Independent Gemini uptime, latency trend, and recent incident windows.
Independent uptime and latency context for Mistral API operations.
Fast outage diagnosis with live checks and practical next-step actions for engineering teams.
Fast outage diagnosis for Anthropic API with live monitor signals.
Compare uptime, latency, error rate, and cost across providers with 30/90-day views.
Timeline, root cause, mitigation, and preventive actions from recent disruption windows.
Historical incident summaries and rolling uptime trends for planning and postmortems.
Filterable 24h/7d/30d reliability view with provider-specific incident windows.
Estimate monthly spend, failed-request exposure, and fallback overhead before incidents happen.
Analyze text with transparent AI-likelihood indicators and confidence scoring.
These numbers come from independent HTTP checks run every 5 minutes against each provider's endpoints — not from official status pages. Reliability varies by region, request shape, and provider-side policy. Use this as an external baseline alongside your own internal telemetry. Full details on endpoint selection, status classification, and known caveats are on the Methodology page.