Kubernetes-native IT operations platform. Self-hosted. Helm install in 5 minutes. SLA monitoring that your management team will actually understand.
From service discovery to SLA reporting, all integrated in a single self-hosted platform.
Real-time uptime tracking with error budgets. 5-minute snapshots feed daily reports (auto-generated at 07:00), monthly rollups in the dashboard. Know exactly where you stand.
Agent auto-discovers Kubernetes services from ConfigMaps. No manual registration. GitOps-native, zero config drift.
ITIL-compliant incident management with visual workflow builder. Auto-creates tickets on service outage. SLA timers on every ticket.
Standalone status page for stakeholders. No login needed. Share a link, and your clients see real-time uptime data.
Push-based health monitoring for VMs and physical hardware. Not just Kubernetes — one platform for your entire infrastructure.
Webhook-based disk usage and backup status tracking. Any backup tool calls a single endpoint. Alerts if backup is overdue.
Every service is identified by a single path: org/platform/env/cluster/service. That’s the only required field. Everything else is optional and progressively unlocks more features.
# it-ops.yaml in a ConfigMap
path: "mlops-app/itops/prod/eu-west-1/payment-api"
Service appears in the Operations tree, K8s workload health flows automatically.
path: "mlops-app/itops/prod/eu-west-1/payment-api"
slaGroup: "payment-system"
SLA monitoring switches on. Criticality is inherited from the group tier.
path: "mlops-app/itops/prod/eu-west-1/payment-api"
slaGroup: "payment-system"
dependencies:
requires:
- path: "mlops-app/itops/prod/eu-west-1/payment-db"
critical: true
Every reference is globally unique. Dependency chips become clickable in the UI.
“Start with one line. Grow the config only when you want a new feature.”
From zero to full SLA dashboard in under 5 minutes.
Add the chart repo and install the agent on your cluster. One command, one namespace.
The agent watches ConfigMaps and reports service status every 30 seconds. No manual setup needed.
Uptime percentages calculated automatically. Share the dashboard with management. Done.
Your clients expect 99.9% uptime. SLA means Service Level Agreement — the uptime promise you make to your customers. Can you prove you are meeting it?
"Your clients expect 99.9% uptime. Can you prove it?"
Monthly uptime reports generated automatically. No more spreadsheets.
Kubernetes clusters and bare metal servers in a single pane of glass.
Stop manually calculating uptime in spreadsheets. ITOps gives your team real-time visibility across every cluster and every environment.
"Stop manually calculating uptime in spreadsheets."
See how ITOps stacks up against popular monitoring and operations tools.
| Feature | ITOps | Datadog | PagerDuty | Uptime Robot |
|---|---|---|---|---|
| Self-hosted | ✓ | ✗ | ✗ | ✗ |
| Kubernetes-native | ✓ | ✓ | ✗ | ✗ |
| SLA Tracking | ✓ | Partial | ✗ | Basic |
| Ticketing Built-in | ✓ | ✗ | ✗ | ✗ |
| Service Discovery | ✓ | ✓ | ✗ | ✗ |
| Bare Metal Support | ✓ | ✓ | ✗ | ✗ |
| Pricing | Free / Licensed | $$$ | $$ | Free / $ |
| Setup Time | 5 minutes | Hours | Hours | Minutes |
| Data Ownership | 100% yours | Cloud | Cloud | Cloud |
Self-hosted. Your data stays yours. No per-seat surprises.
For small teams getting started
All plugins, unlimited services
Dedicated support, custom integrations
Deploy in under 5 minutes. No credit card required.