Skip to content

Platform Operations

Day-2 operations for maintaining and scaling kMetal in production.

Observability

Monitoring - Prometheus monitoring and alerting setup

Resilience

Backup & Recovery - Platform and tenant backup procedures

Disaster Recovery - DR planning and recovery procedures

Lifecycle Management

Upgrades - Platform upgrade procedures

Maintenance - Regular maintenance tasks and schedules

Scaling - Horizontal and vertical scaling strategies