Achievements
- Automated infrastructure management for 750+ Linux servers (Selectel VPS + bare metal) using Ansible playbooks and Kubernetes, reducing manual operations by ~80%
- Reduced deployment time from 8 to 2 minutes (−75%) by optimizing GitLab CI caching, build parallelization, and Docker layer structure
- Reduced MTTR by 60% through centralized monitoring stack (Prometheus, Grafana, AlertManager)
- Reduced Docker image size from 500MB to 200MB by switching to Alpine base, multi-stage builds, and removing debug tools from production images
- Replaced manual service health checks with automated SaaS monitoring service (Go + PostgreSQL + React) supporting configurable check intervals (10s–24h)
- Responded to production incidents including SSH brute-force attacks with 1.5–2 minute SLA via KVM live access and automated blocking