Key Responsibilities:
Stakeholder Management Working with key technology stakeholders to deliver SRE strategies and capabilities to drive and support the digital transformation agenda.
Architect and Optimize CI/CD Pipelines Design and maintain cloud-native CI/CD workflows using tools like GitHub Actions, Jenkins, or ArgoCD. Automate build, test, and deployment processes for microservices across Kubernetes clusters and multi-cloud environments.
Implement DevSecOps Practices Integrate security into every stage of the pipeline—automating vulnerability scans, secrets management, and policy enforcement using tools like Snyk, HashiCorp Vault, and OPA.
Ensure High Availability and Resilience Build fault-tolerant systems using cloud-native patterns (e.g., self-healing, auto-scaling, blue/green deployments). Leverage Kubernetes, service meshes, and distributed tracing to maintain performance and uptime.
Monitor, Alert, and Respond Deploy observability stacks (Prometheus, Grafana, ELK, OpenTelemetry) to monitor system health. Define SLOs/SLIs, set up intelligent alerting, and lead incident response and postmortems.
Manage Infrastructure as Code (IaC) Using Terraform and cloud vendor tools to provision and manage cloud resources. Maintain version-controlled infrastructure and enforce change management practices.
Enforce Compliance and Governance Ensure systems meet regulatory and organizational standards (e.g., SOC 2, HIPAA, ISO 27001). Automate audit trails and implement continuous compliance checks.
Collaborate Across Engineering Teams Partner with developers, QA, and platform engineers to embed reliability and security into the SDLC. Advocate for cloud-native best practices and drive adoption of scalable patterns.
Mentor and Lead by Example Guide junior engineers, conduct technical reviews, and foster a culture of ownership, automation, and continuous learning.
Continuously Improve Systems and Processes Identify performance bottlenecks, reduce toil through automation, and evolve infrastructure to support rapid innovation and growth.
Skills Required: