We are seeking a highly skilled DevOps / SRE Engineer to join our cloud engineering teams supporting large‑scale, mission‑critical platforms. You will work with modern cloud‑native technologies, build and maintain CI/CD pipelines, manage Kubernetes workloads, automate deployments, improve reliability, and ensure secure and scalable operations across Azure environments.
Must‑Have Skills
-
5+ years of DevOps / SRE experience.
-
Strong hands‑on experience with Azure Cloud services.
-
Proficiency with Kubernetes (AKS mandatory).
-
Experience designing and maintaining CI/CD pipelines (Azure DevOps, GitHub Actions).
-
Strong scripting abilities: PowerShell / Bash / Python.
-
Knowledge of Azure CLI, REST APIs, YAML, Git, GitHub, and version‑control best practices.
-
Experience with containerization (Docker) and Kubernetes networking/configuration.
-
Exposure to monitoring & observability tools (Grafana, Prometheus, Azure Monitor).
-
Experience supporting production environments and troubleshooting cloud deployments.
-
Understanding of cloud networking (VNETs, routing, private endpoints, firewalls).
-
Experience with GitOps tools (ArgoCD).
-
Familiarity with Ansible, Terraform, Helm.
-
Knowledge of API gateway tools (Azure APIM, Apigee, etc.).
-
Experience with large distributed systems in enterprise environments.
-
On‑call support experience.
-
Certifications: AZ‑104, AZ‑400, CKA/CKAD (plus).
Key Responsibilities
-
Design, build, and maintain CI/CD pipelines using Azure DevOps and/or GitHub Actions.
-
Administer and optimize Azure Cloud infrastructure (App Services, AKS, Functions, Storage, VNets, Private Endpoints).
-
Manage and operate Kubernetes (AKS) clusters for high‑availability, reliability, and security.
-
Implement automation using IaC tools (ARM/Bicep/Terraform) and configuration management (Ansible).
-
Apply GitOps practices using tools like ArgoCD for continuous delivery.
-
Develop and maintain automation scripts using PowerShell, Bash and/or Python.
-
Work closely with dev teams to support application deployments, API integrations, and debugging.
-
Build and maintain monitoring dashboards (Grafana, Azure Monitor, Application Insights).
-
Troubleshoot production issues related to performance, deployments, networking, and container workloads.
-
Provide support and participate in on‑call rotations.
-
Produce clear documentation, operations runbooks, architecture diagrams, and deployment workflows.