About Arcadion
Arcadion is a Canadian innovation company focused on enterprise-grade cloud infrastructure, AI systems integration, cybersecurity, and next-generation managed services. Our engineering culture is driven by automation, zero-trust principles, and deeply integrated platform design. We build and support mission-critical environments for businesses across Canada, the U.S., and globally.
We are expanding our Cloud & Infrastructure division and are looking for a Kubernetes DevOps Specialist to join our team of senior engineers responsible for designing, deploying, and optimizing complex containerized workloads across private and public cloud environments.
Position Overview
The Kubernetes DevOps Specialist will architect, deploy, automate, and maintain Kubernetes-based platforms across Arcadion’s datacenters and client hybrid cloud environments. You will work closely with our cloud, security, SRE, and AI engineering teams to ensure scalable, secure, resilient, and observable infrastructure.
This role requires deep technical expertise, strong problem-solving skills, and the ability to design production-ready systems that meet the highest standards of reliability, performance, and security.
Key Responsibilities
Kubernetes & Cloud Architecture
- Design and deploy Kubernetes clusters (K8s, K3s, OpenShift, Rancher, Harvester, etc.) in private cloud, multi-cloud, and hybrid environments.
- Implement service mesh technologies (Istio, Linkerd) and container networking (CNI, Calico, Cilium).
- Manage Helm charts, Kustomize configs, cluster lifecycle (GitOps), and automated deployment workflows.
DevOps & Automation
- Build CI/CD pipelines (GitLab, GitHub Actions, ArgoCD, Tekton) for containerized applications.
- Automate infrastructure deployments using IaC (Terraform, Ansible, Pulumi).
- Implement infrastructure observability using Prometheus, Grafana, Loki, ELK/EFK, Jaeger, etc.
Security & Compliance
- Enforce zero-trust design patterns and secure cluster configuration (CIS Benchmarks, NIST frameworks).
- Integrate container security scanning, runtime protection, and secret management (Vault, Sealed Secrets, SOPS).
- Work with Arcadion’s SOC and security teams to ensure compliance and threat monitoring for all cluster workloads.
Operations & Reliability
- Manage cluster upgrades, patching, node scaling, and backup/restore operations.
- Improve reliability, performance tuning, workload scheduling, and cost optimization.
- Support production workloads, troubleshoot distributed systems, and participate in on-call rotations.
Required Skills & Experience
- 5+ years of experience in DevOps, SRE, platform engineering, or cloud infrastructure roles.
- Expert-level understanding of Kubernetes (CKA / CKAD / CKS certifications highly preferred).
- Strong experience with Docker, container architecture, registries, and image pipelines.
- Hands-on experience with Terraform, Ansible, GitOps frameworks, and declarative infrastructure.
- Strong command of Linux systems (Ubuntu, RHEL, SUSE, Debian).
- Deep knowledge of cloud platforms: Azure, AWS, GCP, or OpenStack.
- Experience with load balancers, ingress controllers, networking, DNS, TLS, and certificate automation.
- Solid understanding of DevSecOps, RBAC, IAM, OPA/Gatekeeper, and compliance practices.
Nice-to-Have
- Experience with VMware Tanzu, Nutanix Karbon, SUSE Rancher/Harvester, or Red Hat OpenShift.
- Knowledge of HPC and GPU workloads, NVIDIA GPU Operator, MIG slicing, or AI/ML pipelines.
- Experience with distributed databases (MongoDB, CockroachDB, PostgreSQL HA).
- Familiarity with backup platforms (Velero, Kasten) and DR orchestration.
- Exposure to Arcadion-aligned technologies: CrowdStrike, Acronis, Fortinet, Veeam, Dell/HPE hardware.
Who You Are
- A builder, innovator, and continuous learner.
- Someone who thrives in high-autonomy engineering cultures.
- Comfortable with complex systems and elegant automation.
- Security-driven and detail-oriented.
- Able to work independently and collaborate tightly with global teams.
What We Offer
- Competitive compensation aligned with senior engineering roles.
- Remote-first flexibility with optional access to Arcadion offices.
- Cutting-edge projects involving Kubernetes, AI/ML, HPC, and zero-trust cloud architecture.
- Access to training, certifications, and continuous learning opportunities.
- A culture focused on innovation, engineering excellence, and career growth.