We are looking for a Senior DevOps Engineer with strong expertise in incident and request management, along with experience using tools such as Dynatrace, Grafana, and Splunk.
The position covers monitoring setup, tool administration, and break-fix activities for medium-complexity tickets.
Responsibilities
-
Lead technical initiatives by providing guidance and oversight to team members
-
Architect, design, develop, test, deploy, and maintain software solutions aligned with project requirements
-
Cooperate with other technical and non-technical teams to ensure successful project outcomes
-
Contribute to the development and delivery of solution requirements, estimates, and timelines
-
Uphold the quality of technical deliverables while following development standards and best practices
-
Handle incidents and requests using ServiceNow or JIRA as the tracking platform
-
Stay available for monitoring and escalation during off-hours and weekends, including carrying pager duty for after-hours emergencies
-
Triage tickets, update ticket details, and assess urgency accordingly
-
Be open to participating in on-call rotations in India and during CST hours on a rotational basis
Requirements
-
A minimum of 3 years of relevant professional experience
-
Hands-on background with Microsoft Azure and Azure Log Analytics for cloud infrastructure and monitoring
-
Experience with Dynatrace administration and Dynatrace Workflows for observability and automation
-
Practical knowledge of event management, extensions, and integrations within Dynatrace
-
Skills in infrastructure as code deployments using Terraform
-
Capability in Kubernetes automation for container orchestration and management
-
Familiarity with additional tools such as GitHub Actions for CI/CD, ServiceNow for IT service management, and Confluence for documentation and collaboration
-
Ability to work both independently and as part of a team
-
Strong analytical and problem-solving mindset, with proven experience troubleshooting under pressure
-
Strategic thinking together with complex problem-solving and analytical abilities
-
Solid organizational and interpersonal skills, including experience fostering a culture of operational maturity
-
Capacity to adjust quickly to new technologies
-
Excellent oral and written English communication skills (C2 level)
We offer
-
International projects with top brands
-
Work with global teams of highly skilled, diverse peers
-
Healthcare benefits
-
Employee financial programs
-
Paid time off and sick leave
-
Upskilling, reskilling and certification courses
-
Unlimited access to the LinkedIn Learning library and 22,000+ courses
-
Global career opportunities
-
Volunteer and community involvement opportunities
-
EPAM Employee Groups
-
Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn