We are looking for a highly skilled Senior Data Engineer to join a team responsible for the development, maintenance, and evolution of the data platforms that power a specialized clinical trial management application. This role calls for a technically experienced professional who can work autonomously to design, build, optimize, and support modern data pipelines, ensuring the quality, reliability, and availability of information for critical analytical and operational processes.
Responsibilities
-
Design, develop, and maintain scalable and reliable data pipelines for data ingestion, transformation, and consumption
-
Build ETL/ELT processes using modern data engineering technologies
-
Develop and optimize data transformations using Python, PySpark, and dbt
-
Design and manage data storage and processing solutions in Snowflake
-
Apply best practices in data modeling, data architecture, and information governance
-
Contribute to support, monitoring, and incident resolution activities related to data integration processes and data platforms
-
Define automation, testing, and deployment strategies through CI/CD practices
-
Collaborate with cross-functional teams across development, product, analytics, and business to understand requirements and translate them into efficient technical solutions
-
Ensure data quality, consistency, and traceability throughout the entire information lifecycle
-
Identify opportunities to improve the performance, scalability, security, and maintainability of data solutions
-
Document architectures, technical processes, and development standards
Requirements
-
3+ years of experience in Data Engineering
-
Strong development experience with Python
-
Advanced experience with PySpark for distributed data processing
-
Hands-on experience with Snowflake (modeling, optimization, development, and administration of data objects)
-
Experience implementing transformations using dbt (Data Build Tool)
-
Knowledge and practical experience applying CI/CD practices for data solutions
-
Solid command of the following concepts: Data Modeling, Data Warehousing, Data Lake / Lakehouse, Data Architecture, Data Integration and Orchestration
-
Experience with version control using Git
-
Ability to design technical solutions independently and lead complex data engineering initiatives
-
Strong analytical and problem-solving skills
-
Self-driven, results-oriented approach
-
Effective communication with both technical and functional teams
-
Ability to manage multiple priorities simultaneously
-
Architectural mindset with a focus on scalability
-
Proactivity in identifying risks and improvement opportunities
-
Attention to detail and commitment to quality
-
Strong English communication skills (B2 level or higher)
Nice to have
-
Experience with cloud environments (AWS, Azure, or GCP)
-
Familiarity with orchestration tools such as Airflow or similar
-
Experience working in regulated environments, preferably in healthcare, pharmaceutical, or clinical research sectors
-
Knowledge of data quality and pipeline observability
-
Experience with Agile/Scrum methodologies
We offer
-
International projects with top brands
-
Work with global teams of highly skilled, diverse peers
-
Healthcare benefits
-
Employee financial programs
-
Paid time off and sick leave
-
Upskilling, reskilling and certification courses
-
Unlimited access to the LinkedIn Learning library and 22,000+ courses
-
Global career opportunities
-
Volunteer and community involvement opportunities
-
EPAM Employee Groups
-
Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn
EPAM is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, age, sexual orientation, gender identity or expression, disability, protected veteran status, or any other characteristic protected by applicable law.