3 days ago Be among the first 25 applicants EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential. We are looking for a Lead Cloud Platform Support Engineer to join our high-powered team. As a Lead Cloud Platform Support Engineer, you will lead and mentor a team providing 24/7 support for customer platforms in public clouds. You will be instrumental in defining best practices, driving process improvements, and ensuring the stability, reliability, and performance of cloud environments. You will also play a critical role in collaborating with stakeholders, initiating innovative solutions, and leading the technical efforts for our most complex challenges. If you are a seasoned professional with a passion for cloud technologies and an eagerness to lead by example, we invite you to apply! Responsibilities Lead and oversee Event, Incident, Problem, and Change Management processes Mentor and guide team members to ensure their professional development and alignment with team objectives Act as the primary escalation point for critical technical issues and facilitate resolutions during major incident handling (bridge) calls Support customer environments in public clouds (~1000+ hosts) and provide expertise on advanced technical aspects of installation, scaling, availability, and performance tuning Collaborate with global technical and management teams, providing strategic input and alignment on cloud operations and innovations Automate routine tasks and workflows through advanced scripting Develop and enforce best practices for Infrastructure as Code and configuration management automation Conduct vulnerability assessments, resolve identified security gaps, and ensure ongoing compliance Maintain comprehensive documentation for supported systems, tools, and team processes Provide advanced systems troubleshooting and root cause analysis for applications Drive operational improvements and efficiencies through the adoption of new tools and technologies Requirements 5+ years of experience as a Systems Engineer or Systems Administrator on Linux and Windows platforms 1+ years of leadership experience in relevant roles Advanced proficiency in scripting languages such as Bash, Python, and PowerShell Extensive cloud experience in AWS (VPC, EC2, Lambda, S3, RDS, Route53, ACM, IAM, CloudWatch, Config) with an understanding of advanced cloud-native services Strong theoretical and practical understanding of computer networks, including the OSI model and TCP/IP stack Significant experience with middleware for Java-based applications, including deployment and tuning Proven experience working with WAF, DDoS mitigation, and penetration test assessments Expertise in systems and application performance tuning in complex environments Proficiency with Git, Bitbucket, and version control workflows Strong troubleshooting skills across networks, systems, and distributed applications Demonstrated ability to lead a team while balancing individual technical responsibilities Excellent communication abilities in spoken and written English (B2+ level) Nice to have Cloud experience in Azure and familiarity with multi-cloud environments Experience in designing and writing advanced scripts/templates/playbooks for automating cloud infrastructure, configuration management, and application deployments (e.g., AWS CloudFormation, Puppet, Ansible, Terraform) Deep knowledge of advanced cloud practices and patterns (Load Balancing, Auto-Scaling, Monitoring, Blue-Green Deployment, Zero Downtime, Security Hardening) Understanding of CI/CD principles and ITIL best practices, with hands-on experience implementing these frameworks Experience with containerized applications using Docker, as well as Kubernetes for orchestration We offer International projects with top brands Work with global teams of highly skilled, diverse peers Healthcare benefits Employee financial programs Paid time off and sick leave Upskilling, reskilling and certification courses Unlimited access to the LinkedIn Learning library and 22,000+ courses Global career opportunities Volunteer and community involvement opportunities EPAM Employee Groups Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn Seniority level Seniority level Mid-Senior level Employment type Employment type Full-time Job function Job function Business Development, Information Technology, and Engineering Industries Software Development, IT Services and IT Consulting, and Media and Telecommunications Referrals increase your chances of interviewing at EPAM Systems by 2x Bogota, D.C., Capital District, Colombia 3 months ago Bogota, D.C., Capital District, Colombia 4 days ago DevOps Engineer - (Remote multiple locations) DevOps Engineer Career Opportunities at Dev.Pro - 01 Bogota, D.C., Capital District, Colombia 1 month ago Medellin, Antioquia, Colombia 9 hours ago Bogota, D.C., Capital District, Colombia 1 month ago Bogota, D.C., Capital District, Colombia 3 weeks ago Bogota, D.C., Capital District, Colombia 1 week ago Bogota, D.C., Capital District, Colombia 2 weeks ago We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI. #J-18808-Ljbffr