INGENIERO SRE

80.000.000 - 120.000.000

1 week ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. Direct message the job poster from Liberty Latin America Senior AI Innovation Manager | Transforming telecom using AI Innovation | Founder of Jaibana IT | AI solutions architect | Speaker on AI and cloud… Job purpose LLA´s Advanced Analytics area is responsible for delivering measurable and actionable value to all the group´s business functions (commercial, finance, operations, networks, etc..) through data and analytics-based enhancements. Those enhancements will be a mix of AI/ML models and operationalized strategies that allow the different areas to improve their decisions and actions. To achieve this goal, the area needs to have strong Developer Operation methodologies to speed up and automate aspects of the processes of developing, testing, and releasing machine learning models, allowing the continuous delivery of machine learning models and software updates. That permits the organization to address the architectural and information availability challenges existing in a complex multi-market and multi-system telecom company. Therefore, we count on our site reliability engineers (SREs) and DevOps engineers to empower our users with a rich feature set, high availability, and stellar performance level to pursue their missions. As we expand our machine learning model deployments, we are currently seeking an experienced SRE to deliver insights from massive scale data. Specifically, we are searching for someone who brings fresh ideas, demonstrates a unique and informed viewpoint, and enjoys collaborating with a cross-functional team to develop real-world solutions and positive user experiences at every interaction. The SRE Specialist will be responsible for bridging the gap between development and operations, focusing on automation, monitoring, and infrastructure management. The ideal candidate will have a deep understanding of both DevOps practices and Site Reliability Engineering (SRE) principles, with the ability to drive continuous improvement and ensure the stability and performance of our systems. Objectives of this Role Design, implement, and manage CI/CD pipelines to automate deployment processes. Develop and maintain infrastructure as code (IaC) using tools such as AWS CDK, Terraform, Ansible, or similar. Monitor and ensure the reliability, availability, and performance of our systems and applications. Implement and manage logging, monitoring, and alerting solutions to detect and respond to issues proactively. Collaborate with development teams to ensure applications are designed for reliability and scalability. Conduct root cause analysis and post-mortem reviews to learn from incidents and prevent recurrence. Optimize system performance and manage capacity planning. Lead and mentor junior and senior team members, fostering a culture of continuous learning and improvement. Stay current with industry trends and best practices to drive innovation and efficiency. Key Accountabilities The ideal candidate will have a very strong technical background in cloud architecture and CD/CI pipeline automation, practical experience in AWS, and will fulfil the following main functions: Gather and analyze metrics from our applications to assist in performance tuning and fault finding. Partner with development teams to improve services through rigorous testing and release procedures. Participate in system design consulting, platform management, and capacity planning. Create sustainable systems and services through automation and uplifts. Balance feature development speed and reliability with well-defined service level objectives. Knowledge and Experience Experience: 2+ years of experience in DevOps, SRE, or related roles. Knowledge in Python programming. Proficiency in command-line usage and scripting in Linux. Experience with AWS cloud platform. Experience with CI/CD tools (Jenkins, GitLab CI, CircleCI). Skills & Abilities: Bachelor’s degree in computer science or other highly technical, scientific discipline. Ability to program (structured and OO) with Python. A proactive approach to spotting problems, areas for improvement, and performance bottlenecks. Awareness of DevOps and Agile principles. Strong problem-solving skills. Knowledge of cloud platforms and security. Ability to communicate complicated technical problems to both technical and business audiences. Ability to quickly understand business objectives and to recognize and capitalize data driven solutions from it. A dynamic, competitive edge to personal style with the ability to assist and sympathize with peers’ and other colleagues’ business circumstances and pressures. Have the personal drive and ambition to progress within the organization. Preferred education/ qualifications: Master’s degree in Computer Science, Engineering, or related field. Experience with observability tools (Prometheus, Grafana, ELK Stack). Familiarity with configuration management tools (Chef, Puppet).
Seniority level Seniority level Mid-Senior level Employment type Employment type Full-time Job function Job function Engineering and Information Technology Industries Telecommunications Referrals increase your chances of interviewing at Liberty Latin America by 2x Get notified about new Site Reliability Engineer jobs in Colombia . DevOps Engineer - (Remote multiple locations) Bogota, D.C., Capital District, Colombia 2 months ago Bogota, D.C., Capital District, Colombia 3 months ago Bogota, D.C., Capital District, Colombia 1 month ago Bogota, D.C., Capital District, Colombia 1 month ago Bogota, D.C., Capital District, Colombia 3 weeks ago We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI. #J-18808-Ljbffr

Ir a Jobleads

Vacante publicada hace 1 dia

INGENIERO SRE

80.000.000 - 120.000.000

Sobre TrabajosOnline

Colaboración

Más información