(HJW-183) SITE RELIABILITY ENGINEER (MIDDLE)

Agileengine


Job Description: We are looking for a Site Reliability Engineer to join our team at AgileEngine. As a SRE, you will be responsible for ensuring the reliability and scalability of our systems. Key Responsibilities: - Manage alerts daily, check systems, and escalate issues as needed - Be part of a team that provides 24×7 on-call support for critical SaaS events - Be available in case of emergencies when team members are not available or need help - Document issues and remediation steps Requirements: - 2+ years of professional experience - Experience working with Datadog - Hands-on experience as an AWS Cloud Engineer - Working knowledge of EKS/Terraform/Helm Preferred Qualifications: - Experience with Docker and Docker Swarm - Good understanding of AWS IAM roles and policies - Experience logging and monitoring AWS resources using CloudWatch logs Benefits: - Professional growth: Accelerate your professional journey with mentorship, TechTalks, and personalized growth roadmaps - Competitive compensation: We match your ever-growing skills, talent, and contributions with competitive USD-based compensation and budgets for education, fitness, and team activities

trabajosonline.net © 2017–2021
Más información