SYSTEM INFRASTRUCTURE SPECIALIST - (NY303)

Bebeeengineer


Are you looking for a challenging role in our high-scale, fast-changing production environment? As a Senior Site Reliability Engineer, you will play a vital part in ensuring the availability, performance, and efficiency of our systems. Key Responsibilities: - Ensure production environment availability, performance, and efficiency - Develop scalable infrastructure-as-code (IaC) solutions to improve deployment efficiency and system reliability across our multi-cloud services - Design and maintain web traffic distribution infrastructure handling billions of requests per second and hundreds of gigabytes of traffic - Manage thousands of nodes in Kubernetes clusters while continuously improving containerization strategies and GPU resource management - Build and support a self-service toolset to empower developers and facilitate integration with our infrastructure - Enhance observability, monitoring, and logging to proactively detect and resolve performance issues We are committed to creating an inclusive environment for all employees, believing it is critical for success. Employment decisions are based on qualifications, merit, and business needs. At our company, we value collaboration and teamwork. Our team members work together to achieve common goals and support each other in their career development. We offer a hybrid work schedule with 3 days in-office, with options to come in more often if desired. We work with some of the biggest names in the industry, including leading publishers and advertisers. If you are passionate about building and solving infrastructure challenges with automation, working with cutting-edge technologies, and pushing those technologies to their limits, then this role may be perfect for you.

trabajosonline.net © 2017–2021
Más información