AgileEngine is an award-winning software development company that creates innovative solutions for Fortune 500 brands and trailblazing startups. We excel in application development, AI/ML, and have a people-first culture that has earned us multiple Best Place to Work awards. Job Responsibilities - Design, deploy, and operate scalable and robust Kubernetes environments supporting data and analytics workloads; - Build, automate, and maintain complex data pipelines using Argo Workflows for orchestration, scheduling, and workflow automation; - Lead or support migration of source code repositories and CI/CD pipelines to GitLab or other Git-based platforms; - Develop and manage infrastructure with Terraform and related tools, implementing infrastructure automation and repeatable deployments in AWS and Kubernetes; - Support high-availability S3-based data lake environments and associated data tooling, ensuring robust monitoring, scalability, and security; - Instrument, monitor, and create actionable alerts and dashboards for Kubernetes clusters, Argo workflows, and data platforms to quickly surface and resolve operational issues; - Participate in incident, problem, and change management processes, proactively drive improvements in reliability KPIs; - Work cross-functionally with Data Engineering, SRE, Product, and Business teams to deliver resilient solutions and support key initiatives; Requirements - Bachelor's degree in Computer Science, Engineering, or related field, or equivalent experience; - 5+ years of production experience operating and managing Kubernetes clusters; - Strong hands-on experience with AWS cloud services; - Deep hands-on experience with Argo Workflows, including developing, deploying, and troubleshooting complex pipelines; - Experience with Git, GitLab, and CI/CD, including leading or supporting migration projects and the adoption of GitOps practices; - Effective at developing infrastructure as code with Terraform and related automation tools; - Practical experience in automating data workflows and orchestration in a cloud-native environment; - Proficient in SQL and basic scripting; - Sound understanding of networking, security, and IAM in cloud environments; - Proficient in Linux-based systems administration; - Strong written and verbal communication skills; - Ability to collaborate in cross-functional environments; - Track record delivering reliable, secure, and scalable data platforms; - Experience working with S3-based data lakes or similar large, cloud-native data repositories; - Upper-Intermediate English level.