**Overview** At ForUsAll, we’re revolutionizing the U.S. retirement industry with cutting-edge AI technology. Based in San Francisco, our fintech startup is on a mission to provide cost-efficient retirement solutions for small and mid-sized businesses. Founded by industry pioneers who reimagined 401(k) plans for Fortune 500 companies, we’re supported by top-tier venture capitalists and financial tech experts who share our passion for empowering everyday Americans to achieve financial security. Founded by industry veterans who previously transformed retirement plans for Fortune 500 companies, we’re backed by top venture capital firms and fintech leaders who share our mission to democratize access to modern, diversified retirement portfolios. **About the Role**: You’ll collaborate closely with AI engineers, backend developers, and product teams to streamline development and operational efficiency from code to cloud. **What You’ll Do**: - Architect, maintain, and optimize cloud infrastructure in **AWS**, following security and scalability best practices- Manage and enhance our **CI/CD pipelines** (e.g., GitHub Actions, CodePipeline, CircleCI) to support reliable and fast releases- Build and maintain **MLOps workflows**, including model versioning, training pipelines, testing, and deployment automation- Support container orchestration using **Docker** and **Kubernetes** (EKS preferred)- Define and enforce infrastructure-as-code practices using tools like **Terraform** or **CloudFormation**- Monitor system performance and availability using modern observability stacks (e.g., **Prometheus**, **Grafana**, **Datadog**, **CloudWatch**)- Collaborate with engineering teams to set up staging environments, automate tests, and manage secrets and access control - Drive reliability and incident response practices (alerts, runbooks, root cause analysis, etc.) **Requirements**: - 5+ years of DevOps, Site Reliability, or Infrastructure Engineering experience- Strong hands-on experience with **AWS core services** (EC2, ECS/EKS, S3, IAM, CloudWatch, RDS, Lambda, etc.)- Experience designing and maintaining robust **CI/CD pipelines** with GitHub Actions or similar tools- Solid understanding of **MLOps workflows** and ML model deployment practices- Proficient in infrastructure-as-code tools (Terraform or CloudFormation)- - Proficient with scripting languages (Bash, Python, etc.) for automation and tooling - Familiarity with security best practices (e.g., least-privilege IAM roles, secret rotation, encryption in transit/at rest) **Bonus**: - Experience with AWS sagemaker, Bedrock a plus