← All Jobs
Posted May 2, 2026

Site Reliability Engineer II

Apply Now

POSITION SUMMARY

The ideal candidate will have 7+ years of experience in Linux systems and software management, expertise with Terraform, Ansible, and cloud platforms like AWS, Azure, and GCP. Experience with large-scale distributed systems, monitoring/alerting systems (Prometheus, Grafana), CI/CD pipelines, container orchestration (Docker, Kubernetes), and programming languages (Go, Java, Python) is essential. Because we are an AI-first company, this role also heavily involves engineering scalable infrastructure for machine learning workloads, including GPU provisioning and MLOps integrations. A background in implementing security controls, automating deployments, and troubleshooting complex systems is also required.

‎ 

WHAT YOU'LL DO

‎ 

WHAT YOU'LL NEED

Systems and Tools

Bonus Points If

DISCLOSURE

Our company provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, sex, national origin, age, disability or genetics.

(Colorado & California Only*): The posted annual salary range provided is of $130,000.00 to $140,000.00. This base pay is for illustrative purposes only and will be determined based on skills and experience comparable to the job requirements. This position may be eligible for additional compensation and benefits including but not limited to: incentive compensation; health benefits; retirement benefits; life insurance; paid time off; parental leave and benefits; and other employee perks and benefits.
• Note: Disclosure as required by sb19-085 (8-5-20) of the minimum salary compensation for this role when being hired in California & Colorado.

‎ 

Interested in this role?Apply on iHire