xAI is seeking a Site Reliability Engineer to ensure the reliability, scalability, and performance o
岗位职责
As a Site Reliability Engineer on the xAI Technical Operations team, you will be responsible for ensuring the reliability, scalability, and performance of our AI infrastructure. Your primary duties include:
Infrastructure Management: Design, implement, and maintain highly available and fault-tolerant systems that support our AI models and services. This involves managing cloud-based and on-premise infrastructure, including compute clusters, storage systems, and networking components.
申请条件
To be successful in this role, you should have:
Education: Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience.
Experience: 3-5 years of experience in site reliability engineering, DevOps, or a similar role.