Team Background: As a part of the Infrastructure organization, this team enables development and validation of autonomous vehicle software. Our mission is to provide a reliable, scalable and cost-efficient compute platform for rapid AV software development, validation and deployment. We orchestrate a variety of workloads such as simulations, model compilation & deployment, model performance validation, build, labeling, etc. on the Cloud and custom Cruise hardware. As Cruise’s largest compute provider, we strive to maintain a harmony between cost efficiency and developer productivity.
Role: We are seeking a Senior Software Engineer to lead key initiatives within our AI Validation Platform team, focused on scaling, driving efficiency and high utilization of cutting-edge GPUs and custom on-premise hardware, while also leveling up the platform’s reliability and user experience. The successful candidate will have experience building and running scalable distributed systems. They will rapidly test and promote ideas, possess strong problem-solving skills, and demonstrate a bias for action.
If you are looking to solve one of today’s most complex engineering challenges, see the results of your work in hundreds of self-driving cars, and make a positive impact in the world starting in our cities, join us!
What you’ll be doing:
- Own core platform components, services and our runtime environment.
- Analyze and improve efficiency, scalability, and reliability of systems.
- Proactively drive improvements across product areas.
- Work on large scale initiatives to raise the overall Cruise engineering bar.
- Drive community recognition for Cruise engineering.
What you must have:
- 6+ years of relevant industry experience.
- Proven record of building large-scale distributed systems.
- Expertise in Go, or similar programming languages.
- Experience installing, configuring and running applications on Linux.
- Experience leading large cross-functional initiatives.
- Experience with cloud platforms like GCP or Azure.
- Ability to thrive in a dynamic, multi-tasking environment with ever-evolving priorities.
- Highly effective collaboration and influencing skills.
Bonus points!
- Hands-on experience with Kubernetes or Slurm at scale.
- Experience identifying and driving efficiency of GPU workloads.
- Experience with high performance computing (HPC).
- Familiarity with inference and/or simulations.
- Active participation in the open source community.
The salary range for this position is $152,000 - $223,500. Compensation will vary depending on location, job-related knowledge, skills, and experience. You may also be offered a bonus, long-term incentives, and benefits. These ranges are subject to change.