Company Overview
Deepgram is the leading voice AI platform for developers building speech-to-text (STT), text-to-speech (TTS) and full speech-to-speech (STS) offerings. 200,000+ developers build with Deepgram’s voice-native foundational models – accessed through APIs or as self-managed software – due to our unmatched accuracy, latency and pricing. Customers include software companies building voice products, co-sell partners working with large enterprises, and enterprises solving internal voice AI use cases. The company ended 2024 cash-flow positive with 400+ enterprise customers, 3.3x annual usage growth across the past 4 years, over 50,000 years of audio processed and over 1 trillion words transcribed. There is no organization in the world that understands voice better than Deepgram
Opportunity:
We are seeking an experienced Infrastructure Engineer to design, implement, and maintain our large-scale distributed systems infrastructure. You'll be responsible for building and optimizing our network architecture, storage solutions, and compute platforms that power our AI/ML workloads. This role combines expertise in network engineering, storage systems, and modern container orchestration platforms, with a focus on reliability, scalability, and cost-effectiveness.
What You’ll Do
Design and implement reliable, high-performance network architectures for distributed systems
Architect and maintain large-scale storage solutions, including backup systems, distributed caching, and object storage
Build and optimize cost-effective data center infrastructure
Develop and maintain GPU compute clusters for AI inference workloads
Manage large-scale deployments using modern orchestration platforms like Kubernetes and Slurm
Implement monitoring, alerting, and automation solutions for infrastructure management
You’ll Love This Role If You
Are passionate about building reliable, scalable infrastructure systems
Enjoy optimizing complex distributed systems for performance and cost
Love solving challenging problems in networking and storage at scale
Are excited about working with cutting-edge GPU infrastructure
Want to work at the intersection of infrastructure and AI/ML systems
It’s Important To Us That You Have
5+ years of experience in infrastructure engineering or similar roles
Strong background in network engineering and design for reliability
Experience with large-scale storage systems (distributed file systems, caching solutions)
Proven track record of managing data center infrastructure
Expertise in container orchestration platforms (Kubernetes, Slurm)
Experience with GPU infrastructure management and optimization
Strong automation and scripting skills
It Would Be Great if You Had
Experience with software-defined networking
Knowledge of cost optimization for cloud and on-premise infrastructure
Familiarity with AI/ML workloads and their infrastructure requirements
Experience with multi-region infrastructure deployment
Background in performance optimization for distributed systems
Certification in relevant cloud platforms (AWS, GCP, Azure)
Backed by prominent investors including Y Combinator, Madrona, Tiger Global, Wing VC and NVIDIA, Deepgram has raised over $85 million in total funding. If you're looking to work on cutting-edge technology and make a significant impact in the AI industry, we'd love to hear from you!
Deepgram is an equal opportunity employer. We want all voices and perspectives represented in our workforce. We are a curious bunch focused on collaboration and doing the right thing. We put our customers first, grow together and move quickly. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, gender identity or expression, age, marital status, veteran status, disability status, pregnancy, parental status, genetic information, political affiliation, or any other status protected by the laws or regulations in the locations where we operate.
We are happy to provide accommodations for applicants who need them.