Blackpoint Cyber is the leading provider of world-class cybersecurity threat hunting, detection and remediation technology. Founded by former National Security Agency (NSA) cyber operations experts who applied their learnings to bring national security-grade technology solutions to commercial customers around the world, Blackpoint Cyber is in hyper-growth mode, fueled by a recent $190m series C round.
Why Blackpoint?
Ready to give some hackers hell? At Blackpoint Cyber, we fight unfair fights, eliminating threats before they strike. Built by former US Department of Defense and Intelligence security experts, our mission is to provide absolute and unified Managed Detection and Response (MDR) services to organizations worldwide.
Company Culture
We value high-quality execution, ownership, and integrity—principles that are never compromised. Our team is collaborative, energetic, and thrives in a high-performance culture, continuously growing by tackling the toughest challenges in cybersecurity.
What You'll Do
As a Senior SRE Manager, you will lead the infrastructure, reliability, and cost optimization efforts for Blackpoint Cyber’s mission-critical services. You will be responsible for ensuring the scalability, availability, and efficiency of our cloud infrastructure, while also optimizing COGS (Cost of Goods Sold) to maintain financial efficiency.
This role requires strong leadership, hands-on infrastructure expertise, and a deep understanding of cost-effective scaling strategies. You will work closely with engineering, security, and product teams to ensure our systems are resilient, secure, and cost-efficient.
Key Responsibilities
Infrastructure & Reliability
Lead the design, implementation, and management of scalable, reliable, and highly available cloud-based infrastructure (AWS/Azure/GCP).
Establish SRE best practices, including monitoring, incident response, capacity planning, and performance tuning.
Improve observability, monitoring, and alerting, ensuring quick detection and resolution of reliability issues.
Drive automation-first approaches, reducing manual intervention through Infrastructure-as-Code (IaC) and CI/CD pipelines.
Lead a team of SREs, applying Blackpoint Cyber's management values of Coach, Model, Care, in defining business-critical outcomes, creating action plans, and supporting the team in achieving them.
Continue hands-on contributions in an SRE role, delivering individual impact up to 50% of the time.
Design, implement, and support key infrastructure, including automated attack infrastructure deployment, isolated identity and productivity environments, and secure data storage.
Establish and apply security hygiene and monitoring policies to meet Blackpoint Cyber security requirements.
COGS Optimization & Cost Efficiency
Monitor and optimize cloud spending, ensuring cost-effective resource utilization without compromising reliability.
Define and implement cost-saving strategies (e.g., right-sizing instances, leveraging spot instances, optimizing storage, etc.).
Work closely with finance and procurement teams to forecast infrastructure costs and align expenses with business objectives.
Leadership & Collaboration
Manage and mentor a team of SREs, DevOps engineers, and cloud infrastructure specialists.
Partner with engineering teams to design reliable and scalable architectures, embedding reliability into development workflows.
Collaborate with security teams to ensure compliance, security hardening, and disaster recovery readiness.
Drive post-incident reviews, ensuring continuous improvement in system resilience.
What You Bring
Must-Have Qualifications
10+ years of experience in SRE, DevOps, or Cloud Infrastructure roles.
3+ years of experience in people management, leading SRE team.
Strong experience with AWS, Azure, or GCP, with expertise in cost management and scaling strategies.
Proficiency in Infrastructure-as-Code (IaC) (e.g., Terraform, CloudFormation, Pulumi).
Hands-on experience with CI/CD pipelines, Kubernetes, and container orchestration.
Expertise in monitoring, logging, and observability tools (e.g., Prometheus, Grafana, Datadog, Splunk).
Proven ability to optimize cloud costs (COGS) while maintaining reliability and performance.
Strong leadership, collaboration, and problem-solving skills.
Experience with SLA/SLO/SLIs will be valuable.
"Engineering efficiency" through self-serve tooling.
Nice-to-Have
Experience working in a cybersecurity or high-security environment.
Understanding of compliance frameworks (SOC2, ISO 27001, FedRAMP, etc.).
Knowledge of serverless architectures and edge computing.
Experience working with FinOps teams to manage cloud costs effectively.
Blackpoint Cyber welcomes and encourages applications from qualified individuals of all races, colors, religions, sex, sexual orientation, gender identity or expression, national origin, age, marital status, or any other legally protected status. We are committed to equality of opportunity in all aspects of employment. For eligible employees in the US, Blackpoint offers competitive Health, Vision, Dental, and Life Insurance plans, a robust 401k plan, Discretionary Time Off, and other minor perks.