About the Team:
Netlify’s SRE team is scaling to meet the demands of our rapidly growing platform and user base. The team is responsible for ensuring the reliability, scalability, and efficiency of Netlify’s infrastructure while maintaining a focus on innovation and operational excellence. As a Staff Site Reliability Engineer, you will be at the forefront of driving organizational-level reliability strategies, shaping the direction of Netlify’s systems, and tackling complex, systemic challenges. You will collaborate across teams to build a culture of operational excellence and deliver impactful solutions that support our mission to empower the next generation of web developers.
We are a remote-first, globally distributed group that values asynchronous communication, documentation, and a culture of transparency, empowerment, and collective ownership. Diversity and inclusion are at the heart of what we do, and we welcome team members from all backgrounds to bring their unique perspectives to our mission. Whether you’re launching a new phase of your career or growing an established one, Netlify offers a supportive environment where you can thrive while maintaining a healthy work-life balance.
What You’ll Do:
- Champion the architectural vision and technical strategy for Netlify's reliability systems, making pivotal decisions that influence the entire platform's scalability, performance, and operational excellence.
- Foster cross-organizational reliability initiatives, collaborating with multiple engineering teams to implement large-scale infrastructure improvements and standardize SRE practices.
- Cultivate and set technical standards, best practices, and architectural patterns for reliability that will set the foundation for how teams across the organization construct and operate systems.
- Act as the technical authority during major incidents, making critical decisions about system trade-offs and providing guidance to multiple teams during complex outages.
- Cultivate and strengthen relationships with key stakeholders across Engineering, Product, and Executive teams to ensure reliability considerations are integrated into company-wide technical strategy.
- Mentor senior engineers and tech leads across multiple teams, helping them develop their systems thinking and reliability engineering capabilities while fostering a culture of operational excellence.
- Design and spearhead the implementation of reliability frameworks and tooling that can be adopted organization-wide, creating scalable solutions that elevate the entire engineering organization's capabilities.
- Lead architecture reviews and provide technical oversight for critical infrastructure projects, ensuring solutions meet both immediate requirements and long-term strategic goals.
- Develop and evangelize reliability metrics and SLO frameworks that align with business objectives, helping teams make data-driven decisions about reliability investments.
What You’ll Bring:
- A significant history in Site Reliability Engineering or similar roles, with at least two years leading complex technical projects and mentoring senior engineers.
- Deep expertise in cloud architecture with hands-on experience designing and implementing solutions at global scale in providers such as AWS, GCP, or Azure
- Proven track record of driving large-scale technical initiatives that span multiple teams and significant portions of infrastructure.
- Proven expertise in designing and managing CI/CD pipelines using tools such as Jenkins, GitLab CI, CircleCI, or similar.
- Deep expertise in configuration management using tools like Ansible, Chef, or Puppet, with a track record of implementing scalable configuration management solutions across large infrastructure footprints.
- Proficiency with Kafka or other messaging brokers, including deployment, scaling, and maintenance within multi-cloud environments.
- Strong experience in database management, including design, optimization, and maintenance of relational and/or NoSQL databases to support scalable and high-performance applications.
- Proficiency in programming and scripting languages like Python, Go, or Bash to develop automation solutions.
- Strong technical leadership skills with experience influencing engineering decisions across multiple teams without direct authority.
- Exceptional communication skills with experience presenting complex technical strategies to executive leadership and driving consensus among diverse stakeholders.
- Comprehensive understanding of reliability engineering principles and the ability to develop frameworks that help organizations make better reliability decisions.
- Experience establishing technical standards and best practices that have been successfully adopted across large engineering organizations.
- Understanding of security best practices and experience working with compliance frameworks including PCI, ISO 27001, HIPAA, or SOC certifications.
- We welcome candidates based in the UK, Spain, or Poland for this position.
Applying
Not sure you meet 100% of our qualifications? Please apply anyway!
When applying please include:
A resume or short listing of your job history & skills (link to a LinkedIn profile would be fine). We appreciate a cover letter explaining why you would enjoy working in this role at Netlify to get to know you a bit better, though this is not required and will not impact your application. Our mission is to “build a better web” and that cannot be done without a diversity of skill sets, backgrounds and thoughts.
Of everything we've ever built at Netlify, we are most proud of our team. Netlify is an Equal Opportunity Employer. We are devoted to building a team of people with diverse backgrounds and lifestyles. Driving equality empowers our team, enables us to innovate, and helps us maintain a more inclusive environment. We don’t discriminate against employees or applicants based on gender identity or expression, sexual orientation, religion, age, race, military/veteran status, citizenship, pregnancy status, or any other differences. If we can do anything to provide a better interview, i.e. accommodate a disability, then please let us know by emailing accommodations@netlify.com.
About Netlify
At Netlify, we’re on a mission to build a better web by making it easier than ever to build, deploy, and scale web applications. By unifying an entire ecosystem of web development tools, content sources, services, and APIs into one simplified workflow, Netlify empowers top brands to ship campaigns faster, reduce risk, and boost productivity and revenue. At the forefront of the composable web movement, with over 4 million web developers and businesses using the platform, with Netlify, you can connect everything and build anything.
We are a Series D company that has raised over $200M from investors such as Andreessen Horowitz, Kleiner Perkins, EQT, Bessemer, BOND, and Menlo Ventures. As a fully distributed company, we aim to create a company culture where the best idea can come from anywhere and strive to be thoughtful, compassionate, and collaborative in our work. If this sounds like something you’d like to be part of, we’re excited to connect with you!
At Netlify, we are committed to a compensation philosophy that prioritizes fairness and equity, positions our employee compensation competitively in the market, recognizes and rewards performance, and takes a comprehensive approach to our rewards package. We anchor our compensation philosophy on a market-based approach, therefore salary ranges may differ depending on the labor cost in a particular location. The salary provided is in addition to robust benefits and participation in Netlify’s equity plan. Our base compensation for this role is targeted at £96,000 - £130,000 for most UK-based locations. Candidates outside the UK or in premium markets should consult with their Talent Acquisition partner regarding location-based ranges, as they may be higher or lower than the average UK range listed. The starting pay will be determined based on multiple factors, including expertise and skills, market demands, experience, internal equity, and applicable geographic location. These compensation packages and ranges are subject to change and may be modified in the future.