The Developer Productivity team provides internal tools and support for all engineers in the company to make their lives easier. This includes CI/CD, e2e test automation and our internal command line tool to interact with ClickHouse Cloud. Our primary focus right now is rollout automation - upgrading thousands of ClickHouse clusters in a controlled and safe way is no easy feat. We leverage the excellent Temporal.io framework and extend our existing system to various other related rollout (and general maintenance) problems. Our team is also the maintainer of our ClickHouse operator - a Kubernetes operator specifically built for our Serverless ClickHouse Cloud requirements. This mix of customer-facing and internal projects makes working in the team dynamic and challenging. We seek highly skilled and experienced Golang software and site reliability engineers to join our team.
What will you do?
- Work deep inside Kubernetes to design and develop new features and improvements for the clickhouse-operator.
- Architect and build robust and scalable automation for rollouts and end-to-end tests.
- Improve and extend our internal command line tool.
- Work closely with other cloud and core database teams to identify areas of improvement - we are our own product managers for internal tooling and developer productivity.
- Educate and support other teams using our internal tools.
- Be part of the on-call rotation and investigate issues in our ClickHouse Cloud production environment.
About you
- You have 5+ years of relevant software development industry experience building and operating scalable, fault-tolerant, distributed systems.
- Software development experience in Go (or experience with any other programming language and willingness to learn Go).
- You are passionate about developer experience and automation.
- Experience with cloud technologies such as AWS, Azure, or GCP.
- Experience with Kubernetes and specifically with building a Kubernetes operator.
- Experience leading and shipping large scope technical projects in collaboration with multiple experienced engineers.
- You have excellent communication skills and the ability to work well within a team and across engineering teams.
- You are a strong problem solver and have solid production debugging skills.
- You thrive in a fast paced environment, and see yourself as a partner with the business with the shared goal of moving the business forward.
- You have a high level of responsibility, ownership, and accountability.
#LI-Remote