Description
The Opportunity
We are searching for a talented and motivated Senior Data Engineer to join our growing team. Reporting to our Director of Engineering, you will be instrumental in designing, developing, and maintaining our data infrastructure and pipelines. Your role will ensure the efficient and reliable flow of data throughout our organization. You will collaborate closely with our software development, analytics, and product teams to drive success.
Key Responsibilities
- Data Pipeline Development: Design, implement, and manage robust and scalable data pipelines to ingest, process, and transform data from various sources.
- Data Modeling: Develop and maintain data models to support business intelligence, reporting, and analytics needs.
- Data Warehouse Management: Design and implement data warehousing solutions to store and organize large volumes of data efficiently.
- ETL Development: Develop and optimize ETL (Extract, Transform, Load) processes to ensure data accuracy and integrity.
- Data Quality Assurance: Implement data quality checks and monitoring processes to maintain data integrity and reliability.
- Performance Optimization: Continuously monitor and optimize data pipelines and queries for performance and scalability.
- Collaboration: Work closely with data analysts and other stakeholders to understand their data needs and provide solutions.
- Documentation: Create and maintain clear and comprehensive documentation of data architecture, processes, and data dictionaries.
What You Bring
- Educational Background: Bachelor’s or Master’s degree in Computer Science, Engineering, or related field.
- Experience: 5+ years of experience in data engineering or related areas. You are an end-to-end data engineer whose experience goes beyond creating ETL pipelines.
- Technical Proficiency:
- Expertise in SQL and data manipulation languages.
- Proficiency in data pipeline tools (Airflow, AWS Glue, Spark/PySpark, Pandas).
- Strong programming skills in Python.
- Experience with data storage technologies like warehouses (Snowflake, Redshift) and data lakes (Databricks, Glue Catalog/S3).
- Soft Skills:
- Exceptional problem-solving skills.
- Excellent communication and collaboration abilities.
- Ability to thrive in a fast-paced, agile environment.
- Ownership mindset.
Bonus Points
- Deep familiarity with AWS
- Familiarity with IAC (Pulumi)
- In-depth knowledge of healthcare compliance and regulations.
- Familiarity with data visualization and reporting tools (e.g., Sigma).
- Proven understanding of data privacy and security best practices.
- Contributions to open-source projects.
- Familiarity with machine learning and data analytics tools.
- Passion for improving patient health through technology.
- Experience working in startup environments.
What You’ll Get (Benefits & Perks)
- Competitive compensation
- Medical, dental, vision, and life insurance
- Flexible PTO, company-paid holidays, parental leave
- 401K, wellness and wifi perks
- Flexible, remote-first work culture