Description
We are seeking a highly skilled and experienced Data Engineer to join our team. The successful candidate will be responsible for building, maintaining, and optimizing our ETL pipelines. You will play a key role in ensuring our data architecture supports our rapid growth and enables us to extract meaningful insights from complex data sets.
Requirements
- Design, build, and maintain scalable and reliable ETL pipelines to support data integration from various sources.
- Develop and manage databases using BigQuery, MySQL & Pinecone, ensuring data integrity, security, and performance.
- Collaborate with cross-functional teams, to gather requirements and deliver data solutions that support business initiatives.
- Implement data warehousing solutions and data modeling practices to support advanced analytical and reporting capabilities.
- Optimize data flow and collection to improve data accuracy and value.
- Ensure compliance with data governance and data security requirements.
- Monitor and troubleshoot performance issues on the data pipelines and databases.
- Stay up-to-date with industry standards and advancements in data engineering practices and technologies.
Qualifications:
- Bachelorโs or Masterโs degree in Computer Science, Engineering, or a related field.
- Minimum of 3 years of experience in a Data Engineering role.
- Proficient in SQL and experience with database management systems, particularly BigQuery and MySQL.
- Demonstrable experience with Shopifyโs REST & GraphQL APIs.
- Experience with data pipeline and workflow management tools.
- Strong understanding of ETL techniques and best practices.
- Proficient in one or more programming languages (Python preferred)
- Experience with cloud services (e.g., AWS, Google Cloud Platform) and understanding of cloud-based ETL services.
- Excellent problem-solving skills and attention to detail.
- Strong communication and collaboration abilities to work with team members and stakeholders.
Nice to Have:
- Experience with data visualization tools and dashboard development.
- Experience with vector databases
- Knowledge of machine learning and statistical modeling is a plus.