Description
Job Summary
Our Client has an immediate opening for a Data Scientist with a strong background in data engineering and a solid understanding of data science principles. The ideal candidate will play a critical role in designing, developing, and maintaining our data infrastructure, while also adding expertise to enable advanced analytics and machine learning initiatives.
This position is based in Redmond, WA, and we are able to hire remote candidates in the following states: CA, CO, FL, GA, MO, NY, OR, SC, TN, TX, WA.
Job Responsibilities
Data Pipeline Development
Design, implement, and maintain scalable data pipelines to collect, process, and store data from various sources.Ensure data quality, accuracy, and consistency throughout the pipeline.
Data Modeling
Design and implement data models for predictive analytics, machine learning, and data exploration.Optimize data structures and storage to support efficient querying and analysis.
Data Integration
Work closely with cross-functional teams to integrate data from diverse sources, including databases, APIs, and external data providers.Develop and maintain ETL processes to transform and enrich raw data into actionable insights.
Performance Tuning
Monitor and optimize the performance of data pipelines and databases to meet business requirements.Identify and resolve bottlenecks and performance issues.
Continuous Learning and mentoring
Stay up-to-date with the latest advancements in data engineering and data science technologies.Share knowledge with team members.
Requirements
Must Have
3+ years experience in SQL Query Design, SQL Performance Tuning and Query Optimization3+ years of relevant experience in Data Warehouse Design, Data Warehouse Technical Architectures, Development and Implementation3+ years of relevant experience in ETL Development, ETL Implementation, Unit Testing, Troubleshooting and Support of ETL Processes3+ years of relevant experience with the application of Data Science principles and data modeling.
Knowledge and Skills
Proficiency in SQL Query Design and ImplementationStrong Experience with Relational Data Warehouse Systems Data Warehouse Management Systems Optimization by Indexing, Partitioning and DenormalizationStrong Ability to build and optimize data sets, ‘big data’ data pipelines and architecture Knowledge of data science concepts, machine learning algorithms, and statistical analysis.Programming skills: Python required (bonus for Java or C#)Strong analytical and problem-solving skillsBONUS: Experience with Pandas, scikit-learn and Multi-agent systems (MAS)BONUS: Experience working at scale in a production environment with Personally Identifiable Information (PII) data
Benefits:
Compensation: $140,000-160,000 base payPaid Vacation Time and Paid HolidaysMedical/Vision/Dental Insurance, Voluntary Life & AD&D Insurance, Short-Term & Long-Term Disability, Critical Illness & Accident Insurance 401(k) with employer matchingHybrid/remote with flexible work schedule