Responsibilities :
- Develop, implement, and manage data quality processes to ensure accuracy, completeness, and reliability of data from machine learning models, data science processes, and external sources.
- Create and maintain data validation checks and automated quality monitoring systems.
- Collaborate closely with data scientists, data engineers, and product teams to understand and proactively address data quality challenges.
- Perform root cause analysis and resolve complex data issues, ensuring continuous improvement of data quality processes.
- Establish quality metrics and KPIs to track and report data quality status and progress to stakeholders.
- Design and document standards and best practices for data quality management.
- Mentor junior team members, providing technical leadership and guidance on data quality strategies and tools.
Qualifications :
Bachelor's or Master's degree in Computer Science, Engineering, Data Science, or related technical discipline.5+ years of experience in data engineering, data quality assurance, or related roles.Strong proficiency in SQL and experience with data querying, profiling, and validation.Experience with automated testing frameworks and quality assurance methodologies.Familiarity with big data platforms and tools (e.g., Databricks, Snowflake, Hadoop, Spark).Experience in cloud environments (Azure, AWS, GCP).Excellent analytical and problem-solving skills with meticulous attention to detail.Strong communication and collaboration skills, able to effectively engage with technical and non-technical stakeholders.Preferred Skills :
Experience with Machine Learning operations (MLOps) and model validation techniques.Knowledge of Python or similar scripting languages for automation and quality testing.Familiarity with CI / CD processes and tools for data pipelines.Experience working specifically with Databricks is highly preferred.Role Description :
We are seeking a highly skilled Senior Data Quality Engineer to join our team and ensure the integrity and accuracy of data flowing through our advanced analytics, machine learning pipelines, and data-driven products. The ideal candidate is detail-oriented, technically proficient, and passionate about data excellence and quality assurance.
#J-18808-Ljbffr