Data Engineer - Remote About us Creative Information Technology Inc (CITI) is an esteemed IT enterprise renowned for its exceptional customer service and innovation. We serve both government and commercial sectors, offering a range of solutions such as Healthcare IT, Human Services, Identity Credentialing, Cloud Computing, and Big Data Analytics. With clients in the US and abroad, we hold key contract vehicles including GSA IT Schedule 70, NIH CIO-SP3, GSA Alliant, and DHS-Eagle II. Join us in driving growth and seizing new business opportunities. Role and Responsibilities Assess feasibility and technical requirements for LINKS → DataLake integration. Collaborate with OPH Immunization Program, OPH Bureau of Health Informatics and STChealth on data specifications and recurring ingestion pipelines. Build and optimize ETL workflows for LINKS and complementary datasets (Vital Records, labs, registries). Design scalable data workflows to improve data quality, integrity, and identity resolution. Implement data governance, observability, and lineage tracking across all pipelines. Mentor engineers, support testing, and enforce best practices in orchestration and architecture. Document and communicate technical solutions to technical and non-technical stakeholders. Minimum Qualification Assess feasibility and technical requirements for LINKS → DataLake integration. Collaborate with OPH Immunization Program, OPH Bureau of Health Informatics and STChealth on data specifications and recurring ingestion pipelines. Build and optimize ETL workflows for LINKS and complementary datasets (Vital Records, labs, registries). Design scalable data workflows to improve data quality, integrity, and identity resolution. Implement data governance, observability, and lineage tracking across all pipelines. Mentor engineers, support testing, and enforce best practices in orchestration and architecture. Document and communicate technical solutions to technical and non-technical stakeholders. Preferred Qualification 5+ years of experience in data engineering roles Experience integrating or developing REST / JSON or XML APIs Familiarity with CI / CD pipelines (GitHub Actions, Azure DevOps, etc.). Exposure to Infrastructure as Code experience (Terraform, CloudFormation). Experience with data governance and metadata tools (Atlan, OpenMetadata, Collibra). Public health / healthcare dataset or similar experience, including PHI / PII handling. Familiarity with SAS and R workflows to support epidemiologists and analysts. Experience with additional SQL platforms (Postgres, Snowflake, Redshift, BigQuery). Familiarity with data quality frameworks (Great Expectations, Deequ). Experience with real-time / streaming tools (Kafka, Spark Streaming). Familiarity with big data frameworks for large-scale transformations (Spark, Hadoop). Knowledge of data security and compliance frameworks (HIPAA, SOC 2, etc.). Agile / SCRUM team experience.
Data Engineer Remote • VA, VA, US