Overview
- The Data Engineer will play a crucial role in advancing the CDC Foundation's mission by designing, building, and maintaining data infrastructure for a public health organization. This role is aligned to the Workforce Acceleration Initiative (WAI). WAI is a federally funded CDC Foundation program with the goal of helping the nation’s public health agencies by providing them with the technology and data experts they need to accelerate their information system improvements. Working within the City of El Paso Department of Public Health Public Health Infrastructure Grant (PHIG) program, the data engineer will be responsible for rebuilding projects within REDCap to incorporate standardization and validation. Once completed, the existing data will be imported into the newly created project to build a research tool to share notifiable condition data with area hospitals and clinics. In addition, you will be tasked with developing and training the staff from the epidemiology section on the proper use and building of REDCap surveys to collect data for analysis and how to create projects that incorporate standardization and validation. The data engineer will also assist the PHIG team in gathering requirements for, planning, and developing a department-wide system to support the tracking of department goals and performance. The Data Engineer will be hired by the CDC Foundation and assigned to the City of El Paso Department of Public Health Public Health Infrastructure Grant. This position is eligible for a fully remote work arrangement for based candidates.
Responsibilities
Create and manage the systems and pipelines that enable efficient and reliable flow of data, including ingestion, processing, and storage.Collect data from various sources, transforming and cleaning it to ensure accuracy and consistency. Load data into storage systems or data warehouses.Optimize data pipelines, infrastructure, and workflows for performance and scalability.Monitor data pipelines and systems for performance issues, errors, and anomalies, and implement solutions to address them.Implement security measures to protect sensitive information.Collaborate with data modernization coordinator, program managers, and other partners to understand their data needs and requirements and to ensure that the data infrastructure supports the organization's goals and objectives.Collaborate with cross-functional teams to understand data requirements and design scalable solutions that meet business needs.Implement and maintain ETL (Extract, Transform, Load) processes to ensure the accuracy, completeness, and consistency of data.Design and manage REDCap projects and data storage systems, including relational databases, NoSQL databases, and data warehouses.Design and manage dashboards for executive leadership, program managers and city leadership.Knowledgeable about industry trends, best practices, and emerging technologies in data engineering and incorporating the trends into the organization's data infrastructure.Provide technical guidance to other staff.Communicate effectively with partners at all levels of the organization to gather requirements, provide updates, and present findings.Qualifications
Bachelor's degree in computer science, information technology, data science, social science, health or a related field.Minimum 5 years of relevant professional experienceProficiency in programming languages commonly used in data engineering, such as Python, Java, Scala, or SQL. Candidate should be able to implement data automations within existing frameworks as opposed to writing one off scripts.Minimum of 2 years of translational research database systems experience, such as REDCap or systems similar to REDCap.Knowledge of relational databases (, MySQL, PostgreSQL) and NoSQL databases (, MongoDB, Cassandra).Experience regarding engineering best practices such as source control, automated testing, continuous integration and deployment, and peer review.Knowledge of data warehousing concepts and tools.Knowledge of cloud computing platforms.Expertise in data modeling, ETL processes, and data integration techniques.Strong analytical thinking and problem-solving abilities.Excellent verbal and written communication skills, including the ability to convey technical concepts to non-technical partners effectively.Flexibility to adapt to evolving project requirements and priorities.Outstanding interpersonal and teamwork skills and the ability to develop productive working relationships with colleagues and partners.Experience working in a virtual environment with remote partners and teamsProficiency in Microsoft Office.Experience in developing and administering trainingSpecial Notes This role is involved in a dynamic public health program. As such, roles and responsibilities are subject to change as situations evolve. Roles and responsibilities listed above may be expanded upon or updated to match priorities and needs, once written approval is received by the CDC Foundation in order to best support the public health programming.