Job Description: This role will be focused on coordinating responses to the Department's Data Inventory tasker. The tasker includes 1) Data Steward Identification, 2) Responding to a sensitive data categorization survey, and 3) Providing descriptive information about every Bureau's data asset. The individual will be responsible for analyzing the input and metadata provided by bureaus, and potentially building some sort of non-complex bureau compliance scorecard. This role will sit in the Enterprise Data Management group.
Required Skills: - Bachelor's degree
- 3 years of experience
- Experience building ETL pipelines using Azure Data Factory and Azure Databricks (2+ years of project experience with ADF & Databricks)
- Experience working with Cloud Infrastructure (Azure)
- Experience with web scraping and APIs
- Basic understanding of Cloud Networking
- Fluency in SQL and NoSQL database technologies such as SQL Server, Oracle SQL
- Fluency in programming languages such as Python and R; and query languages such as SQL
- Experience implementing data solutions that use the following: Azure Cosmos DB, Azure SQL Database, Azure Data Lake Storage, Azure Data Factory, Azure Databricks, Elastic
- Proficiency in data extraction, transformation, wrangling, and ETL to support advanced analytics
Desired Skills: - Familiarity with Department of State
- Familiarity with analytics techniques, such as statistics, simulation modeling, optimization, machine learning, or natural language processing
- Make requested changes, updates, and modifications to the database structure and data
- Experience with data-related implementation tasks that include provisioning data storage services, ingesting streaming and batch data, transforming data, implementing security requirements, implementing data retention policies, identifying performance bottlenecks, and accessing external data sources (cloud, Azure preferred).
- Experience working in Agile
- Experience testing and deploying new infrastructure (e.g., Git repo) components into a cloud tech stack