Senior Research Data Engineer
Climate LLC’s St. Louis, MO office seeks a Senior Research Data Engineer. This is a hybrid work-from-home position that will work closely with team members to develop and implement research data pipelines for producing data layers and storage of research data.
- Will also : (i) implement and maintain scalable data-intensive processing pipelines that apply geospatial data to ML / DL models;
- ii) assist in the automation and processing of internal geospatial and environmental products and datasets using large amounts of geospatially referenced data;
iii) utilize Python and its libraries such as SciPy, NumPy, Pandas, Geopandas, utm, pingoui, Xarray etc., and also work with geospatial / imagery formats such as Geotiff, GeoJSON, Avro and Parquet etc.
iv) accessing data over the AWS cloud and processing it using the distributed computing environment such as Pyspark, Scala etc.
and (v) develop POCs when necessary for new pipelines to be integrated to science data pipeline in collaboration with diverse research partners.
This is a hybrid office / work-from-home position that requires the employee to be able to work in our St. Louis office three (3) days per week.
Must have a Master’s degree (or foreign equivalent degree) in Information Science, Computer Science, Data Science, Data Analytics, or a directly related field plus three (3) years of experience in a related position.
- Experience must include three (3) years with : (i) designing, developing and testing geospatial pipelines that encompass statistical yield analysis and report generation, proven experience in design and implementation of scalable data integration & transformation for geospatial applications and interactive visualization dashboards;
- ii) working with large size raster and vector geospatial datasets applied to machine learning model generation and deployment in big data environment;
- iii) designing and developing packages, procedures / methods using Python, SQL / PLSQL in distributed environment; (iv) with RDBMS, SQL and No-SQL databases to perform complex transformations using tools such as Oracle, Postgres, MSSQL and solid knowledge of SQL, query optimization and OLAP;
- v) using geospatial and remote sensing tools such QGIS, ArcGIS, Postgis etc. and pre-processing workflows such as yield analysis, report generation, and field prescription creation by applying GDAL, proj4, CRS, projections for geospatial experimentational design;
- vi) working in a matrix / hybrid organization structure with cross-functional leadership skills as well as troubleshooting and making quick and effective decisions;
- vii) Big Data technologies including Spark, Arrow and fluent in interacting with file format like Avro, Parquet, CSVs, Geotiff and GeoJSON;
viii) Amazon Webservices S3, EMR, ECS and RDS and orchestration platform like AWS Step Functions, AWS Event bridge, etc.
ix) code versioning and dependency management systems such as GitHub, Git, GitLab with CI / CD pipelines, SharePoint and / or other repositories used by the team;
and (x) waterfall or Agile methodologies in Atlassian suite or JIRA.
This is a hybrid office / work-from-home position that requires the employee to be able to work in our St. Louis office three (3) days per week.
Experience can be concurrent.
Apply at , #3534 or Careers climate [email protected] with reference to #3534 (Senior Research Data Engineer).