Data Engineer

Resource Informatics Group IncSeattle, WA, US

30+ days ago

Job type

Full-time

Job description

Job Description

Responsibilities :

1. Work in big data. Take existing solutions, optimize and build high-performance algorithms. Scale algorithms on terabytes of data. Improve time complexity and space complexity of data pre-processing. Recommend ways to improve data reliability, efficiency and quality.

2. Process structured and unstructured data, validate data quality, help to design data quality tests in big data environment.

3. Help to develop and support data products. Develop data set processes for data modeling, mining and production.

4. Work closely with engineers and data scientists. Help data science team and engineers to improve spark performance. Be involved into data products deployment.

5. Create custom software components using Spark or PySpark (e.g. specialized UDFs) and analytics applications.

6. Help to create visualization tools for tracking model performance and data quality. Integrate new data management technologies and software engineering tools into existing structures.

7. Collaborate with data architects, engineers, data scientist, business team members on project goals.

Minimum qualifications :

BS degree in a quantitative field such as statistics, operations research, computer science, mathematics, physics, electrical engineering, industrial engineering.
2 years of relevant work experience in big data analysis or related field (data engineer / developer).
Expert in Spark, Python / or R, PySpark / or SparkR, Scala, SQL / Hive
Familiar with Spark MLlib, SparkSQL
Accomplished in Hadoop-based data mining frameworks.

Preferred qualifications :

MS or PhD degree in a quantitative field.

2-3 years of relevant work experience, including deep expertise in Spark.

Create a job alert for this search

Data Engineer • Seattle, WA, US