Hadoop Developer

Indotronix Avani GroupDallas, TX, US

5 hours ago

Job type

Full-time

Job description

Job Title : Sr Hadoop Developer

Location : Pittsburgh, PA / Strongsville, OH - Hybrid 3 days minimum in office but may be subject to change

Duration : Long term

Potential for Contract Extension : yes

Only W2

This position is contract with the right to hire if a need becomes available. Manager will only look at candidates that are open to converting to a full time employee. Client will not sponsor work visas if the decision is made to hire the contingent worker.

Industry background : technical Team Dynamic : 12 people; BSAs, Scrum Master, Jr and Sr engineers, and DevOps. Full stack team

Roles and Responsibilities :

Optimize and maintain large-scale feature engineering jobs using PySpark, Pandas, and PyArrow on Hadoop-based infrastructure.

Refactor and modularize ML codebases to improve reusability, maintainability, and performance.

Collaborate with platform teams to manage compute capacity, resource allocation, and system updates.

Integrate with existing Model Serving Framework to support testing, deployment, and rollback of ML workflows.

Monitor and troubleshoot production ML pipelines, ensuring high reliability, low latency, and cost efficiency.

Contribute to internal Model Serving Framework by sharing insights, proposing and implementing improvements, and documenting best practices.

(Nice to Have) Experience implementing near real-time ML pipelines using Kafka and Spark Streaming for low-latency use cases. Experience with aws and the SageMaker MLOPs ecosystem.

Must Have Technical Skills :

Expert-level proficiency in Python, with strong experience in Pandas, PySpark, and PyArrow.

Expert-level proficiency in Hadoop ecosystem, distributed computing, and performance tuning.

5+ years of experience in software engineering, data engineering, or MLOps roles.

Experience with CI / CD tools and best practices in ML environments.

Experience with monitoring tools and techniques for ML pipeline health and performance.

Strong collaboration skills, especially in cross-functional environments involving platform and data science teams.

Flex Skills / Nice to Have :

Experience contributing to internal MLOps frameworks or platforms.

Familiarity with SLURM clusters or other distributed job schedulers.

Exposure to Kafka, Spark Streaming, or other real-time data processing tools.

Knowledge of model lifecycle management, including versioning, deployment, and drift detection.

Soft Skills : Strong collaboration skills, especially in cross-functional environments involving platform and data science teams.

Strong written and verbal communication skills

Education / Certifications : Bach min

Screening Questions :

Describe in detail the difference between python, PySpark, Hadoop, spark, impala and when do you use each one?
What types of models did you deploy and how were they used by the business?

Create a job alert for this search

Developer • Dallas, TX, US

Related jobs

Promoted

Hadoop Developer

Diverse LynxDallas, TX, United States

Full-time

Diverse Lynx LLC is an Equal Employment Opportunity employer.All qualified applicants will receive due consideration for employment without any discrimination. All applicants will be evaluated solel...Show moreLast updated: 30+ days ago

Promoted

Hadoop / ETL Developer (Plano)

ExperisPlano, TX, United States

Full-time

Our client, a leading organization in the financial services industry, is seeking a Hadoop / ETL Developer (Plano) to join their team. As a Hadoop / ETL Developer (Plano), you will be part of the Data E...Show moreLast updated: 4 days ago

Promoted
New!

Specialist, Info Security Systems Engineer Secret - Clifton, NJ

L3Harris TechnologiesFARMERSVILLE, Texas, United States

Full-time

L3Harris is dedicated to recruiting and developing high-performing talent who are passionate about what they do.Our employees are unified in a shared dedication to our customers’ mission and quest ...Show moreLast updated: less than 1 hour ago

Promoted

Remote Finance Director - AI Trainer ($50-$60 / hour)

Data AnnotationRockwall, Texas

Remote

Full-time +1

We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the quality of ...Show moreLast updated: 20 days ago

Promoted

Remote FP&A Manager – AI Trainer ($50-$60 / hour)

Data AnnotationRockwall, Texas

Remote

Full-time +1

Promoted

Senior Core Java Developer

Resource Informatics Group IncDallas, TX, US

Full-time

Job Title : : Senior Core Java Developer.Rate : $market / hr All Inclusive.Job Description – Core Java Developer.Work with business units to crystallize business needs into product features, acti...Show moreLast updated: 30+ days ago

Promoted

Hadoop Platform Engineer | Onsite | Dallas, TX

PhotonDallas, TX, United States

Full-time

Expertise in design, implement, and maintain Hadoop clusters in large volume, including components such as HDFS, YARN, and MapReduce. Collaborate with data engineers and data scientists to understan...Show moreLast updated: 30+ days ago

Promoted

Hadoop / ETL Developer (Plano)

Manpower Group Inc.Plano, TX, United States

Full-time

Promoted

Lead, Systems Engineer (Cost Engineer - TruePlanning))

L3Harris TechnologiesNEVADA, Texas, United States

Full-time

Promoted
New!

Travel Endo Tech / GI Tech - $1,870 to $1,990 per week in Waxahachie, TX

AlliedTravelCareersWaxahachie, TX, US

Full-time

AlliedTravelCareers is working with Fusion Medical Staffing to find a qualified Endo Tech / GI Tech in Waxahachie, Texas, 75165!. Fusion Medical Staffing is seeking a skilled Endoscopic Technician f...Show moreLast updated: less than 1 hour ago

Promoted

Hadoop / Autosys

Syntricate TechnologiesMurphy, TX, United States

Full-time

Hadoop / Autosys Job Opportunity.Hi Friend Hope you are doing well Number of position : 6 Only Full Time I, Salman Shaikh would like to share a job opportunity as Hadoop / Autosys based in Plano, TX (On...Show moreLast updated: 8 days ago

Promoted

Sr. Hyperion Developer

Resource Informatics GroupIrving, TX, United States

Full-time

Location : Houston, TX (Onsite).Under general supervision, must exercise a high degree of independent judgment and skill with problem resolution to solve issues met with complex and confidential bus...Show moreLast updated: 30+ days ago

Promoted

ETL Datastage / Hadoop Developer

Tata Consultancy ServicesAddison, TX, United States

Full-time

Must Have Technical / Functional Skills.Proficiency in SQl / Oracle, Datastage.Diverse experience in Software development and Data Analytics. Experience in managing data held in a relational database ma...Show moreLast updated: 8 days ago

Promoted

Sr. Hadoop Engineer

Inizio PartnersDallas, TX, United States

Full-time

Location : Foster city, CA / Atlanta, GA / Houston, TX & Dallas, TX; Hybrid role.Big Data Architecture Hadoop, Hive, SQL, Airflow. Strong architecture skills for.Client & stakeholder management.Offshor...Show moreLast updated: 30+ days ago

Promoted
New!

Travel Endoscopy Technician

Trustaff AlliedWaxahachie, TX, US

Full-time

Trustaff Allied is seeking a travel Endoscopy Technician for a travel job in Waxahachie, Texas.Job Description & Requirements. As an Endoscopy Technician, you'll be the primary resource for ...Show moreLast updated: 5 hours ago

Promoted

Remote Investment Analyst – AI Trainer ($50-$60 / hour)

Data AnnotationRockwall, Texas

Remote

Full-time +1

Promoted

Big Data / Hadoop Engineer

Tekstrom IncDallas, TX, US

Full-time

Big Data / Hadoop Engineer - (Ideal candidate will be a Sr.Developer currently working in Big Data and AWS cloud platform). Work with big data analytics engineers to develop automated testing systems...Show moreLast updated: 30+ days ago

Promoted

Remote FinTech Product Analyst - AI Trainer ($50-$60 / hour)

Data AnnotationRockwall, Texas

Remote

Full-time +1