Talent.com
Principal Data Engineer (PySpark) (Remote - US)
Principal Data Engineer (PySpark) (Remote - US)Jobgether • US
Principal Data Engineer (PySpark) (Remote - US)

Principal Data Engineer (PySpark) (Remote - US)

Jobgether • US
3 days ago
Job type
  • Full-time
  • Remote
  • Quick Apply
Job description

This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Principal Data Engineer (PySpark) in the United States .

We are seeking a highly skilled Principal Data Engineer to lead the design, development, and optimization of scalable data systems that power advanced analytics and AI initiatives. In this hands-on role, you will architect both batch and real-time data pipelines, collaborate closely with product and AI teams, and influence the overall data strategy. You will mentor other engineers, enforce best practices, and ensure high-quality, reliable, and accessible data for the organization. The ideal candidate thrives in a fast-paced, high-impact environment, enjoys solving complex problems, and is passionate about building robust, efficient, and scalable data infrastructure.

Accountabilities :

  • Design, implement, and evolve distributed, cloud-based data infrastructure for batch and real-time workloads.
  • Build and maintain scalable data pipelines supporting analytics and AI / ML applications.
  • Integrate with third-party e-commerce platforms to expand and enrich the data ecosystem.
  • Ensure data reliability, availability, and quality through automated monitoring and auditing.
  • Collaborate with engineering, AI, and product teams to deliver data solutions that meet business-critical needs.
  • Mentor and support data engineers, promoting coding standards, best practices, and professional growth.
  • Drive innovation by identifying opportunities for improved data processing and architecture.

Requirements

  • 10+ years of experience in software development and data engineering with ownership of production-grade systems.
  • Deep expertise in PySpark and scaling Spark in production environments (Databricks, EMR, etc.).
  • Strong knowledge of distributed computing and modern data modeling for scalable systems.
  • Proficiency in Python and implementation of software engineering best practices.
  • Hands-on experience with both relational (PostgreSQL, MySQL) and NoSQL (MongoDB, DynamoDB, Cassandra) databases.
  • Excellent problem-solving skills and ability to communicate effectively across teams.
  • Bachelor’s degree in Computer Science or related field, or equivalent practical experience.
  • Experience mentoring and influencing cross-functional teams.
  • Familiarity with MLOps pipelines and integrating ML models into data workflows is a plus.
  • Previous experience in early-stage, high-growth environments is advantageous.
  • Benefits

  • 💻 Competitive salary : $235,000–$285,000 depending on experience and skills.
  • 🌎 Flexible remote work within the United States.
  • 🏖 Generous paid time off and holidays.
  • 🏥 Comprehensive medical, dental, and vision coverage options.
  • 💼 401(k) plan and employer match.
  • 💡 Professional development support, coaching, and mentorship opportunities.
  • 🧩 Employee assistance programs for work-life balance and wellness.
  • Jobgether is a Talent Matching Platform that partners with companies worldwide to efficiently connect top talent with the right opportunities through AI-driven job matching.

    When you apply, your profile goes through our AI-powered screening process designed to identify top talent efficiently and fairly :

  • 🔍 Our AI evaluates your CV and LinkedIn profile thoroughly, analyzing your skills, experience, and achievements.
  • 📊 It compares your profile to the job’s core requirements and past success factors to determine your match score.
  • 🎯 Based on this analysis, we automatically shortlist the 3 candidates with the highest match to the role.
  • 🧠 When necessary, our human team may perform an additional manual review to ensure no strong profile is missed.
  • The process is transparent, skills-based, and free of bias—focusing solely on your fit for the role. Once the shortlist is completed, it is shared directly with the company. The final decision and next steps, such as interviews or additional assessments, are then made by their internal hiring team.

    Thank you for your interest!

    #LI-CL1

    Create a job alert for this search

    Principal Data Engineer • US