Talent.com
Principal Data Processing Engineer

Principal Data Processing Engineer

DatapelagoMountain View, CA, United States
5 hours ago
Job type
  • Full-time
Job description

Principal Data Processing Engineer

Mountain View, CA

About DataPelago :

DataPelago is at the forefront of revolutionizing data processing for traditional analytics and cutting-edge GenAI preprocessing. We are building an innovative data processing engine that is transforming how Apache Spark, Apache Flink, Ray and others operate on diverse, large-scale data. Our team of engineers drive and adopt advances in hardware-accelerated computing, parallel processing of large-scale data, query optimization, distributed systems, compilers, machine learning, and cloud-native computing. We are looking for specialists to join our engineering team and shape the future of accelerated data processing.

The Opportunity :

As a Principal Data Processing Engineer, you will be a key technical leader of core execution engine components of our data processing engine. You will lead architecture, design, and implementation that will enhance functional breadth, performance, scale, and reliability of the engine to deliver a product that will redefine how users extract intelligence from their data. This is a unique opportunity to make a significant impact on a category-defining product and work with a talented team of engineers.

What You'll Do :

  • Architectural Leadership : Drive the evolution of our parallel and distributed execution en-

gine architecture, with a strong focus on leveraging accelerated computing technologies.

  • End to End Ownership : Lead the execution engine team in the complete lifecycle of design,
  • implementation, and rollout of an enterprise-grade product.

  • Core Development : Individually design, implement, test, and maintain critical components
  • of the data processing execution engine.

  • Innovation and Differentiation : Analyze technology advances from industry and academia to
  • identify opportunities for the engine to enhance technology and product leadership.

  • Collaboration and Mentorship : Partner effectively with engineering, product management,
  • and customer success teams. Guide and mentor engineers on the execution engine team.

  • Continuous Improvement : Foster best practices in design and code reviews, testing, CI / CD,
  • and issue resolution to maintain highest product quality, security, efficiency, & productivity.

    What You'll Bring :

  • Bachelor's degree in Computer Science, or a related field with 15+ years of relevant experience OR a Master's degree in Computer Science or a related field with 10+ years of relevant experience.
  • 10+ years of deep technical experience in developing core components of enterprise-grade
  • database or analytics execution engines designed for large-scale data processing.

  • Proven expertise in developing high-performance parallel implementations of data processing operators and functions on rich data types.
  • Significant experience developing for and understanding the internals of platforms such as
  • Apache Spark, Apache Flink, Apache Doris, Apache Gluten, Velox, Apache DataFusion, or

    Apache DataFusion Comet is highly preferred.

  • Demonstrated experience leading teams of 10+ engineers in the design, development, and
  • successful release of high-performance data processing engines for large production de-

    ployments.

  • Exceptional programming skills in C, C++, and Rust.
  • Extensive development experience in Linux environments.
  • Strong analytical and problem-solving skills with a passion for performance optimization.
  • Excellent communication and collaboration skills, with the ability to articulate complex
  • technical concepts to both technical and non-technical audiences.

    Location Considerations :

    We value face-to-face collaboration, but recognize that talent can be found anywhere. Our engineering team works at our headquarters in Mountain View, CA, at our India office in Hyderabad, and atremote locations. This specific position will be based at our headquarters in Mountain View, CA.

    Why Join DataPelago?

  • Technical Leadership : Take a leadership role in shaping the architecture and development of
  • our core parallel analytics engine.

  • Cutting-Edge Innovation : Work on challenging problems at the forefront of accelerated
  • computing and data processing.

  • Significant Impact : Your contributions will directly impact the performance and scalability
  • of our mission-critical platform.

  • Mentorship and Growth : Mentor and guide other talented engineers while expanding your
  • own technical expertise.

  • Competitive compensation, stock options, comprehensive benefits package, leadership de-
  • velopment opportunities.

    Create a job alert for this search

    Principal Data Engineer • Mountain View, CA, United States

    Related jobs
    • Promoted
    Senior Data Engineer

    Senior Data Engineer

    ShopmonkeyMorgan Hill, California, United States
    Full-time
    We are seeking a highly skilled and motivated Senior Data Engineer to join our team at Shopmonkey.This role is critical to building, maintaining and improving our data infrastructure, ensuring that...Show moreLast updated: 30+ days ago
    • Promoted
    Principal Data Engineer

    Principal Data Engineer

    TendSan Francisco, CA, United States
    Full-time
    As a Principal Data Engineer, you will work within the Engineering team and contribute to Tendo’s strategic data engineering solutions by ingesting, transforming, and warehousing healthcare-related...Show moreLast updated: 30+ days ago
    • Promoted
    Principal Data Engineer

    Principal Data Engineer

    Tendo SystemsSan Francisco, CA, United States
    Full-time
    As a Principal Data Engineer, you will work within the Engineering team and contribute to Tendo’s strategic data engineering solutions by ingesting, transforming, and warehousing healthcare-related...Show moreLast updated: 4 days ago
    • Promoted
    • New!
    Data Engineer -(GCP,Airflow,Pyspark) Hybrid-Sunnyvale,CA

    Data Engineer -(GCP,Airflow,Pyspark) Hybrid-Sunnyvale,CA

    Kaav Inc.Sunnyvale, CA, United States
    Full-time
    Bachelor or master's degree in computer science, Software Engineering, or a related field.Proven experience (7+ years) as a data Engineer, preferably with a focus on software development, distribut...Show moreLast updated: 7 hours ago
    • Promoted
    Perception Data Engineer

    Perception Data Engineer

    ZiplineSouth San Francisco, California, United States
    Full-time
    Do you want to change the world? Zipline is on a mission to transform the way goods move.Our aim is to solve the world’s most urgent and complex access challenges by building, manufacturing and ope...Show moreLast updated: 30+ days ago
    • Promoted
    Principal, Data Engineer

    Principal, Data Engineer

    Gemini Trust CompanySan Francisco, CA, United States
    Full-time
    Gemini is a global crypto and Web3 platform founded by Cameron and Tyler Winklevoss in 2014, offering a wide range of simple, reliable, and secure crypto products and services to individuals and in...Show moreLast updated: 30+ days ago
    • Promoted
    Principal Data Engineer

    Principal Data Engineer

    TendoSan Francisco, California, United States
    Full-time
    As a Principal Data Engineer, you will work within the Engineering team and contribute to Tendo’s strategic data engineering solutions by ingesting, transforming, and warehousing healthcare-related...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer - Open on W2 only

    Data Engineer - Open on W2 only

    DataflixSan Jose, CA, United States
    Remote
    Full-time
    We are looking for a Data Engineer to build out and scale our Analytics platform.As a member of the team, you will be responsible for building and scaling a robust platform that will act as the dri...Show moreLast updated: 30+ days ago
    • Promoted
    Principal Data Engineer

    Principal Data Engineer

    SanasPalo Alto, CA, United States
    Full-time
    Founded by a team of Stanford researchers and entrepreneurs with deep industry experience, Sanas has developed the world’s first real-time speech transformation platform capable of accent translati...Show moreLast updated: 30+ days ago
    • Promoted
    Principal Data Engineer

    Principal Data Engineer

    UberSan Francisco, CA, United States
    Full-time
    This is a Technical Data Leader position.The Data Engineering team focuses on building core Business Intelligence and Data Solutions for multiple business verticals at Uber, like Uber Eats, Grocery...Show moreLast updated: 18 days ago
    • Promoted
    Platform Data Engineer

    Platform Data Engineer

    OhaloSan Francisco, California, United States
    Full-time
    Ohalo is seeking an experienced.This role involves building and maintaining data pipelines to support our machine learning engineering activities and managing data related to plant phenotypes and g...Show moreLast updated: 30+ days ago
    • Promoted
    Principal AI Engineer

    Principal AI Engineer

    TENEX.AISan Jose, California, United States
    Full-time
    TENEX is an AI-native, automation-first, built-for-scale Managed Detection and Response (MDR) provider.We are a force multiplier for defenders, helping organizations enhance their cybersecurity pos...Show moreLast updated: 30+ days ago
    • Promoted
    Principal Engineer - AI / ML

    Principal Engineer - AI / ML

    QuizletSan Francisco, California, United States
    Full-time
    At Quizlet, our mission is to help every learner achieve their outcomes in the most effective and delightful way.We’re a $1B+ learning platform used by two-thirds of U. We blend cognitive science wi...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Principal Data Engineer

    Principal Data Engineer

    TENDOSan Francisco, CA, United States
    Full-time
    As a Principal Data Engineer, you will work within the Engineering team and contribute to Tendo's strategic data engineering solutions by ingesting, transforming, and warehousing healthcare-related...Show moreLast updated: 7 hours ago
    • Promoted
    • New!
    Principal Data Engineer for AI Platform

    Principal Data Engineer for AI Platform

    MasterCardSan Francisco, CA, United States
    Full-time +1
    Mastercard powers economies and empowers people in 200+ countries and territories worldwide.Together with our customers, we’re helping build a sustainable economy where everyone can prosper.We supp...Show moreLast updated: 7 hours ago
    • Promoted
    Big Data Engineer

    Big Data Engineer

    UberSan Francisco, California, United States
    Full-time
    Work with tech and product teams to develop new tools and systems to support the growth of the business.Create and optimize our data pipeline architecture. Build data access platform for our data sc...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Principal Data Engineer

    Principal Data Engineer

    Wilson Sonsini Goodrich and RosatiPalo Alto, CA, United States
    Full-time
    Wilson Sonsini is the premier legal advisor to technology, life sciences, and other growth enterprises worldwide.We represent companies at every stage of development, from entrepreneurial start-ups...Show moreLast updated: 7 hours ago
    • Promoted
    Principal Data Engineer I - QuantumBlack

    Principal Data Engineer I - QuantumBlack

    McKinsey & CompanyPalo Alto, CA, United States
    Full-time
    As a Principal Data Engineer I, you will lead technical teams, architect data platforms, and build scalable pipelines while mentoring colleagues and collaborating with clients at all levels.You’ll ...Show moreLast updated: 7 days ago