Talent.com
Principal Data Engineer

Principal Data Engineer

StartXPalo Alto, CA, United States
21 days ago
Job type
  • Full-time
Job description

Overview

Sanas is revolutionizing the way we communicate with the world’s first real-time algorithm, designed to modulate accents, eliminate background noises, and magnify speech clarity. Pioneered by seasoned startup founders with a proven track record of creating and steering multiple unicorn companies, our groundbreaking GDP-shifting technology sets a gold standard.

Sanas is a 200-strong team, established in 2020. In this short span, we’ve successfully secured over $100 million in funding. Our innovation has been supported by the industry’s leading investors, including Insight Partners, Google Ventures, Quadrille Capital, General Catalyst, Quiet Capital, and other influential investors. Our reputation is further solidified by collaborations with numerous Fortune 100 companies. With Sanas, you’re not just adopting a product; you’re investing in the future of communication.

Role

We’re looking for an experienced and forward-thinking Principal Data Engineer to lead the design and implementation of our end-to-end data infrastructure for industry-leading Voice AI products. This is a high-impact role where you will shape the technical vision, own strategic architecture decisions, and mentor a growing team of Data engineers focused on delivering reliable and scalable data systems for Machine Learning at scale.

You’ll work cross-functionally with AI research scientists, infrastructure and product teams to ensure that data—from raw audio to training-ready features—is consistently accessible, compliant and optimized for speed and scale. You’ll help push the boundaries of real-time Voice AI!

Key Responsibilities

  • Architect and lead the development of large-scale data pipelines and data lakes to ingest, transform and serve high-quality data for AI model training, product telemetry and analytics.
  • Drive long-term data infrastructure strategy across streaming and batch, feature store extensions, Iceberg / Delta lake choices, metadata management, and lakehouse evolution.
  • Drive platform and infrastructure decisions, optimizing compute fleets (e.g., Ray, Spark clusters), orchestration tooling (Airflow, Dagster), and streaming stacks (Kafka, Flink).
  • Collaborate with AI research scientists, engineering leads, product, finance, marketing, and legal to align data architecture with business and regulatory requirements.
  • Advocate best practices in data governance, lineage, observability, testing, tooling, and disaster recovery across pipelines and data stores.
  • Act as a mentor and technical leader—review design and code, share patterns, elevate team capability, and support recruitment and hiring.
  • Drive build vs buy decisions for tools to implement data quality and observability solutions to achieve high data quality.

Qualifications

  • 10+ years of experience in Data Engineering, Infrastructure, or ML Systems, with at least 2+ years in a technical leadership capacity.
  • Expertise in building distributed batch and real-time data systems.
  • Expertise in databases (like Postgres) and data lakes (like Snowflake, Databricks and ClickHouse).
  • Experience using data processing frameworks like Spark, Flink and Ray.
  • Deep experience with cloud platforms AWS / GCP, object storage (e.g., S3), and orchestration tools like Airflow and Dagster.
  • Strong knowledge of data lifecycle management, including privacy, security, compliance and reproducibility.
  • Comfortable working in a fast-paced startup environment.
  • Strategic mindset and proven ability to collaborate across engineering, ML and product teams to deliver infrastructure that scales with the business.
  • Nice to Have

  • Familiarity with audio data and its unique challenges, like large file sizes and time-series features; metadata handling is a strong plus.
  • Experience with Voice AI models like ASR, TTS and speaker verification.
  • Familiarity with real-time data processing frameworks like Kafka, Flink, Druid and Pinot.
  • Familiarity with ML workflows including MLOps, feature engineering, model training and inference.
  • Experience with labeling tools, audio annotation platforms, or human-in-the-loop annotation pipelines.
  • Joining us means contributing to the world’s first real-time speech understanding platform revolutionizing Contact Centers and Enterprises alike.

    Our technology empowers agents, transforms customer experiences, and drives measurable growth. But this is just the beginning. You'll be part of a team exploring the vast potential of an increasingly sonic future.

    #J-18808-Ljbffr

    Create a job alert for this search

    Principal Data Engineer • Palo Alto, CA, United States

    Related jobs
    • Promoted
    Principal GTM Engineer

    Principal GTM Engineer

    VirtualVocationsOakland, California, United States
    Full-time
    A company is looking for a Principal GTM Engineer to architect and execute the go-to-market engine for their AI product suite. Key Responsibilities Design and implement multi-touch campaigns to ac...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    Stefanini GroupSan Francisco, CA, United States
    Full-time
    Stefanini is looking for a Data Engineer for various location across USA (Hybrid).For quick apply, please connect with Prakhar Goel : 248 263 5255 / Prakhar. As a Data Engineer, this CW will be respon...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    FigmaSan Francisco, CA, United States
    Full-time
    Figma is growing our team of passionate creatives and builders on a mission to make design accessible to all.Figma's platform helps teams bring ideas to life-whether you're brainstorming, creating ...Show moreLast updated: 3 days ago
    • Promoted
    Principal Data Engineer

    Principal Data Engineer

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for a Principal Platform Data Engineer (Databricks Platform).Key Responsibilities Collaborate with stakeholders to translate business requirements into technical specificatio...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    Mercor IncSan Francisco, CA, United States
    Full-time
    Mercor is training models that predict how well someone will perform on a job better than a human can.We use our platform to source, vet, and onboard expert contractors who help train AI models in ...Show moreLast updated: 3 days ago
    • Promoted
    Data Engineer II

    Data Engineer II

    VirtualVocationsHayward, California, United States
    Full-time
    A company is looking for a Data Engineer II.Key Responsibilities Produce high-quality data models and maintain data integrity for analytics products Develop scalable ELT pipelines and business i...Show moreLast updated: 30+ days ago
    • Promoted
    Principal, Data Engineer

    Principal, Data Engineer

    Gemini Trust CompanySan Francisco, CA, United States
    Full-time
    Gemini is a global crypto and Web3 platform founded by Cameron and Tyler Winklevoss in 2014, offering a wide range of simple, reliable, and secure crypto products and services to individuals and in...Show moreLast updated: 2 days ago
    • Promoted
    Data Engineer - Data Platform

    Data Engineer - Data Platform

    Tik TokSan Jose, CA, United States
    Full-time
    As a data engineer in the data platform team, you will have the opportunity to build, optimize and grow one of the largest data platforms in the world. You'll have the opportunity to gain hands-on e...Show moreLast updated: 3 days ago
    • Promoted
    Lead Data Engineer

    Lead Data Engineer

    Inizio PartnersFremont, CA, United States
    Full-time
    About the job Lead Data Engineer.Location : Fremont, CA(3 days onsite, 2 days work from home).Candidate should Provide technical expertise in needs identification, data modelling, data movement, tra...Show moreLast updated: 30+ days ago
    • Promoted
    Data Platform Engineer

    Data Platform Engineer

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for a Data Platform Engineer, Data Capture.Key Responsibilities Expand developer tools for capturing business events and operational data into the Lakehouse Enhance self-ser...Show moreLast updated: 30+ days ago
    • Promoted
    Principal Data Architect

    Principal Data Architect

    VirtualVocationsFremont, California, United States
    Full-time
    A company is looking for a Principal Data Architect responsible for shaping enterprise data strategy, architecture, and governance. Key Responsibilities Define and maintain enterprise data archite...Show moreLast updated: 29 days ago
    • Promoted
    Principal Data Engineer

    Principal Data Engineer

    SanasPalo Alto, CA, United States
    Full-time
    Sanas is revolutionizing the way we communicate with the world’s first real-time algorithm, designed to modulate accents, eliminate background noises, and magnify speech clarity.Our GDP-shifting te...Show moreLast updated: 23 days ago
    • Promoted
    Principal Sales Engineer

    Principal Sales Engineer

    IntercomSan Francisco, CA, United States
    Full-time
    Intercom is the AI Customer Service company on a mission to help businesses provide incredible customer experiences.Our AI agent Fin, the most advanced customer service AI agent on the market, lets...Show moreLast updated: 3 days ago
    • Promoted
    Principal Data Architect

    Principal Data Architect

    QualysFoster City, CA, United States
    Full-time
    Come work at a place where innovation and teamwork come together to support the most exciting missions in the world!.Define standards and best practices for data modeling, governance, integrations ...Show moreLast updated: 19 days ago
    • Promoted
    Data Engineer

    Data Engineer

    Ekman AssociatesMenlo Park, CA, United States
    Full-time
    Data Engineer (Data Pipelines & Visualization).Ekman Associates is a management consulting firm that specializes in developing business, digital, and technology strategy, delivering solutions, and ...Show moreLast updated: 3 days ago
    • Promoted
    Data Platform Engineer

    Data Platform Engineer

    Cogent SecuritySan Francisco, CA, United States
    Full-time
    Cogent Security is on a mission to stop breaches and prevent cybercrime by innovating at the frontier of generative AI systems. We are building the world's first AI cyber taskforce, composed of AI a...Show moreLast updated: 3 days ago
    • Promoted
    Lead Data Engineer

    Lead Data Engineer

    VirtualVocationsSanta Clara, California, United States
    Full-time
    A company is looking for a Lead Data Engineer to design, build, and manage enterprise-grade data pipelines.Key Responsibilities Design, develop, and optimize metadata-driven data pipelines in Fab...Show moreLast updated: 30+ days ago
    • Promoted
    Databricks Data Engineer

    Databricks Data Engineer

    Tekfortune IncPleasanton, CA, United States
    Permanent
    Tekfortune is a fast-growing consulting firm specialized in permanent, contract & project-based staffing services for world's leading organizations in a broad range of industries.In this quickly ch...Show moreLast updated: 3 days ago