Talent.com
Sr. Data Engineer, New Venture

Sr. Data Engineer, New Venture

SanitySan Francisco, CA, United States
22 hours ago
Job type
  • Full-time
Job description

At Sanity.io, we're building the future of AI-powered Content Operations. Our AI Content Operating System gives teams the freedom to model, create, and automate content the way their business works, accelerating digital development and supercharging content operations efficiency. Companies like SKIMS , Figma , Riot Games , Anthropic , COMPLEX , Nordstrom , and Morningbrew are using Sanity to power and automate their content operations.

As part of our new venture, your work will center on addressing one of AI's toughest problems : how to help machines truly understand and use human-created content. You'll build systems that structure and enrich large volumes of information to enable AI agents and LLMs to access the right context at the right time. This means designing and developing tools and pipelines that shape, structure, and connect information and content in innovative ways, and creating new methods to ensure AIs reflect the most accurate, authentic, and up-to-date representation of a business, its brand, products, and knowledge base.

As a Senior Data Engineer you'll architect and optimize the data infrastructure that powers our next generation of AI capabilities. You'll be the engine behind our AI systems, building scalable, efficient data pipelines that process massive volumes of content while maintaining low latency and managing costs intelligently. Your work will directly enable AI agents and LLMs to access the right data at the right time. You'll join a small, cross-functional team where your expertise in data engineering and ML infrastructure will be critical to turning ambitious AI concepts into production-ready systems. If you're passionate about building robust data systems that power cutting-edge AI, obsess over performance optimization, and love solving complex scaling challenges, we'd love to have you on the team.

What you will do :

  • Design, build, and optimize scalable data pipelines for AI and ML workloads, handling large volumes of structured and unstructured content data.
  • Architect data processing systems that transform, enrich, and prepare content for LLM consumption, with a focus on latency optimization and cost efficiency.
  • Build ETL / ELT workflows that extract, transform, and load data from diverse sources to support real-time and batch AI operations.
  • Implement data quality monitoring and observability systems to ensure pipeline reliability and data accuracy for AI models.
  • Collaborate with engineers and product teams to understand data requirements and design optimal data architectures that support AI features.
  • Optimize data storage strategies across data lakes, warehouses, and vector databases to balance performance, cost, and scalability.
  • Build automated data validation and testing frameworks to maintain data integrity throughout the pipeline.
  • Stay at the forefront of LLM research, understanding model behaviors, limitations, and capabilities to inform system design decisions.
  • Monitor and optimize pipeline performance, identifying bottlenecks and implementing solutions to improve throughput and reduce latency.
  • Create clear documentation of data architectures, pipeline logic, and operational procedures.

About you :

  • Based in the San Francisco Bay Area and able to work at least 2 days per week in our San Francisco office.
  • 5+ years of data engineering experience, with at least 2 years focused on AI / ML data pipelines or supporting machine learning workloads.
  • High level of proficiency in Python and SQL.
  • Strong experience with distributed data processing frameworks like Apache Spark, Dask, or Ray.
  • Proficiency with GCP and their data services.
  • Experience with real-time data streaming technologies like Kafka, Redpanda or NATS.
  • Familiarity with vector databases (e.g., Milvus, ElasticSearch, Vespa) and their role in AI applications.
  • Experience with data modeling, schema design, and working with both relational and NoSQL databases (PostgreSQL, MongoDB, Cassandra).
  • Strong focus on performance optimization, cost management, and building systems that scale efficiently.
  • Experience implementing data observability and monitoring solutions (e.g., Prometheus, ClickHouse).
  • Ability to write clean, well-documented, maintainable code with proper testing practices.
  • Excellent problem-solving skills and a data-driven approach to decision making.
  • Strong communication skills and ability to collaborate effectively with cross-functional teams.
  • Comfortable with ambiguity and excited about working on undefined problems that require creative solutions.
  • Familiarity with data pipeline orchestration tools such as Airflow, Dagster, Prefect, or similar frameworks is a nice to have.
  • What we can offer :

  • A highly skilled, inspiring, and supportive team.
  • Positive, flexible, and trust-based work environment that encourages long-term professional and personal growth.
  • A global, multi-culturally team of colleagues and customers.
  • Comprehensive health plans and perks.
  • A healthy work-life balance that accommodates individual and family needs.
  • Competitive salary and stock options program.
  • Base Salary Range : $210,000 - $265,000 annually. Final compensation within this range will be determined based on the candidate's experience and skill set.
  • Who we are :

    Sanity.io is a modern, flexible content operating system that replaces rigid legacy content management systems. One of our big differentiators is treating content as data so that it can be stored in a single source of truth, but seamlessly adapted and personalized for any channel without extra effort. Forward-thinking companies choose Sanity because they can create tailored content authoring experiences, customized workflows, and content models that reflect their business.

    Sanity recently raised a $85m Series C led by GP Bullhound and is also backed by leading investors like ICONIQ Growth, Threshold Ventures, Heavybit and Shopify, as well as founders of companies like Vercel, WPEngine, Twitter, Mux, Netlify and Heroku. This funding round has put Sanity in a strong position for accelerated growth in the coming years.

    You can only build a great company with a great culture. Sanity is a 200+ person company with highly committed and ambitious people. We are pioneers , we exist for our customers , we are hel ved , and we love type two fun ! Read more about our values here!

    Sanity.io pledges to be an organization that reflects the globally diverse audience that our product serves. We believe that in addition to hiring the best talent, a diversity of perspectives, ideas, and cultures leads to the creation of better products and services. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, or gender identity.

    Create a job alert for this search

    Sr Data Engineer • San Francisco, CA, United States

    Related jobs
    • Promoted
    Sr. Data Engineer

    Sr. Data Engineer

    Diverse LynxSan Francisco, CA, United States
    Full-time
    Location : San Francisco, CA (Remote).Architect and design metadata-driven, scalable, secure, and cost-efficient Lakehouse solutions on Azure that leverage Delta Lake for ACID transactions, schema e...Show moreLast updated: 30+ days ago
    • Promoted
    Sr. Data Engineer

    Sr. Data Engineer

    Contact Government Services, LLCSan Francisco, CA, United States
    Full-time
    Employment Type : Full-Time, Mid-level.Department : Business Intelligence.CGS is seeking a passionate and driven Data Engineer to support a rapidly growing Data Analytics and Business Intelligence pl...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Sr. Data Engineer

    Sr. Data Engineer

    USMPleasanton, CA, United States
    Full-time
    Visa Types Green Card, US Citiz.Location : Pleasanton, California (hybrid work).As a Senior / Lead Data Engineer, you will lead the design, development, and ownership of core data infrastructure-from ...Show moreLast updated: 22 hours ago
    • Promoted
    Sr Data Engineer

    Sr Data Engineer

    Diverse LynxSan Francisco, CA, United States
    Full-time
    Key Skills : Tableau, Python, Snowflake.The ideal candidate will excel at collaborating with business users, cross-functional teams, and offshore teams to deliver high-quality data solutions.A solid...Show moreLast updated: 30+ days ago
    • Promoted
    Sr. Distinguished, Data Engineer - Enterprise Data Storage and Consumption Platforms Data Engineer

    Sr. Distinguished, Data Engineer - Enterprise Data Storage and Consumption Platforms Data Engineer

    Information Technology Senior Management ForumSan Francisco, CA, United States
    Full-time
    Distinguished, Data Engineer - Enterprise Data Storage and Consumption Platforms.Distinguished Data Engineers are individual contributors who strive to be diverse in thought so we visualize the pro...Show moreLast updated: 12 days ago
    • Promoted
    Sr. Data Engineer

    Sr. Data Engineer

    VisaFoster City, CA, United States
    Full-time
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...Show moreLast updated: 30+ days ago
    • Promoted
    Sr Data Engineer

    Sr Data Engineer

    VisaFoster City, CA, United States
    Full-time
    Commercial Money Movement Solutions (CMS) division's charter is to capture new sources of money movement through card and non-card flows, including Visa Business Solutions, Government Solutions and...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Sr. Data Engineer

    Sr. Data Engineer

    AkkodisPleasanton, CA, United States
    Full-time
    You will be responsible for building scalable data pipelines and managing data models across cloud-based storage systems like S3. The rate may be negotiable based on experience, education, geographi...Show moreLast updated: 22 hours ago
    • Promoted
    • New!
    Sr. Data Engineer :

    Sr. Data Engineer :

    AkrayaSan Francisco, CA, United States
    Permanent
    Primary Skills : Python (Advanced), PySpark (Advanced), Databricks (Intermediate), AWS (Intermediate), ETL (Intermediate) Contract Type : W2 Only Duration : 12+ Months Location : San Francisco, CA Pay ...Show moreLast updated: 22 hours ago
    • Promoted
    • New!
    Sr Engineer Data Engineering - US Based Remote

    Sr Engineer Data Engineering - US Based Remote

    Anywhere Real EstateSan Francisco, CA, United States
    Remote
    Full-time
    At Anywhere, we work to build and improve a platform that helps real estate professionals work effectively and helps delight home buyers and sellers with an excellent experience.We do that by combi...Show moreLast updated: 22 hours ago
    • Promoted
    • New!
    Sr Data Engineer

    Sr Data Engineer

    LendingClubSan Francisco, CA, United States
    Full-time
    At LendingClub, our mission is to empower people striving to achieve better financial health and data plays a critical role in making that mission possible. The Data and Analytics team builds the sy...Show moreLast updated: 22 hours ago
    • Promoted
    Sr Customer Data Engineer

    Sr Customer Data Engineer

    Tailored Brands IncDublin, CA, United States
    Full-time
    At Tailored Brands, we help people love the way they look and feel for their most important moments.Our Technology team loves the way they feel and thrive at work, with : . Flexible work opportunities...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Sr Data Engineer

    Sr Data Engineer

    Early Warning ServicesSan Francisco, CA, United States
    Full-time
    At Early Warning, we've powered and protected the U.As a trusted name in payments, we partner with thousands of institutions to increase access to financial services and protect transactions for hu...Show moreLast updated: 22 hours ago
    • Promoted
    • New!
    Sr Data Engineer

    Sr Data Engineer

    Early Warning Services, LLCSan Francisco, CA, United States
    Full-time
    At Early Warning, we've powered and protected the U.Zelle®, Paze℠, and so much more.As a trusted name in payments, we partner with thousands of institutions to increase access to financial services...Show moreLast updated: 22 hours ago
    • Promoted
    Sr. Data Engineer

    Sr. Data Engineer

    SS&C TechnologiesSan Francisco, CA, United States
    Full-time
    As a leading financial services and healthcare technology company based on revenue, SS&C is headquartered in Windsor, Connecticut, and has 27,000+ employees in 35 countries.Some 20,000 financial se...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Data Engineer

    Senior Data Engineer

    JobotSan Jose, CA, US
    Full-time
    REMOTE Senior Data Solutions Engineer / Lead Data Engineer Needed for Industry-Leader!.This Jobot Job is hosted by : Reed Kellick. Are you a fit? Easy Apply now by clicking the "Apply Now" button and...Show moreLast updated: 10 days ago
    • Promoted
    • New!
    Sr. Data Engineer - High-Growth Tech Startup

    Sr. Data Engineer - High-Growth Tech Startup

    AndiamoSan Francisco, CA, United States
    Full-time
    Data Engineer - High-Growth Tech Startup.Data Engineer - High-Growth Tech Startup.Senior Data Engineer High-Growth Tech Startup. M+ tech startup in the data management space.You will play a critical...Show moreLast updated: 22 hours ago
    • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Mercury Insurance Services, LLCSan Francisco, CA, United States
    Full-time
    As a Sr Data Engineer you will create production data pipelines for our advanced analytics and data science teams - as well as collaborate with other technical personnel on internal & external data...Show moreLast updated: 30+ days ago