Talent.com
Software Engineer, Data Infrastructure

Software Engineer, Data Infrastructure

Thinking Machines LabSan Francisco, CA, United States
30+ days ago
Job type
  • Full-time
Job description

Thinking Machines Lab's mission is to empower humanity through advancing collaborative general intelligence. We're building a future where everyone has access to the knowledge and tools to make AI work for their unique needs and goals.

We are a small team of scientists, engineers, and builders who've created some of the most widely used AI products, like ChatGPT, Character.ai, Mistral, PyTorch, OpenAI Gym, Fairseq, and Segment Anything.

About The Role

We're looking for a Staff Software Engineer with deep expertise in Data Infrastructure to help build the systems that power our foundation models.

You'll join a small, high-impact team responsible for architecting and scaling the core infrastructure behind distributed training pipelines, multimodal data catalogs, and intelligent processing systems that operate over petabytes of data.

Infrastructure is critical to us : it's the bedrock that enables every breakthrough. You'll work directly with researchers to accelerate experiments, develop new datasets, improve infrastructure efficiency, and enable key insights across our data assets.

If you're excited by distributed systems, large-scale data mining, open-source tools like Spark, Kafka, Beam, Ray, and Delta Lake, and enjoy building from the ground up, we'd love to hear from you.

What You'll Do

  • Design, build, and operate scalable, fault-tolerant infrastructure for LLM Research : distributed compute, data orchestration, and storage across modalities.
  • Develop high-throughput systems for data ingestion, processing, and transformation - including training data catalogs, deduplication, quality checks, and search.
  • Build systems for traceability, reproducibility, and robust quality control at every stage of the data lifecycle.
  • Implement and maintain monitoring and alerting to support platform reliability and performance.
  • Collaborate with research teams to unlock new features, improve data quality, and accelerate training cycles.

Required Qualifications

  • Have 5+ years of experience in data infrastructure, ideally supporting ML or research use cases.
  • Are fluent in distributed compute frameworks such as Apache Spark and Ray.
  • Have hands-on experience with Kafka, dbt, Terraform, and Airflow.
  • Have experience building a web crawler.
  • Have extensive experience studying and scaling deduplication, data mining, and search.
  • Are deeply familiar with cloud infrastructure, data lake architectures, and batch + streaming pipelines.
  • Have strong knowledge of file formats and storage systems (e.g., Parquet, Delta Lake, etc.) and how they impact performance and scalability.
  • Are proactive about documentation, testing, and empowering your teammates with good tooling.
  • Strong Candidates May Also Have

  • 5+ years of industry experience building large-scale distributed systems.
  • Strong proficiency in Python, SQL, and bonus for Rust.
  • Familiarity with performance tuning and memory management in high-volume data systems.
  • Track record of scaling infrastructure and debugging complex systems in production.
  • Excellent communication and collaboration skills.
  • Logistics

  • Location : This role is based in San Francisco, California.
  • Visa sponsorship : We sponsor visas. While we can't guarantee success for every candidate or role, if you're the right fit, we're committed to working through the visa process together.
  • Benefits : Thinking Machines offers competitive health, dental, and vision benefits, unlimited PTO, paid parental leave, and relocation support as needed.
  • Compensation : Depending on background, skills and experience, the expected annual salary range for this position is $300,000-$350,000 USD.
  • We encourage you to apply even if you do not believe you meet every single qualification.
  • As set forth in Thinking Machines' Equal Employment Opportunity policy, we do not discriminate on the basis of any protected group status under any applicable law.
  • Create a job alert for this search

    Software Engineer Infrastructure • San Francisco, CA, United States

    Related jobs
    • Promoted
    • New!
    Software Engineer, Data Infrastructure

    Software Engineer, Data Infrastructure

    OpenAISan Francisco, CA, United States
    Full-time
    Data Platform at OpenAI owns the foundational data stack powering critical product, research, and analytics workflows.We operate some of the largest Spark compute fleets in production; design, and ...Show moreLast updated: 19 hours ago
    • Promoted
    • New!
    MTS, Data Infrastructure Engineer

    MTS, Data Infrastructure Engineer

    DelphinaSan Francisco, CA, United States
    Full-time
    Today's Data Scientists are in pain - spending their time manually wrangling data, building models through slow trial and error, taking on painstaking rewrites for deployment, and dealing with coun...Show moreLast updated: 19 hours ago
    • Promoted
    Software Engineer - Data Infrastructure (Pretraining Data)

    Software Engineer - Data Infrastructure (Pretraining Data)

    XaiSan Francisco, CA, United States
    Full-time
    Software Engineer - Data Infrastructure (Pretraining Data).AIs mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge.Our team is s...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Backend Software Engineer - Data Infrastructure (New York City, Los Angeles, or San Francisco

    Senior Backend Software Engineer - Data Infrastructure (New York City, Los Angeles, or San Francisco

    RegardSan Francisco, CA, US
    Full-time
    As a Senior Backend Software Engineer - Data Infrastructure at Regard, you’ll play a leading role in building our products. You’ll be responsible for designing and developing critical ba...Show moreLast updated: 25 days ago
    • Promoted
    • New!
    Data Infrastructure Engineer

    Data Infrastructure Engineer

    zaimlerSan Mateo, CA, United States
    Full-time
    We're creating the foundation for AI systems that don't just generate, but retrieve, link, and reason over enterprise knowledge. In just over a year, we've begun partnering with Fortune 500 design p...Show moreLast updated: 19 hours ago
    • Promoted
    • New!
    Software Engineer II, Data Engineering & Infrastructure

    Software Engineer II, Data Engineering & Infrastructure

    Aurora COSan Francisco, CA, United States
    Full-time
    The Aurora Driver will create a new era in mobility and logistics, one that will bring a safer, more efficient, and more accessible future to everyone. At Aurora, you will tackle massively complex p...Show moreLast updated: 19 hours ago
    • Promoted
    Application Engineer (27344)

    Application Engineer (27344)

    SupermicroSan Jose, CA, United States
    Full-time
    Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show moreLast updated: 11 days ago
    • Promoted
    Software Engineer, Data Infrastructure

    Software Engineer, Data Infrastructure

    datologyaiRedwood City, CA, United States
    Full-time
    Companies want to train their own large models on their own data.The current industry standard is to train on a random sample of your data, which is inefficient at best and actively harmful to mode...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Software Engineer, Data Infrastructure

    Software Engineer, Data Infrastructure

    FigmaSan Francisco, CA, United States
    Full-time
    Figma is growing our team of passionate creatives and builders on a mission to make design accessible to all.Figma's platform helps teams bring ideas to life-whether you're brainstorming, creating ...Show moreLast updated: 19 hours ago
    • Promoted
    Software Infrastructure & Platform Engineer

    Software Infrastructure & Platform Engineer

    PsiQuantumPalo Alto, CA, United States
    Full-time
    Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Software Engineer, Data Infrastructure

    Senior Software Engineer, Data Infrastructure

    LMArenaSan Francisco, CA, United States
    Full-time
    Software Engineer, Data Infrastructure at LMArena.LMArena is seeking a Software Engineer to join our team and build the data infrastructure that powers real-world AI evaluation.You'll play a crucia...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Infrastructure Software Engineer, Enterprise AI

    Senior Infrastructure Software Engineer, Enterprise AI

    Scale AI, Inc.San Francisco, CA, United States
    Full-time
    Scale GP is building the next generation of enterprise-grade Generative AI products.Our platform provides APIs for knowledge retrieval, inference, and evaluation, enabling customers to build and de...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Software Engineer - Data Infrastructure

    Senior Software Engineer - Data Infrastructure

    PlaidSan Francisco, CA, United States
    Full-time
    Senior Software Engineer - Data Infrastructure.Making data driven decisions is key to Plaid's culture.To support that, we need to scale our data systems while maintaining correct and complete data....Show moreLast updated: 30+ days ago
    • Promoted
    Software Engineer, Data Infrastructure & Acquisition - San Francisco, USA

    Software Engineer, Data Infrastructure & Acquisition - San Francisco, USA

    SpeechifySan Francisco, CA, United States
    Full-time
    PLEASE APPLY THROUGH THIS LINK : .The mission of Speechify is to make sure that reading is never a barrier to learning.Over 50 million people use Speechify's text-to-speech products to turn whatever ...Show moreLast updated: 30+ days ago
    • Promoted
    Infrastructure Software Engineer, Public Sector

    Infrastructure Software Engineer, Public Sector

    Scale AI, Inc.San Francisco, CA, United States
    Full-time
    Scale AI is seeking a highly skilled and motivated.Software Engineer, AI Infrastructure & Security.Public Sector Engineering team. As a part of this team, you will play a critical role in delivering...Show moreLast updated: 30+ days ago
    • Promoted
    Software Engineer II, Data Engineering & Infrastructure

    Software Engineer II, Data Engineering & Infrastructure

    Aurora InnovationSan Francisco, CA, United States
    Full-time
    Aurora's mission is to deliver the benefits of self-driving technology safely, quickly, and broadly.The Aurora Driver will create a new era in mobility and logistics, one that will bring a safer, m...Show moreLast updated: 30+ days ago
    • Promoted
    Software Engineer, Infrastructure & Data

    Software Engineer, Infrastructure & Data

    LIGHTFIELD INCSan Francisco, CA, United States
    Full-time
    Lightfield is a next-generation CRM that automatically captures customer interactions like emails, meetings, and support tickets and organizes them into structured CRM data, enabling deep analysis ...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Software Engineer - Data Infrastructure (Pretraining Data)San Francisco & Palo Alto, CA

    Software Engineer - Data Infrastructure (Pretraining Data)San Francisco & Palo Alto, CA

    XaiSan Francisco, CA, United States
    Full-time
    Software Engineer - Data Infrastructure.AI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motiv...Show moreLast updated: 19 hours ago