Talent.com
Software Engineer - Large Scale Inference
Software Engineer - Large Scale InferenceSF Compute • San Francisco, CA, United States
Software Engineer - Large Scale Inference

Software Engineer - Large Scale Inference

SF Compute • San Francisco, CA, United States
Hace 15 días
Tipo de contrato
  • A tiempo completo
Descripción del trabajo

We're going to secure the financial risk of the largest infrastructure build-out in the history of the world.

When people finance clusters, the data centers that house them, and the power that powers them, they need "offtake". Offtake just means someone who's signed a contract to buy the thing they're building for some period of time. It's pretty hard to write a loan against a cluster that isn't sold to someone. The lender doesn't want to take on the risk that the cluster developer can't sell it, and the cluster developer really does not take on that risk. So instead, they ship it to the customer, by forcing folks to buy risky long-term contracts. Since margins are thin & volumes are large, if they mess up that purchase, that's game over. A 20% utilization gap might mean the difference between profit & bankruptcy.

If we don't solve that risk, there's a bubble. Application layer companies sign multi-year contracts for compute and inference, but sell to their customers on monthly subscriptions. This isn't SaaS anymore. A minor shift in your growth rate could cause the revenue to fail to come in the door, the compute bill to still come due, at the same time the venture capital slows down.

As AI scales, the folks who can most effectively take on that risk become the largest players. After all, a 2-person startup in a San Francisco Victorian can't realistically sign a 5-year take or pay contract on $100m supercomputer. But they may be able to buy the month of liquidity that someone else sold back.

So that's what we make : a liquid market for GPU offtake.

About the Role

As an engineer working on Large Scale Inference you will build and scale software systems that accept, distribute, and purchase compute for inference work loads. We're working with a cutting edge technology company as our partner. You will play a key role in building out and defining the inference products at SFC. You will work closely with a small competent team driving high impact revenue generating changes. You will have the opportunity to deliver the highest quality, most affordable inference products at scale.

On a day-to-day basis, you might...

  • Quantify, optimize, and visualize the unit economics of inference
  • Design solutions to maximize compute utilization
  • Create automated compute purchasing software to optimally fulfill inference job demand
  • Scale inference software pipeline systems to hundred trillion token scale

This role may be a good fit for you if...

  • You enjoy the craftsmanship of software
  • You're a thoughtful high-agency engineer
  • Have a deep appreciation for reliable fault tolerant systems
  • Are a strong communicator and work well with others
  • Have SQL experience
  • Bonus points if you...

  • Have previously worked at a startup
  • Have interest in market dynamics
  • Have previously scaled distributed work scheduling systems
  • Are comfortable working in Rust
  • Have some Kubernetes or terraform experience
  • Benefits

    Generous equity grant

    Team members are offered a competitive salary along with equity in the company

    Visa Sponsorships

    Yes, we sponsor visas and work permits

    Retirement matching

    We match 401(k) plans up to 4%

    Medical, dental & vision

    We offer competitive medical, dental, vision insurance for employees and dependents and cover 100% of premiums

    Time off

    We offer unlimited paid time off as well as 10+ observed holidays

    Parental leave

    We offer biological, adoptive, and foster parents paid time off to spend quality time with family

    Daily lunch

    We cover lunch daily for employees

    Unlimited office book budget

    You can buy as many books for the office as you want

    The San Francisco Compute Company is committed to maintaining a workplace free from discrimination and harassment.

    We make employment decisions based on business needs, job requirements, and individual qualifications, without regard to race, color, religion, belief, national origin, social or ethical origin, age, physical, mental, or sensory disability, sexual orientation, gender identity or expression, marital status, civil union or domestic partnership status, past or present military service, HIV status, family medical history or genetic information, family or parental status including pregnancy, or any other status protected by law.

    We welcome the opportunity to consider qualified applicants with prior arrest or conviction records. Our commitment to diversity includes hiring talented individuals regardless of their criminal history, in accordance with local, state, and federal laws, including San Francisco's Fair Chance Ordinance and California's ban-the-box laws.

    If you require reasonable accommodation for any reason, please reach out to us at team@sfcompute.com.

    Crear una alerta de empleo para esta búsqueda

    Software Engineer • San Francisco, CA, United States

    Ofertas relacionadas
    Software Engineer

    Software Engineer

    General Medicine • Fremont, CA, United States
    A tiempo completo
    As a software engineer at General Medicine, you’ll help build and scale a healthcare store that makes it delightfully simple to shop for any type of care. We provide upfront cash and insurance price...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Controls Software Engineer

    Controls Software Engineer

    Lawrence Berkeley National Laboratory • Berkeley, CA, United States
    A tiempo completo
    Berkeley Lab's Engineering Division is seeking an innovative and creative.Beamline Controls Group at the Advanced Light Source (ALS). The ALS is on the brink of an expansive equipment upgrade that w...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Senior Software Engineer - Large Vision Model Systems

    Senior Software Engineer - Large Vision Model Systems

    Tik Tok • San Jose, CA, United States
    A tiempo completo
    Team Introduction The Intelligent Creation - AI Platform team is a team focusing on building advanced end-to-end AI production pipelines, including deep learning model training, optimization, deplo...Mostrar más
    Última actualización: hace 1 día • Oferta promocionada
    Software Engineer, Inference

    Software Engineer, Inference

    algojobs • San Francisco, CA, United States
    A tiempo completo
    Anthropic’s mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group ...Mostrar más
    Última actualización: hace 6 días • Oferta promocionada
    Senior Inference Software Engineer

    Senior Inference Software Engineer

    Etched • San Jose, CA, United States
    A tiempo completo
    Etched is building AI chips that are hard-coded for individual model architectures.Our first product (Sohu) only supports transformers, but has an order of magnitude more throughput and lower laten...Mostrar más
    Última actualización: hace 6 días • Oferta promocionada
    Software Engineer, Enterprise AI

    Software Engineer, Enterprise AI

    Scale AI, Inc. • San Francisco, CA, United States
    A tiempo completo
    Scale GP (Scale Generative AI Platform) is an enterprise-grade Generative AI platform that provides APIs for knowledge retrieval, inference, evaluation, and more. We are looking for a strong enginee...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Software Engineer, Model Inference

    Software Engineer, Model Inference

    OpenAI • San Francisco, CA, United States
    A tiempo completo
    Our Inference team brings OpenAI's most capable research and technology to the world through our products.We empower consumers, enterprise and developers alike to use and access our start-of-the-ar...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Senior Firmware EngineerSoftware Engineering • Berkeley, CA • Full time • On-site

    Senior Firmware EngineerSoftware Engineering • Berkeley, CA • Full time • On-site

    Form Energy • Berkeley, CA, United States
    A tiempo completo
    Are you ready to build America's energy future? Form Energy is an American manufacturing and energy technology company.We're revolutionizing energy storage with cost-effective, multi-day technology...Mostrar más
    Última actualización: hace 29 días • Oferta promocionada
    Inference Software Engineer - Collectives

    Inference Software Engineer - Collectives

    ETCHED LLC • San Jose, CA, United States
    A tiempo completo
    Etched is building the world's first AI inference system purpose-built for transformers - delivering over 10x higher performance and dramatically lower cost and latency than a B200.With Etched ASIC...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Senior Software Engineer, Detect

    Senior Software Engineer, Detect

    Clutch Canada • Mountain View, CA, United States
    A tiempo completo
    Resemble AI is at the forefront of voice AI technology, pioneering the field of Custom Voices.Our advanced Deep Learning models enable us to produce highly realistic Speech Synthesis, marking a sig...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Software Engineer - Large Vision Model Systems

    Software Engineer - Large Vision Model Systems

    Tik Tok • San Jose, CA, United States
    A tiempo completo
    Team Introduction The Intelligent Creation - AI Platform team is a team focusing on building advanced end-to-end AI production pipelines, including deep learning model training, optimization, deplo...Mostrar más
    Última actualización: hace 1 día • Oferta promocionada
    Staff Software Platform EngineerSoftware Engineering • Berkeley, CA; Somerville, MA; Weirton, WV • Full time • On-site

    Staff Software Platform EngineerSoftware Engineering • Berkeley, CA; Somerville, MA; Weirton, WV • Full time • On-site

    Form Energy • Berkeley, CA, United States
    A tiempo completo
    Are you ready to build America's energy future? Form Energy is an American manufacturing and energy technology company.We're revolutionizing energy storage with cost-effective, multi-day technology...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Senior Software Engineer - Intelligence

    Senior Software Engineer - Intelligence

    Hard Yaka • San Francisco, CA, United States
    A tiempo completo
    We exist to accelerate innovation.We do this by giving more people the opportunity to participate in the venture economy by building the financial infrastructure that makes it possible for more peo...Mostrar más
    Última actualización: hace 5 días • Oferta promocionada
    Software Engineer, Foundation Inference Infrastructure

    Software Engineer, Foundation Inference Infrastructure

    Tesla • Palo Alto, CA, United States
    A tiempo completo
    As a member of the Foundation Inference Infrastructure team, you will design & implement a diverse set of backend services and tools that power autonomy software and hardware development processes....Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Software Engineer, Optimus Inference Co Design

    Software Engineer, Optimus Inference Co Design

    Tesla • Palo Alto, CA, United States
    A tiempo completo
    The AI inference co-design team's goal is to take research models and make them run efficiently on our AI-ASIC to power real-time inference for Optimus humanoid robot programs, with applications ex...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Senior Software Engineer - AI Agent Infrastructure (Healthcare)

    Senior Software Engineer - AI Agent Infrastructure (Healthcare)

    Honey Health • San Jose, CA, United States
    A tiempo completo
    Honey Health is the all-in-one AI back office for primary and specialty care.Our AI agents autonomously handle core back-office jobs, such as aggregating patients data, processing orders and prescr...Mostrar más
    Última actualización: hace 11 días • Oferta promocionada
    Inference Software Engineer

    Inference Software Engineer

    ETCHED LLC • Cupertino, CA, United States
    A tiempo completo
    Etched is building AI chips that are hard-coded for individual model architectures.Our first product (Sohu) only supports transformers, but has an order of magnitude more throughput and lower laten...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Senior Software Engineer, Inference Platform

    Senior Software Engineer, Inference Platform

    MongoDB • Palo Alto, CA, US
    A tiempo completo
    MongoDB's mission is to empower innovators to create, transform, and disrupt industries by unleashing the power of software and data. We enable organizations of all sizes to easily build, scale, and...Mostrar más
    Última actualización: hace 26 días • Oferta promocionada