Talent.com
Software Engineer - Large Scale Inference
Software Engineer - Large Scale InferenceSF Compute • San Francisco, CA, United States
Software Engineer - Large Scale Inference

Software Engineer - Large Scale Inference

SF Compute • San Francisco, CA, United States
17 days ago
Job type
  • Full-time
Job description

We're going to secure the financial risk of the largest infrastructure build-out in the history of the world.

When people finance clusters, the data centers that house them, and the power that powers them, they need "offtake". Offtake just means someone who's signed a contract to buy the thing they're building for some period of time. It's pretty hard to write a loan against a cluster that isn't sold to someone. The lender doesn't want to take on the risk that the cluster developer can't sell it, and the cluster developer really does not take on that risk. So instead, they ship it to the customer, by forcing folks to buy risky long-term contracts. Since margins are thin & volumes are large, if they mess up that purchase, that's game over. A 20% utilization gap might mean the difference between profit & bankruptcy.

If we don't solve that risk, there's a bubble. Application layer companies sign multi-year contracts for compute and inference, but sell to their customers on monthly subscriptions. This isn't SaaS anymore. A minor shift in your growth rate could cause the revenue to fail to come in the door, the compute bill to still come due, at the same time the venture capital slows down.

As AI scales, the folks who can most effectively take on that risk become the largest players. After all, a 2-person startup in a San Francisco Victorian can't realistically sign a 5-year take or pay contract on $100m supercomputer. But they may be able to buy the month of liquidity that someone else sold back.

So that's what we make : a liquid market for GPU offtake.

About the Role

As an engineer working on Large Scale Inference you will build and scale software systems that accept, distribute, and purchase compute for inference work loads. We're working with a cutting edge technology company as our partner. You will play a key role in building out and defining the inference products at SFC. You will work closely with a small competent team driving high impact revenue generating changes. You will have the opportunity to deliver the highest quality, most affordable inference products at scale.

On a day-to-day basis, you might...

  • Quantify, optimize, and visualize the unit economics of inference
  • Design solutions to maximize compute utilization
  • Create automated compute purchasing software to optimally fulfill inference job demand
  • Scale inference software pipeline systems to hundred trillion token scale

This role may be a good fit for you if...

  • You enjoy the craftsmanship of software
  • You're a thoughtful high-agency engineer
  • Have a deep appreciation for reliable fault tolerant systems
  • Are a strong communicator and work well with others
  • Have SQL experience
  • Bonus points if you...

  • Have previously worked at a startup
  • Have interest in market dynamics
  • Have previously scaled distributed work scheduling systems
  • Are comfortable working in Rust
  • Have some Kubernetes or terraform experience
  • Benefits

    Generous equity grant

    Team members are offered a competitive salary along with equity in the company

    Visa Sponsorships

    Yes, we sponsor visas and work permits

    Retirement matching

    We match 401(k) plans up to 4%

    Medical, dental & vision

    We offer competitive medical, dental, vision insurance for employees and dependents and cover 100% of premiums

    Time off

    We offer unlimited paid time off as well as 10+ observed holidays

    Parental leave

    We offer biological, adoptive, and foster parents paid time off to spend quality time with family

    Daily lunch

    We cover lunch daily for employees

    Unlimited office book budget

    You can buy as many books for the office as you want

    The San Francisco Compute Company is committed to maintaining a workplace free from discrimination and harassment.

    We make employment decisions based on business needs, job requirements, and individual qualifications, without regard to race, color, religion, belief, national origin, social or ethical origin, age, physical, mental, or sensory disability, sexual orientation, gender identity or expression, marital status, civil union or domestic partnership status, past or present military service, HIV status, family medical history or genetic information, family or parental status including pregnancy, or any other status protected by law.

    We welcome the opportunity to consider qualified applicants with prior arrest or conviction records. Our commitment to diversity includes hiring talented individuals regardless of their criminal history, in accordance with local, state, and federal laws, including San Francisco's Fair Chance Ordinance and California's ban-the-box laws.

    If you require reasonable accommodation for any reason, please reach out to us at team@sfcompute.com.

    Create a job alert for this search

    Software Engineer • San Francisco, CA, United States

    Related jobs
    Software Engineer, Science

    Software Engineer, Science

    Chan Zuckerberg Initiative • Redwood City, California, United States
    Full-time
    The Chan Zuckerberg Initiative was founded by Priscilla Chan and Mark Zuckerberg in 2015 to help solve some of society’s toughest challenges — from eradicating disease and improving education to ad...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, Full Stack

    Software Engineer, Full Stack

    Arcadia Science • Emeryville, California, United States
    Full-time
    We are Arcadia Science, an evolutionary biology company founded and led by scientists.Our mission is to turn natural innovations into real-world solutions by developing systematic, and quantitative...Show more
    Last updated: 29 days ago • Promoted
    Software / API Engineer

    Software / API Engineer

    Lawrence Berkeley National Laboratory • Berkeley, CA, United States
    Full-time
    The National Energy Research Scientific Computing Center (NERSC) is seeking a versatile Software / API Engineer to join our team building software systems that integrate scientific workflows and su...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, Model Inference

    Software Engineer, Model Inference

    OpenAI • San Francisco, CA, United States
    Full-time
    Our Inference team brings OpenAI's most capable research and technology to the world through our products.We empower consumers, enterprise and developers alike to use and access our start-of-the-ar...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer - (Full-Stack)

    Software Engineer - (Full-Stack)

    Encord • San Francisco, California, United States
    Full-time
    At Encord, we're building the AI infrastructure of the future.One of the biggest challenges AI companies face today is data quality. The success of any AI application relies heavily on the quality o...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, Platform

    Software Engineer, Platform

    Attain • Redwood City, California, United States
    Full-time
    Built for consumers and companies, alike.In a world driven by data, we believe consumers and businesses can coexist.Our founders had a vision to empower consumers to leverage their greatest asset—t...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer

    Software Engineer

    Confidential • Redwood City, California, United States
    Full-time
    C3 Energy is looking for data engineers to develop and implement the next generation of analytics for the smart grid.We are building a platform able to handle the extremely large amount of data gen...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer, Metadata

    Senior Software Engineer, Metadata

    Box • Redwood City, California, United States
    Full-time
    In the current iteration, it's used to organize, discover, annotate, secure and retain content.But, we've only scratched the surface of what's possible. You have 4+ years of professional software de...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, Inference - TL

    Software Engineer, Inference - TL

    Openai • San Francisco, California, United States
    Full-time
    Our team brings OpenAI’s most capable research and technology to the world through our products.We empower consumers, enterprises, and developers alike to access state-of-the-art AI models - unlock...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer - Large Scale Inference

    Software Engineer - Large Scale Inference

    The San Francisco Compute Company • San Francisco, CA, United States
    Full-time
    We think people should buy it like one.Startups shouldn’t be forced to buy a year’s worth of compute time in order to get market rate and compute providers shouldn’t go bankrupt because they can’t ...Show more
    Last updated: 11 days ago • Promoted
    Senior Software Engineer - Machine Learning Platform

    Senior Software Engineer - Machine Learning Platform

    Snowflake • Menlo Park, California, United States
    Full-time
    The Snowflake Machine Learning Platform team’s mission is to enable customers to bring their machine learning and deep learning workloads to Snowflake. Our customers want to build powerful models wi...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer

    Software Engineer

    The Swift Group • Vallejo, California, United States
    Full-time
    Our capabilities include Software Development, Engineering & IT, Data Science, Cyber Enablement, Logistics, and Training. Founded in 2019, Swift supports Civilian, Defense, and Intelligence Communit...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, Inference

    Software Engineer, Inference

    Trypulse • San Francisco, CA, United States
    Full-time
    Pulse is tackling one of the most persistent challenges in data infrastructure : extracting accurate, structured information from complex documents at scale. We have a breakthrough approach to docume...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, Machine Learning Infrastructure

    Software Engineer, Machine Learning Infrastructure

    Datologyai • Redwood City, California, United States
    Full-time
    Companies want to train their own large models on their own data.The current industry standard is to train on a random sample of your data, which is inefficient at best and actively harmful to mode...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, Inference

    Software Engineer, Inference

    Anthropic • San Francisco, CA, United States
    Full-time
    Senior / Staff Software Engineer, Inference.Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a ...Show more
    Last updated: 30+ days ago • Promoted
    AI Software Engineer, Search

    AI Software Engineer, Search

    Nexus • San Francisco, California, United States
    Full-time
    Nexus is building a world supercomputer by leveraging the latest advancements in cryptography, engineering, and science.Our team of experts is developing and deploying the Nexus Layer 1, the Nexus ...Show more
    Last updated: 30+ days ago • Promoted
    Sr. Software Engineer

    Sr. Software Engineer

    Gervino Group • Redwood City, California, United States
    Full-time
    This position is ideal for engineers with experience in functional programming and a passion for building fault-tolerant, scalable software solutions that power next-generation robotics, automation...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, Inference

    Software Engineer, Inference

    Pulse • San Francisco, CA, United States
    Full-time
    Pulse is tackling one of the most persistent challenges in data infrastructure : extracting accurate, structured information from complex documents at scale. We have a breakthrough approach to docume...Show more
    Last updated: 11 days ago • Promoted