Location
San Francisco
Employment Type
Full time
Location Type
Hybrid
Department
Engineering
Inference.net is hiring a Senior Full-Stack (Frontend-Focused) Engineer
Help us build beautiful, performant web experiences that give users super-powers over our globally distributed LLM inference platform. If you love shipping React apps that feel snappy at planet-scale, we’d love to meet you.
About Inference.net
We combine idle GPU capacity from around the world into a single cohesive plane of compute capable of serving models like DeepSeek and Llama 4. At any moment, 5,000+ GPUs and hundreds of terabytes of VRAM are connected to our network.
We’re a small, well-funded team working in-person from downtown San Francisco (hybrid flexibility as needed). Investors include a16z CSX and Multicoin . We’re high-agency, collaborative, and obsessed with craft—whether that’s a distributed scheduler or a pixel-perfect UI.
What you’ll do
What we’re looking for
Must-Have
Nice-to-Have
You don’t need to tick every “nice-to-have” box—curiosity and the ability to learn quickly matter more.
Compensation
How we work
We iterate fast, test in prod (safely!), and celebrate small wins. You’ll demo work twice a week, pair with systems engineers, and ship to users continuously. Most of us are in the office 3–4 days a week; remote candidates considered if time-zone compatible with Pacific hours.
Equal Opportunity
Inference.net is an equal opportunity employer. We value diversity and do not discriminate on the basis of race, color, religion, gender identity, sexual orientation, national origin, veteran status, disability, age, or any other protected status.
Ready to build the front door to planet-scale AI?
Send a short note and a link to something you’ve shipped (code, demo, or Dribbble shots) to jobs@inference.net . We can’t wait to chat!
#J-18808-Ljbffr
Frontend Engineer • San Francisco, CA, United States