Talent.com
Senior Software Engineer, Observability

Senior Software Engineer, Observability

Together AISan Francisco, California, United States
20 hours ago
Job type
  • Full-time
Job description

Together AI is building the AI Acceleration Cloud, an end-to-end platform for the full generative AI lifecycle, combining the fastest LLM inference engine with state-of-the-art AI cloud infrastructure.

The AI Infrastructure team at Together AI is at the forefront of building and scaling the foundational systems that power our generative AI platform. The storage and observability team is crucial for designing, implementing, and maintaining robust distributed storage solutions, ensuring seamless data access and management. They are also responsible for developing comprehensive observability platforms, providing critical insights into system performance and GPU utilization, and proactively identifying and resolving issues.

Requirements

5+ years of demonstrated experience in building large scale, fault tolerant, distributed systems and API microservices

Experience designing, analyzing and improving efficiency, scalability, and stability of various system resources

Excellent communication skills – able to write clear design docs and work effectively with both technical and non-technical team members

Demonstrated experience with building and operating high-performance and / or globally distributed microservice architectures across one or more cloud providers (AWS, Azure, GCP)

Responsibilities

Identify, design, and develop foundational backend services that power Together’s cloud platform

Analyze and improve the robustness and scalability of existing distributed systems, APIs, databases, and infrastructure

Partner with product teams to understand functional requirements and deliver solutions that meet business needs

Write clear, well-tested, and maintainable software and IaC for both new and existing systems

Conduct design and code reviews, create developer documentation, and develop testing strategies for robustness and fault tolerance

Participate in an on‑call rotation to address critical incidents when necessary

About Together AI

Together AI is a research‑driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co‑designing software, hardware, algorithms, and models. We have contributed to leading open‑source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers in our journey in building the next generation AI infrastructure.

Compensation

We offer competitive compensation, startup equity, health insurance, and other benefits, as well as flexibility in terms of remote work. The US base salary range for this full‑time position is : $160,000 - $260,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job‑related knowledge.

Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.

#J-18808-Ljbffr

Create a job alert for this search

Engineer Observability • San Francisco, California, United States

Related jobs
  • Promoted
Senior Software Engineer, Unification

Senior Software Engineer, Unification

IXL LearningSan Mateo, CA, United States
Full-time
IXL Learning, a leading developer of personalized learning products used by millions worldwide, is looking for a Senior Software Engineer to join our Unification Team-a high-impact group driving te...Show moreLast updated: 30+ days ago
  • Promoted
Senior Software Engineer - Observability (Databases)

Senior Software Engineer - Observability (Databases)

Databricks Inc.Mountain View, CA, United States
Full-time
At Databricks, we are inspired by allowing data teams to solve the world’s toughest problems, from security threat detection to cancer drug development. We do this by building and running the world’...Show moreLast updated: 11 days ago
  • Promoted
  • New!
Senior Software Optimization Engineer

Senior Software Optimization Engineer

Efficient Computer Service LLCSan Jose, CA, United States
Full-time
Efficient is developing the world's most energy-efficient general-purpose computer processor.Efficient's patented technology uses 100x less energy than state of the art commercially available ultra...Show moreLast updated: 18 hours ago
  • Promoted
Software Engineer 2

Software Engineer 2

Typical Set, LLCBerkeley, CA, US
Full-time
Typical Set, LLC Position : Software Engineer 2 (SE2510) Responsible for developing systems to translate cutting-edge machine learning into complex trading behaviors. Touch areas as wide-ranging as m...Show moreLast updated: 11 days ago
  • Promoted
Senior Software Engineer

Senior Software Engineer

ForageSan Francisco, CA, United States
Full-time
Forage is building the modern payments stack that powers inclusive commerce.Our technology enables grocers, delivery platforms, and point-of-sale systems to seamlessly accept EBT payments both onli...Show moreLast updated: 30+ days ago
  • Promoted
Senior Software Engineer

Senior Software Engineer

Scale AI, Inc.San Francisco, CA, United States
Full-time
Software is eating the world, but AI is eating software.We live in unprecedented times - AI has the potential to exponentially augment human intelligence. Every person will have a personal tutor, co...Show moreLast updated: 14 days ago
  • Promoted
Senior Software Engineer, Control & Calibration

Senior Software Engineer, Control & Calibration

PsiQuantumPalo Alto, CA, United States
Full-time
Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...Show moreLast updated: 30+ days ago
  • Promoted
Senior Staff Software EngineerSoftware Engineering • Berkeley, CA • Full time • On-site

Senior Staff Software EngineerSoftware Engineering • Berkeley, CA • Full time • On-site

Form EnergyBerkeley, CA, United States
Full-time
Are you ready to build America's energy future? Form Energy is an American manufacturing and energy technology company.We're revolutionizing energy storage with cost-effective, multi-day technology...Show moreLast updated: 12 days ago
  • Promoted
  • New!
Senior Software Engineer - Observability

Senior Software Engineer - Observability

RipplingSan Francisco, CA, United States
Full-time
Senior Software Engineer - Observability.Rippling gives businesses one place to run HR, IT, and Finance.It brings together all of the workforce systems that are normally scattered across a company,...Show moreLast updated: 18 hours ago
  • Promoted
Senior Software Engineer, GenAI

Senior Software Engineer, GenAI

Scale AI, Inc.San Francisco, CA, United States
Full-time
At Scale AI, our mission is to accelerate the development of AI applications.For 8 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including : g...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
Senior Software Engineer - Observability and Reliability San Francisco, CA

Senior Software Engineer - Observability and Reliability San Francisco, CA

Sigma Computing Inc.San Francisco, California, United States
Full-time
Senior Software Engineer - Observability and Reliability We are growing the engineering team and looking for engineers who have the chops to build and deliver world‑class technology.You will be par...Show moreLast updated: 8 hours ago
  • Promoted
Senior Firmware EngineerSoftware Engineering • Berkeley, CA • Full time • On-site

Senior Firmware EngineerSoftware Engineering • Berkeley, CA • Full time • On-site

Form EnergyBerkeley, CA, United States
Full-time
Are you ready to build America's energy future? Form Energy is an American manufacturing and energy technology company.We're revolutionizing energy storage with cost-effective, multi-day technology...Show moreLast updated: 15 days ago
  • Promoted
Senior Software Engineer, Platform Observability

Senior Software Engineer, Platform Observability

EverlawOakland, CA, United States
Full-time
Everlaw is looking for a Senior Software Engineer that brings experience in building robust observability tooling, humility in their approach, and interest in expanding their skills in new directio...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
Senior Software Engineer

Senior Software Engineer

PubmaticRedwood City, CA, United States
Full-time
PubMatic (Nasdaq : PUBM) is an independent technology company maximizing customer value by delivering digital advertising's supply chain of the future. PubMatic's sell-side platform empowers the worl...Show moreLast updated: 18 hours ago
  • Promoted
Senior Software Engineer

Senior Software Engineer

Multiply LabsSan Francisco, CA, United States
Full-time
Multiply Labs is a cutting-edge startup based in San Francisco, California, supported by top-tier tech and life science investors such as Casdin Capital, Lux Capital, and Y Combinator.We are revolu...Show moreLast updated: 30+ days ago
  • Promoted
Senior Staff Embedded Software Engineer

Senior Staff Embedded Software Engineer

Bio-Rad LaboratoriesPleasanton, CA, United States
Full-time
As a self-motivated member of the firmware team, you will apply critical thinking and leadership in the design, implementation, integration, testing, debugging, deployment, and maintenance of embed...Show moreLast updated: 24 days ago
  • Promoted
Staff Software Engineer

Staff Software Engineer

Bio-Rad LaboratoriesHercules, CA, United States
Full-time
This role is both technical and collaborative.You will work closely with cross-functional teams including systems engineers, mechanical designers, assay development scientists, and quality engineer...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
Senior Software Engineer, Observability

Senior Software Engineer, Observability

Together AISan Francisco, CA, United States
Full-time
Together AI is building the AI Acceleration Cloud, an end-to-end platform for the full generative AI lifecycle, combining the fastest LLM inference engine with state-of-the-art AI cloud infrastruct...Show moreLast updated: 18 hours ago