Talent.com
Senior Software Engineer, Observability
Senior Software Engineer, ObservabilityTogether AI • San Francisco, CA, United States
Senior Software Engineer, Observability

Senior Software Engineer, Observability

Together AI • San Francisco, CA, United States
17 days ago
Job type
  • Full-time
Job description

Together AI is building the AI Acceleration Cloud, an end-to-end platform for the full generative AI lifecycle, combining the fastest LLM inference engine with state-of-the-art AI cloud infrastructure.

The AI Infrastructure team at Together AI is at the forefront of building and scaling the foundational systems that power our generative AI platform. The storage and observability team is crucial for designing, implementing, and maintaining robust distributed storage solutions, ensuring seamless data access and management. They are also responsible for developing comprehensive observability platforms, providing critical insights into system performance and GPU utilization, and proactively identifying and resolving issues.

Requirements

  • 5+ years of demonstrated experience in building large scale, fault tolerant, distributed systems and API microservices
  • Experience designing, analyzing and improving efficiency, scalability, and stability of various system resources
  • Excellent communication skills – able to write clear design docs and work effectively with both technical and non-technical team members
  • Demonstrated experience with building and operating high-performance and / or globally distributed microservice architectures across one or more cloud providers (AWS, Azure, GCP)

Responsibilities

  • Identify, design, and develop foundational backend services that power Together’s cloud platform
  • Analyze and improve the robustness and scalability of existing distributed systems, APIs, databases, and infrastructure
  • Partner with product teams to understand functional requirements and deliver solutions that meet business needs
  • Write clear, well-tested, and maintainable software and IaC for both new and existing systems
  • Conduct design and code reviews, create developer documentation, and develop testing strategies for robustness and fault tolerance
  • Participate in an on‑call rotation to address critical incidents when necessary
  • About Together AI

    Together AI is a research‑driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co‑designing software, hardware, algorithms, and models. We have contributed to leading open‑source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers in our journey in building the next generation AI infrastructure.

    Compensation

    We offer competitive compensation, startup equity, health insurance, and other benefits, as well as flexibility in terms of remote work. The US base salary range for this full‑time position is : $160,000 - $260,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job‑related knowledge.

    Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.

    #J-18808-Ljbffr

    Create a job alert for this search

    Engineer Observability • San Francisco, CA, United States

    Related jobs
    Senior Software Engineer, Unification

    Senior Software Engineer, Unification

    IXL Learning • San Mateo, CA, United States
    Full-time
    IXL Learning, a leading developer of personalized learning products used by millions worldwide, is looking for a Senior Software Engineer to join our Unification Team-a high-impact group driving te...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer I

    Senior Software Engineer I

    Qualia • San Francisco, California, United States
    Full-time
    At Qualia, we've built the leading B2B real estate technology that transforms the home buying and selling experience into a simple, secure, and enjoyable process. Our SMB and Enterprise products bri...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer

    Senior Software Engineer

    FLEETWORKS INC. • San Francisco, CA, United States
    Full-time
    Every year, companies spend over a trillion dollars moving freight across the U.We're building voice agents that transform the chaotic freight booking process into a modern, intelligent marketplace...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer

    Senior Software Engineer

    Forage • San Francisco, CA, United States
    Full-time
    Forage is building the modern payments stack that powers inclusive commerce.Our technology enables grocers, delivery platforms, and point-of-sale systems to seamlessly accept EBT payments both onli...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer

    Senior Software Engineer

    Nautilus Biotechnology • San Carlos, California, United States
    Full-time
    At Nautilus, we have a big and important mission : improve the health of millions by unleashing the potential of the proteome to accelerate drug development and enable a new world of precision and p...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer

    Senior Software Engineer

    Scale AI, Inc. • San Francisco, CA, United States
    Full-time
    Join the team shaping the future of AI at Scale.Software is eating the world, but AI is eating software.We live in unprecedented times – AI has the potential to exponentially augment human intellig...Show more
    Last updated: 12 days ago • Promoted
    Senior Software Engineer, Observability

    Senior Software Engineer, Observability

    Together Ai • San Francisco, California, United States
    Full-time
    Together AI is building the AI Acceleration Cloud, an end-to-end platform for the full generative AI lifecycle, combining the fastest LLM inference engine with state-of-the-art AI cloud infrastruct...Show more
    Last updated: 17 days ago • Promoted
    Senior Software Engineer

    Senior Software Engineer

    Assort Health • San Francisco, California, United States
    Full-time
    Assort’s vision is to make exceptional healthcare accessible anytime, anywhere, for everyone.We are building the most trusted patient-facing multimodal AI agent with industry-leading safety, accura...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer, Platform Observability

    Senior Software Engineer, Platform Observability

    Everlaw • Oakland, CA, United States
    Full-time
    Everlaw is looking for a Senior Software Engineer that brings experience in building robust observability tooling, humility in their approach, and interest in expanding their skills in new directio...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer

    Senior Software Engineer

    Weightwatchers • San Francisco, California, United States
    Full-time
    WeightWatchers is a global digital health company.We are the #1 doctor-recommended – and most clinically studied – behavioral weight health program in the world. For sixty years, WeightWatchers has ...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer, Observability

    Senior Software Engineer, Observability

    Airtable • San Francisco, CA, United States
    Full-time
    San Francisco, CA; New York, NY; Remote (Seattle, WA only).Airtable is the no-code app platform that empowers people closest to the work to accelerate their most critical business processes.More th...Show more
    Last updated: 13 days ago • Promoted
    Senior Software Engineer

    Senior Software Engineer

    Pubmatic • Redwood City, CA, United States
    Full-time
    PubMatic (Nasdaq : PUBM) is an independent technology company maximizing customer value by delivering digital advertising's supply chain of the future. PubMatic's sell-side platform empowers the worl...Show more
    Last updated: 17 days ago • Promoted
    Senior Software Engineer

    Senior Software Engineer

    Shakudo • San Francisco, California, United States
    Full-time
    At Shakudo, we are building the world’s first operating system for data and AI.We use the term operating system in the truest sense of the word. Like iOS, Windows and Linux, Shakudo’s end-to-end OS ...Show more
    Last updated: 30+ days ago • Promoted
    Senior+ Software Engineer - Cloud Availability Platform Engineering (Observability)

    Senior+ Software Engineer - Cloud Availability Platform Engineering (Observability)

    Epoch Biodesign • San Francisco, CA, United States
    Full-time
    We are looking for a highly skilled engineer with deep expertise in building and operating observability platforms at scale. You will design, develop, and run Crusoe’s next-generation observability ...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer, Observability

    Senior Software Engineer, Observability

    Together AI • San Francisco, CA, United States
    Full-time
    Together AI is building the AI Acceleration Cloud, an end-to-end platform for the full generative AI lifecycle, combining the fastest LLM inference engine with state-of-the-art AI cloud infrastruct...Show more
    Last updated: 17 days ago • Promoted
    Senior Software Engineer, Capabilities

    Senior Software Engineer, Capabilities

    Amplitude • San Francisco, California, USA
    Full-time
    Amplitude is the leading Amplitude is the leading digital analytics platform helping over 4300 customersincluding Atlassian Burger King NBCUniversal Square and Under Armourbuild better products and...Show more
    Last updated: 12 days ago • Promoted
    Senior Software Engineer

    Senior Software Engineer

    Securitypal, Inc. • San Francisco, California, United States
    Full-time
    SecurityPal is on a mission to accelerate trust on the internet.We streamline and automate the security review process for growing companies, helping them close deals faster, build buyer confidence...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer

    Senior Software Engineer

    Kargo • San Francisco, CA, United States
    Full-time
    At Kargo, our mission is to build a connective tissue between the physical world of freight and the digital ecosystem used to manage it. We believe that advancements in smart infrastructure are crit...Show more
    Last updated: 30+ days ago • Promoted