Talent.com
Senior Software Engineer, Observability
Senior Software Engineer, ObservabilityTogether AI • San Francisco, CA, United States
Senior Software Engineer, Observability

Senior Software Engineer, Observability

Together AI • San Francisco, CA, United States
16 days ago
Job type
  • Full-time
Job description

Together AI is building the AI Acceleration Cloud, an end-to-end platform for the full generative AI lifecycle, combining the fastest LLM inference engine with state-of-the-art AI cloud infrastructure.

The AI Infrastructure team at Together AI is at the forefront of building and scaling the foundational systems that power our generative AI platform. The storage and observability team is crucial for designing, implementing, and maintaining robust distributed storage solutions, ensuring seamless data access and management. They are also responsible for developing comprehensive observability platforms, providing critical insights into system performance and GPU utilization, and proactively identifying and resolving issues.

Requirements

  • 5+ years of demonstrated experience in building large scale, fault tolerant, distributed systems and API microservices
  • Experience designing, analyzing and improving efficiency, scalability, and stability of various system resources
  • Excellent communication skills – able to write clear design docs and work effectively with both technical and non-technical team members
  • Demonstrated experience with building and operating high-performance and / or globally distributed microservice architectures across one or more cloud providers (AWS, Azure, GCP)

Responsibilities

  • Identify, design, and develop foundational backend services that power Together’s cloud platform
  • Analyze and improve the robustness and scalability of existing distributed systems, APIs, databases, and infrastructure
  • Partner with product teams to understand functional requirements and deliver solutions that meet business needs
  • Write clear, well-tested, and maintainable software and IaC for both new and existing systems
  • Conduct design and code reviews, create developer documentation, and develop testing strategies for robustness and fault tolerance
  • Participate in an on‑call rotation to address critical incidents when necessary
  • About Together AI

    Together AI is a research‑driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co‑designing software, hardware, algorithms, and models. We have contributed to leading open‑source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers in our journey in building the next generation AI infrastructure.

    Compensation

    We offer competitive compensation, startup equity, health insurance, and other benefits, as well as flexibility in terms of remote work. The US base salary range for this full‑time position is : $160,000 - $260,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job‑related knowledge.

    Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.

    #J-18808-Ljbffr

    Create a job alert for this search

    Engineer Observability • San Francisco, CA, United States

    Related jobs
    Senior Software Engineer, CaaS Development

    Senior Software Engineer, CaaS Development

    Docker • San Francisco, CA, United States
    Full-time
    At Docker, we simplify app development, empowering developers to focus on what truly matters.Our global remote-first team is driven by a shared passion for innovation and enhancing developer experi...Show more
    Last updated: 3 days ago • Promoted
    Senior Software Engineer, Unification

    Senior Software Engineer, Unification

    IXL Learning • San Mateo, CA, United States
    Full-time
    IXL Learning, a leading developer of personalized learning products used by millions worldwide, is looking for a Senior Software Engineer to join our Unification Team-a high-impact group driving te...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer, Durable Objects (DO)

    Senior Software Engineer, Durable Objects (DO)

    Cloudflare Inc • San Francisco, CA, United States
    Full-time
    At Cloudflare, we are on a mission to help build a better Internet.Today the company runs one of the world's largest networks that powers millions of websites and other Internet properties for cust...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer

    Senior Software Engineer

    Scale AI • San Francisco, CA, United States
    Full-time
    Software is eating the world, but AI is eating software.We live in unprecedented times – AI has the potential to exponentially augment human intelligence. Every person will have a personal tutor, co...Show more
    Last updated: 27 days ago • Promoted
    Senior Software Engineer

    Senior Software Engineer

    KETCH • San Francisco, CA, United States
    Full-time
    Ketch powers responsible data use for millions of people every day across leading media, retail, technology, and financial companies. We're built on a simple belief : data privacy is a fundamental hu...Show more
    Last updated: 16 days ago • Promoted
    Senior Software Engineer

    Senior Software Engineer

    FLEETWORKS INC. • San Francisco, CA, United States
    Full-time
    Every year, companies spend over a trillion dollars moving freight across the U.We're building voice agents that transform the chaotic freight booking process into a modern, intelligent marketplace...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer

    Senior Software Engineer

    Forage • San Francisco, CA, United States
    Full-time
    Forage is building the modern payments stack that powers inclusive commerce.Our technology enables grocers, delivery platforms, and point-of-sale systems to seamlessly accept EBT payments both onli...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer

    Senior Software Engineer

    Pantera Capital • San Francisco, CA, United States
    Full-time
    Our unified platform, spanning AI-powered analytics, study management, and grant automation, streamlines the entire research lifecycle, enabling faster, smarter, and more impactful discoveries acro...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer, Core Experiences - Berkeley, USA

    Senior Software Engineer, Core Experiences - Berkeley, USA

    Clutch Canada • Berkeley, CA, United States
    Full-time
    Speechify is the easiest way to listen to the world’s information.Articles on the web, documents in the cloud, books on your phone. We absorb it all and let you listen to it at your desk, on the go,...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer, Robotics

    Senior Software Engineer, Robotics

    Nimble Robotics • San Francisco, CA, United States
    Full-time
    Nimble is a robotics and AI company inventing and scaling autonomous logistics with intelligent robots to enable fast, efficient, and sustainable commerce. We’re developing generalized robot intelli...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer

    Senior Software Engineer

    GroundControl Software Inc • San Mateo, California, United States, 94402
    Full-time
    About GroundControl Software Inc.Our mission is to deliver advanced software solutions that enable manufacturers to produce high-quality parts and systems with precision and confidence.These parts ...Show more
    Last updated: 30+ days ago
    Senior Software Engineer

    Senior Software Engineer

    TENDO • San Francisco, CA, United States
    Full-time
    We are looking for a seasoned software engineer who is passionate about creating next-generation healthcare software that will dramatically improve the lives of patients, clinicians, and caregivers...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer, Platform Observability

    Senior Software Engineer, Platform Observability

    Everlaw • Oakland, CA, United States
    Full-time
    Everlaw is looking for a Senior Software Engineer that brings experience in building robust observability tooling, humility in their approach, and interest in expanding their skills in new directio...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer, Observability

    Senior Software Engineer, Observability

    Airtable • San Francisco, CA, United States
    Full-time
    San Francisco, CA; New York, NY; Remote (Seattle, WA only).Airtable is the no-code app platform that empowers people closest to the work to accelerate their most critical business processes.More th...Show more
    Last updated: 12 days ago • Promoted
    Senior Software Engineer

    Senior Software Engineer

    Pubmatic • Redwood City, CA, United States
    Full-time
    PubMatic (Nasdaq : PUBM) is an independent technology company maximizing customer value by delivering digital advertising's supply chain of the future. PubMatic's sell-side platform empowers the worl...Show more
    Last updated: 16 days ago • Promoted
    Senior Software Engineer - Managed Kubernetes

    Senior Software Engineer - Managed Kubernetes

    Lambda • San Francisco, CA, United States
    Full-time
    In 2012, Lambda started with a crew of AI engineers publishing research at top machine-learning conferences.We began as an AI company built by AI engineers. Today, we're on a mission to be the world...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer, Perception

    Senior Software Engineer, Perception

    Saildrone Inc • Alameda, CA, United States
    Permanent
    With more than 2 million nautical miles sailed and 50,000 days at sea, Saildrone has earned the trust of governments worldwide. Our unmanned surface vehicles (USVs) deliver continuous, real-time int...Show more
    Last updated: 16 days ago • Promoted
    Senior Software Engineer, Observability

    Senior Software Engineer, Observability

    Together AI • San Francisco, CA, United States
    Full-time
    Together AI is building the AI Acceleration Cloud, an end-to-end platform for the full generative AI lifecycle, combining the fastest LLM inference engine with state-of-the-art AI cloud infrastruct...Show more
    Last updated: 16 days ago • Promoted