Talent.com
Founding Site Reliability Engineer

Founding Site Reliability Engineer

ReductoSan Francisco, CA, United States
9 hours ago
Job type
  • Full-time
Job description

About Reducto

Nearly 80% of enterprise data is in unstructured formats like PDFs. PDFs are the status quo for enterprise knowledge in nearly every industry. Reducto helps extract data from complex documents, enabling digital workflows.

We empower ingestion pipelines for hundreds of AI teams, from startups to Fortune 10 enterprises. We’ve grown quickly and are funded by tier 1 investors.

The Opportunity

You’ll be the first dedicated SRE at Reducto, influencing every aspect of our infrastructure from the ground up. You can expect to architect and scale resilient systems for AI and ML workloads, automate cloud infrastructure, and implement monitoring and incident response practices that set the standard for reliability. This role requires technical leadership, hands-on systems engineering, and strong collaboration with our founders and product teams as we build a company around reliability, rapid iteration, and high-impact product delivery.

The Core Work Will Include

  • Designing, building, and maintaining highly available, scalable infrastructure to support intensive AI / ML workloads and real-time model deployments.
  • Implementing robust monitoring, alerting, and observability systems to ensure system health, performance, and uptime across cloud and on-prem environments.
  • Debugging, optimizing, and automating infrastructure for fast iteration and rapid deployment cycles, focusing on both reliability and developer velocity.
  • Proactively identifying, investigating, and resolving incidents to minimize downtime and maintain world-class service levels for enterprise customers.
  • Collaborating closely with engineers, ML specialists, and founders to shape product, infrastructure, and security strategies.

We Would Love To Meet You If You

  • Are your own worst critic—have an extremely high bar for quality and always aim for robust solutions rather than quick fixes.
  • Have 5+ years of hands-on experience in building or supporting production-grade infrastructure and reliability processes for high-throughput systems.
  • Are comfortable with Python or similar languages, and exceptional at working across cloud platforms, container orchestration (e.g., Kubernetes), networking, and storage technologies.
  • Build your own tools on the fly to diagnose, experiment, and address reliability problems—whether it’s an internal dashboard or an automated remediation workflow.
  • Bring a quantitative, hands-on approach to system operations, automation, and continuous improvement.
  • Bonus Points If You

  • Have prior experience founding a company or building products / infrastructure in early-stage, high-growth environments.
  • Are excited about automating incident management processes with LLMs / AI.
  • Are driven, ambitious, and deeply care about both technical excellence and collaborative problem-solving.
  • Keep up with the latest trends in cloud, observability, and SRE best practices.
  • Are passionate about open-source and have contributed tools or automation to reliability communities.
  • Have built or optimized monitoring, incident response, or high-performance computing systems for demanding AI / ML, fintech, or enterprise clients.
  • This is an in person role at our office in SF. We’re an early stage company which means that the role requires working hard and moving quickly. Please only apply if that excites you.

    About Reducto

    Reducto Breaks Document Layouts Into Subsections And Then Contextually Parses Each Depending On The Type Of Content, Using Vision Models, LLMs, and a suite of heuristics. We can help you :

  • Accurately extract text and tables even with nonstandard layouts
  • Automatically convert graphs to tabular data and summarize images in documents
  • Extract important fields from complex forms with simple, natural language instructions
  • Build powerful retrieval pipelines using document metadata
  • Intelligently chunk information using the document’s layout data
  • Benefits at Reducto

  • Unlimited PTO : We believe great work requires recharging.
  • Lunch : Free lunch daily at the office.
  • Reimbursed Transportation : We’ll cover your transportation costs with receipts.
  • Insurance : Generous health insurance covering medical, dental, and vision.
  • Health and Wellness Budget : Up to $150 / mo for health and wellness spending.
  • Parental Leave : Flexible leave schedules to accommodate family needs.
  • Reducto is an Equal Opportunity Employer committed to diversity and inclusion in the workplace. All qualified applicants will receive consideration for employment without regard to sex, race, color, national origin, religion, disability, genetic information, marital status, sexual orientation, gender identity / assignment, citizenship, pregnancy or maternity, protected veteran status, or any other status prohibited by law.

    #J-18808-Ljbffr

    Create a job alert for this search

    Site Reliability Engineer • San Francisco, CA, United States

    Related jobs
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Redwood Materials, Inc.San Francisco, CA, United States
    Full-time
    Redwood is localizing a global battery supply chain that seamlessly integrates recovery, reuse, and recycling—keeping critical minerals in circulation and driving the energy transition.Founded in 2...Show moreLast updated: 9 hours ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    criteoPalo Alto, CA, United States
    Full-time
    At Criteo we face challenging problems in the IT industry at scale.Our data is large and our systems require speed and complexity handling. We have about 40 petabytes in Hadoop storage and respond t...Show moreLast updated: 9 hours ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    WritemedSan Francisco, CA, United States
    Full-time
    Would you like to join one of the fastest-growing organizations with a goal of using the latest AI, GenAI, LLM, Cloud, and Digital Technologies to advance drug development and improve patient care ...Show moreLast updated: 9 hours ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Together AISan Francisco, CA, United States
    Full-time
    As a Site Reliability Engineer (SRE) at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a soft...Show moreLast updated: 9 hours ago
    • Promoted
    • New!
    Principal Site Reliability Engineer

    Principal Site Reliability Engineer

    JPMorganChasePalo Alto, CA, United States
    Full-time
    Join a globally recognized financial organization and advance your profession to new heights by contributing to revolutionary projects. You've discovered the perfect environment to have a major impa...Show moreLast updated: 9 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PacerProSan Francisco, CA, United States
    Full-time
    You’ll be joining the engineering team responsible for delivering PacerPro’s SaaS and on-premise solutions that orchestrate case data workflows and provide data driven legal insights for our client...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PerplexitySan Francisco, CA, United States
    Full-time
    Perplexity is an AI-powered answer engine founded in December 2022 and growing rapidly as one of the world’s leading AI platforms. Perplexity has raised over $1B in venture investment from some of t...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    ZipRecruiterBerkeley, CA, United States
    Full-time
    Job DescriptionJob Description.We are seeking a Site Reliability Engineer to join our Operations Group.This role plays a key part in advancing scientific discovery by supporting high-performance co...Show moreLast updated: 9 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    AlchemySan Francisco, CA, United States
    Full-time
    Our mission is to bring web3 to a billion people, by providing builders with the tools they need to build exceptional onchain products. Alchemy is the only complete developer platform that offers th...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    xAIPalo Alto, CA, United States
    Full-time
    AI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excelle...Show moreLast updated: 9 hours ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Jobs via DiceRedwood City, CA, United States
    Full-time
    Dice is the leading career destination for tech experts at every stage of their careers.Our client, Kforce Technology Staffing, is seeking a Reliability Engineer in Redwood City, CA.Deliver high-le...Show moreLast updated: 9 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PrimerSan Francisco, CA, United States
    Full-time
    Primer helps B2B products break out of the B2C-centric marketing box.Our platform turns consumer ad channels, data streams, and emerging AI workflows into measurable growth engines for go-to-market...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Redwood MaterialsSan Francisco, CA, United States
    Full-time
    Redwood is localizing a global battery supply chain that seamlessly integrates recovery, reuse, and recycling — keeping critical minerals in circulation and driving the energy transition.Founded in...Show moreLast updated: 9 hours ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    HiveSan Francisco, CA, United States
    Full-time
    Hive is the leading provider of cloud-based AI solutions to understand, search, and generate content, and is trusted by hundreds of the world's largest and most innovative organizations.The company...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    ZapierSan Francisco, CA, United States
    Full-time
    We're humans who simply think computers should do more work.At Zapier, we’re not just making software—we’re building a platform to help millions of businesses globally scale with automation and AI....Show moreLast updated: 9 hours ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Bits to AtomsSan Francisco, CA, United States
    Full-time
    Site Reliability Engineer (SRE).You’ll work at the intersection of infrastructure, AI / ML systems, and mission-critical physical operations. You’ll collaborate directly with engineering, AI, and oper...Show moreLast updated: 9 hours ago
    • Promoted
    • New!
    Senior Site Reliability Engineer Denver, Colorado, United States; San Francisco, California, Un[...]

    Senior Site Reliability Engineer Denver, Colorado, United States; San Francisco, California, Un[...]

    CheckrSan Francisco, CA, United States
    Full-time
    Checkr is building the data platform to power safe and fair decisions.Established in 2014, Checkr’s innovative technology and robust data platform help customers assess risk and ensure safety and c...Show moreLast updated: 9 hours ago
    • Promoted
    Associate Site Reliability Engineer / Site Reliability Engineer

    Associate Site Reliability Engineer / Site Reliability Engineer

    MedStar HealthRedwood City, CA, United States
    Full-time
    C3 AI (NYSE : AI), is the Enterprise AI application software company.C3 AI delivers a family of fully integrated products including the C3 Agentic AI Platform, an end-to-end platform for developing,...Show moreLast updated: 30+ days ago