Talent.com
Site Reliability Engineer
Site Reliability EngineerFractal • San Francisco, CA, United States
Site Reliability Engineer

Site Reliability Engineer

Fractal • San Francisco, CA, United States
30+ days ago
Job type
  • Full-time
Job description

This range is provided by Fractal. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.

Base pay range

$110,000.00 / yr - $160,000.00 / yr

Site Reliability Engineer

Fractal Analytics is a strategic AI partner to Fortune 500 companies with a vision to power every human decision in the enterprise. Fractal is building a world where individual choices, freedom, and diversity are the greatest assets. An ecosystem where human imagination is at the heart of every decision. Where no possibility is written off, only challenged to get better. We believe that a true Fractalite empowers imagination with intelligence. And that it will be such Fractalites that will continue to build the company for the next 100 years.

  • Please Note : This role is specifically located in the Bay Area of San Francisco. You will need to work onsite Monday - Friday. We offer paid relocation.

Role Overview

As a Site Reliability Engineer with Fractal, you will be dedicated to ensuring the highest system availability and performance levels. This role involves comprehensive monitoring, addressing complex technical issues, automating solutions to recurring problems, and contributing to developing resilient system architectures and deployment strategies. You will work closely with our Services and Engineering teams, playing a crucial role in optimizing our platforms and infrastructures.

Responsibilities

  • Ensure maximum uptime and system availability to meet or exceed functional and performance SLAs.
  • Implement thorough end-to-end monitoring and alerting on all critical components to ensure quick detection and response.
  • Tackle complex challenges affecting critical services, focusing on automating problem resolution to prevent future occurrences.
  • Drive the development of innovative designs, architectures, standards, and methodologies to support and enhance our platform.
  • Lead in scripting and automation efforts, aiming to refine system updates and upgrade processes.
  • Design and configure essential infrastructure, tools, and frameworks to enhance the deployment lifecycle.
  • Collaborate effectively with cross-functional teams within Services and Engineering.
  • Qualifications

  • Have interest and ability to become certified on the end client AI platform. (We will provide all the necessary training and support)
  • Bachelor’s or master’s degree in computer science, a related field, or equivalent professional experience.
  • Minimum of 10 years of relevant experience.
  • Proven experience in deploying, managing, and optimizing scalable, fault-tolerant Linux / Kubernetes / JVM infrastructure across various cloud platforms like AWS, GCP, and Azure.
  • Deep expertise in Linux Operating Systems, Networking principles, and Database management.
  • Practical experience with Cassandra or similar NoSQL technologies.
  • Proficiency with major cloud services providers, notably AWS, Azure, and GCP.
  • Familiarity with configuration management tools such as Ansible or Terraform.
  • Proficiency in programming languages like Ruby or Python, particularly for system automation and monitoring.
  • Strong problem-solving abilities, critical thinking skills, and effective communication capabilities.
  • Prior experience in a DevOps or system administration role, ideally supporting commercial SaaS solutions.
  • Pay :

    The wage range for this role takes into account the wide range of factors that are considered in making compensation decisions, including but not limited to skill sets; experience and training; licensure and certifications; and other business and organizational needs. A reasonable estimate of the current range is : $110,000 - $160,000. In addition, you may be eligible for a discretionary bonus for the current performance period.

    As a full-time employee of the company or as an hourly employee working more than 30 hours per week, you will be eligible to participate in the health, dental, vision, life insurance, and disability plans in accordance with the plan documents, which may be amended from time to time. You will be eligible for benefits on the first day of employment with the Company. In addition, you are eligible to participate in the Company 401(k) Plan after 30 days of employment, in accordance with the applicable plan terms. The Company provides for 11 paid holidays and 12 weeks of Parental Leave. We also follow a “free time” PTO policy, allowing you the flexibility to take the time needed for either sick time or vacation.

    Fractal provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.

    Seniority level

  • Mid-Senior level
  • Employment type

  • Full-time
  • Job function

  • Information Technology, Consulting, and Engineering
  • Industries

  • Technology, Information and Media, IT Services and IT Consulting, and Business Consulting and Services
  • #J-18808-Ljbffr

    Create a job alert for this search

    Site Reliability Engineer • San Francisco, CA, United States

    Related jobs
    Site Reliability Engineer

    Site Reliability Engineer

    ConductorOne • San Francisco, CA, United States
    Full-time
    Shape the future of identity with the highest-caliber team.If you’re amazing at what you do and want to solve big challenges in identity and security, come on board. Identity is how companies are be...Show more
    Last updated: 11 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Fortinet • Sunnyvale, CA, United States
    Full-time
    At Fortinet, we strive to provide a supportive, collaborative environment where people are empowered to do the best work of their careers. Our team members enjoy solving complex problems, and obsess...Show more
    Last updated: 3 days ago • Promoted
    Site Reliability Engineer II

    Site Reliability Engineer II

    VirtualVocations • San Jose, California, United States
    Full-time
    A company is looking for a Site Reliability Engineer II- Process Automation.Key Responsibilities Optimize and automate incident and change management processes to enhance system efficiency and re...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer I

    Site Reliability Engineer I

    prosper.com • San Francisco, CA, United States
    Full-time
    As a Site Reliability Engineer I at Prosper, you will play a crucial role in enhancing the reliability, scalability, and maintainability of our technology platform. This entry-level position is desi...Show more
    Last updated: 6 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Bits to Atoms • San Francisco, CA, United States
    Full-time
    Site Reliability Engineer (SRE).You’ll work at the intersection of infrastructure, AI / ML systems, and mission-critical physical operations. You’ll collaborate directly with engineering, AI, and oper...Show more
    Last updated: 26 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PsiQuantum • Palo Alto, CA, United States
    Full-time
    Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer Lead

    Site Reliability Engineer Lead

    VirtualVocations • Fremont, California, United States
    Full-time
    A company is looking for a Site Reliability Engineer, Team Lead.Key Responsibilities Ensure 24x7 availability of production application systems and drive operational efficiency initiatives Ident...Show more
    Last updated: 2 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Redwood Materials, Inc. • San Francisco, CA, United States
    Full-time
    Redwood is localizing a global battery supply chain that seamlessly integrates recovery, reuse, and recycling—keeping critical minerals in circulation and driving the energy transition.Founded in 2...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Sigmaways Inc • San Francisco, CA, United States
    Full-time
    As a Site reliability engineer, you will partner with development and IT teams to implement CI / CD pipelines, develop automation and monitoring solutions to ensure our platforms are secure, scalable...Show more
    Last updated: 2 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    WorkOS • San Francisco, CA, United States
    Full-time
    About WorkOS 🚀 WorkOS builds tools and services for developers to help them implement authentication, identity, authorization, and overall enterprise readiness. We’re a fully distributed team with ...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer I

    Site Reliability Engineer I

    Prosper Marketplace • San Francisco, CA, United States
    Full-time
    As a Site Reliability Engineer I at Prosper, you will play a crucial role in enhancing the reliability, scalability, and maintainability of our technology platform. This entry-level position is desi...Show more
    Last updated: 2 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Alchemy • San Francisco, CA, United States
    Full-time
    Our mission is to bring web3 to a billion people, by providing builders with the tools they need to build exceptional onchain products. Alchemy is the only complete developer platform that offers th...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Together AI • San Francisco, CA, United States
    Full-time
    As a Site Reliability Engineer (SRE) at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a soft...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer - Technical Lead

    Site Reliability Engineer - Technical Lead

    ZipRecruiter • San Francisco, CA, United States
    Full-time
    Veryon is a leading software and technology company that enables aviation teams around the world to improve efficiency and safety. Our products maximize uptime for aircraft maintenance teams through...Show more
    Last updated: 19 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Redwood Materials • San Francisco, CA, United States
    Full-time
    Redwood is localizing a global battery supply chain that seamlessly integrates recovery, reuse, and recycling — keeping critical minerals in circulation and driving the energy transition.Founded in...Show more
    Last updated: 28 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    VirtualVocations • Concord, California, United States
    Full-time
    A company is looking for a Site Reliability Engineer to join a Cloud Services team in a remote role.Key Responsibilities Serve as a cloud SME for clients, providing expertise in design, architect...Show more
    Last updated: 30+ days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    VirtualVocations • San Francisco, California, United States
    Full-time
    A company is looking for a Senior Site Reliability Engineer.Key Responsibilities Design, develop, and implement software to enhance system availability, scalability, latency, and efficiency Lead...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer Team Lead

    Site Reliability Engineer Team Lead

    VirtualVocations • Concord, California, United States
    Full-time
    A company is looking for a Site Reliability Engineer, Team Lead.Key Responsibilities Ensure 24x7 availability of production application systems Drive initiatives to improve operational efficienc...Show more
    Last updated: 2 days ago • Promoted