Talent.com
Site Reliability Engineer

Site Reliability Engineer

FortinetSunnyvale, California, United States
30+ days ago
Job type
  • Full-time
Job description

At Fortinet, we strive to provide a supportive, collaborative environment where people are empowered to do the best work of their careers.

Our team members enjoy solving complex problems, and obsess over getting the details right. We love what we do and are proud of our work to secure clouds and container environments for thousands of b2b customers worldwide.

Our team is growing, and we are looking for engineers with passion for automation. You will help support the Lacework platform and play a key role in building, operating, and improving the Lacework Cloud Security Platform, the world's best real-time cloud-native threat detection system.

Our team develops and supports the infrastructure layers spanning our cloud accounts, network / connectivity, workload management, observability, and storage services. We build tooling to perform automated operations in order to scale the Lacework infrastructure and service. To be successful you will design, define, develop, deploy and operate internal tooling, APIs, and frameworks which streamline our workflows and automate our infrastructure.

The Role :

  • Automate as much as reasonable to significantly improve operational efficiency of the Lacework platform
  • Design, build and improve our infrastructure to enhance service scalability, resiliency, and efficiency across the company.
  • Identify mission-critical problems and solve them via automation, tooling, communication, and informed design.
  • Build and improve monitoring and instrumentation to predict future scalability or failure risks and solve them before they manifest into customer-facing issues.
  • Facilitate company-wide visibility into key metrics, SLAs, and milestones so that scale and resiliency are a part of every conversation.
  • Develop best practices alongside engineering / operations teams to improve the scalability and reliability of internal processes.
  • Participate in an on-call rotation.

Minimum Qualifications :

  • 3 years of Devops / SRE experience with production systems (depending on level)
  • Strong development and automation skills.
  • Extensive experience with Infrastructure as Code (Terraform, etc), as well as supporting tooling (Atlantis, ArgoCD, etc)
  • Extensive experience with Kubernetes and supporting tooling (Helm, operators, etc)
  • Extensive experience with a variety of cloud managed services and providers

  • AWS : EKS, EC2, S3, RDS, Secrets Manager, etc.
  • Experience building production quality cloud infrastructure that enables reliable and rapid deployment of microservices with effective monitoring and built in high availability and / or fault tolerance.
  • Strong passion for using automation to create simple repeatable dev and ops patterns that ensures a stable, reliable experience for customers.
  • Strong cross-team communication skills.
  • Experience with the building blocks of large-scale systems including load balancing, distributed / cloud computing, containers, instrumentation, and monitoring.
  • Knowledge of cloud networking, including VPC configuration and cross-cloud connectivity.
  • Familiarity with one or more programming languages (Python, Golang, etc.).
  • Preferred Qualifications :

  • Experience with monitoring and observability systems and tools (Prometheus, Grafana, New Relic, DataDog, etc.)
  • Believe everything should be "as code"
  • Experience with Java application servers and JVM configuration
  • Create a job alert for this search

    Site Reliability Engineer • Sunnyvale, California, United States

    Related jobs
    • Promoted
    Senior Engineer, Site Reliability

    Senior Engineer, Site Reliability

    VirtualVocationsSan Francisco, California, United States
    Full-time
    A company is looking for a Senior Engineer in Site Reliability Engineering for Digital Banking.Key Responsibilities Ensure the reliability, availability, and performance of applications in produc...Show moreLast updated: 4 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ZapierSan Francisco, CA, United States
    Full-time
    We're humans who simply think computers should do more work.At Zapier, we’re not just making software—we’re building a platform to help millions of businesses globally scale with automation and AI....Show moreLast updated: 8 days ago
    Site Reliability Engineer

    Site Reliability Engineer

    DTEX SystemsFremont, CA, US
    Full-time
    Quick Apply
    We are excited that you’ve taken the time to explore our business and potentially join us on this incredible journey.We are already the leader in the Insider Risk Management, but our story do...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer - SRE at Descope Los Altos, CA

    Site Reliability Engineer - SRE at Descope Los Altos, CA

    Itlearn360Los Altos, CA, United States
    Full-time
    Site Reliability Engineer - SRE job at Descope.Descope R&D group is a skilled team of developers with a unique DNA of creativity,flexibility,anopen mindset. We are looking for a passionate SRE to jo...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer I

    Site Reliability Engineer I

    prosper.comSan Francisco, CA, United States
    Full-time
    As a Site Reliability Engineer I at Prosper, you will play a crucial role in enhancing the reliability, scalability, and maintainability of our technology platform. This entry-level position is desi...Show moreLast updated: less than 1 hour ago
    • Promoted
    Senior / Staff Site Reliability Engineer, Storage

    Senior / Staff Site Reliability Engineer, Storage

    FluidstackSan Francisco, CA, United States
    Full-time
    Fluidstack is building GPU supercomputers for top AI labs, governments, and enterprises.Our customers include Mistral, Poolside, Black Forest Labs, Meta, and more. Our team is small, highly motivate...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PsiQuantumPalo Alto, CA, United States
    Full-time
    Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...Show moreLast updated: 30+ days ago
    • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    VirtualVocationsOakland, California, United States
    Full-time
    A company is looking for a Lead Site Reliability Engineer (M365).Key Responsibilities Lead the team in identifying system and service issues, and manage the deployment of new versions Oversee pr...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer - Technical Lead

    Site Reliability Engineer - Technical Lead

    ZipRecruiterSan Francisco, CA, United States
    Full-time
    Veryon is a leading software and technology company that enables aviation teams around the world to improve efficiency and safety. Our products maximize uptime for aircraft maintenance teams through...Show moreLast updated: 13 days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Foxconn Industrial Internet - FIISan Jose, CA, US
    Full-time +1
    Quick Apply
    Site Reliability Engineer Foxconn Industrial Internet (Fii), is a world leading professional design and manufacturing service provider of communication network equipment, cloud service equipment, p...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CanonicalSan Francisco, CA, United States
    Full-time
    Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiat...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer (SRE) - grok.com & API

    Site Reliability Engineer (SRE) - grok.com & API

    Pantera CapitalPalo Alto, CA, United States
    Full-time
    AI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excelle...Show moreLast updated: 11 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    WritemedSan Francisco, CA, United States
    Full-time
    Would you like to join one of the fastest-growing organizations with a goal of using the latest AI, GenAI, LLM, Cloud, and Digital Technologies to advance drug development and improve patient care ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    Air AppsSan Francisco, CA, United States
    Full-time
    At Air Apps, we believe in thinking bigger—and moving faster.We’re a family-founded company on a mission to create the world’s first AI-powered Personal & Entrepreneurial Resource Planner (PRP), an...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer II

    Site Reliability Engineer II

    PinterestSan Francisco, CA, United States
    Full-time
    Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we're on a mission to br...Show moreLast updated: 11 days ago
    • Promoted
    • New!
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Gridware Technologies Inc.San Francisco, CA, United States
    Full-time
    Gridware is a San Francisco-based technology company dedicated to protecting and enhancing the electrical grid.We pioneered a groundbreaking new class of grid management called active grid response...Show moreLast updated: less than 1 hour ago
    • Promoted
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    BasetenSan Francisco, CA, United States
    Full-time
    Site Reliability Engineer (SRE).Baseten powers inference for the world's most dynamic AI companies, like OpenEvidence, Clay, Mirage, Gamma, Sourcegraph, Writer, Abridge, Bland, and Zed.By uniting a...Show moreLast updated: 30+ days ago
    • Promoted
    Founding Site Reliability Engineer

    Founding Site Reliability Engineer

    Relevance AISan Francisco, CA, United States
    Full-time
    San Francisco, USA (Hybrid 3 days / week).At Relevance AI, our mission is to empower anyone to delegate work to the AI workforce. We’re building a new category of AI automation, enabling teams to crea...Show moreLast updated: 1 day ago