Talent.com
Senior Site Reliability Engineer (SRE) CloudVision as a Service (CVaaS)
Senior Site Reliability Engineer (SRE) CloudVision as a Service (CVaaS)Arista Networks, Inc. • Santa Clara, CA, United States
Senior Site Reliability Engineer (SRE) CloudVision as a Service (CVaaS)

Senior Site Reliability Engineer (SRE) CloudVision as a Service (CVaaS)

Arista Networks, Inc. • Santa Clara, CA, United States
12 hours ago
Job type
  • Full-time
Job description

Senior Site Reliability Engineer (SRE) CloudVision as a Service (CVaaS)

Arista Networks is an industry leader in data-driven, client-to-cloud networking for large data center, campus and routing environments. What sets us apart is our relentless pursuit of innovation. We leverage the latest advancements in cloud computing, artificial intelligence, and software-defined networking to provide our clients with a competitive edge in an increasingly interconnected world. Our solutions are designed to not only meet the current demands of the digital landscape but to also anticipate and adapt to future challenges.

At Arista we value the diversity of thought and perspectives that each employee brings to the table. We believe that fostering an inclusive environment, where individuals from various backgrounds and experiences feel welcome, is essential for driving creativity and innovation.

Our commitment to excellence has earned us several prestigious awards, such as Best Engineering Team, Best Company for Diversity, Compensation, and Work-Life Balance. At Arista, we take pride in our track record of success and strive to maintain the highest standards of quality and performance in everything we do.

Job Description

Who You'll Work With

SRE's at Arista combine strong software and systems engineering with a passion for operating production systems at scale. As an SRE you'll be part of the team responsible for our global service fleet.

What You'll Do As a Senior SRE, you'll be responsible for our global CloudVision service fleet. This includes :

  • Building the CI / CD lifecycle for services, from inception and design to deployment and scaling
  • Improving operational processes through automation
  • Identifying key service indicators to be used in capacity planning
  • Owning disaster recovery and management
  • Driving infrastructure and cloud-based application security design
  • Leading sustainable incident response and blameless postmortems
  • Being an active member of our globally distributed on-call team

Arista's CloudVision is an enterprise network management and streaming telemetry SaaS offering. CloudVision is deployed on Kubernetes across global regions using Spinnaker for our CI / CD pipeline. Our tech stack runs on GKE, using HBase / Hadoop as main distributed database and storage layer, ElasticSearch for powering search data, ClickHouse for fast real time queries of flow data, our own Kafka-based distributed real time stream processing layer for analytics, and TensorFlow for ML analysis. Our monitoring system is built on top of Prometheus, Grafana, Loki, and other OSS tools.

Qualifications

  • BS / MS degree in Computer Science or a relevant experience subject.
  • 5+ years software engineering experience.
  • Experience developing or managing deployments of distributed database systems or scale out applications for a SaaS environment.
  • Compensation Information The new hire base pay for this role has a salary range of $101,000 to $161,000. Arista offers different pay ranges based on work location, so that we can offer consistent and competitive pay appropriate to the market. The actual base pay offered will be based on a wide range of factors, including skills, qualifications, relevant experience, and work location. The pay range provided reflects base pay only and in addition certain roles may also be eligible for discretionary Arista bonuses and equity. Employees in Sales roles are eligible to participate in Arista's Sales Incentive Plan, which pays commissions calculated as a percentage of eligible sales. US-based employees are also entitled to benefits including medical, dental, vision, wellbeing, tax savings and income protection. The recruiting team can share more details during the hiring process specific to the role and location.

    #LI-GR1

    Additional Information

    Arista Networks is an equal opportunity employer. Arista makes all hiring and employment-related decisions in a non-discriminatory manner without regard to race, color, religion, sex, sexual orientation, gender identity, national origin or any other factor determined to be unlawful under applicable federal, state, or law law. All your information will be kept confidential according to EEO guidelines.

    Create a job alert for this search

    Site Reliability Engineer Sre • Santa Clara, CA, United States

    Related jobs
    Firmware Test Lead – Hardware Systems (NPI)

    Firmware Test Lead – Hardware Systems (NPI)

    OSI Engineering • Cupertino, CA, US
    Full-time
    Summary : Are you ready to lead in a fast-paced, innovation-driven environment? We’re seeking a Test Design Lead with strong firmware experience to join a top consumer electronics company in Cuperti...Show more
    Last updated: 30+ days ago • Promoted
    Staff Software Engineer - SRE, Backend (Reliability Engineering)

    Staff Software Engineer - SRE, Backend (Reliability Engineering)

    Affirm • Palo Alto, CA, United States
    Full-time
    Staff Software Engineer - SRE, Backend (Reliability Engineering).Affirm is reinventing credit to make it more honest and friendly, giving consumers the flexibility to buy now and pay later without ...Show more
    Last updated: 1 day ago • Promoted
    Cloud Service Reliability Engineer in San Francisco

    Cloud Service Reliability Engineer in San Francisco

    Energy Jobline ZR • San Francisco, CA, United States
    Full-time
    Energy Jobline is the largest and fastest growing global Energy Job Board and Energy Hub.We have an audience reach of over 7 million energy professionals, 400,000+ monthly advertised global energy ...Show more
    Last updated: 1 day ago • Promoted
    AWS Integration Lead

    AWS Integration Lead

    Reveille Technologies,Inc • Fremont, CA, United States
    Full-time
    This is to support the migration activities between.Medical Super Search portal and to deliver the below with good collaboration with the customer stakeholders. Provide technical expertise in softwa...Show more
    Last updated: 1 day ago • Promoted
    BMS Systems Integrator

    BMS Systems Integrator

    University of California - Santa Cruz • Santa Cruz, CA, United States
    Full-time +1
    For full consideration, applicants should attach their resume and cover letter when applying for a job opening.For guidance related to the application process or if you are experiencing difficultie...Show more
    Last updated: 30+ days ago • Promoted
    Senior Site Reliability Engineer, BCM - DGX Cloud

    Senior Site Reliability Engineer, BCM - DGX Cloud

    NVIDIA • Santa Clara, CA, United States
    Full-time
    NVIDIA has been redefining computer graphics, PC gaming, and accelerated computing for over 25 years.It’s a unique legacy of innovation fueled by great technology—and dynamic people.Today, we’re ta...Show more
    Last updated: 1 day ago • Promoted
    Lead Software Engineer- Middleware Reliability Engineering

    Lead Software Engineer- Middleware Reliability Engineering

    Visa • Foster City, CA, United States
    Full-time
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...Show more
    Last updated: 30+ days ago • Promoted
    HPC Technical Systems Support Analyst - DoE Q or TS clearance

    HPC Technical Systems Support Analyst - DoE Q or TS clearance

    Jobot • Livermore, CA, US
    Full-time
    This Jobot Job is hosted by : Kurt Holzmuller.Are you a fit? Easy Apply now by clicking the "Apply Now" button and sending us your resume. Salary : $130,000 - $180,000 per year.We are a leading global...Show more
    Last updated: 30+ days ago • Promoted
    Integration Solutions Architect, eero

    Integration Solutions Architect, eero

    Amazon • San Francisco, CA, United States
    Full-time
    At eero, our mission is to serve as the central nervous system of the home.While we began by revolutionizing home WiFi, we aim to create comprehensive solutions that serve both wireless and wired c...Show more
    Last updated: 30+ days ago • Promoted
    Cloud Storage Engineer

    Cloud Storage Engineer

    VirtualVocations • San Francisco, California, United States
    Full-time
    A company is looking for a Cloud Storage Engineer who specializes in designing, implementing, and managing cloud-based and hybrid storage solutions. Key Responsibilities Design and implement scala...Show more
    Last updated: 30+ days ago • Promoted
    L3 Application Support Engineer

    L3 Application Support Engineer

    VirtualVocations • Fremont, California, United States
    Full-time
    A company is looking for a Senior Application Support Engineer.Key Responsibilities Troubleshoot data-platform workflows and investigate incidents related to Snowflake and Airflow Manage high-se...Show more
    Last updated: 5 days ago • Promoted
    Slurm Administration & Systems Architecture

    Slurm Administration & Systems Architecture

    Midjourney • Hayward, CA, US
    Full-time
    We are seeking a highly skilled HPC / AI / ML Cluster Engineer to support the design, deployment, and ongoing operations of large-scale HPC environments powered by Slurm. This role centers on cluster en...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer, Tesla Energy Services Platform

    Site Reliability Engineer, Tesla Energy Services Platform

    Tesla • Palo Alto, CA, United States
    Full-time
    Tesla is looking for a Site Reliability Engineer to build a software and hardware platform for Tesla Megapack service and local data. Our team is defining the future of edge data and infrastructure ...Show more
    Last updated: 1 day ago • Promoted
    Senior System Validation Engineer - SerDes / Ethernet (PAM4)

    Senior System Validation Engineer - SerDes / Ethernet (PAM4)

    Astera Labs • San Jose, CA, United States
    Full-time
    Astera Labs (NASDAQ : ALAB) provides rack-scale AI infrastructure through purpose-built connectivity solutions grounded in open standards. By collaborating with hyperscalers and ecosystem partners, A...Show more
    Last updated: 30+ days ago • Promoted
    Sr. Network Validation Engineer

    Sr. Network Validation Engineer

    Supermicro • San Jose, CA, United States
    Full-time
    Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer (SRE) / DevOps Engineer

    Site Reliability Engineer (SRE) / DevOps Engineer

    Diverse Lynx • Sunnyvale, CA, United States
    Full-time
    BS / MS in Computer Science or Equivalent • At least 8+ years in a Reliability Engineering, DevOps or infrastructure focused role • Advanced experience with programming languages (Python, Java) • Passio...Show more
    Last updated: 1 day ago • Promoted
    Distinguished Engineer, Regulated Cloud Environments

    Distinguished Engineer, Regulated Cloud Environments

    NVIDIA • Santa Clara, CA, United States
    Full-time
    NVIDIA is a key player in the emerging space of sovereign computing, delivering in high security environments, and providing solutions for confidential computing. We’re a team of innovative engineer...Show more
    Last updated: 1 day ago • Promoted
    Senior Site Reliability Engineer - DGX Cloud

    Senior Site Reliability Engineer - DGX Cloud

    NVIDIA • Santa Clara, CA, United States
    Full-time
    Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and availability using the combination of...Show more
    Last updated: 1 day ago • Promoted