Talent.com
Senior Site Reliability Engineer
Senior Site Reliability EngineerShein • Bellevue, Washington, United States
No longer accepting applications
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Shein • Bellevue, Washington, United States
30+ days ago
Job type
  • Full-time
Job description

About SHEIN

SHEIN is a global online fashion and lifestyle retailer, offering SHEIN branded apparel and products from a global network of vendors, all at affordable prices. Headquartered in Singapore, with more than 15,000 employees operating from offices around the world, SHEIN is committed to making the beauty of fashion accessible to all, promoting its industry-leading, on-demand production methodology, for a smarter, future-ready industry.

Position Summary

We are looking for an experienced Senior Site Reliability Engineer(Official name Senior Site Reliability Engineer I) to join our Site Reliability Engineering team. Site Reliability Engineers at SHEIN are hybrid software / systems engineers whose overarching goal is to ensure that production services are “always on.” They strive to build the most reliable and performant systems on the planet.

SREs work closely with cross-functional teams to ensure we have the right set of tools to generate, collect, analyze, visualize and alert on operational data; knowing exactly what happens across the ecosystem, identifying problems before they occur and addressing them as quickly as possible. They are also responsible for improving operational efficiency, utilization and system resiliency of the platform. They own critical open-source software that our platform relies on and they are core participants in every significant engineering effort underway in the platform.

Additionally, SRE’s are tasked with driving forward the operability of the platform to reduce the number of incidents while reducing MTTR. To accomplish this, the team combines software development, networking and systems engineering expertise along with a desire to be challenged by issues of scale and complexity to make our service better for our customers.

Job Responsibilities

  • Participate in an on-call rotation to ensure 24 / 7 / 365 availability of SHEIN's production system.
  • Supervise capacity & utilization and work closely with cross-functional teams to orchestrate scale up / down of the services.
  • Own and operate critical open-source services like Elasticsearch, Kafka, RabbitMQ, Redis.
  • Build tools and design processes that help improve observability and system resiliency of the platform.
  • Triage site availability Incidents and proactively work towards reducing MTTR for customer impacting incidents.
  • Partner with service owners to implement service level metrics and service level objectives that act as service level health indicators.
  • Establish design patterns for monitoring, benchmarking and deploying new features for the backend services.
  • Develop and maintain technical documentation, network diagrams, runbooks, and procedures.
  • Drive initiatives to evolve our current platform to increase efficiency and keep it in line with current standards and best practices.
  • Respond to production incidents leverage experience in software development, systems engineering, and networking to proactively prevent recurring issues.
  • Provide relief and sustainable resolution to issues within our infrastructure.
  • Drive initiatives with partner teams to improve the reliability and performance of the infrastructure through improved system design.
  • Join a culture of intolerance to manual activity which results in a highly automated environment delivering scalable solutions.
  • Drive efficiencies through software improvement and root cause analysis resulting in service delivery, maturity, and scalability.

Job Requirements

  • Bachelor's degree in Computer Science or Information Systems or equivalent technical discipline.
  • 5+ years of working experience in an enterprise 24 / 7 production environment supporting mission-critical, real-time, high-traffic applications, especially in cloud environments.
  • Systematic problem-solving approach, combined with a sense of ownership and drive.
  • Full-stack debugging and performance optimization ability, including knowledge of Cloud systems (load balancing, caching, content distribution, etc.), continuous integration / build systems, Java, SQL and NoSQL databases.
  • Track record monitoring and analyzing system performance, isolating issues or bottlenecks that could impact reliability, performance and scalability.
  • Strong experience with observability tools such as Grafana, Prometheus, Zabbix, etc.
  • Experience in any of the scripting / programming languages such as Python, GoLang, etc.
  • Familiar with container technology, such as Docker, Kubernetes, Mesos, etc.
  • Strong verbal and written communication skills; able to work effectively with geographically remote teams.
  • Experience with one or more OSS technologies like Elasticsearch, Kafka and Redis.
  • Proficient with SRE concepts and practices, including being an advocate for the elimination of toil and drive simple solutions.
  • Nice to Have

  • Experience with big data related component operation and maintenance experience (Hadoop / Yarn / Hbase / Hive / Spark, etc.)
  • Solid understanding of Linux system.
  • Benefits and Perks

  • Bonus and RSU eligible.
  • Healthcare (medical, dental, vision, prescription drugs)
  • Health Savings Account with Employer Funding
  • Flexible Spending Accounts (Healthcare and Dependent care)
  • Company-Paid Basic Life / AD&D insurance
  • Company-Paid Short-Term and Long-Term Disability
  • Voluntary Benefit Offerings (Voluntary Life / AD&D, Hospital Indemnity, Critical Illness, and Accident)
  • Employee Assistance Program
  • Business Travel Accident Insurance
  • 401(k) Savings Plan with discretionary company match and access to a financial advisor
  • Vacation, paid holidays, floating holiday and sick days
  • Employee discounts
  • Free weekly catered lunch
  • Dog-friendly office (available at select locations)
  • Free gym access (available at select locations)
  • Free swag giveaways
  • Annual Holiday Party
  • Invitations to pop-ups and other company events
  • Complimentary daily office snacks and beverages
  • #LI-AR1

    Pay Range

    $107,600 - $180,200 USD

    Create a job alert for this search

    Senior Site Reliability Engineer • Bellevue, Washington, United States

    Related jobs
    Design and Analysis Engineer

    Design and Analysis Engineer

    US Tech Solutions, Inc. • Everett, WA, US
    Full-time
    Duration : 06 Months, Possible extension Shifts : 1st shift 8 : 00 am – 4 : 30 pm Job Description : Leads work with customers to develop and document complex electronic and electrical system requirements....Show more
    Last updated: 11 days ago • Promoted
    Sr. Satellite Lead Systems Engineer

    Sr. Satellite Lead Systems Engineer

    Blue Origin • Seattle, WA, United States
    Permanent
    Applications will be accepted on an ongoing basis until the requisition is closed.At Blue Origin, we envision millions of people living and working in space for the benefit of Earth.We're working t...Show more
    Last updated: 25 days ago • Promoted
    Hardware Reliability Engineer III- New Shepard

    Hardware Reliability Engineer III- New Shepard

    Blue Origin • Seattle, WA, United States
    Permanent
    Applications will be accepted on an ongoing basis until the requisition is closed.At Blue Origin, we envision millions of people living and working in space for the benefit of Earth.We're working t...Show more
    Last updated: 30+ days ago • Promoted
    Continuous Improvement Leader

    Continuous Improvement Leader

    Hexcel • Kent, WA, US
    Permanent
    With our strong investment in research and development and our culture of continuous improvement, Hexcel is the industry leader in the manufacturing of advance composite materials, including carbon...Show more
    Last updated: 30+ days ago • Promoted
    Associate or Mid-level Aerospace Systems Safety Engineer

    Associate or Mid-level Aerospace Systems Safety Engineer

    Boeing • EVERETT, WA, United States
    Full-time +1
    At Boeing, we innovate and collaborate to make the world a better place.We’re committed to fostering an environment for every teammate that’s welcoming, respectful and inclusive, with great opportu...Show more
    Last updated: 6 days ago • Promoted
    Non-Destructive Testing Level 2 - SCI

    Non-Destructive Testing Level 2 - SCI

    Safran • Everett, WA, United States
    Full-time +2
    Non-Destructive Testing Level 2 - SCI.Safran is an international high-technology group, operating in the aviation (propulsion, equipment and interiors), defense and space markets.Its core purpose i...Show more
    Last updated: 27 days ago • Promoted
    Senior SAN Engineer

    Senior SAN Engineer

    PACCAR • Renton, WA, United States
    Full-time
    PACCAR is seeking a Senior Storage Area Network (SAN) Engineer to deploy and support the firm's complex storage infrastructure. The storage engineer will be dedicated to providing reliable and flexi...Show more
    Last updated: 30+ days ago • Promoted
    Mid-Level or Senior Propulsion Engineer

    Mid-Level or Senior Propulsion Engineer

    Boeing • EVERETT, WA, United States
    Full-time +1
    At Boeing, we innovate and collaborate to make the world a better place.We’re committed to fostering an environment for every teammate that’s welcoming, respectful and inclusive, with great opportu...Show more
    Last updated: 6 days ago • Promoted
    Configuration Management Specialist (Senior or Lead)

    Configuration Management Specialist (Senior or Lead)

    The Boeing Company • Everett, WA, United States
    Permanent +1
    At Boeing, we innovate and collaborate to make the world a better place.We're committed to fostering an environment for every teammate that's welcoming, respectful and inclusive, with great opportu...Show more
    Last updated: 8 days ago • Promoted
    Propulsion Engineer

    Propulsion Engineer

    Boeing • EVERETT, WA, United States
    Full-time
    At Boeing, we innovate and collaborate to make the world a better place.We’re committed to fostering an environment for every teammate that’s welcoming, respectful and inclusive, with great opportu...Show more
    Last updated: 6 days ago • Promoted
    Senior Site Reliability Engineer - Infrastructure

    Senior Site Reliability Engineer - Infrastructure

    The Trade Desk • Seattle, WA, United States
    Full-time
    The Trade Desk is changing the way global brands and their agencies advertise to audiences around the world.How? With a media buying platform that helps brands deliver a more insightful and relevan...Show more
    Last updated: 30+ days ago • Promoted
    Configuration Management Specialist (Senior or Lead)

    Configuration Management Specialist (Senior or Lead)

    Boeing • Everett, WA, United States
    Permanent +1
    Configuration Management Specialist (Senior or Lead).The Boeing Company is currently seeking a.Configuration Management Specialist (Senior or Lead). Everett, WA; Seattle, WA; San Antonio, TX; Dallas...Show more
    Last updated: 8 days ago • Promoted
    Senior Sediment SI / RM Lead (Remote / Hybrid)

    Senior Sediment SI / RM Lead (Remote / Hybrid)

    ERM • Seattle, WA, United States
    Remote
    Full-time
    A leading sustainability consultancy in Seattle is seeking a Managing Consultant, Engineer for Sediment Site Investigation & Remediation Management. The ideal candidate will manage projects, lead te...Show more
    Last updated: 3 days ago • Promoted
    Sr. Liquid System Responsible Engineer IV - Lunar Permanence

    Sr. Liquid System Responsible Engineer IV - Lunar Permanence

    Blue Origin • Seattle, WA, United States
    Permanent
    Applications will be accepted on an ongoing basis until the requisition is closed.At Blue Origin, we envision millions of people living and working in space for the benefit of Earth.We're working t...Show more
    Last updated: 30+ days ago • Promoted
    Senior Real Estate Services Agent

    Senior Real Estate Services Agent

    HDR • Everett, WA, US
    Full-time
    Senior Real Estate Services Title Agent.At HDR, our employee-owners are fully engaged in creating a welcoming environment where each of us is valued and respected, a place where everyone is empower...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer, Observability

    Senior Software Engineer, Observability

    Airtable • Seattle, WA, United States
    Full-time
    San Francisco, CA; New York, NY; Remote (Seattle, WA only).Airtable is the no-code app platform that empowers people closest to the work to accelerate their most critical business processes.More th...Show more
    Last updated: 16 days ago • Promoted
    Senior Control Systems Engineer

    Senior Control Systems Engineer

    System One • Everett, WA, US
    Temporary
    Job Title : Senior Control Systems Engineer Location : Everett, WA Type : Contract (yearlong) Compensation : $52 - $85 hourly Contractor Work Model : Onsite (Local candidates only) System One is seeking...Show more
    Last updated: 19 days ago • Promoted
    Software Engineer–Developer (Experienced or Senior)

    Software Engineer–Developer (Experienced or Senior)

    Boeing • EVERETT, WA, United States
    Full-time
    At Boeing, we innovate and collaborate to make the world a better place.We’re committed to fostering an environment for every teammate that’s welcoming, respectful and inclusive, with great opportu...Show more
    Last updated: 6 days ago • Promoted