Talent.com
Site Reliability Engineer

Site Reliability Engineer

TP-Link Systems Inc.Irvine, CA, US
30+ days ago
Job type
  • Full-time
Job description

Job Description

Job Description

At the forefront of the future of connected living, TP-Link's Systems Inc. R&D Center in Irvine, Southern California's innovation hub, spearheads research and development of next-generation networking, IoT smart home products, and software services. Our team of passionate engineers are constantly innovating, engineering solutions that transform the end user experience with simpler, smarter, and more reliable connectivity.

We're looking for a passionate and experienced Site Reliability Engineer to join our team and play a crucial role in ensuring our cloud platform's security, Reliability, scalability, and operational excellence.

About Us :

Headquartered in the United States, TP-Link Systems Inc. is a global provider of reliable networking devices and smart home products, consistently ranked as the world’s top provider of Wi-Fi devices. The company is committed to delivering innovative products that enhance people’s lives through faster, more reliable connectivity. With a commitment to excellence, TP-Link serves customers in over 170 countries and continues to grow its global footprint.

We believe technology changes the world for the better! At TP-Link Systems Inc, we are committed to crafting dependable, high-performance products to connect users worldwide with the wonders of technology.

Embracing professionalism, innovation, excellence, and simplicity, we aim to assist our clients in achieving remarkable global performance and enable consumers to enjoy a seamless, effortless lifestyle.

Responsibilities :

  • Assist in implementing and operating Microservices on Kubernetes cloud-based platforms.
  • Collaborate with the Cloud Technical Development and DevOps teams to deploy services to the Multi-Cloud Platform.
  • Conduct Load Tests and Chaos Tests to ensure the scalability and reliability of microservices.
  • Build observability for Microservices and cloud platforms like AWS, OCI, Azure, and GCP.
  • Contribute to writing and executing disaster recovery plans in collaboration with the Development and DevOps teams.
  • Help analyze and resolve production risks caused by insufficient resources, such as node groups, CPU, memory, HPA scheduling, JVM pre-warming, etc.
  • Write and maintain scripts for automation using languages like Python, Go, or Bash.
  • Assist in defining and maintaining the KPIs (SLA / SLO / SLI) for all cloud microservices with development teams to better understand the business.
  • Create and maintain technical documentation, including architecture diagrams, design documents, and standard operating procedures.
  • Ensure adherence to security and compliance standards, including ISO27001, SOC2, and GDPR.
  • Participate in incident response efforts to troubleshoot and resolve production issues quickly.
  • Conduct post-incident analysis to identify root causes and potential workarounds / solutions.
  • Contribute to product / technology selection, including implementation of POCs.
  • Be adaptable to change and evolving processes and tools.
  • Participate in mentoring and training less senior members of the team.
  • Be part of the on-call rotation and provide support after work hours and on weekends.
  • Other duties as assigned.

Requirements

  • Bachelor's degree in Computer Science, Information Technology, or a related field.
  • 1-3 years of experience as a Site Reliability Engineer or in a related role.
  • Proficiency in programming and scripting languages like Java, Python, Bash, or PowerShell.
  • Hands-on experience in SRE, DevOps, cloud operations, and cloud security best practices.
  • Basic knowledge of security technologies, including Identity and access management, Network security, Application security, and Data protection.
  • Strong problem-solving and analytical skills, with the ability to work independently and as part of a team.
  • Experience in developing and maintaining technical documentation and implementing compliance requirements.
  • Additional Skills (Preferred) :

  • Relevant cloud certifications include AWS Solutions Architect, Azure Solutions Architect Expert, or GCP Professional Cloud Architect.
  • Experience with container orchestration technologies (e.g., Kubernetes)
  • Benefits

    Salary range : $100,000 - $140,000

  • Free snacks and drinks, and provided lunch on Fridays
  • Fully paid medical, dental, and vision insurance (partial coverage for dependents)
  • Contributions to 401k funds
  • Bi-annual reviews, and annual pay increases
  • Health and wellness benefits, including free gym membership
  • Quarterly team-building events
  • At TP-Link Systems Inc., we are continually searching for ambitious individuals who are passionate about their work. We believe that diversity fuels innovation, collaboration, and drives our entrepreneurial spirit. As a global company, we highly value diverse perspectives and are committed to cultivating an environment where all voices are heard, respected, and valued. We are dedicated to providing equal employment opportunities to all employees and applicants, and we prohibit discrimination and harassment of any kind based on race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local laws. Beyond compliance, we strive to create a supportive and growth-oriented workplace for everyone. If you share our passion and connection to this mission, we welcome you to apply and join us in building a vibrant and inclusive team at TP-Link Systems Inc.

    Please, no third-party agency inquiries, and we are unable to offer visa sponsorships at this time.

    Create a job alert for this search

    Site Reliability Engineer • Irvine, CA, US

    Related jobs
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    TP-Link North America, Inc.Irvine, CA, United States
    Full-time
    At the forefront of the future of connected living, TP-Link's Systems Inc.R&D Center in Irvine, Southern California's innovation hub, spearheads research and development of next-generation networki...Show moreLast updated: 30+ days ago
    • Promoted
    System Analyst

    System Analyst

    BOOZ, ALLEN & HAMILTON, INC.Camp Pendleton North, CA, US
    Full-time +1
    Do you want to use your creativity, problem-solving, and storytelling skills to improve organizational mission performance in global defense? You understand there is no single or easy solution to p...Show moreLast updated: 3 days ago
    • Promoted
    Senior System Reliability Engineer

    Senior System Reliability Engineer

    Ford Motor CompanyIrvine, CA, United States
    Full-time
    We are the movers of the world and the makers of the future.We get up every day, roll up our sleeves and build a better world together. At Ford, we're all a part of something bigger than ourselve...Show moreLast updated: 3 days ago
    • Promoted
    Reliability Engineer, Advanced Effects

    Reliability Engineer, Advanced Effects

    Anduril IndustriesCosta Mesa, CA, United States
    Full-time
    Anduril Industries is a defense technology company with a mission to transform U.By bringing the expertise, technology, and business model of the 21st century's most innovative companies to the def...Show moreLast updated: 3 days ago
    • Promoted
    Senior Site Reliability Engineer (Remote)

    Senior Site Reliability Engineer (Remote)

    ExperianCosta Mesa, CA, United States
    Remote
    Full-time
    Experian is a global data and technology company, powering opportunities for people and businesses around the world.We help to redefine lending practices, uncover and prevent fraud, simplify health...Show moreLast updated: 30+ days ago
    • Promoted
    DevOps / Site Reliability Engineer (SRE) US

    DevOps / Site Reliability Engineer (SRE) US

    ChannelwillPasadena, CA, United States
    Full-time
    Pasadena, California (Remote or Hybrid).SaaS company based in Pasadena, California, providing innovative post-purchase solutions for eCommerce brands. Our products help merchants improve customer ex...Show moreLast updated: 3 days ago
    • Promoted
    Senior Site Reliability Engineer - Developer, Connected Warfare

    Senior Site Reliability Engineer - Developer, Connected Warfare

    Anduril IndustriesCosta Mesa, CA, United States
    Full-time
    Anduril Industries is a defense technology company with a mission to transform U.By bringing the expertise, technology, and business model of the 21st century's most innovative companies to the def...Show moreLast updated: 3 days ago
    • Promoted
    Sr. Site Reliability Engineer, Infrastructure Engineering

    Sr. Site Reliability Engineer, Infrastructure Engineering

    AmazonIrvine, CA, United States
    Full-time
    Prime Video is a first-stop entertainment destination offering customers a vast collection of premium programming in one app available across thousands of devices. Prime members can customize their ...Show moreLast updated: 3 days ago
    • Promoted
    Sr Site Reliability Engineer (Python)

    Sr Site Reliability Engineer (Python)

    Ledgent TechIrvine, CA, United States
    Full-time
    No Corp-to-Corp, No 3rd party firms.Hiring on behalf of our client for Sr Site Reliability Engineer (SRE).Location : 100% onsite in Irvine, CA. Salary Range : $140,000 to $180,000.Partnered with a cli...Show moreLast updated: 3 days ago
    • Promoted
    Site Reliability Engineer (Temp to Perm)

    Site Reliability Engineer (Temp to Perm)

    LeidosVista, CA, United States
    Temporary
    This position will require up to 75% travel • • • • •.Come put your Site Reliability Engineer (SRE) skills into action! Leidos has openings for talented SREs to join our team and work real-time hands-o...Show moreLast updated: 3 days ago
    • Promoted
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    Diverse LynxNewport Coast, CA, United States
    Full-time
    DevOps Engineer With Strong Site Reliability Engineering Capabilities.Experienced DevOps Engineers with strong Site Reliability Engineering (SRE) capabilities who can work independently, think crit...Show moreLast updated: 3 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    TP-Link Systems Inc.Irvine, CA, US
    Full-time
    At the forefront of the future of connected living, TP-Link's Systems Inc.R&D Center in Irvine, Southern California's innovation hub, spearheads research and development of next-generat...Show moreLast updated: 30+ days ago
    • Promoted
    Stationary Engineer

    Stationary Engineer

    Facility Services Management Inc.Camp Pendleton, CA, US
    Full-time
    Now Hiring : Stationary Engineer.Facility Services Management, Inc.Full-Time | On-Site | Federal Contract.Are you a seasoned engineer with the expertise to keep critical systems running flawlessly?....Show moreLast updated: 9 days ago
    • Promoted
    Combat Engineer

    Combat Engineer

    United States ArmyRiverside, CA, US
    Temporary
    Combat Engineer Job Overview : Jump start your career in engineering with our world class training program earning up to 45 advanced certifications. As a Combat Engineer, you will gain construction a...Show moreLast updated: 7 days ago
    • Promoted
    Site Reliability Engineer (Temp to Perm)

    Site Reliability Engineer (Temp to Perm)

    Leidos IncVista, CA, United States
    Temporary
    This position will require up to 75% travel • • •.Come put your Site Reliability Engineer (SRE) skills into action! Leidos has openings for talented SREs to join our team and work real-time hands-on ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    TP-Link North America, Inc.Irvine, CA, United States
    Full-time
    At the forefront of the future of connected living, TP-Link's Systems Inc.R&D Center in Irvine, Southern California's innovation hub, spearheads research and development of next-generation networki...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer (SRE)

    Senior Site Reliability Engineer (SRE)

    StubHubAliso Viejo, CA, United States
    Full-time
    StubHub is on a mission to redefine the live event experience on a global scale.Whether someone is looking to attend their first event or their hundredth, we're here to delight them all the way fro...Show moreLast updated: 3 days ago
    • Promoted
    Reliability Engineer in Irvine

    Reliability Engineer in Irvine

    Energy Jobline ZRIrvine, CA, United States
    Full-time
    Energy Jobline is the largest and fastest growing global Energy Job Board and Energy Hub.We have an audience reach of over 7 million energy professionals, 400,000+ monthly advertised global energy ...Show moreLast updated: 3 days ago