Talent.com
Software Engineer - E5 (Kubernetes)

Software Engineer - E5 (Kubernetes)

WhatfixSan Jose, California, United States
30+ days ago
Job type
  • Full-time
Job description

Who are we?

Founded in 2014 by Khadim Batti and Vara Kumar, Whatfix is a leading global B2B SaaS provider and the largest pure-play enterprise digital adoption platform (DAP). Whatfix empowers companies to maximize the ROI of their digital investments across the application lifecycle, from ideation to training to the deployment of software. Driving user productivity, ensuring process compliance, and improving user experience of internal and customer-facing applications.

Spearheading the category with serial innovation and unmatched customer-centricity, Whatfix is the only DAP innovating beyond the category, positioning itself as a comprehensive suite for GenAI-powered digital adoption, analytics, and application simulation.

Whatfix product suite consists of 3 products - DAP, Product Analytics, and Mirror. This product suite helps businesses accelerate ROI on digital investments by streamlining application deployment across its lifecycle.

Whatfix has seven offices across the US, India, UK, Germany, Singapore, and Australia and a presence across 40+ countries.

Customers : 700+ enterprise customers, including over 80 Fortune 500 companies such as Shell, Microsoft, Schneider Electric, and UPS Supply Chain Solutions.

Investors : Raised a total of ~$270 million. Most recently Series E round of $125 Million led by Warburg Pincus with participation from existing investor SoftBank Vision Fund 2. Other investors include Cisco Investments, Eight Roads Ventures (A division of Fidelity Investments), Dragoneer Investments, Peak XV Partners, and Stellaris Venture Partners.

  • With over 45% YoY sustainable annual recurring revenue (ARR) growth, Whatfix is among the “Top 50 Indian Software Companies” as per G2 Best Software Awards.
  • Recognized as a “Leader” in the digital adoption platforms (DAP) category for the past 4+ years by leading analyst firms like Gartner, Forrester, IDC, and Everest Group.
  • The only vendor recognized as a Customers’ Choice in the 2024 Gartner® Voice of the Customer for Digital Adoption Platforms has once again earned the Customers’ Choice distinction in 2025. We also boast a star rating of 4.6 on G2 Crowd, 4.5 on Gartner Peer Insights, and a high CSAT of 99.8%
  • Highest-Ranking DAP on 2023 Deloitte Technology Fast 500™ North America for Fourth Consecutive Year
  • Won the Silver for Stevie's Employer of the Year 2023 – Computer Software category and also recognized as Great Place to Work 2022-2023
  • Only DAP to be among the top 35% companies worldwide in sustainability excellence with EcoVadis Bronze Medal
  • On the G2 peer review platform, Whatfix has received 77 Leader badges across all market segments, including Small, Medium, and Enterprise, in 2024, among numerous other industry recognitions.

Position Overview : We are looking for a highly skilled and experienced Senior Software Engineer (E5) to join our Site Reliability Engineering team who can take end‑to‑end ownership of large, business‑critical features. You’ll design, build, ship, and operate reliable, scalable services; break complex work into actionable tasks for yourself and other engineers; set the technical bar through thoughtful design and rigorous reviews; and mentor teammates while partnering with product, platform, and customer‑facing groups to keep our systems fast, observable, and always‑on.

Responsibilities : Scope & Impact :

This role is critical to enhancing the reliability, availability, and overall resilience of Whatfix’s software products. The role will own these Non Functional Areas and build automated mechanisms to target gaps in these areas. These automated mechanisms should be scalable to an extent where other Engineering Teams can build their own pipelines to ensure reliability for their owned services. The role should be able to build a framework which can democratize the approach to enhance observability, recoverability and self healing capabilities of the products in Whatfix EcoSystem. This should also provide visibility to other engineering systems on the performance of their microservices.

Ownership :

  • Designs and ships scalable platform code that bakes‑in reliability, fault‑tolerance and self‑healing for all Whatfix products
  • Owns, designs and develops frameworks (eliminate or significantly reduce manual efforts, e.g., through self-healing and auto-scaling systems, and platformization), processes and architecture which enhances the Availability and Reliability of the System.
  • Provides as a first responder for critical software issues within the team’s domain.
  • Prioritizes and takes ownership of unowned or complex tasks that enable the team to move faster.
  • Ensure that customer issues are not just fixed but that effective long-term solutions are implemented to prevent recurrence.
  • Technical Execution :

  • Own task breakdown from stories / features, ensuring each task is feasible within five days
  • Detail out design documents for the features being worked on
  • Implement well tested and documented code based on engineering standards and best practices
  • Own and support the features owned by the team to ensure high availability and compliances
  • Review designs and code written by peers as well as other teams from perspectives of testability, maintainability, reliability, security and cost.
  • Work with other teams to enhance developer experience through the enhancement of developer tools, suggest and implement AI workflows in the area of observability, availability and reliability
  • Demonstrate expertise in one or more technical areas and contribute to the overall technical direction of the team.
  • Skillset :

    Observability and Alertability of Infrastructure :

    The candidate should have proven experience in :

  • Increasing the observability of Software Systems
  • Managing Infrastructure in automated manner (utilizing automated pipelines for CI / CD and frameworks for IaaC)
  • Identifying gaps in Monitoring and Observability and fixing such gaps in a sustainable, scalable and automated manner.
  • Proven track record of defining SLAs for Systems and working on tasks to continuously track these SLAs and enhancing these SLAs
  • Resilience Engineering Practices : Drives post‑incident blameless RCAs and converts findings into code, tests and platform improvements
  • Collaboration & Guidance :

    The candidate should have experience in :

  • Working with other teams to help enhance the observability and recoverability (such as through self healing) of those team’s features
  • Conduct training sessions or workshops on observability and reliability practices.
  • Provide guidance on best practices for monitoring, alerting, and logging.
  • Required Technical Skills and Qualifications :

  • Candidate should have experience in the following technologies
  • Strong experience in Java.
  • Working experience in Kubernetes, Helm, ArgoCD
  • Ability to work with Java and Python based applications and identify gaps that could result in failures.
  • Familiarity with CI / CD pipelines and infrastructure as code (IaC) practices.
  • Preferred Skills :

  • Familiarity with log aggregation tools (e.g., ELK Stack).
  • Knowledge of Chaos Engineering principles.
  • Soft Skills :

  • Strong problem-solving and troubleshooting abilities.
  • Excellent communication and collaboration skills.
  • Ability to mentor and guide cross-functional teams.
  • Perks / Benefits

  • Uncapped incentives
  • Equity plan
  • Mac shop, work with the newest technologies
  • Unlimited PTO policy
  • Paid maternity / paternity leave
  • Monthly cell phone stipend
  • Paid UberEats lunches-daily
  • Medical, Dental, and Vision coverage (Whatfix pays 80% of the premium for individuals and their families; for the HSA, Whatfix contributes $1,000 for individuals and $2,000 for a family)
  • Team and company outings
  • Learning and Development benefits
  • At Whatfix, we value collaboration, innovation, and human connection. We believe that working together in the office five days a week fosters open communication, strengthens our community, and drives innovation, helping us achieve our goals more effectively.

    To facilitate global collaboration, our US teams start and end early, while our India teams start and end late. US teams do not have any evening meetings. Relocation and Sponsorship offered.

    We strive to live and breathe our Cultural Principles and encourage employees to demonstrate some of these core values - Customer First; Empathy; Transparency; Fail Fast and scale Fast; No Hierarchies for Communication; Deep Dive and innovate; Trust is the foundation; and Do it as you own it.

    Whatfix is an Equal Opportunity Employer and an E-Verify participant. All activities must comply with our Equal Opportunity Laws, ADA, and other regulations, as appropriate.

    We are an  equal opportunity employer  and value diverse people  because of  and not in spite of the differences. We do not discriminate on the basis of race, religion, color, national origin, ethnicity, gender, sexual orientation, age, marital status, veteran status, or disability status.

    The salary range for this position is 130K-170K OTE. Compensation will be determined by factors such as level, job-related knowledge, skills, and experience.

    Due to our company's global nature and our hiring committee's span of different time zones, the interviews for this role will be recorded for those not in attendance to review.

    Create a job alert for this search

    Software Engineer • San Jose, California, United States

    Related jobs
    • Promoted
    Systems Engineers & Software Developers

    Systems Engineers & Software Developers

    Info Way SolutionsFremont, CA, US
    Full-time
    Systems Engineers & Software Developers Company : Info Way Solutions Location : Fremont, CA Position Type : Full Time Experience : See below for details Education : See below for details Systems Enginee...Show moreLast updated: 26 days ago
    • Promoted
    DevOps Engineer - Developer Experience & Tooling

    DevOps Engineer - Developer Experience & Tooling

    Candid HealthMenlo Park, California, United States
    Full-time
    Candid Health is seeking a new engineer for our DevOps team.In this role, you'll work with incredibly capable engineers and operators who are intent on tackling some of the hardest and most meaning...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Backend Engineer (Go / C++)

    Senior Backend Engineer (Go / C++)

    SukiRedwood City, California, United States
    Full-time
    The Future of Healthcare Needs You.At Suki, we’re building technology that listens, understands, and gets out of the way — so clinicians can get back to being clinicians. AI to automate clinical doc...Show moreLast updated: 30+ days ago
    • Promoted
    Embedded Software Engineer

    Embedded Software Engineer

    Reliable RoboticsMountain View, CA, United States
    Permanent
    We're building safety-enhancing technology for aviation that will save lives.Automated aviation systems will enable a future where air transportation is safer, more convenient and fundamentally tra...Show moreLast updated: 30+ days ago
    • Promoted
    Sr. DevOps Engineer

    Sr. DevOps Engineer

    FortinetSunnyvale, CA, United States
    Full-time
    We are seeking a DevOps Software Developer experienced in Build and Release Engineering, Secure Software Development, and Software Supply Chain Risk Management (SCRM) in alignment with ISO 27001, N...Show moreLast updated: 1 day ago
    • Promoted
    Engineer II - IAM Technologies

    Engineer II - IAM Technologies

    ExelixisAlameda, CA, United States
    Full-time
    The Engineer II - Client Technology provides advanced engineering support across a broad range of technologies and platforms. This role plays a critical part in anticipating and resolving escalated ...Show moreLast updated: 30+ days ago
    (Senior) Software Engineer, Infrastructure (Kubernetes Platform)

    (Senior) Software Engineer, Infrastructure (Kubernetes Platform)

    pony.aiFremont, CA, US
    Full-time
    Quick Apply
    Founded in 2016 in Silicon Valley, Pony.Operating Robotaxi, Robotruck and Personally Owned Vehicles (POV) business units, Pony. CNBC Disruptor list of the 50 most innovative and disruptive tech comp...Show moreLast updated: 30+ days ago
    • Promoted
    Software Engineer

    Software Engineer

    SupermicroSan Jose, CA, United States
    Full-time
    Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show moreLast updated: 30+ days ago
    • Promoted
    Sr. Software Engineer - Engineering Productivity (Fullstack)

    Sr. Software Engineer - Engineering Productivity (Fullstack)

    Reliable RoboticsMountain View, CA, United States
    Permanent
    We're building safety-enhancing technology for aviation that will save lives.Automated aviation systems will enable a future where air transportation is safer, more convenient and fundamentally tra...Show moreLast updated: 30+ days ago
    • Promoted
    DevOps Engineer

    DevOps Engineer

    Cxapp Us, Inc.San Ramon, California, United States
    Full-time
    CXAPP is a forward-thinking technology company that leverages AI to transform industries, drive innovation and deliver cutting-edge solutions. As a DevOps Engineer at CXAPP, you will be responsible ...Show moreLast updated: 30+ days ago
    • Promoted
    Lead DevOps Engineer

    Lead DevOps Engineer

    Scout MotorsFremont, California, United States
    Full-time
    Here at Scout Motors, we're carrying forward the heritage of one of the most iconic American vehicles in history.One that forged the path for future generations of rugged SUVs and will do so once a...Show moreLast updated: 30+ days ago
    • Promoted
    Software Engineer II

    Software Engineer II

    Omada HealthSouth San Francisco, CA, United States
    Full-time
    Omada Health is on a mission to inspire and engage people in lifelong health, one step at a time.Omada Health is a digital care provider that empowers people to achieve their health goals through s...Show moreLast updated: 30+ days ago
    • Promoted
    Devops Engineer

    Devops Engineer

    Axiom Software Solutions LimitedSanta Clara, California, United States
    Full-time
    Location : Santa Clara CA 5 days per week onsite ).DevOps person to support MLL IT migration.Need someone with DevOps Experience with hands on skills,. Constant Integration / Contant Development GIT Pi...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Software Engineer, Datacenter Operations Platform Engineering

    Senior Software Engineer, Datacenter Operations Platform Engineering

    CrusoeSan Francisco, CA, United States
    Full-time
    We are seeking Senior Software Engineers to design and develop internal datacenter tooling and infrastructure management systems for Crusoe Cloud, a leading cloud provider.You will play a crucial r...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Infrastructure Linux & DevOps Engineer

    Senior Infrastructure Linux & DevOps Engineer

    Matrix Precise, Inc.Pleasanton, California, United States
    Full-time
    Infra Linux Engineer’s primary function will be to advance the infrastructure team from a traditional infrastructure methodology to an infrastructure as code approach. You will be responsible for ma...Show moreLast updated: 30+ days ago
    • Promoted
    Sr. System Debug Engineer

    Sr. System Debug Engineer

    SupermicroSan Jose, CA, United States
    Full-time
    Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show moreLast updated: 30+ days ago
    • Promoted
    Principal Software Engineer

    Principal Software Engineer

    Informatica LLCRedwood City, CA, United States
    Full-time
    Build Your Career at Informatica.We seek innovative thinkers who believe in the power of data to drive meaningful change. At Informatica, we welcome adventurous, work-from-anywhere minds eager to so...Show moreLast updated: 30+ days ago
    • Promoted
    Embedded Software Engineer

    Embedded Software Engineer

    JobotSan Jose, CA, US
    Full-time
    Cutting edge space & defense technology!.This Jobot Job is hosted by : Stephen Brainerd.Are you a fit? Easy Apply now by clicking the "Apply Now" button and sending us your resume.With numerous offi...Show moreLast updated: 30+ days ago