Talent.com
Sr. Manager - Site Reliability Engineer

Sr. Manager - Site Reliability Engineer

VisaAshburn, VA, United States
30+ days ago
Job type
  • Full-time
Job description

Company Description

Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable, and secure payments network, enabling individuals, businesses, and economies to thrive while driven by a common purpose - to uplift everyone, everywhere by being the best way to pay and be paid.

Make an impact with a purpose-driven industry leader. Join us today and experience Life at Visa.

Job Description

Job Description and Responsibilities

Visa's Technology Organization is a dynamic community of problem solvers and innovators dedicated to redefining the future of commerce. We manage one of the world's most advanced processing networks, handling over 65,000 secure transactions per second across 80 million merchants, 15,000 financial institutions, and billions of individuals. By joining us, you will engage with complex distributed systems and address large-scale challenges in new payment flows, business and data solutions, cybersecurity, and B2C platforms.

The Opportunity :

As a Senior Manager, you should bring a well-rounded skill set that includes expertise in Site Reliability Engineering (SRE) principles and practices, cloud platforms (AWS, Azure, Google Cloud), and security protocols. You should have experience with cloud migration, automation tools, and applying Generative AI to improve operational efficiencies. Proficiency in containerization technologies (Docker, Kubernetes), observability tools, and distributed caching systems is essential.

As a Technology Manager in Visa's Operations and Infrastructure division, you will join our Global PRE team to design, enhance, and build a highly available, secure, scalable, and resilient infrastructure in an agile environment. You will collaborate with supportive and challenging colleagues daily. You will take lead roles on projects to ensure the reliability and performance of our services, platforms, RESTful APIs, container-based distributed systems, and cloud services.

Key skills for this role include team mentorship, and the ability to foster a culture of collaboration and continuous improvement. Strong communication skills are vital for effective collaboration with cross-functional teams. The Senior Manager should possess problem-solving abilities, drive innovation, and prioritize tasks efficiently. Experience in fast-paced 24x7 environments, demonstrating adaptability and confident decision-making to address the evolving needs of the team and the organization, is crucial. Experience in swiftly resolving live production issues is essential to maintain system reliability and performance.

Responsibilities :

  • Security and Safety : Ensure the security and safety of application services and platforms. Lead efforts to enhance operational practices with a focus on efficiency, security, and excellence.
  • Zero Downtime : Maintain zero downtime by swiftly addressing any issues to ensure environments are always operational. Conduct rapid root cause analysis and implement remediation in production environments after thorough testing.
  • Environment Management : Oversee all activities within the environment, including deploying new code.
  • Team Culture : Foster an inclusive, innovative, and collaborative team culture.
  • Stakeholder Partnerships : Build strong partnerships with key stakeholders, including product management, engineering, design, and operations.
  • Effective Communication : Communicate effectively with both technical and business partners to create frameworks for discussing complex topics.
  • Automation and AI : Regularly analyze the environment and promote the adoption of automation and Generative AI to stay competitive.
  • Cloud Infrastructure : Lead cloud infrastructure adoption and migration, ensuring a seamless transition with minimal downtime.
  • Problem Resolution : Run problem bridges by collaborating with different functional and technical teams, escalating issues as needed for timely resolution.
  • Information Sharing : Proactively share important context and information with relevant stakeholders.
  • By fulfilling these responsibilities, the Senior Manager will ensure the Site Reliability Engineering team delivers a robust, secure, and efficient infrastructure that supports application services and platforms, meeting the organization's evolving needs.

This is a hybrid position. Expectation of days in office will be confirmed by your Hiring Manager.

Qualifications

Basic Qualifications :

  • 8 or more years of relevant work experience with a Bachelor Degree or at least 5 years of experience with an Advanced Degree (e.g. Masters, MBA, JD, MD) or 2 years of work experience with a PhD
  • Preferred Qualifications

  • 9 or more years of relevant work experience with a Bachelor Degree or 7 or more relevant years of experience with an Advanced Degree (e.g. Masters, MBA, JD, MD) or 3 or more years of experience with a PhD
  • Strong work ethic, self-starter, ability to work in a fast-paced, team-oriented environment, and comfortable working with a global team.
  • 10+ years of relevant work experience and a bachelor's degree in computer science.
  • 8+ years of experience with JAVA, J2EE applications, and a deep understanding of Web Services technologies : REST & SOAP.
  • 4+ years of experience managing applications on Containers (Docker) and Cloud (AWS, GCP, Azure).
  • Good understanding of Linux, Jenkins, .NET or Java-based web applications, MySQL / MSSQL / Oracle database concepts, Tomcat services, and web applications on Apache.
  • Ability to build deployment scripts and automated solutions using scripting languages such as Shell scripting (Bash), JavaScript, Python, or others.
  • Good understanding of infrastructure components like Linux operating systems, virtual machines, MQ, storage, etc.
  • Knowledge of Generative AI capabilities and use cases to enable such capabilities in the environment.
  • Prior experience working in 24x7 environments.
  • Exceptional analytical and problem-solving skills, along with strong oral and written communication abilities. Proven proficiency in troubleshooting, root-cause analysis, application design, and implementing major components for large projects.
  • Experience building tools to automate production support activities, enhancing the efficiency and productivity of service desk and operations groups.
  • Good understanding and knowledge of observability tools.
  • Join us to be part of a dynamic team that ensures the seamless operation of critical services worldwide. Your expertise will help us maintain the high standards of availability, security, and performance that our customers rely on.

    Additional Information

    Work Hours : Varies upon the needs of the department.

    Travel Requirements : This position requires travel5-10% of the time.

    Mental / Physical Requirements : This position will be performed in an office setting. The position will require the incumbent to sit and stand at a desk, communicate in person and by telephone, frequently operate standard office equipment, such as telephones and computers.

    Visa is an EEO Employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability or protected veteran status. Visa will also consider for employment qualified applicants with criminal histories in a manner consistent with EEOC guidelines and applicable local law.

    Visa will consider for employment qualified applicants with criminal histories in a manner consistent with applicable local law, including the requirements of Article 49 of the San Francisco Police Code.

    U.S. APPLICANTS ONLY : The estimated salary range for a new hire into this position is 152,200.00 to 220,850.00 USD per year, which may include potential sales incentive payments (if applicable). Salary may vary depending on job-related factors which may include knowledge, skills, experience, and location. In addition, this position may be eligible for bonus and equity. Visa has a comprehensive benefits package for which this position may be eligible that includes Medical, Dental, Vision, 401 (k), FSA / HSA, Life Insurance, Paid Time Off, and Wellness Program.

    Create a job alert for this search

    Site Reliability Engineer • Ashburn, VA, United States

    Related jobs
    • Promoted
    Staff Site Reliability Engineer

    Staff Site Reliability Engineer

    VisaAshburn, VA, United States
    Full-time
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...Show moreLast updated: 30+ days ago
    • Promoted
    Sr. Manager, Site Reliability Engineering

    Sr. Manager, Site Reliability Engineering

    VisaAshburn, VA, United States
    Full-time
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...Show moreLast updated: 13 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Leidos IncReston, VA, United States
    Full-time
    The Multi Domain Solutions Division at Leidos is looking for a.This role involves supporting the delivery of comprehensive IT and support services to ensure mission success while adhering to DoD st...Show moreLast updated: 16 days ago
    • Promoted
    Site Reliability Engineer III

    Site Reliability Engineer III

    VerisignReston, VA, United States
    Full-time
    Verisign helps enable the security, stability, and resiliency of the internet.We are a trusted provider of internet infrastructure services for the networked world and deliver unmatched performance...Show moreLast updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Tax AnalystsFalls Church, VA, US
    Full-time
    Quick Apply
    Tax Analysts is seeking a Site Reliability Engineer (SRE) to help establish and shape our reliability engineering practice from the ground up. This is a unique opportunity to join a mission-driven o...Show moreLast updated: 30+ days ago
    • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    Federated ITWashington, DC, United States
    Full-time
    Bridge Defense is redefining how modern defense technology is delivered.Department of Defense, the Intelligence Community, and federal law enforcement agencies. We provide full-spectrum national sec...Show moreLast updated: 5 days ago
    • Promoted
    Site Reliability Engineer (SRE) – TS / SCI Clearance

    Site Reliability Engineer (SRE) – TS / SCI Clearance

    Tech CraticWashington, DC, United States
    Full-time
    Site Reliability Engineer (SRE) – TS / SCI Clearance.Technology has revolutionized how we approach job hunting, and this book streamlines the process into a fast, efficient system that works.Instead ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Reliability Engineer

    Senior Reliability Engineer

    The Johns Hopkins University Applied Physics LaboratoryLaurel, MD, United States
    Full-time
    Are you passionate about applying reliability and system engineering principles to analyze and assess the resilience of future strategic weapon systems?. Do you have a strong technical background in...Show moreLast updated: 7 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CSCI ConsultingQuantico, VA, United States
    Full-time
    CSCI Consulting is looking for a.Site Reliability Engineer (SRE).This role combines deep systems engineering knowledge with DevOps automation, proactive monitoring, and incident response practices....Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Powder River IndustriesWashington, DC, United States
    Full-time
    Conduct analysis of alternatives for configuration tools, make recommendations, work with team to design, develop, test, implement, and maintain tool choice. Responsible for the administration, moni...Show moreLast updated: 4 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    EngFlowWashington, DC, United States
    Full-time
    Join to apply for the Site Reliability Engineer role at EngFlow.At EngFlow, we help developers save time by accelerating software builds and tests. Our cloud-based, distributed service optimizes dev...Show moreLast updated: 4 days ago
    • Promoted
    Principal Site Reliability Engineer (SRE) at Jobgether Washington DC

    Principal Site Reliability Engineer (SRE) at Jobgether Washington DC

    JobgetherWashington, DC, United States
    Full-time
    Principal Site Reliability Engineer (SRE) job at Jobgether.This position is posted by Jobgether on behalf of.We are currently looking for a. Principal Site Reliability Engineer (SRE).Join a high-imp...Show moreLast updated: 30+ days ago
    • Promoted
    Deployment Site Reliability Engineer - Connected Warfare

    Deployment Site Reliability Engineer - Connected Warfare

    Anduril Industries, Inc.Washington, DC, United States
    Full-time
    Senior Deployed Site Reliability Engineer, Connected Warfare.Washington, District of Columbia, United States.Anduril Industries is a defense technology company with a mission to transform U.By brin...Show moreLast updated: 4 days ago
    • Promoted
    • New!
    Site Reliability Engineer — Scale mission-critical platforms

    Site Reliability Engineer — Scale mission-critical platforms

    Anduril IndustriesWashington, DC, United States
    Full-time
    A defense technology company is seeking a Site Reliability Engineer in Washington, DC.The role involves solving challenges in networking and systems integration while working with cross-functional ...Show moreLast updated: 11 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CapeWashington, DC, United States
    Full-time
    Cape was founded in early 2022 by Palantir and Anduril alums with deep expertise in privacy and national security.While running Palantir’s US national security business, our CEO became passionate a...Show moreLast updated: 4 days ago
    • Promoted
    Gitlab Site Reliability Engineer

    Gitlab Site Reliability Engineer

    2Prod Technologies Corp.Washington, DC, United States
    Full-time
    Site Reliability Engineer (SRE) with strong GitLab expertise to support and enhance enterprise platforms.This role will focus primarily on GitLab while also maintaining Jira and Confluence in a sec...Show moreLast updated: 4 days ago
    • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    Bridge DefenseWashington, DC, United States
    Full-time
    Bridge Defense is redefining how modern defense technology is delivered.Department of Defense, the Intelligence Community, and federal law enforcement agencies. We provide full-spectrum national sec...Show moreLast updated: 4 days ago
    • Promoted
    Staff Site Reliability Engineer (Federal)

    Staff Site Reliability Engineer (Federal)

    Okta for DevelopersWashington, DC, United States
    Full-time
    Staff Site Reliability Engineer (Federal).Be among the first 25 applicants.Okta is The World’s Identity Company.We free everyone to safely use any technology, anywhere, on any device or app.Our fle...Show moreLast updated: 4 days ago