Talent.com
FedNow Principal Site Reliability Engineer

FedNow Principal Site Reliability Engineer

Federal ReserveSan Francisco, CA, United States
17 days ago
Job type
  • Full-time
  • Part-time
Job description

Overview

Federal Reserve Bank of Boston - Federal Reserve Financial Services (FRFS) delivers a suite of payments services to financial institutions via FedLine Solutions, FedNowS M, Fedwire, National Settlement Service (NSS), FedCash, FedACH (Automated Clearing House), and Check Services. We are currently leading a strategic effort to transform FRFS to a national, enterprise-focused organization. Through our evolved structure, we will meet the needs of the marketplace for new products and services more quickly, seek to provide a more robust and unified customer experience across our financial service offerings, and create new career growth opportunities for FRFS staff.

The Federal Reserve has developed a new interbank 24x7x365 real-time gross settlement (RTGS) service with integrated clearing functionality, called the FedNow Service. This service enables financial institutions to provide their customers with the ability to send and receive payments any time, any day, and have full access to those funds within seconds. This position is a unique opportunity to be part of this mission-critical Federal Reserve initiative that is transforming the payments landscape in the United States.

The position will be primarily on-site with residency commutable to one of our offices required.

Responsibilities

  • As a Principal Engineer of the SRE / Production Operations team for FedNow, you will operate the production environment for the program.
  • You will architect, implement, and leverage solution monitoring and tooling to be used for capacity planning, utilization reporting, and scaling.
  • The team uses open source and proprietary software to support Engineering, DevOps, and DevSecOps tools, services, and solutions.
  • CI / CD and IaC Pipeline automation design and development.
  • Resiliency, DR and BCP (including testing).
  • The SRE / Production Operations team is part of the Technical Operations (TechOps) department and has the overall responsibility for the design, management and execution of operations required to support the ongoing technical and delivery needs of the FedNow Program, as well as the transition to production support and operations.
  • This team interfaces with internal stakeholders, customers for planning, delivery, and service management.
  • It owns ongoing ITIL processes, and the implementation and driving of continuous improvement initiatives.
  • You will work closely with Engineers and Architects of the FedNow program in order to maintain seamless automation across the entire platform.
  • Proactively identify suspected gaps in system architecture and design experiments to expose them.
  • The ideal candidate is someone who loves building and maintaining reliable and scalable systems, CI / CD tooling, and automating cloud-based highly available, high performing applications.

Key Skills

  • Strong communication and collaboration skills
  • Extensive knowledge and understanding of working in AWS environments & services
  • EC2, EBS, EKS, RDS, Aurora, S3, Route 53, ELB, IAM, etc.
  • Hashicorp Terraform, Consul, Vault, and Ansible
  • Automation experience preferably using GitLab
  • Experience with scripting languages preferably Python for automated processes
  • Experience working in Linux environment and shell scripting
  • Experience supporting infrastructure for large multi-services applications
  • Experience working with continuous deployment in micro-services architectures
  • Experience working with Docker, Containers, ECR and EKS.
  • Observability - CloudWatch, OpenSearch, Dynatrace, Grafana, Prometheus
  • Familiarity with Fault Injection tooling (i.e. AWS Fault Injection Simulator, Gremlin, ChaosToolkit, Chaos Monkey)
  • Automation mindset to enable consistency and dependability in common actions
  • The Federal Reserve Bank of Boston is committed to provide equal employment opportunities to all persons without regard to race, color, religion, national origin, sex, sexual orientation, gender identity, age, genetic information, disability, or military service.

    All employees assigned to this position will be subject to FBI fingerprint / criminal background and Patriot Act / Office of Foreign Assets Control (OFAC) watch list checks at least once every five years.

    For this job, any offer of employment is contingent upon successfully passing a two-phase security screening. The first phase consists of the satisfactory completion of a physical examination (including a drug screening), reference checks, and a security investigation consisting of credit and criminal history checks.

    The second phase, which might not be complete until after you begin working at the Reserve Bank, is an additional risk-based security screening determined by the risk rating of the position. Depending upon the sensitivity of the position, this phase may include, and is not limited to, work and residency eligibility verification, and personal interviews with the candidate, references, and prior employers.

    All applicants must have resided in the United States for at least three (3) years.

    Full Time / Part Time Full time Regular / Temporary Regular Job Exempt (Yes / No) Yes Job Category Information Technology Family Group Work Shift First (United States of America)

    The Federal Reserve Banks are committed to equal employment opportunity for employees and job applicants in compliance with applicable law and to an environment where employees are valued for their differences.

    Always verify and apply to jobs on Federal Reserve System Careers or through verified Federal Reserve Bank social media channels.

    Privacy Notice

    #J-18808-Ljbffr

    Create a job alert for this search

    Site Reliability Engineer • San Francisco, CA, United States

    Related jobs
    • Promoted
    Site Reliability Engineer - SRE at Descope Los Altos, CA

    Site Reliability Engineer - SRE at Descope Los Altos, CA

    Itlearn360Los Altos, CA, United States
    Full-time
    Site Reliability Engineer - SRE job at Descope.Descope R&D group is a skilled team of developers with a unique DNA of creativity,flexibility,anopen mindset. We are looking for a passionate SRE to jo...Show moreLast updated: 30+ days ago
    • Promoted
    Principal Site Reliability Engineer

    Principal Site Reliability Engineer

    FortinetSanta Clara, CA, United States
    Full-time
    At Fortinet, we strive to provide a supportive, collaborative environment where people are empowered to do the best work of their careers. Our team members enjoy solving complex problems, and obsess...Show moreLast updated: 6 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    VirtualVocationsSan Francisco, California, United States
    Full-time
    A company is looking for an Operations Engineer - (Site Reliability Engineer).Key Responsibilities Design, implement, and maintain scalable systems for production and test environments Identify ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer II

    Site Reliability Engineer II

    VirtualVocationsSan Jose, California, United States
    Full-time
    A company is looking for a Site Reliability Engineer II- Process Automation.Key Responsibilities Optimize and automate incident and change management processes to enhance system efficiency and re...Show moreLast updated: 30+ days ago
    • Promoted
    Principal Site Reliability Engineer

    Principal Site Reliability Engineer

    Harrison ClarkeSan Francisco, CA, United States
    Full-time
    Principal Site Reliability Engineer (SRE).The ideal candidate should have extensive experience in designing highly scalable infrastructure, building systems, and performing testing, monitoring, and...Show moreLast updated: 17 days ago
    • Promoted
    DevOps Engineer / Site Reliability Engineer

    DevOps Engineer / Site Reliability Engineer

    HyperFiSan Francisco, CA, United States
    Full-time
    We're building the kind of platform we always wanted to use : fast, flexible, and built for making sense of real-world complexity. Behind the scenes is a robust, event-driven architecture that connec...Show moreLast updated: 5 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Bits to AtomsSan Francisco, CA, United States
    Full-time
    Site Reliability Engineer (SRE).You’ll work at the intersection of infrastructure, AI / ML systems, and mission-critical physical operations. You’ll collaborate directly with engineering, AI, and oper...Show moreLast updated: 23 days ago
    • Promoted
    Site Reliability Engineer I

    Site Reliability Engineer I

    prosper.comSan Francisco, CA, United States
    Full-time
    As a Site Reliability Engineer I at Prosper, you will play a crucial role in enhancing the reliability, scalability, and maintainability of our technology platform. This entry-level position is desi...Show moreLast updated: 3 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PsiQuantumPalo Alto, CA, United States
    Full-time
    Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    FortinetSunnyvale, CA, United States
    Full-time
    At Fortinet, we strive to provide a supportive, collaborative environment where people are empowered to do the best work of their careers. Our team members enjoy solving complex problems, and obsess...Show moreLast updated: 6 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    WorkOSSan Francisco, CA, United States
    Full-time
    About WorkOS 🚀 WorkOS builds tools and services for developers to help them implement authentication, identity, authorization, and overall enterprise readiness. We’re a fully distributed team with ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Together AISan Francisco, CA, United States
    Full-time
    As a Site Reliability Engineer (SRE) at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a soft...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer (Pleasanton)

    Site Reliability Engineer (Pleasanton)

    Rockwoods IncPleasanton, CA, US
    Part-time
    Note : Candidates must have relevant experience in Medical / Healthcare domains, this is mandatory.Senior SRE Engineer - Pleasanton, 5 days office. Primary work : 24x7 On-call support and setting up mo...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer - Openstack

    Site Reliability Engineer - Openstack

    FortinetSunnyvale, CA, United States
    Full-time
    Fortinet is recruiting a Site Reliability Engineer- OPENSTACK to join our FortiStack team.This team is responsible for the management, operation and continued development of our Openstack-based pri...Show moreLast updated: 6 days ago
    • Promoted
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    OPPO US Research CenterPalo Alto, CA, United States
    Full-time
    OPPO US Research Center is seeking a skilled and proactive.Site Reliability Engineer (SRE).In this role, you will be responsible for ensuring the stability, scalability, and performance of our appl...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    VirtualVocationsSan Francisco, California, United States
    Full-time
    A company is looking for a Senior / Staff Software Engineer (SRE).Key Responsibilities Design, automate, and scale secure cloud infrastructure Lead incident response and manage system outages Mai...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Signify TechnologyPalo Alto, CA, United States
    Full-time
    Competitive, based on experience.We are a technology startup advancing healthcare with a safety-focused AI platform that assists medical professionals by managing patient communications, including ...Show moreLast updated: 2 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Rockwoods IncPleasanton, CA, United States
    Full-time
    Note : Candidates must have relevant experience in Medical / Healthcare domains, this is mandatory.Senior SRE Engineer - Pleasanton, 5 days office. Primary work : 24x7 On-call support and setting up mo...Show moreLast updated: 2 days ago