Talent.com
No longer accepting applications
Principal Site Reliability Engineer - Cloud (Remote)

Principal Site Reliability Engineer - Cloud (Remote)

Donnelley Financial, LLCRockville, MD, United States
12 days ago
Job type
  • Full-time
  • Remote
Job description

Description :

Join a dynamic team at the pulse of global markets, where we deliver innovative software and service solutions for essential financial reporting and capital markets transactions. At DFIN, we are a values-driven organization that empowers you to build a fulfilling career while bringing your authentic self to work every day.Our "Win as One" mentality ensures that our team's success is directly linked to Client, Shareholder and Employee Satisfaction.

Recognized by Newsweek as one of AMERICA'S MOST LOVED WORKPLACES for three consecutive years and a Built In Best Places to Work for six years, we are committed to our employees' total wellbeing. Enjoy competitive compensation, a flexible workplace, comprehensive benefits, and opportunities for professional growth. Bring your passion and talents to DFIN - because being YOU thrives here.

Summary :

We are looking for technical team members at all levels who want to push themselves to deliver best in market SaaS solutions. We offer a challenging environment where you will have to grow, adapt and use your skills consistently. Our customers rely on us in the moments that matter. Engineering delivers on that promise.

ThePrincipal Site Reliability Engineer- Cloud is responsiblefor designing, building, securing, monitoring and maintaining our SaaS product cloud infrastructure so it is fast, cost effective, stable and optimized for our customers. SRE's at DFIN take on availability, performance, managing change, monitoring, response and are guardians of non-functional requirements.

You either have a SaaS cloud infrastructure backgroundin Azure or AWS witha programmatic, automated mindset or are someone that comes with a software engineering background with SaaS cloud infrastructure experience in Azure or AWS. The SRE goal is to build automated systems that reduce or eliminate manual work to keep our products up and running and performing optimally. We are looking for someone who thrives on collaboration within the team and across other groups and can lead colleagues independently to deliver solutions to complex problems.

Responsibilities :

  • Champion and implement a culture to maintain performant, reliable, secure, cost-effective platform cloud infrastructurein DFIN SaaS products based on operationalized processes you define
  • Champion security of our cloud infrastructure collaborating with Security and Governance teams and using static and dynamic tooling
  • Champion and implement application and cloud infrastructure monitoring and alerting to prevent client impacting issues by ensuring system availability, performance and scalability to maintain SLOs and SLAs
  • Optimize cloud infrastructure and application performance at scale while maintaining effective cost controls
  • Automate cloud infrastructure buildout and maintenance including system operational runbooks
  • Dive deep into technology and stay on the forefront of the latest tools, technologies, and strategies; help evaluate, prototype, and integrate them into operationalized work processes
  • Perform with broad independence and deliver on project milestones and tasks you define on schedule while communicating progress regularly
  • Build strong relationships with SRE team members and software engineering teams to hold each other accountable for quality expectations
  • Learn continuously and apply lessons learned
  • Evangelize best practices, eliminate bottlenecks, and improve process
  • Participate in on-call duties 365 / 24 / 7 and lead the triage and RCA of production incidents

Qualifications :

  • 8+ years experience designing, building, securing, monitoring and maintaining cloud infrastructure in Azure or AWS
  • 5+ years experience creating, configuring, maintaining and monitoring Kubernetes clusters(AKS or EKS) in cloud infrastructure to optimize application performance and reliability
  • 5+ years building and deploying Infrastructure as Code with Terraform or similar technology
  • 5+ years experience with common cloud networking, firewall and load balancing configuration
  • 5+years experiencewriting software in any modern software language such as C# .NET, Java
  • 5+years experiencecreating automated deployments with tools such as Harness, Azure DevOps, Ansible or Jenkins to manage Infrastructure as Code and software build and deployment in a continuous integration (CI) / continuous delivery (CD) environment
  • 5+years experienceimplementing production performance, availability, and scalability monitoring and alerting using a tool such asNew Relic, Dynatrace, DataDog or AppDynamics
  • 5+ years experience supportingpublic client facing revenue generatingsystems
  • Experiencing monitoring and preventing issues with databases and database queries (SQL) using tools like Solarwinds Database Performance Analyzer, Idera SQL Diagnostic Manager, or Redgate SQL Monitor
  • Experience planning, coordinating, developing and executing all stages of post deployment verification test scripts
  • Experience securing Windows or Linux systems in 24x7 production environment
  • Education :

    BS in Computer Scienceor equivalent work experience

    It is the policy of Donnelley Financial Solutions to select, place, and manage all its employees without discrimination based on race, color, national origin, gender, age, religion, actual or perceived disability, veteran status, actual or perceived sexual orientation, genetic information or any other protected status.

    If you are a qualified individual with a disability or a disabled veteran, you have the right to request a reasonable accommodation if you are unable or limited in your ability to use or accessjobs.dfinsolutions.com as a result of your disability. You can request a reasonable accommodation by sending an email to Accommodations@dfinsolutions.com.

    At DFIN, protecting your identity is a top priority. Please be aware of scammers impersonating DFIN recruiters. DFIN recruiters will never request personal information via email or text. You will only receive a text from us if you've already been in contact. All automated messages will come fromnoreply@dfinsolutions.com. If you ever have doubts about the legitimacy of any communication from us, please do not hesitate to reach out for verification via talentacquisition@dfinsolutions.com (this email is for general TA questions and is not used for updates on your application status).#BI-Remote

    Create a job alert for this search

    Site Reliability Engineer • Rockville, MD, United States

    Related jobs
    Principal Site Reliability Engineer

    Principal Site Reliability Engineer

    Black Rock GroupsWashington, DC, United States
    Full-time
    Quick Apply
    The Principal Site Reliability Engineer will be a critical technical leader responsible for driving the operational excellence, resilience, and security of our core systems for a key Randstad clien...Show moreLast updated: 5 days ago
    • Promoted
    Staff Site Reliability Engineer

    Staff Site Reliability Engineer

    VisaAshburn, VA, United States
    Full-time
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Principal Cloud Engineer (AWS)

    Senior Principal Cloud Engineer (AWS)

    Omniscius ConsultingWashington, DC, United States
    Full-time
    Senior Principal Cloud Engineer onsite in Washington, DC.AWS to lead cloud architecture, migration, and optimization initiatives. This role will support federal agency programs by designing, deployi...Show moreLast updated: 22 days ago
    • Promoted
    Senior Lead Software Engineer, Full Stack (Cloud Operations Resilience Engineering)

    Senior Lead Software Engineer, Full Stack (Cloud Operations Resilience Engineering)

    Capital OneMclean, Virginia, US
    Full-time +1
    Senior Lead Software Engineer, Full Stack (Cloud Operations Resilience Engineering) Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer, Home

    Site Reliability Engineer, Home

    Google Inc.Washington, DC, United States
    Full-time
    Experience completing work as directed, and collaborating with teammates; developing knowledge of relevant concepts and processes. At Google, we have a vision of empowerment and equitable opportunit...Show moreLast updated: 29 days ago
    • Promoted
    Senior Software Engineer, Full Stack (Cloud Operations Resilience Engineering)

    Senior Software Engineer, Full Stack (Cloud Operations Resilience Engineering)

    Capital OneWashington D.C., District Of Columbia, US
    Full-time +1
    Senior Software Engineer, Full Stack (Cloud Operations Resilience Engineering) Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-...Show moreLast updated: 30+ days ago
    • Promoted
    Principal Site Reliability Engineer - Cloud (Remote)

    Principal Site Reliability Engineer - Cloud (Remote)

    Donnelley Financial, LLCRockville, MD, United States
    Remote
    Full-time
    Join a dynamic team at the pulse of global markets, where we deliver innovative software and service solutions for essential financial reporting and capital markets transactions.At DFIN, we are a v...Show moreLast updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Black Rock GroupsWashington, DC, United States
    Full-time
    Quick Apply
    Randstad is seeking a skilled and proactive Site Reliability Engineer (SRE) to join our client in the Washington D.The ideal candidate will bridge the gap between development and...Show moreLast updated: 5 days ago
    • Promoted
    Sr. Manager - Site Reliability Engineer

    Sr. Manager - Site Reliability Engineer

    VisaAshburn, VA, United States
    Full-time
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...Show moreLast updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Tax AnalystsFalls Church, VA, US
    Full-time
    Quick Apply
    Tax Analysts is seeking a Site Reliability Engineer (SRE) to help establish and shape our reliability engineering practice from the ground up. This is a unique opportunity to join a mission-driven o...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Sr Lead Software Engineer, Back End / SRE - Shopping (Remote-Eligible)

    Sr Lead Software Engineer, Back End / SRE - Shopping (Remote-Eligible)

    Capital OneWashington, DC, US
    Remote
    Full-time +1
    Sr Lead Software Engineer, Back End / SRE - Shopping (Remote-Eligible).Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, col...Show moreLast updated: 4 hours ago
    • Promoted
    Senior Lead Software Engineer, Full Stack (Cloud Operations Resilience Engineering)

    Senior Lead Software Engineer, Full Stack (Cloud Operations Resilience Engineering)

    Information Technology Senior Management ForumMcLean, VA, United States
    Full-time +1
    Senior Lead Software Engineer, Full Stack (Cloud Operations Resilience Engineering).Capital One seeks Full Stack Software Engineers who are passionate about marrying data with emerging technologies...Show moreLast updated: 1 day ago
    • Promoted
    Lead Software Engineer, Full Stack (Cloud Operations Resilience Engineering)

    Lead Software Engineer, Full Stack (Cloud Operations Resilience Engineering)

    Capital OneWASHINGTON D.C., District of Columbia, United States
    Full-time +1
    Lead Software Engineer, Full Stack (Cloud Operations Resilience Engineering).Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-pa...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CSCI ConsultingQuantico, VA, United States
    Full-time
    CSCI Consulting is looking for a.Site Reliability Engineer (SRE).This role combines deep systems engineering knowledge with DevOps automation, proactive monitoring, and incident response practices....Show moreLast updated: 30+ days ago
    • Promoted
    Lead Software Engineer, Site Reliability

    Lead Software Engineer, Site Reliability

    Capital OneWashington D.C., District Of Columbia, US
    Full-time +1
    Lead Software Engineer, Site Reliability Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, collaborative, inclusive , and ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Lead Software Engineer, Full Stack (Cloud Operations Resilience Engineering)

    Senior Lead Software Engineer, Full Stack (Cloud Operations Resilience Engineering)

    Capital One National AssociationMcLean, VA, United States
    Full-time +1
    Senior Lead Software Engineer, Full Stack (Cloud Operations Resilience Engineering).Capital One seeks Senior Lead Software Engineers who are passionate about marrying data with emerging technologie...Show moreLast updated: 2 days ago
    Network Reliability Engineer - Remote (1772)

    Network Reliability Engineer - Remote (1772)

    CoreSiteReston, VA, US
    Remote
    Full-time
    Quick Apply
    At CoreSite, we empower a more connected future through high-performance data centers and interconnection solutions.Recognized as a trusted partner in digital transformation, our strategically loca...Show moreLast updated: 26 days ago
    • Promoted
    • New!
    Senior Software Engineer, Site Reliability

    Senior Software Engineer, Site Reliability

    Capital OneWashington D.C., District Of Columbia, US
    Full-time +1
    Senior Software Engineer, Site Reliability Do you love building and pioneering in the technology space? Do you enjoy solving complex business problems in a fast-paced, collaborative, inclusive , an...Show moreLast updated: 2 hours ago
    • Promoted
    Sr. Systems Engineer / Operations Site Liaison TS / SCI Poly

    Sr. Systems Engineer / Operations Site Liaison TS / SCI Poly

    Leidos IncAnnapolis Junction, MD, United States
    Full-time
    System Engineer / Operational Site Liaison.National Security Sector's (NSS) Cyber & Analytics Business Area (CABA).Our talented team is at the forefront in Security Engineering, Computer Network Oper...Show moreLast updated: 11 days ago
    Site Reliability Engineer (SRE) / Service Availability Manager in MD

    Site Reliability Engineer (SRE) / Service Availability Manager in MD

    Vertex Elite LLCBethesda, MD, United States
    Full-time
    Quick Apply
    Position Title : Site Reliability Engineer (SRE) / Service Availability Manager Location : 7750 Wisconsin Avenue, Bethesda, MD 20814 Duration : Fullt...Show moreLast updated: 5 days ago