Position Title : DevOps Manager
From Fivetran founding until now, our mission has remained the same : to make access to data as simple and reliable as electricity. With Fivetran, customer data arrives in their warehouses, canonical and ready to query, with no engineering or maintenance required. Were proud that more organizations continue to leverage our technology every day to become truly data-driven.
About the Role
Fivetran is building data pipelines to power the modern data stack for thousands of companies.
As a Manager of Site Reliability Engineering, you will take on the responsibility for the Serbia-based group of SRE Engineers. Together with other SRE managers and engineers in Ireland, India, and the US, you will take ownership of the reliability of Fivetrans service, including building and monitoring repeatable infrastructure, reliability, and robustness of the continuously deployed release pipeline, as well as timely and effective incident response and resolution. You will co?own the responsibility for the scalability and reliability of Fivetrans connector infrastructure on AWS, GCP, and Azure. You will bring together and grow a Serbia?based team that reliably delivers excellent results while maintaining a culture of strong collaboration, engagement, and continuous improvement.
This is a full?time position based out of our Novi Sad office. Our hybrid work model offers a blend of remote flexibility and in?person collaboration, including two days in the office each week to connect and build as a team.
Technologies Youll Use
- AWS, Azure, Google Cloud Platform (GCP)
- EKS, AKS, GKE (managed services)
- Buildkite, ArgoCD
- PostgreSQL, Cloud Datastore
- Go, Java
- Python, Shell
- Terraform, Pulumi
- FastAPI (RESTful APIs)
- PrivateLinks (AWS, Azure), Private Service Connect (GCP), site?to?site VPNs across major cloud providers
- Grafana
What Youll Do
Leadership and Talent Management
Build, hire, and plan the growth of the Serbia?based SRE organizationHelp engineers advance in their careers; act as a coachSet clear expectations and create a positive work environment based on accountabilityEstablish strong global and cross?team relationships with product, field, software teams, and the other SRE teams around the worldSRE Subject Matter Expertise
Drive initiatives that improve service reliability, scalability, and performance through automation, observability, and proactive problem?solvingAdvocate for simple, elegant, and easily scalable system designSupport new services before they go live through activities such as system design consulting / review, capacity planning, and launch reviewsBe hands?on and willing to act as player?coach in SRE areas such as IaC, Observability & Alerting, and Release ManagementDemonstrate strong accountability for infrastructure cost managementOptimize our continuous integration and deployment process, striving for safe, frequent, and automated releasesOversee incident management practices, ensuring timely response, effective / blameless postmortems, and systemic improvementsStay current with emerging technologies, tools, and industry best practices relevant to reliability engineeringSkills Were Looking For
Experience in managing or leading an SRE, DevOps, or Infrastructure Engineering team operating in a public cloud at scaleDemonstrate significant working knowledge of Continuous Integration and Deployment processes and toolingProven experience in cloud?based infrastructure design and IaCStrong understanding and experience in security control design, implementation, and operationsSolid technical working experience on AWS, GCP, or Azure, distributed systems, networking, and container orchestration (Kubernetes)Deep understanding of reliability concepts, including monitoring / observability, capacity planning, and disaster recoveryExperience leading incident response, root cause analysis, and reliability?focused postmortemsFamiliarity with cost optimization strategies in large?scale cloud environmentsExcellent leadership, communication, and stakeholder management skillsAbility to iterate in the context of an evolving service environmentExperience in managing changes and getting buy?in from the organizationA passion for SRE / DevOps and running highly resilient / automated systemsOptional Bonus Skills
Knowledge of compliance and security practices in production environments (SOC2, ISO27001, etc.)Experience with multi?cloud supportHands?on coding / scripting experience in languages such as Python, Go, or JavaPerks and Benefits
100% employer?paid medical insuranceGenerous paid time?off policy (PTO), plus paid sick time, inclusive parental leave policy, holidays, and volunteer days offRSU stock grantsProfessional development and training opportunitiesCompany virtual happy hours, free food, and fun team?building activitiesMonthly cell phone stipendAccess to an innovative mental health support platform that offers personalized care and resources in areas such as therapy, coaching, and self?guided mindfulness exercises for all covered employees and their covered dependentsMay vary by country and worker type please reach out to your recruiter for more information
We are committed to ensuring that all candidates have an equal opportunity to participate in our interview process. If you require accommodations at any stage of the process due to a disability, medical condition, or any other circumstance, please dont hesitate to submit your request by filling out a form. We will work with you to provide reasonable accommodations to facilitate your participation and ensure a fair and accessible interview experience. Your request and any information provided will be kept confidential and will not impact your candidacy. We look forward to hearing from you and accommodating your needs to the best of our ability.
Equal Opportunity Employer, including disability / protected veterans.
Equal employment opportunity, including veterans and individuals with disabilities.
#J-18808-Ljbffr