Kaseya
is the leading provider of complete IT infrastructure and security management solutions for Managed Service Providers (MSPs) and internal IT organizations worldwide powered by AI. Kaseyas best-in-breed technologies allow organizations to efficiently manage and secure IT to drive sustained business success. Kaseya has achieved sustained strong double-digit growth over the past several years and is backed by Insight Venture Partners ) a leading global private equity firm investing in high-growth technology and software companies that drive transformative change in the industries they serve.
Founded in 2000 Kaseya currently serves customers in over 20 countries across a wide variety of industries and manages over 15 million endpoints worldwide. To learn more about our company and our award-winning solutions go to and for more information on Kaseyas culture.
Kaseya is not your typical company. We are not afraid to tell you exactly who we are and our expectations. The thousands of people that succeed at Kaseya are prepared to go above and beyond for the betterment of our customers.
We are seeking a strategic and technically accomplished Director of Site Reliability Engineering (SRE) to lead our global infrastructure network and public cloud engineer and operations teams. The ideal candidate will have a strong background in site reliability engineering network management infrastructure services and cloud technologies. This role requires a strategic thinker with excellent leadership skills to ensure the reliability scalability and performance of our systems.
Responsibilities
- Architect and manage resilient infrastructure across all global office locations
- Develop and implement strategies to ensure the reliability availability and performance of our systems
- Oversee the design deployment and maintenance of network infrastructure ensuring optimal performance and security
- Lead public cloud deployments (AWS Azure OCI) with a focus on scalability cost-efficiency and compliance
- Collaborate with cross-functional teams to define and implement infrastructure and network standards
- Establish observability and monitoring systems to proactively manage performance and availability
- Develop and maintain disaster recovery and business continuity plans
- Ensure compliance with industry standards and regulations
- Mentor and develop team members fostering a culture of continuous improvement and innovation
- Maintain comprehensive infrastructure diagrams and create processes SOPs and other technical documentation
- Provide technical leadership and training to engineers on the team
- Establish best practices throughout the entire technology lifecycle management framework
- Build and mature relations with business partners to identify areas of improvement to support business growth and agility
Skills
12 years of experience in site reliability engineering network management and infrastructure services with 5 years in the leadership roleExtensive experience with network technologies such as Palo Alto and Meraki firewalls Cisco and Meraki switch devicesExcellent understanding of networking technologies such as BGP OSPF STP (RSTP / MSTP) AAA and layer 2 switchingProven experience with global hybrid-cloud interconnectivity network architectureExpertise in solutions architecture principles working with public cloud service platforms including Azure AWS and OCIFamiliar with network access control principles and enterprise-scale solutions using tools such as CISO ISE and PRISMA AccessProven working experience with cloud service platforms such as Azure AWS and OCI and knowledge of best practices and methods for resolving issues in those settingsWorking knowledge of Infrastructure and Network monitoring systems such as Logicmonitor Solarwinds and ThousandeyesGood knowledge and experience in managing Azure landing zone architectures Server and Storage workloads Entra ID Active Directory DNS and DHCP servicesKnowledge of business continuity and disaster recovery continuity of operations plansExperience with automation and orchestration tools such as Ansible Terraform or KubernetesSkill in assessing security controls based on cybersecurity principles and knowledge of how to use network analysis tools to identify vulnerabilitiesKnowledge of network access identity and access management (e.g. public key infrastructure Oauth OpenID SAML SPML)Proven project management abilities to guide complex projects and the ability to give instructions to a non-technical audience.Proven experience with managing large scale projects across cross-discipline teams including managing vendor resourcesCommunications / Leadership
Strong leadership and team management skillsExcellent oral written and interpersonal skillsExcellent analytical and problem-solving skillsAbility to create work relationships across multiple areas engaging with stakeholders vendors and suppliers their teams and other employeesAbility to motivate guide and develop Team membersEducation / Technology
Bachelors degree in computer science Management Information Systems or a related fieldMasters degree in related field preferredCCNA CCIE CISSP or other IT / security certifications desiredCertifications in cloud platforms (AWS Azure Google Cloud) preferredOther :
Enterprise-sized company experience a plusGlobal experience desiredProven ability to scale teams build and retain right talentSkilled in developing new processes and driving user adoptionA documented history of successfully driving projects to completionProven experience in translating complex requirements to infrastructure teamsExcellent English and great communication skills.Join the Kaseya growth rocket ship and see how we are #ChangingLives !
Additional information
Kaseya provides equal employment opportunity to all employees and applicants without regard to race religion age ancestry gender sex sexual orientation national origin citizenship status physical or mental disability veteran status marital status or any other characteristic protected by applicable law.
Required Experience :
Director
Key Skills
Kubernetes,FMEA,Continuous Improvement,Elasticsearch,Go,Root cause Analysis,Maximo,CMMS,Maintenance,Mechanical Engineering,Manufacturing,Troubleshooting
Employment Type : Full Time
Experience : years
Vacancy : 1