Senior Site Reliability Engineer

Veeva Systems

Columbus, Ohio

$65K-$115K a year

Full-time

The Role

Veeva is seeking a talented and motivated Site Reliability Engineer (SRE) to join our dynamic team. As an SRE, you are innately curious, have a penchant for problem-solving, and will play a crucial role in ensuring the reliability, scalability, and performance of our systems.

Our mission is to protect, provide for, and progress the software and systems utilized by our product engineering teams.

Ideal candidates have worked in enterprise software development or for a high-growth technology company.

What You'll Do

Take responsibility for managing production and pre-production environments, security, change management, deployment, architecture, and tools
Perform root cause analysis for complex failures and offer modern solutions and tools
Analyze performance and ensure the applications (GitLab, Jira, Confluence, TestRail, Mattermost), hosted in AWS, meet the scalability and reliability needs of our internal teams
Work closely with Infrastructure, DevOps, Security, and product teams to stabilize, secure, and scale applications for continued growth
Automate deployment, monitoring, and incident response processes to enhance system reliability and performance
Continuously monitor system health, proactively identify issues, and implement solutions to ensure optimal performance
Identify and troubleshoot performance bottlenecks and reliability issues across the stack
Implement best practices for cloud-based infrastructure, ensuring security, scalability, and cost efficiency
You want to make the system better every day and are self-driven to learn all that is necessary to provide full-stack diagnostics and determine the root cause of problems
During an incident, lead the effort to triage and mitigate. You might need to perform periodic on-call duty if issues are escalated
Communicate effectively with engineering and infrastructure teams, and describe problems succinctly with sufficient detail
Engage in real-time communication during outages with both technical and non-technical audiences

Requirements

Bachelor’s degree in Computer Science, Information Technology, or related field (or equivalent work experience)
3+ years of working experience as a DevOps or SRE engineer
Independent learner, curious to learn new technologies
Experience with AWS and container orchestration tools (e.g., Kubernetes)
Familiarity with infrastructure as code tools (e.g., Terraform, Ansible) and version control systems (e.g., Git)
GitLab system administration experience
Experience supporting GitLab including CI / CD processes and GitLab runners
Solid scripting skills; experience with Shell, Bash, Ansible, Python, Go, Ruby, etc.
Excellent problem-solving skills and the ability to troubleshoot complex issues under pressure
3+ years of experience in relational databases with a mastery of SQL
Demonstrated history of incident management and leadership ability
Hands-on operational experience in a high-volume or critical production service environment
Effective communication skills across all levels whether talking to individual contributors or executives
Experience with disaster recovery planning and implementation
Experience with performance tuning of databases and distributed storage systems
Ability to handle the periodic, on-call duty
Fluent in English both written and verbal
We are looking for strong mentors with a proven record of making your team better

Nice to Have

Experience with serverless computing and serverless architectures (e.g., AWS Lambda, Azure Functions)
Knowledge of security best practices

Perks & Benefits

Medical, dental, vision, and basic life insurance
Flexible PTO and company paid holidays
Retirement programs
1% charitable giving program

Compensation

Base pay : $65,000 - $115,000
The salary range listed here has been provided to comply with local regulations and represents a potential base salary range for this role.

Please note that actual salaries may vary within the range above or below, depending on experience and location. We look at compensation for each individual and base our offer on your unique qualifications, experience, and expected contributions.

This position may also be eligible for other types of compensation in addition to base salary, such as variable bonus and / or stock bonus.

30+ days ago

Related jobs

Promoted

Senior Network Reliability Engineer

Vaco

Columbus, Ohio

Network Reliability Engineer will work within a dedicated team of automation and reliability engineers tasked with improving the availability, reliability, and scalability of Wendy's global network infrastructure. Senior Network Site Reliability Engineer. Network Reliability Engineer will implement ...

Promoted

AWS SRE Site Reliability Engineer

Central Point Partners

Columbus, Ohio

Expertise with application observ ability patterns and site reliability practices. The Programmer/Analyst-Senior modifies existing software/application programs, which are typically more complex in nature, or writes new programs to support user and management needs. Improve reliability, quality, and...

Site Reliability Engineer

People Integra LLC

Columbus, Ohio

Big Pluses Ansible, Cloudformation, Packer Database administration skills (AWS Aurora, MySQL, Postgres, Oracle) Have leveraged deployment strategies such as blue-green and canary Experience building RESTful services and/or web applications Experience automating software deployments an...

Senior Network Reliability Engineer

Vaco

Columbus, Ohio

Network Reliability Engineer will work within a dedicated team of automation and reliability engineers tasked with improving the availability, reliability, and scalability of Wendy’s global network infrastructure. Senior Network Site Reliability Engineer. Network Reliability Engineer will implement ...

Site Reliability Engineer

Huntington National Bank

Columbus, Ohio

The Site Reliability Engineer provides technical and consultative support on the most complex technical matters. Apply deep knowledge of SRE principles to ensure the scalability and reliability of systems. Update and Manage Runbooks and Maintenance reference Manuals on Support wiki site. MVC, websit...

Lead Site Reliability Engineer

JPMorgan Chase & Co.

Columbus, Ohio

As a Lead Site Reliability Engineer at JPMorgan Chase within the Enterprise technology, network operations team, you hold a leadership role in your team, demonstrate strong knowledge across multiple technical domains, and advise others on the technical and business issues facing them. Formal trainin...

Site Reliability Engineer - Manager

Huntington National Bank

Ohio

The Google Cloud Platform (GCP) Site Reliability Engineer (SRE) Manager is responsible for supporting the GCP framework and consumers of the platform. The qualified candidate will collaborate with the CDO, Application, Incident, Security, and Change Management teams to manage the ITIL process, reduc...

Site Reliability Engineer (only W2 candidate) Location: Columbus OH/Hybrid 3 days onsite

Cerebra Consulting Inc

Columbus, Ohio

Position: Site Reliability Engineer Location: Columbus OH/Hybrid 3 days onsite Duration: 3-month CTH Required Skills SRE background for 4+ years, AWS and EC2 and Lambda DynamoDB, pyt...

Site Reliability Engineer

Apex Systems

Columbus, Ohio

Job Title: Site Reliability Engineer . If you have visited our website in search of information on employment opportunities or to apply for a position, and you require an accommodation in using our website for a search or application, please contact our Employee Services Department at employeeservic...

Site Reliability Engineer

Impact tech Inc

Columbus, Ohio

As a Site Reliability Engineer, you'll be the champion of performance and stability for our impact. Working closely with various engineering squads, you'll ensure features meet strict performance and error rate benchmarks. Ultimately, your mission is to guarantee exceptional platform reliability and...

Senior Site Reliability Engineer

Senior Network Reliability Engineer

AWS SRE Site Reliability Engineer

Site Reliability Engineer

Senior Network Reliability Engineer

Site Reliability Engineer

Lead Site Reliability Engineer

Site Reliability Engineer - Manager

Site Reliability Engineer (only W2 candidate) Location: Columbus OH/Hybrid 3 days onsite

Site Reliability Engineer

Site Reliability Engineer

Related searches