Search jobs > Columbus, OH > Senior site reliability

Senior Site Reliability Engineer

Veeva Systems
Columbus, Ohio
$65K-$115K a year
Full-time

The Role

Veeva is seeking a talented and motivated Site Reliability Engineer (SRE) to join our dynamic team. As an SRE, you are innately curious, have a penchant for problem-solving, and will play a crucial role in ensuring the reliability, scalability, and performance of our systems.

Our mission is to protect, provide for, and progress the software and systems utilized by our product engineering teams.

Ideal candidates have worked in enterprise software development or for a high-growth technology company.

What You'll Do

  • Take responsibility for managing production and pre-production environments, security, change management, deployment, architecture, and tools
  • Perform root cause analysis for complex failures and offer modern solutions and tools
  • Analyze performance and ensure the applications (GitLab, Jira, Confluence, TestRail, Mattermost), hosted in AWS, meet the scalability and reliability needs of our internal teams
  • Work closely with Infrastructure, DevOps, Security, and product teams to stabilize, secure, and scale applications for continued growth
  • Automate deployment, monitoring, and incident response processes to enhance system reliability and performance
  • Continuously monitor system health, proactively identify issues, and implement solutions to ensure optimal performance
  • Identify and troubleshoot performance bottlenecks and reliability issues across the stack
  • Implement best practices for cloud-based infrastructure, ensuring security, scalability, and cost efficiency
  • You want to make the system better every day and are self-driven to learn all that is necessary to provide full-stack diagnostics and determine the root cause of problems
  • During an incident, lead the effort to triage and mitigate. You might need to perform periodic on-call duty if issues are escalated
  • Communicate effectively with engineering and infrastructure teams, and describe problems succinctly with sufficient detail
  • Engage in real-time communication during outages with both technical and non-technical audiences

Requirements

  • Bachelor’s degree in Computer Science, Information Technology, or related field (or equivalent work experience)
  • 3+ years of working experience as a DevOps or SRE engineer
  • Independent learner, curious to learn new technologies
  • Experience with AWS and container orchestration tools (e.g., Kubernetes)
  • Familiarity with infrastructure as code tools (e.g., Terraform, Ansible) and version control systems (e.g., Git)
  • GitLab system administration experience
  • Experience supporting GitLab including CI / CD processes and GitLab runners
  • Solid scripting skills; experience with Shell, Bash, Ansible, Python, Go, Ruby, etc.
  • Excellent problem-solving skills and the ability to troubleshoot complex issues under pressure
  • 3+ years of experience in relational databases with a mastery of SQL
  • Demonstrated history of incident management and leadership ability
  • Hands-on operational experience in a high-volume or critical production service environment
  • Effective communication skills across all levels whether talking to individual contributors or executives
  • Experience with disaster recovery planning and implementation
  • Experience with performance tuning of databases and distributed storage systems
  • Ability to handle the periodic, on-call duty
  • Fluent in English both written and verbal
  • We are looking for strong mentors with a proven record of making your team better

Nice to Have

  • Experience with serverless computing and serverless architectures (e.g., AWS Lambda, Azure Functions)
  • Knowledge of security best practices

Perks & Benefits

  • Medical, dental, vision, and basic life insurance
  • Flexible PTO and company paid holidays
  • Retirement programs
  • 1% charitable giving program

Compensation

  • Base pay : $65,000 - $115,000
  • The salary range listed here has been provided to comply with local regulations and represents a potential base salary range for this role.

Please note that actual salaries may vary within the range above or below, depending on experience and location. We look at compensation for each individual and base our offer on your unique qualifications, experience, and expected contributions.

This position may also be eligible for other types of compensation in addition to base salary, such as variable bonus and / or stock bonus.

30+ days ago
Related jobs
Promoted
Vaco
Columbus, Ohio

Network Reliability Engineer will work within a dedicated team of automation and reliability engineers tasked with improving the availability, reliability, and scalability of Wendy's global network infrastructure. Senior Network Site Reliability Engineer. Network Reliability Engineer will implement ...

Promoted
Central Point Partners
Columbus, Ohio

Expertise with application observ ability patterns and site reliability practices. The Programmer/Analyst-Senior modifies existing software/application programs, which are typically more complex in nature, or writes new programs to support user and management needs. Improve reliability, quality, and...

People Integra LLC
Columbus, Ohio

Big Pluses Ansible, Cloudformation, Packer Database administration skills (AWS Aurora, MySQL, Postgres, Oracle) Have leveraged deployment strategies such as blue-green and canary Experience building RESTful services and/or web applications Experience automating software deployments an...

Vaco
Columbus, Ohio

Network Reliability Engineer will work within a dedicated team of automation and reliability engineers tasked with improving the availability, reliability, and scalability of Wendy’s global network infrastructure. Senior Network Site Reliability Engineer. Network Reliability Engineer will implement ...

Huntington National Bank
Columbus, Ohio

The Site Reliability Engineer provides technical and consultative support on the most complex technical matters. Apply deep knowledge of SRE principles to ensure the scalability and reliability of systems. Update and Manage Runbooks and Maintenance reference Manuals on Support wiki site. MVC, websit...

JPMorgan Chase & Co.
Columbus, Ohio

As a Lead Site Reliability Engineer at JPMorgan Chase within the Enterprise technology, network operations team, you hold a leadership role in your team, demonstrate strong knowledge across multiple technical domains, and advise others on the technical and business issues facing them. Formal trainin...

Huntington National Bank
Ohio

The Google Cloud Platform (GCP) Site Reliability Engineer (SRE) Manager is responsible for supporting the GCP framework and consumers of the platform. The qualified candidate will collaborate with the CDO, Application, Incident, Security, and Change Management teams to manage the ITIL process, reduc...

Cerebra Consulting Inc
Columbus, Ohio

Position: Site Reliability Engineer <br /> Location: Columbus OH/Hybrid 3 days onsite <br /> Duration: 3-month CTH</p> <p> </p> <p> <i>Required Skills</i></p> <p><b>SRE background for 4+ years, AWS and EC2 and Lambda DynamoDB, pyt...

Apex Systems
Columbus, Ohio

Job Title: Site Reliability Engineer . If you have visited our website in search of information on employment opportunities or to apply for a position, and you require an accommodation in using our website for a search or application, please contact our Employee Services Department at employeeservic...

Impact tech Inc
Columbus, Ohio

As a Site Reliability Engineer, you'll be the champion of performance and stability for our impact. Working closely with various engineering squads, you'll ensure features meet strict performance and error rate benchmarks. Ultimately, your mission is to guarantee exceptional platform reliability and...