Talent.com
Entry Level Site Reliability Engineer

Entry Level Site Reliability Engineer

IBMSan Jose, CA, United States
23 hours ago
Job type
  • Full-time
Job description

Introduction

A career in IBM Software means you'll be part of a team that transforms our customer's challenges into solutions.

Seeking new possibilities and always staying curious, we are a team dedicated to creating the world's leading AI-powered, cloud-native software solutions for our customers. Our renowned legacy creates endless global opportunities for our IBMers, so the door is always open for those who want to grow their career.

IBM's product and technology landscape includes Research, Software, and Infrastructure. Entering this domain positions you at the heart of IBM, where growth and innovation thrive.

Your role and responsibilities

As a Site Reliability Engineer, you will work in an agile, collaborative environment to build, deploy, configure, and maintain systems for the IBM client business. In this role, you will lead the problem resolution process for our clients, from analysis and troubleshooting, to deploying the latest software updates & fixes.

Your primary responsibilities include :

24x7 Observability : Be part of a worldwide team that monitors the health of production systems and services around the clock, ensuring continuous reliability and optimal customer experience.

Cross-Functional Troubleshooting : Collaborate with engineering teams to provide initial assessments and possible workarounds for production issues. Troubleshoot and resolve production issues effectively.

Deployment and Configuration : Leverage Continuous Delivery (CI / CD) tools to deploy services and configuration changes at enterprise scale.

Security and Compliance Implementation : Implementing security measures that meet or exceed industry standards for regulations such as GDPR, SOC2, ISO 27001, PCI, HIPAA, and FBA.

Maintenance and Support : Tasks related to applying security patches and upgrades, and collaborating with Product support for issue resolution.

Required technical and professional expertise

System Monitoring and Troubleshooting : 1-3 years of experience in monitoring / observability, issue response, and troubleshooting for optimal system performance.

Automation Proficiency : 1-3 years of experience in automation for production environment changes, streamlining processes for efficiency, and reducing toil.

Linux : 1 to 3 years of experience working with Linux operating systems.

Operation and Support Experience : 1-3 years of experience of experience in handling day-to-day operations, alert management, incident support, migration tasks, and break-fix support.

Preferred technical and professional experience

Kubernetes / OpenShift : knowledge or experience of Kubernetes / OpenShift environments.

Automation / Scripting : knowledge or experience of Ansible, Python, Terraform, and CI / CD tools such as Jenkins, IBM Continuous Delivery, ArgoCD.

Monitoring / Observability : knowledge or experience crafting alerts and dashboards using tools such as Instana, New Relic, Grafana / Prometheus.

DBA : Interest or experience configuring and maintaining SQL, NoSQL, and data streaming technologies (e.g. PostgreSQL, CouchDB, Redis, Kafka, Spark, etc.).

IBM is committed to creating a diverse environment and is proud to be an equal-opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender, gender identity or expression, sexual orientation, national origin, caste, genetics, pregnancy, disability, neurodivergence, age, veteran status, or other characteristics. IBM is also committed to compliance with all fair employment practices regarding citizenship and immigration status.

Create a job alert for this search

Site Reliability Engineer • San Jose, CA, United States

Related jobs
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

ID.meMountain View, CA, US
Full-time
Consumers can verify their identity with ID.Over 152 million users experience streamlined login and identity verification with ID. More than 600+ consumer brands use ID.Commerce Department and is ap...Show moreLast updated: 15 days ago
  • Promoted
Site Reliability Engineer I

Site Reliability Engineer I

ProsperSan Francisco, CA, United States
Full-time
As a Site Reliability Engineer I at Prosper, you will play a crucial role in enhancing the reliability, scalability, and maintainability of our technology platform. This entry-level position is desi...Show moreLast updated: 8 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

FortinetSunnyvale, CA, United States
Full-time
At Fortinet, we strive to provide a supportive, collaborative environment where people are empowered to do the best work of their careers. Our team members enjoy solving complex problems, and obsess...Show moreLast updated: 3 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

PsiQuantumPalo Alto, CA, United States
Full-time
Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

Insight GlobalSanta Clara, CA, United States
Full-time
Insight Global is looking for a seasoned SRE to join one of our largest technology clients' multifaceted and fast-paced Infrastructure, Planning and Processes organization where you will be working...Show moreLast updated: 3 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

Redwood Materials, Inc.San Francisco, CA, United States
Full-time
Redwood is localizing a global battery supply chain that seamlessly integrates recovery, reuse, and recycling—keeping critical minerals in circulation and driving the energy transition.Founded in 2...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

Runloop AISan Francisco, CA, United States
Full-time
Runloop is building the foundational infrastructure for the next generation of AI development.We provide AI engineers and data scientists with lightning-fast, secure, and reproducible code sandboxe...Show moreLast updated: 12 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

ReplitSan Mateo, CA, United States
Full-time
Join Our Site Reliability Engineering Team.Replit is the fastest way to turn ideas into software.With our powerful AI-powered Agent and Assistant, anyone can create and launch apps from natural lan...Show moreLast updated: 3 days ago
  • Promoted
  • New!
Site Reliability EngineerSan Francisco

Site Reliability EngineerSan Francisco

Together AISan Francisco, CA, United States
Full-time
As a Site Reliability Engineer (SRE) at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a soft...Show moreLast updated: 17 hours ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

XaiPalo Alto, CA, United States
Full-time
AIs mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellen...Show moreLast updated: 3 days ago
  • Promoted
Site Reliability Engineer - West Coast

Site Reliability Engineer - West Coast

ZapierSan Francisco, CA, United States
Full-time
We're humans who simply think computers should do more work.At Zapier, we’re not just making software—we’re building a platform to help millions of businesses globally scale with automation and AI....Show moreLast updated: 22 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

PSI QuantumPalo Alto, CA, United States
Full-time
Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer I

Site Reliability Engineer I

Prosper.comSan Francisco, CA, United States
Full-time
As a Site Reliability Engineer I at Prosper, you will play a crucial role in enhancing the reliability, scalability, and maintainability of our technology platform. This entry-level position is desi...Show moreLast updated: 3 days ago
  • Promoted
Site Reliability Engineer - Supercomputing

Site Reliability Engineer - Supercomputing

XaiPalo Alto, CA, United States
Full-time
Site Reliability Engineer - Supercomputing.We are seeking a talented Site Reliability Engineer (SRE) to join our SuperComputing team. In this role, you'll ensure the reliability, scalability, and pe...Show moreLast updated: 3 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

WorkOSSan Francisco, CA, US
Full-time
WorkOS builds tools and services for developers to help them implement authentication, identity, authorization, and overall enterprise readiness. We’re a fully distributed team with employees ...Show moreLast updated: 15 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

Signify TechnologyAtherton, CA, United States
Full-time
Competitive, based on experience.We are a technology startup advancing healthcare with a safety-focused AI platform that assists medical professionals by managing patient communications, including ...Show moreLast updated: 3 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

Rockwoods IncPleasanton, CA, US
Full-time
Note : Candidates must have relevant experience in Medical / Healthcare domains, this is mandatory.Senior SRE Engineer - Pleasanton, 5 days office. Primary work : 24x7 On-call support and setting up mo...Show moreLast updated: 22 days ago
  • Promoted
Site Reliability Engineer - Openstack

Site Reliability Engineer - Openstack

FortinetSunnyvale, CA, United States
Full-time
Fortinet is recruiting a Site Reliability Engineer- OPENSTACK to join our FortiStack team.This team is responsible for the management, operation and continued development of our Openstack-based pri...Show moreLast updated: 30+ days ago