Senior Site Reliability Engineer

CoStar Group
CA, Orange County
$124K-$210.8K a year
Full-time

RESPONSIBILITIES

  • Develop and provide operational support for our full-stack software applications
  • Collaborate with development operations staff to scale, monitor and troubleshoot the system infrastructure (on-premise & in the cloud) to support our growing ecosystem
  • Increase system resilience and serve large customer volumes with expert-level coding, bulletproof release and change management skills
  • Improve and write automation and increase system’s diagnostic and debugging capabilities
  • Collect operating system data and report performance metrics to stakeholders
  • Troubleshoot and debug production issues as they arise
  • Adhere to industry standard security best practices

BASIC QUALIFICATIONS :

  • Bachelor’s Degree required from an accredited, not for profit university or college
  • A track record of commitment to prior employers
  • 7 or more years of experience with programming languages, such as C#, Python, Golang, Java
  • 3 or more years of experience developing a high-performance web-based system.
  • Ability to debug, profile, optimize code, and to automate routine tasks.
  • The candidate must be self-motivated, systematic problem-solving, a great communicator, have a sense of ownership and drive.

PREFERRED QUALIFICATIONS :

  • Expertise in designing, analyzing, troubleshooting, and capacity planning large-scale distributed systems.
  • Experience with any of the following frontend frameworks : React / Angular / Vue
  • Experience with the following tools : SQL (any RDBS), Bash, PowerShell, TFS / Azure DevOps, & Git.
  • Experience with operating systems (e.g., Windows / Linux Distros / Debian / Kali), administration (e.g., Filesystems, System RPC calls, etc.

and networking (e.g., TCP / IP, routing, network topologies and hardware, etc.).

  • Experience in one or more of the following : AWS, GCP, Python, Serverless, Docker, Container Orchestration (i.e. Kubernetes).
  • System load testing and baseline measurement.
  • Familiar with log management and analytics tools : Open Telemetry, Jaeger, Elasticsearch, Kibana, Logstash, Grafana, Prometheus, DataDog, Splunk

WHATS IN IT FOR YOU :

Working at CoStar Group means you'll enjoy a culture of collaboration and innovation that attracts the best and brightest across a broad range of disciplines.

In addition to generous compensation and performance-based incentives, you'll be supported in both your professional and academic growth with internal training, tuition reimbursement, and an inter-office exchange program.

Our benefits package includes (but is not limited to) :

  • Comprehensive healthcare coverage : Medical / Vision / Dental / Prescription Drug
  • Life, legal, and supplementary insurance
  • Commuter and parking benefits
  • 401(K) retirement plan with matching contributions
  • Employee stock purchase plan
  • Paid time off
  • Paid parental leave (up to 12 weeks)
  • Tuition reimbursement
  • On-site fitness center and / or reimbursed fitness center membership costs (location dependent), with yoga studio, Pelotons, personal training, group exercise classes, as well as Segways and bikes available for use during the day
  • Complimentary gourmet coffee, tea, hot chocolate, prepared foods, fresh fruit, and other healthy snacks
  • 30+ days ago
Related jobs
Promoted
Canonical - Jobs
Fresno, California

As a Senior Site Reliability / Gitops Engineer you will. As an Senior SRE & Gitops engineer you'll be in a unique position to drive operations automation to the next level, both in our own private clouds as well as in the public clouds. This role is an opportunity for a hands-on, but literal...

Promoted
Work Truck Solutions
Chico, California

Site Reliability Engineer or similar role. You’ll work closely with the development and infrastructure teams to enhance the reliability, scalability, and performance of our Azure-based systems. Collaborate with development teams to improve application reliability and cloud integration. Bachelo...

Infused Solutions
San Francisco, California

Our client is looking for a skilled Senior Site Reliability Engineer with an Microsoft Azure background and a good level of software engineering experience. Senior Site Reliability Engineer. Infused Solutions have partnered with a market leader in the San Francisco area, they are looking for a Senio...

Long Term Stock Exchange
San Francisco, California

Full Time] Senior Systems Reliability Engineer at Long Term Stock Exchange (United States). Senior Systems Reliability Engineer. The Role: An opportunity to join the team responsible for the reliability and resiliency of one of the newest US stock exchanges. Capable of and excited by designing capac...

Qcells
Santa Clara, California

Master’s or PhD degree in Reliability Engineering, or ASQ Certified Reliability Engineer is a plus. Apply solid knowledge of reliability methods and power electronics systems to design accelerated test plans for design validation, burn-in testing, environmental stress screening (ESS), ongoing reliab...

Zscaler
San Jose, California

We're looking for an experienced Site Reliability Engineer to join our Site Reliability Engineering team. Reporting to the Manager- Site Reliability Engineering, you'll be responsible for:. Our Engineering team built the world's largest cloud security platform from the ground up, and we keep buildin...

Federal Reserve System
San Francisco, California

Site Reliability Engineer, you will be part of the Data & Analytics Services (DAS) Team and will get an opportunity to broadly apply your engineering skills across various technology solutions, as well as build your skills in other areas by being exposed to various aspects of product delivery from i...

Abbott
Alameda, California

The Senior Reliability Engineer’s job duties consist of three primary aspects: performing failure analysis investigation on returned ADC products, supporting small to medium-sized projects, and performing sustaining activities. Reliability Engineer position works out of our Alameda, CA location in t...

General Motors
Palo Alto, California

Lead Site Reliability engineering effort to improve anomaly detection, platform stability and resilience using modern best practice. Collaborate with engineering teams to analyze and provide inputs in architecture, infrastructure resources, observability to achieve reliability and scalability goals....

GEICO
Palm Springs, California

Our Senior Manager is an engineering leader who works with the engineering staff to innovate and build new engineering solutions, improveand enhance existing solutions as well as leverage engineering solutions to solve critical operational problems. Senior Manager, Site Reliability Engineering – Dat...