Talent.com
Site Reliability Engineer

Site Reliability Engineer

DatSeattle, Washington, United States
30+ days ago
Job type
  • Full-time
Job description

About DAT

DAT  is an award-winning employer of choice and a next-generation SaaS technology company that has been at the leading edge of innovation in transportation supply chain logistics for 45 years. We continue to transform the industry year over year, by deploying a suite of software solutions to millions of customers every day - customers who depend on DAT for the most relevant data and most accurate insights to help them make smarter business decisions and run their companies more profitably. We operate the largest marketplace of its kind in North America, with 400 million freights posted in 2022, and a database of $150 billion of annual global shipment market transaction data. Our headquarters are in Denver, CO, and Beaverton, OR, with additional offices in Seattle, WA; Springfield, MO; and Bangalore, India. For additional information, see   www.DAT.com / company

Job Application Deadline : 09 / 30 / 2025

The Opportunity

DAT is looking for a Site Reliability Engineer to join our SRE platform team. This position will work hybrid or remote in Seattle, WA.

Candidate profile

DAT is seeking an experienced Site Reliability Engineer to help grow our SRE practices. In this role, you will be responsible for contributing to technical initiatives and enhancing your skills. You’ll work closely with development teams and platform architects to achieve critical reliability goals and help scale our platform.DAT is actively seeking a highly skilled and experienced Site Reliability Engineer (SRE) to play a pivotal role in the expansion and maturation of our SRE practices. In this critical position, the successful candidate will be instrumental in driving key technical initiatives, fostering a culture of continuous improvement, and significantly enhancing their own professional expertise.

This role necessitates close collaboration with various stakeholders, including our dedicated development teams and platform architects. The primary objective of these partnerships is to collectively achieve ambitious reliability goals and strategically scale our platform to meet evolving growth of the company. The SRE will be responsible for ensuring the stability, performance, and scalability of our systems, implementing robust monitoring solutions, automating operational tasks, and proactively identifying and resolving potential issues. This will involve a deep understanding of distributed systems, cloud infrastructure, and a commitment to best practices in site reliability engineering.

What You’ll Do

  • Contribute to the design, implementation, and maintenance of scalable and reliable systems.  Collaborate with engineering teams to ensure reliability targets are met.
  • Identify and troubleshoot complex issues across distributed systems, ensuring minimal downtime and optimal performance.
  • Advocate for and implement SRE best practices, including automation, monitoring, and incident response, to enhance system resilience.
  • Participate in capacity planning and performance tuning to proactively address potential bottlenecks and support future growth.
  • Leverage new AI tools to assist with coding and observability tasks.
  • Assist and respond to critical engineering incidents.
  • Improve your engineering skills within the SRE team.
  • Provide technical guidance and best practices for use of cloud infrastructure and tooling. Contribute to Infrastructure-as-Code within the platform.  We strive to automate all the things!
  • Contribute to reliability-focused initiatives and projects.
  • Help optimize our work to be customer-focused. Continually seek feedback from our customers on how we can improve.
  • Assist in migrating legacy systems to modern, scalable cloud environments.
  • Help develop and drive a culture of continuous improvement with the Platform Engineering and Software Engineering groups.
  • Participate in an on-call rotation.

The Skills and Experience You’ll Bring

  • Strong collaboration and problem-solving abilities, especially within SRE or Platform Engineering / Infrastructure teams.
  • Total of 2 to 4+ years industry experience
  • At least 1 year of software engineering experience (JavaScript, Python, Go, Java / Kotlin, C++, etc)
  • Experience with modern observability tools (Datadog preferred).
  • Experience with cloud platforms (preferably AWS).
  • Demonstrated success in contributing to large technical initiatives and acting as a driving force to complete those initiatives.
  • Proven experience assisting in modernizing legacy code and infrastructure.
  • Ability to work closely with peer teams, platform / software architects and management to drive key reliability improvements.
  • Willingness to share your expertise among team members and others within the engineering organization.  We value upleveling our peers however we can.
  • Understanding of cloud infrastructure, automation, and best practices for reliability.
  • Experience with our tools (Kubernetes, ArgoCD, Terraform, Github Actions) a plus.
  • Why DAT?

    DAT is an award winning employer of choice.

    For starters, we have a hybrid work environment, but we also know what makes a great workplace. We have a time-tested and resolute set of operating values predicated on integrity, mutual respect, open communication, and executing with excellence. These values inform our strategic vision as much as any one of our products does. We’ve been an employer of choice in the Portland metropolitan area for four decades, and within one year of opening our Denver office, DAT was #26 on Built In Colorado’s 100 Best Places to Work In Colorado.

  • Medical, Dental, Vision, Life, and AD&D insurance
  • Parental Leave
  • Up to 20 days of paid time off starting in year one
  • An additional 10 holidays of paid time off per calendar year
  • 401k matching (immediately vested)
  • Employee Stock Purchase Plan
  • Short- and Long-term disability sick leave
  • Flexible Spending Accounts
  • Health Savings Accounts
  • Tuition Reimbursement Program
  • Employee Assistance Program
  • Additional programs - Employee Referral, Internal Recognition, and Wellness
  • Free TriMet transit pass (Beaverton Office)
  • Competitive salary and benefits package
  • Work on impactful projects in a cutting-edge environment
  • Collaborative and supportive team culture
  • Opportunity to make a real difference in the trucking industry
  • Employee Resource Groups
  • For Washington-based candidates, in compliance with the Washington States Pay Transparency Law, the  minimum salary for this role is $110,000.00 + benefits + target bonus. The maximum compensation for this role can vary significantly depending on your job-related skills and experience. DAT considers factors such as scope and responsibilities of the position, candidate's work experience, education and training, core skills, internal equity, and market and business elements when extending an offer.

    DAT embraces the value of a diverse workforce, and believes it is a core strength of our company that we encourage those values in every DAT employee, at every level of our organization, regardless of tenure or rank. We provide equal employment opportunities (EEO) to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, disability, genetic information, marital status, amnesty, or status as a covered veteran in accordance with applicable federal, state, and local laws.

    Equal Opportunity Employer / Protected Veterans / Individuals with Disabilities

    The contractor will not discharge or in any other manner discriminate against employees or applicants because they have inquired about, discussed, or disclosed their own pay or the pay of another employee or applicant. However, employees who have access to the compensation information of other employees or applicants as a part of their essential job functions cannot disclose the pay of other employees or applicants to individuals who do not otherwise have access to compensation information, unless the disclosure is (a) in response to a formal complaint or charge, (b) in furtherance of an investigation, proceeding, hearing, or action, including an investigation conducted by the employer, or (c) consistent with the contractor’s legal duty to furnish information. 41 CFR 60-1.35(c)

    #LI-RF1

    #LI-hybrid

    Create a job alert for this search

    Site Reliability Engineer • Seattle, Washington, United States

    Related jobs
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    SheinBellevue, Washington, United States
    Full-time
    SHEIN is a global online fashion and lifestyle retailer, offering SHEIN branded apparel and products from a global network of vendors, all at affordable prices. Headquartered in Singapore, with more...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer - Networking

    Senior Site Reliability Engineer - Networking

    LambdaSeattle, WA, United States
    Full-time
    Lambda, The Superintelligence Cloud, builds Gigawatt-scale AI Factories for Training and Inference.Lambda’s mission is to make compute as ubiquitous as electricity and give every person access to a...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    Travel CT Technologist

    Travel CT Technologist

    The Good Life MedStaffPort Townsend, WA, US
    Full-time
    The Good Life MedStaff is seeking a travel CT Technologist for a travel job in Port Townsend, Washington.Job Description & Requirements. The Good Life MedStaff Job ID #34915230.Pay package is ba...Show moreLast updated: 16 hours ago
    • Promoted
    Sr. Kubernetes Platform Site Reliability Engineer (Starlink)

    Sr. Kubernetes Platform Site Reliability Engineer (Starlink)

    SpacexRedmond, Washington, United States
    Full-time +1
    SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technolo...Show moreLast updated: 30+ days ago
    • Promoted
    Hardware / Infrastructure Site Reliability Engineer (Starlink)

    Hardware / Infrastructure Site Reliability Engineer (Starlink)

    SpacexRedmond, Washington, United States
    Full-time +1
    SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technolo...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Satellite Tooling and GSE Engineer, Amazon Leo Tooling and GSE Engineering

    Satellite Tooling and GSE Engineer, Amazon Leo Tooling and GSE Engineering

    Amazon Kuiper Manufacturing Enterprises LLCRedmond, WA, US
    Permanent
    Amazon Leo is Amazon’s low Earth orbit satellite network.Our mission is to deliver fast, reliable internet connectivity to customers beyond the reach of existing networks.From individual hous...Show moreLast updated: 16 hours ago
    • Promoted
    Site Reliability Engineer, Software Simulation

    Site Reliability Engineer, Software Simulation

    Anduril IndustriesSeattle, Washington, United States
    Full-time
    The Software Integration Environment (SIE) team is responsible for managing fleets of Kubernetes clusters, CI / CD pipelines, test environments and external cloud deployments.If you are an experience...Show moreLast updated: 30+ days ago
    • Promoted
    Kubernetes Platform Site Reliability Engineer (Starlink)

    Kubernetes Platform Site Reliability Engineer (Starlink)

    SpacexRedmond, Washington, United States
    Full-time +1
    SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technolo...Show moreLast updated: 30+ days ago
    • Promoted
    Sr. Hardware / Infrastructure Site Reliability Engineer (Starlink)

    Sr. Hardware / Infrastructure Site Reliability Engineer (Starlink)

    SpacexRedmond, Washington, United States
    Full-time +1
    SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technolo...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    DatSeattle, Washington, United States
    Full-time
    SaaS technology company that has been at the leading edge of innovation in transportation supply chain logistics for 45 years. We continue to transform the industry year over year, by deploying a su...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    HiveSeattle, Washington, United States
    Full-time
    Hive is the leading provider of cloud-based AI solutions to understand, search, and generate content, and is trusted by hundreds of the world's largest and most innovative organizations.The company...Show moreLast updated: 30+ days ago
    • Promoted
    Associate Site Reliability Engineer

    Associate Site Reliability Engineer

    OktaBellevue, Washington, United States
    Full-time +1
    Okta is The World’s Identity Company.We free everyone to safely use any technology, anywhere, on any device or app.Our flexible and neutral products, Okta Platform and Auth0 Platform, provide secur...Show moreLast updated: 30+ days ago
    • Promoted
    Linux Site Reliability Engineer

    Linux Site Reliability Engineer

    SpacexRedmond, Washington, United States
    Full-time +1
    SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technolo...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer, Networking

    Senior Site Reliability Engineer, Networking

    CartaSeattle, Washington, United States
    Full-time
    Carta develops purpose-built software that transforms traditional accounting into a powerful growth engine.Carta’s world-class fund administration platform supports nearly 7,000 funds and SPVs, and...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer I

    Senior Site Reliability Engineer I

    AxonSeattle, Washington, United States
    Full-time
    Join Axon and be a Force for Good.At Axon, we’re on a mission to Protect Life.We’re explorers, pursuing society’s most critical safety and justice issues with our ecosystem of devices and cloud sof...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer - Infrastructure

    Senior Site Reliability Engineer - Infrastructure

    The Trade DeskSeattle, WA, United States
    Full-time
    The Trade Desk is changing the way global brands and their agencies advertise to audiences around the world.How? With a media buying platform that helps brands deliver a more insightful and relevan...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Senior Site Reliability Engineer, Software Simulation

    Senior Site Reliability Engineer, Software Simulation

    Anduril IndustriesSeattle, Washington, United States
    Full-time
    The Software Integration Environment (SIE) team is responsible for managing fleets of Kubernetes clusters, CI / CD pipelines, test environments and external cloud deployments.If you are an experience...Show moreLast updated: 6 hours ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    CognitivBellevue, Washington, United States
    Remote
    Full-time
    Are you ready to revolutionize the advertising industry? At Cognitiv, we are not just another AdTech company—we are industry trailblazers redefining media buying with our Deep Learning Advertising ...Show moreLast updated: 30+ days ago