Talent.com
Software Engineer, IoT Reliability

Software Engineer, IoT Reliability

AirGarageSan Francisco, CA, US
1 day ago
Job type
  • Full-time
Job description

Overview

AirGarage is seeking a Software Engineer to own the reliability, health, and observability of our nationwide IoT device fleet. You will work with embedded systems, backend infrastructure, and site reliability engineering. You’ll design and build the tools, monitoring pipelines, and automation that keep hundreds of devices online and performing reliably across our locations.

What You Will Do

  • Design and implement systems to monitor, diagnose, and improve IoT device health at scale.
  • Build internal tools and scripts for device setup, fleet observability, QA automation, and ongoing monitoring.
  • Contribute to backend services that support device integration, calibration, and reliability improvements.
  • Investigate and resolve fleet-wide issues by analyzing metrics, logs, and telemetry; minimize downtime through remote debugging and fixes.
  • Test and tune hardware products during or post-installation (e.g., camera exposure, detection modes, connectivity parameters) to ensure optimal performance.
  • Conduct periodic fleet-wide health assessments to detect degradation, systemic issues, or underperforming devices, and recommend firmware or deployment improvements.
  • Serve as the primary internal contact for hardware health, providing regular reports to operations on per-site hardware performance, device uptime, and systemic issues affecting service quality.
  • Collaborate with operations and hardware teams to surface recurring pain points and propose architectural or process improvements that drive greater reliability and scalability.
  • Author and maintain troubleshooting guides, repair instructions, and internal playbooks that enable consistency and efficiency across deployments.
  • Travel occasionally (~20%, otherwise fully remote) for QA, deployments, and on-site debugging when remote fixes aren’t possible.

What You Need

  • 5+ years of professional software engineering experience.
  • Experience managing distributed Linux-based hardware appliances or IoT fleets.
  • Familiarity with observability and monitoring tools (e.g., DataDog, OpenTelemetry, Prometheus, Grafana) and building internal tooling for device health and alerting.
  • Strong proficiency in Python and SQL, with experience shipping production-quality code. C++ background is a plus.
  • Track record building internal tooling, monitoring, or reliability platforms.
  • Hands-on experience with Linux systems (dmesg, journalctl, ip, systemd, etc.) and debugging distributed hardware / software environments.
  • Background in cellular (4G LTE, CAT 4, CAT 1bis, 5G RedCap), WiFi, WiFi HaLow, or other wireless connectivity.
  • Excellent written and verbal communication skills; able to translate complex technical findings into clear reports and playbooks.
  • Self-starter who thrives in a fast-paced, ownership-driven environment.
  • Willingness to travel to locations for troubleshooting (roughly 20% travel, otherwise fully remote).
  • The Upside

  • Equity : Have a stake in the business that you’re helping to build and grow.
  • Work remotely : Live and work wherever you like. We currently hire teammates located anywhere within North America.
  • Health insurance : We offer health insurance and currently cover 85% of the cost for the primary employee and 50% for dependents.
  • Home office setup : Laptop and equipment provided to set you up for success.
  • Time to recharge : Unlimited PTO with a minimum requirement of 10 days per year.
  • 401k : 401k retirement savings program.
  • Team off-sites : ~2 times per year for a full-week gathering in places like Tahoe, Puerto Vallarta, San Diego, and Austin.
  • Room to grow : Opportunities to grow with a rapidly expanding team.
  • Transform our cities : Help change how real estate is used in our cities.
  • Work with a diverse team : Our team is ~40% female and 30%+ from underrepresented communities.
  • AirGarage is an Equal Opportunity Employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. Candidates and employees are evaluated based on merit, qualifications, and performance. We will never discriminate on the basis of race, color, gender, national origin, ethnicity, veteran status, disability status, age, sexual orientation, gender identity, marital status, or any other legally protected status.

    Compensation Range : $160K - $190K

    Job Details

  • Seniority level : Mid-Senior level
  • Employment type : Full-time
  • Job function : Engineering and Information Technology
  • Industries : Technology, Information and Internet
  • #J-18808-Ljbffr

    Create a job alert for this search

    Reliability Engineer • San Francisco, CA, US

    Related jobs
    • Promoted
    Mid-Level Software Engineer, Reliability

    Mid-Level Software Engineer, Reliability

    Jobright.aiSan Francisco, CA, US
    Full-time
    Mid-Level Software Engineer, Reliability.Mid-Level Software Engineer, Reliability.Mid-Level Software Engineer, Reliability. Be among the first 25 applicants.Mid-Level Software Engineer, Reliability....Show moreLast updated: 1 day ago
    • Promoted
    Lead Software Engineer- Middleware Reliability Engineering

    Lead Software Engineer- Middleware Reliability Engineering

    ZipRecruiterFoster City, CA, US
    Full-time
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...Show moreLast updated: 1 day ago
    • Promoted
    Systems Reliability Engineer (SRE) - Edge

    Systems Reliability Engineer (SRE) - Edge

    Cloudflare, Inc.San Francisco, CA, US
    Full-time
    At Cloudflare, we are on a mission to help build a better Internet.Today the company runs one of the world's largest networks that powers millions of websites and other Internet properties for cust...Show moreLast updated: 1 day ago
    • Promoted
    Software Engineer - Reliability

    Software Engineer - Reliability

    xAISan Francisco, CA, US
    Full-time
    AI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excelle...Show moreLast updated: 1 day ago
    • Promoted
    Software Engineer - Reliability

    Software Engineer - Reliability

    RubrikPalo Alto, CA, US
    Full-time
    The Rubrik Engineering team is comprised of people who produce extraordinary results.Our engineers are driven to build efficient, reliable, and cost effective products. We believe in empowering our ...Show moreLast updated: 1 day ago
    • Promoted
    Technical Lead, Site Reliability Engineer, Fleetnet, Vehicle Software

    Technical Lead, Site Reliability Engineer, Fleetnet, Vehicle Software

    Tesla Motors, Inc.Palo Alto, CA, US
    Full-time
    We are a small team of experts focused on creating the next-generation server-side infrastructure for Tesla.We're the invisible link connecting every Tesla product, whether it's vehicles, robots, r...Show moreLast updated: 1 day ago
    • Promoted
    Software Engineer III, Site Reliability Engineering

    Software Engineer III, Site Reliability Engineering

    Google Inc.San Francisco, CA, United States
    Full-time
    Software Engineer III, Site Reliability Engineering.Bachelor’s degree in Computer Science, a related field, or equivalent practical experience. Master's degree in Computer Science or Engineering.Sit...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Software Engineer- Reliability, Global E-Commerce

    Senior Software Engineer- Reliability, Global E-Commerce

    TikTokSan Jose, CA, US
    Full-time
    Senior Software Engineer - Reliability, Global E-Commerce.The role focuses on service reliability, scalable design, and release management in a cloud-native global e-commerce platform.Part of the o...Show moreLast updated: 1 day ago
    • Promoted
    Lead Software Engineer Middleware Reliability Engineering

    Lead Software Engineer Middleware Reliability Engineering

    VisaFoster City, CA, US
    Full-time
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...Show moreLast updated: 1 day ago
    • Promoted
    Distinguished Software Engineer, Reliability Infra

    Distinguished Software Engineer, Reliability Infra

    Next MatterMountain View, CA, US
    Full-time
    LinkedIn is the world’s largest professional network, built to create economic opportunity for every member of the global workforce. Our products help people make powerful connections, discover exci...Show moreLast updated: 1 day ago
    • Promoted
    Software Engineer, IoT Reliability

    Software Engineer, IoT Reliability

    ZipRecruiterSan Francisco, CA, US
    Full-time
    AirGarage is on a mission to bring real estate online, starting with parking.We replace broken parking machines, fragmented software, and manual, labor-intensive operations with a unified, data-ric...Show moreLast updated: 1 day ago
    • Promoted
    Lead Software Engineer- Middleware Reliability Engineering

    Lead Software Engineer- Middleware Reliability Engineering

    Visa Inc.Foster City, CA, US
    Full-time
    Lead Software Engineer- Middleware Reliability Engineering at Visa.Transform global payment systems through automation and innovation. Join Visa's Middleware Reliability Engineering team to revoluti...Show moreLast updated: 1 day ago
    • Promoted
    Senior Software Engineer, Site Reliability Tooling

    Senior Software Engineer, Site Reliability Tooling

    OhioxSan Mateo, CA, US
    Full-time
    Upstart is the leading AI lending marketplace partnering with banks and credit unions to expand access to affordable credit. By leveraging Upstart's AI marketplace, Upstart-powered banks and credit ...Show moreLast updated: 1 day ago
    • Promoted
    Senior Software Engineer, Site Reliability Engineer (SRE)

    Senior Software Engineer, Site Reliability Engineer (SRE)

    harvey.aiSan Francisco, CA, United States
    Full-time
    At Harvey, we’re transforming how legal and professional services operate — not incrementally, but end-to-end.By combining frontier agentic AI, an enterprise-grade platform, and deep domain experti...Show moreLast updated: 12 days ago
    • Promoted
    Software Engineer (Site Reliability Engineer)

    Software Engineer (Site Reliability Engineer)

    CerebrasSan Francisco, CA, US
    Full-time
    San Francisco or Palo Alto, CA.At Anyscale, we take a market-based approach to compensation.We are data-driven, transparent, and consistent. As the market data changes over time, the target salary f...Show moreLast updated: 1 day ago
    • Promoted
    Lead Software Engineer- Middleware Reliability Engineering

    Lead Software Engineer- Middleware Reliability Engineering

    TinkFoster City, CA, US
    Full-time
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...Show moreLast updated: 1 day ago
    • Promoted
    Software Engineer (Distributed Systems)

    Software Engineer (Distributed Systems)

    Browserbase, Inc.San Francisco, CA, US
    Full-time
    As a Software Engineer (Distributed Systems) at Browserbase, you’ll be directly responsible for developing our core web automation platform. You’ll ensure it is high performance, scalable, constantl...Show moreLast updated: 1 day ago
    • Promoted
    • New!
    Senior Software Engineer – Distributed Systems (Erlang Preferred)

    Senior Software Engineer – Distributed Systems (Erlang Preferred)

    SourceOwls, LLCRedwood City, CA, US
    Full-time
    Senior Software Engineer – Distributed Systems (Erlang Preferred) Location : Onsite 3–5 days / week Type : Full-Time Visa Sponsorship : Not Available Relocation Assistance : Not Available Benefits Includ...Show moreLast updated: 18 hours ago