Talent.com
Senior Software Engineer, Product Foundations
Senior Software Engineer, Product FoundationsMetropolis • Los Angeles, CA, United States
Senior Software Engineer, Product Foundations

Senior Software Engineer, Product Foundations

Metropolis • Los Angeles, CA, United States
18 days ago
Job type
  • Full-time
Job description

Who we are

Metropolis is an artificial intelligence company that uses computer vision technology to enable frictionless, checkout-free experiences in the real world. Today, we are reimagining parking to enable millions of consumers to just "drive in and drive out." We envision a future where people transact in the real world with a speed, ease and convenience that is unparalleled, even online. Tomorrow, we will power checkout-free experiences anywhere you go to make the everyday experiences of living, working and playing remarkable - giving us back our most valuable asset, time.

Who you are

We are building a hyperscaler company and need someone to own reliability across the entire Metropolis platform. As a Staff or Senior Software Engineer focused on Reliability, you'll establish and drive the comprehensive reliability practices that ensure system availability, resilience, and observability for our mission-critical mobility infrastructure serving millions of transactions.

This is your opportunity to build reliability from first principles - architecting failover systems, implementing chaos engineering practices, and improving the observability foundation that will enable Metropolis to scale to new markets while maintaining 99.9%+ uptime. You'll be the technical owner of our reliability posture, working on everything from multi-region failover architectures to incident response workflows to SLO-based alerting strategies.

Our platform handles real-time payment processing, customer authentication, and parking facility operations - systems that cannot go down. You'll tackle challenges like external service failover, dependency mirroring to prevent upstream outages, database replication and automatic promotion, and building the monitoring and alerting infrastructure that ensures we detect and respond to issues in minutes, not hours.

If you're energized by the challenge of ensuring system reliability at scale, building robust failover mechanisms, implementing comprehensive observability, and establishing the practices that prevent incidents before they occur, this role is for you. You'll work alongside highly technical teams across the organization, influencing architecture decisions and establishing reliability standards that affect every service we build.

What you'll do

  • Reliability Ownership : Own the overall reliability posture for the Metropolis platform, establishing practices, metrics, and systems that ensure 99.9%+ uptime across all services
  • External Service Failover : Design and implement automatic failover mechanisms for critical external dependencies (Twilio for SMS / voice, Stripe for payments) with circuit breakers, retry policies, and degraded mode operations
  • Multi-Region / Cloud Failover : Architect and build active-passive or active-active regional deployment strategies with database replication, automated failover, and DNS-based traffic routing including disaster recovery planning and testing
  • Observability & Monitoring : Establish comprehensive monitoring using Datadog for APM, logs, and metrics correlation; implement synthetic monitoring, SLO-based alerting, on-call rotation, and escalation policies; build service health dashboards that show customer impact
  • I ncident Response : Own the incident management process including workflows, tooling, post-mortem culture, runbook automation, and MTTR reduction initiatives - driving down mean time to recovery from detection to resolution
  • Service Resilience Patterns : Drive adoption of resilience patterns across all services including health checks, graceful degradation, feature flags, rate limiting, backpressure mechanisms, and chaos engineering practices.
  • Dependency Mirroring : Build and maintain local mirrors for critical dependencies (Maven / NPM / Docker registries) with artifact caching, dependency pinning, and vulnerability scanning to prevent build failures from upstream outages.

What we're looking for

  • 8+ years of backend software engineering experience with deep focus on distributed systems and platform infrastructure
  • Expert-level Java proficiency with deep understanding of JVM performance, concurrency, and ecosystem tooling. Scala experience is a big plus
  • Production experience with microservices architecture, container orchestration (Kubernetes), and cloud platforms (AWS)
  • Strong systems thinking with proven ability to design and implement large-scale, high-availability distributed systems that handle significant load
  • Observability expertise including hands-on production experience with metrics, logging, tracing, and alerting systems in high-load environments
  • Database and data systems knowledge including relational databases, event streaming (Kafka, SQS), caching strategies, and data consistency patterns
  • Experience with AI-powered development tools such as Claude Code, GitHub Copilot, or similar agentic coding tools for enhanced productivity - context engineering in particular
  • Excellent technical communication with ability to design and document complex systems, lead technical discussions, and collaborate across multiple teams local to New York City, Seattle, or Los Angeles area
  • While not required, these are a plus :

  • SRE or Reliability Engineering experience at companies known for operational excellence (Google, Amazon, Netflix, etc.) or high-growth startups where you built reliability practices from the ground up
  • Incident response leadership including experience building incident management processes, conducting blameless post-mortems, and driving MTTR reduction initiatives in production environments
  • Chaos engineering experience with tools like Chaos Monkey, Gremlin, or similar, including designing and executing game days and failure injection testing
  • Performance optimization experience with profiling, benchmarking, capacity planning, and system tuning at hyperscale including experience optimizing for high-throughput, low-latency systems
  • Open source contributions or technical blog writing that demonstrates depth of expertise in reliability engineering, distributed systems, or production operations
  • Our Stack

  • Languages + Frameworks : TypeScript, React, Scala (principally), Java (limited)
  • Datastores : MySQL, PostgreSQL, Snowflake
  • Cloud : AWS
  • Version control : Git & GitHub
  • AI Tooling : Copilot on GitHub
  • Observability : Datadog
  • When you join Metropolis, you'll join a team of world-class product leaders and engineers, building an ecosystem of technologies at the intersection of parking, mobility, and real estate. Our goal is to build an inclusive culture where everyone has a voice and the best idea wins. You will play a key role in building and maintaining this culture as our organization grows. The anticipated base salary for this position is $180,000.00 USD to $260,000.00 USD annually. The actual base salary offered is determined by a number of variables, including, as appropriate, the applicant's qualifications for the position, years of relevant experience, distinctive skills, level of education attained, certifications or other professional licenses held, and the location of residence and / or place of employment. Base salary is one component of Metropolis's total compensation package, which may also include access to or eligibility for healthcare benefits, a 401(k) plan, short-term and long-term disability coverage, basic life insurance, a lucrative stock option plan, bonus plans and more. #LI-CM1 #LI-Onsite

    Metropolis values in-person collaboration to drive innovation, strengthen culture, and enhance the Member experience. Our corporate team members hold to our office-first model, which requires employees to be on-site at least four days a week, fostering organic interactions that spark creativity and connection

    Metropolis may utilize an automated employment decision tool (AEDT) to assess or evaluate your candidacy for employment or promotion. AEDTs are used to assist in assessing a candidate's application relative to the required job qualifications and responsibilities listed in the job posting.

    As part of this process, Metropolis retains data relevant to your candidacy, including personal information, for a period that is reasonably necessary for the use of the tool. If you are hired for the position, your data may become part of your employee records.

    Metropolis Technologies is an equal opportunity employer. We make all hiring decisions based on merit, qualifications, and business needs, without regard to race, color, religion, sex (including gender identity, sexual orientation, or pregnancy), national origin, disability, veteran status, or any other protected characteristic under federal, state, or local law.

    Create a job alert for this search

    Software Engineer Product • Los Angeles, CA, United States

    Related jobs
    Senior Software Engineer (Ground Software)

    Senior Software Engineer (Ground Software)

    INVERSION • Playa Vista, CA, United States
    Permanent
    Turning Space into a Transportation Layer for Earth.Eras of humanity can often be defined by a dominant transportation mode - horse drawn chariots, ocean going boats, or aircraft.These were spurred...Show more
    Last updated: 18 days ago • Promoted
    Senior Software Engineer - Core Platform

    Senior Software Engineer - Core Platform

    Picogrid • El Segundo, CA, United States
    Permanent
    Picogrid builds hardware and software infrastructure to connect and control the systems that power critical industries.Our platform unifies sensors, platforms, and operators to power mission planni...Show more
    Last updated: 18 days ago • Promoted
    Senior Software Engineer

    Senior Software Engineer

    Warner Bros. Discovery • Burbank, CA, United States
    Full-time
    When we say, "the stuff dreams are made of," we're not just referring to the world of wizards, dragons and superheroes, or even to the wonders of Planet Earth. Behind WBD's vast portfolio of iconic ...Show more
    Last updated: 18 days ago • Promoted
    Senior Software Engineer

    Senior Software Engineer

    CyRAD Solutions • El Segundo, CA, United States
    Full-time
    About the job Senior Software Engineer.Senior Software Engineer : Autonomy & Mission Platforms.Design and build the real-time, scalable software infrastructure that unifies sensors, autonomous syste...Show more
    Last updated: 18 days ago • Promoted
    Senior Software Engineer

    Senior Software Engineer

    Akido • Los Angeles, CA, United States
    Full-time
    Akido builds AI-powered doctors Akido is the first AI-native care provider, combining cutting-edge technology with a nationwide medical network to address America's physician shortage and make exce...Show more
    Last updated: 14 days ago • Promoted
    Senior Software Engineer

    Senior Software Engineer

    Modern Technology Solutions Inc • El Segundo, CA, United States
    Full-time
    Modern Technology Solutions, Inc.MTSI) is seeking a Senior Software Engineer to join our team in El Segundo, CA.You will join a systems engineering team defining the next generation of space commun...Show more
    Last updated: 18 days ago • Promoted
    Senior Software Engineer

    Senior Software Engineer

    Outpost • Los Angeles, CA, United States
    Permanent
    Location : Playa Vista, California (in-person, five days per week).Outpost is pioneering Earth return logistics for space. We're building vehicles that can return payloads from orbit safely and preci...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer

    Senior Software Engineer

    Sony Pictures Entertainment • Los Angeles, CA, United States
    Full-time
    Sony Pictures Imageworks is located on the traditional, ancestral and unceded territory of the Kizh / Gabrieleño peoples. We are committed to respecting traditional lands, and working with communities...Show more
    Last updated: 15 days ago • Promoted
    Senior Software Engineer, Platform

    Senior Software Engineer, Platform

    Red Cat Holdings • Long Beach, CA, United States
    Permanent
    Senior Software Engineer, Platform.The Senior Software Engineer, Platform at FlightWave Aerospace will own the development and sustainment of the C++ application core across the Edge130 UAS platfor...Show more
    Last updated: 18 days ago • Promoted
    Senior Software Engineer

    Senior Software Engineer

    GumGum • Santa Monica, CA, United States
    Full-time
    GumGum is a contextual-first, global digital advertising platform that uses advanced AI technology to serve captivating creative ads that drive consumer attention, without the use of personal data....Show more
    Last updated: 13 days ago • Promoted
    Senior Software Engineer, Interoperability

    Senior Software Engineer, Interoperability

    Axle Health • Santa Monica, CA, United States
    Full-time
    Axle Health builds scheduling and workforce management software to empower in-home healthcare providers to deliver exceptional, personalized care right where patients feel most comfortable-at home....Show more
    Last updated: 18 days ago • Promoted
    Senior Software Engineer

    Senior Software Engineer

    Disney Cruise Line • Santa Monica, CA, United States
    Full-time
    We are not considering remote candidates at this time.Technology is at the heart of Disneys past, present, and future.Disney Entertainment and ESPN Product & Technology is a global organization of ...Show more
    Last updated: 5 days ago • Promoted
    Senior Software Engineer

    Senior Software Engineer

    Salient Motion • Torrance, CA, United States
    Full-time
    Permanent Residents (Green Card holders) to meet customer and regulatory requirements • •.We are pioneering modular motion technologies to power the next generation of innovation for Industrial, Aero...Show more
    Last updated: 18 days ago • Promoted
    Senior Enterprise Software Engineer

    Senior Enterprise Software Engineer

    K2 Space • Los Angeles, CA, United States
    Permanent
    K2 Space is building large, high-powered spacecraft for the next generation of space development.Backed by Lightspeed Venture Partners, Altimeter Capital, and many others ($200M raised to date), we...Show more
    Last updated: 5 days ago • Promoted
    Senior Software Engineer, Platform

    Senior Software Engineer, Platform

    FlightWave Aerospace • Carson, CA, United States
    Full-time +1
    Position : Senior Software Engineer, Platform.The Senior Software Engineer, Platform at FlightWave Aerospace will own the development and sustainment of the C++ application core across the Edge130 U...Show more
    Last updated: 7 days ago • Promoted
    Senior Software Engineer - Experimentation Platform

    Senior Software Engineer - Experimentation Platform

    StubHub • Los Angeles, CA, United States
    Full-time
    StubHub is on a mission to redefine the live event experience on a global scale.Whether someone is looking to attend their first event or their hundredth, we're here to delight them all the way fro...Show more
    Last updated: 7 days ago • Promoted
    Senior Software Engineer

    Senior Software Engineer

    Eleven Recruiting • El Segundo, CA, United States
    Full-time
    We are a specialized technology staffing agency supporting professional and financial services companies.Why do we stand out in technology staffing? We listen and act as advisors for our candidates...Show more
    Last updated: 18 days ago • Promoted
    Senior Software Engineer

    Senior Software Engineer

    ServiceTitan • Glendale, CA, United States
    Full-time
    As a Senior Software Engineer will be part of the engineering team at ServiceTitan to help improve our products and build new ones. This is an exciting role for an engineer to come in and lead the m...Show more
    Last updated: 18 days ago • Promoted