Talent.com
Staff Site Reliability Engineer, Platform
Staff Site Reliability Engineer, PlatformGemini Trust Company • San Francisco, CA, United States
No longer accepting applications
Staff Site Reliability Engineer, Platform

Staff Site Reliability Engineer, Platform

Gemini Trust Company • San Francisco, CA, United States
30+ days ago
Job type
  • Full-time
Job description

About the Company

Gemini is a global crypto and Web3 platform founded by Cameron and Tyler Winklevoss in 2014, offering a wide range of simple, reliable, and secure crypto products and services to individuals and institutions in over 70 countries. Our mission is to unlock the next era of financial, creative, and personal freedom by providing trusted access to the decentralized future. We envision a world where crypto reshapes the global financial system, internet, and money to create greater choice, independence, and opportunity for all — bridging traditional finance with the emerging cryptoeconomy in a way that is more open, fair, and secure.

The Department : Platform

Our Platform organization’s purpose is to enable Gemini to scale effectively and empower our engineering teams to focus on building innovative financial products and experiences for individuals around the world. Within Platform, the Site Reliability Engineering team is responsible for partnering with Gemini’s other engineering teams to ensure all our systems are architected, engineered and deployed to be resilient, reliable and performant.

The Embedded SRE team is a part of Site Reliability Engineering with a focus on engaging directly with our other engineering teams to onboard them onto our platform systems, reviewing and recommending design and architectural decisions, and guiding our engineering teams on how to implement the tooling provided by the larger Platform organization required to ensure systems can scale and react to changing conditions, with continuous improvement loops.

The Role : Staff Site Reliability Engineer

You will be an integral part of leading Gemini’s engineering teams towards modern DevOps practices, both by developing and providing modern automation and operational tooling, and working cross functionally across Gemini’s engineering teams to influence and shape our development practices and culture.

This role is required to be in person twice a week at either our San Francisco, CA or New York City, NY office.

Responsibilities :

  • Provide primary operational support and engineering for various Gemini services
  • Improve reliability, quality and time-to-market across all Gemini services and offerings
  • Guide engineering teams onto the various supported services provided by Platform
  • Run on-going performance evaluations and improvements for Gemini systems
  • Provide architecture recommendations and engagement as part of SDLC
  • Create “Production-ready Scorecards” to evaluate the health of systems pre-launch
  • Implement and teaching monitoring, alerting and automated resolution best practices
  • Define SLIs, SLOs with Engineering teams
  • Educate and guide Engineering teams on reliability and resiliency best practices, like statelessness, chaos testing, blue / green deployments etc.
  • Build operational tooling and automations

Qualifications :

  • 7+ years using monitoring, alerting, and automation tooling to understand and remediate performance and health issues in systems at scale
  • Good knowledge for various cloud technology providers like AWS, GCP, or Azure
  • Experience in a code-first environment, developing automated solutions to solve support and operational issues
  • Experience as a Technical Leader within a team, helping evaluating and making tech decisions for the team
  • Experience working with containerization such as Nomad, EKS (k8s), Docker, etc.
  • Experience working with Configuration Management such as Ansible, Chef, Puppet
  • Experience writing scripts or cli tools that help increase Developer Productivity in high-level languages like Python, Go, etc.
  • Experience analyzing system and application performance, identifying bottlenecks, and recommending architectural or systemic improvements
  • Experience working with Engineering teams, teaching, training, and mentoring on how to implement best-practice technical solutions
  • Experience working in a code-drive, automation-first public cloud infrastructure (Terraform)
  • It Pays to Work Here The compensation & benefits package for this role includes :

  • Competitive starting salary
  • A discretionary annual bonus
  • Long-term incentive in the form of a new hire equity grant
  • Comprehensive health plans
  • 401K with company matching
  • Paid Parental Leave
  • Flexible time off
  • Salary Range : The base salary range for this role is between $168,000 - $240,000in the State of New York, the State of California and the State of Washington. This range is not inclusive of our discretionary bonus or equity package. When determining a candidate’s compensation, we consider a number of factors including skillset, experience, job scope, and current market data.

    In the United States, employees within the New York, Seattle, San Francisco, and Miami metropolitan areas are expected to work from the designated office on a hybrid cadence, unless there is a job-specific requirement to be in the office every work day. We believe our hybrid approach for those near our NYC, Seattle, San Francisco, and Miami offices increases productivity through more in-person collaboration where possible. Employees outside of these areas are considered part of our remote-first workforce.

    At Gemini, we strive to build diverse teams that reflect the people we want to empower through our products, and we are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, or Veteran status. Equal Opportunity is the Law, and Gemini is proud to be an equal opportunity workplace. If you have a specific need that requires accommodation, please let a member of the People Team know.

    #LI-JS3

    #J-18808-Ljbffr

    Create a job alert for this search

    Site Reliability Engineer • San Francisco, CA, United States

    Related jobs
    Staff Site Reliability Engineer

    Staff Site Reliability Engineer

    Altana • San Francisco, California, United States
    Full-time
    AI can be a powerful tool for good in the world – at Altana we apply AI to the world’s largest organized body of supply chain data to power a more resilient, more secure, and more sustainable model...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer, Frontier Systems Infrastructure

    Site Reliability Engineer, Frontier Systems Infrastructure

    OpenAI • San Francisco, California, United States
    Full-time
    The Frontier Systems team at OpenAI builds, launches, and supports the largest supercomputers in the world that OpenAI uses for its most cutting edge model training. We take data center designs, tur...Show more
    Last updated: 29 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Conductorone • San Francisco, California, United States
    Full-time
    ConductorOne is the modern identity governance platform that makes it possible to move beyond the limitations of legacy IGA and reduce the identity attack surface with confidence.Designed for flexi...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Baseten • San Francisco, California, United States
    Full-time
    We’re a growing team of builders backed by top-tier investors, including.ML teams at enterprises and category-defining AI-native companies like. Baseten to power their core production workloads with...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer - Inference

    Site Reliability Engineer - Inference

    Lambda • San Francisco, California, United States
    Full-time
    In 2012, Lambda started with a crew of AI engineers publishing research at top machine-learning conferences.We began as an AI company built by AI engineers. Today, we're on a mission to be the world...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Workos • San Francisco, California, United States
    Remote
    Full-time
    WorkOS builds tools and services for developers to help them implement authentication, identity, authorization, and overall enterprise readiness. We’re a fully distributed team with employees across...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Zoox • Foster City, California, United States
    Full-time
    Zoox is looking for a platform / site reliability engineer who will be responsible for measuring and maintaining the uptime of the many services critical to the development process for autonomous veh...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Redwood Materials • San Francisco, California, United States
    Full-time
    Redwood is localizing a global battery supply chain that seamlessly integrates recovery, reuse, and recycling .Responsibilities will include : . Collect business & technical requirements and work wit...Show more
    Last updated: 30+ days ago • Promoted
    Staff Software Engineer, Site Reliability Engineer (SRE)

    Staff Software Engineer, Site Reliability Engineer (SRE)

    Harvey • San Francisco, California, United States
    Full-time
    Harvey is a secure AI platform for legal and professional services that augments productivity and automates complex workflows. Harvey uses algorithms with reasoning-adept LLMs that have been customi...Show more
    Last updated: 30+ days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Loft Orbital Solutions • San Francisco, California, United States
    Full-time
    Loft Orbital builds a space infrastructure providing a fast & simple path to orbit.We operate satellites, fly customer payloads onboard and handle the entire mission from initial concept through in...Show more
    Last updated: 30+ days ago • Promoted
    Senior Site Reliability Engineer, Supply

    Senior Site Reliability Engineer, Supply

    Mithril • San Francisco Bay, California, United States
    Full-time
    Mithril is actively seeking talented candidates at the Senior to Principal level, with leveling determined based on experience and demonstrated expertise. We welcome individuals who bring deep techn...Show more
    Last updated: 17 days ago • Promoted
    Staff Site Reliability Engineer (SRE)

    Staff Site Reliability Engineer (SRE)

    Heartflow • San Francisco, California, United States
    Full-time
    Heartflow is a medical technology company advancing the diagnosis and management of coronary artery disease, the #1 cause of death worldwide, using cutting-edge technology.The flagship product—an A...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Checkr • San Francisco, California, United States
    Full-time
    Checkr is building the data platform to power safe and fair decisions.Established in 2014, Checkr’s innovative technology and robust data platform help customers assess risk and ensure safety and c...Show more
    Last updated: 30+ days ago • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    Visa • Foster City, California, United States
    Full-time
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Latent • San Francisco, California, United States
    Full-time
    San Francisco, CA (5 Days In-Office).You are the infrastructure expert who enables our rapid product development and guarantees. AI platform for major health systems.Your focus on operational excell...Show more
    Last updated: 30+ days ago • Promoted
    Staff Site Reliability Engineer

    Staff Site Reliability Engineer

    Replit • Foster City, California, United States
    Full-time
    Replit is the agentic software creation platform that enables anyone to build applications using natural language.With millions of users worldwide and over 500,000 business users, Replit is democra...Show more
    Last updated: 30+ days ago • Promoted
    Staff Site Reliability Engineer

    Staff Site Reliability Engineer

    Crusoe • San Francisco, California, United States
    Full-time
    Crusoe is building the World’s Favorite AI-first Cloud infrastructure company.We’re pioneering vertically integrated, purpose-built AI infrastructure solutions trusted by Fortune 500 companies to ...Show more
    Last updated: 30+ days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Alembic • San Francisco, California, United States
    Full-time
    We’re looking for an experienced.Site Reliability Engineer (SRE).You’ll partner with engineers and data scientists to build, automate, and maintain the infrastructure that powers our core platform—...Show more
    Last updated: 19 days ago • Promoted