Talent.com
Senior Site Reliability Engineer

Senior Site Reliability Engineer

GreenLiteNew York, New York, United States
30+ days ago
Job type
  • Full-time
Job description

Our Company

Founded in 2022, GreenLite is revolutionizing development in America by streamlining the collaboration between developers, builders, and local regulatory authorities. GreenLite’s software powers its Private Plan Review offering, serving many of the nation’s largest public retailers, developers, and production home builders. By leveraging GreenLite’s technology, its customers save months on each project, significantly accelerating their timelines and staying within budget.

GreenLite is founded by experts in technology, development, and within the AEC (Architecture, Engineering, and Construction) industry, and backed by leading venture capital firms. GreenLite is at the forefront of the privatization of construction permitting and plan review, reshaping a multi-hundred billion dollar industry.

GreenLite has raised nearly $40M from the country’s leading venture capital investors, including Craft Ventures, who led GreenLite’s $28.5M Series A. We’re well capitalized to achieve our mission of revolutionizing the plan review and construction permitting process across the country.

The role and why it matters

Reliability is a product for our customers : city officials rely on us to be available whenever a submission deadline looms, and builders stake millions of dollars on predictable turnaround. As our first dedicated SRE you will establish the patterns, tooling and culture that keep our systems fast, observable and resilient while we 10x traffic over the next 18 months.

Our operating principles— Winning Mentality, Speed & Urgency, Disagree & Commit, Ownership & Integrity, Customer Centricity —are not wall art; they guide hiring, architecture and on‑call decisions.

What you’ll do

Design & harden production infrastructure AWS ECS / Fargate via AWS Copilot (migrating to Terraform), RDS / Postgres, S3, EventBridge, Bedrock.

Lead reliability engineering : SLO / SLA definition, error‑budget policies, capacity planning and load testing ahead of major launches.

Own CI / CD : advance our GitHub Actions pipeline, introduce progressive delivery and automated rollbacks to steadily maintain & improve deployment frequency and lead time for changes.

Instrument & Observe : deploy metrics, tracing and logging (Datadog) and drive an on‑call culture focused on MTTR and learning reviews, not blame.

Security & compliance : partner with the engineers to automate patching, secrets management & rotation, least‑privilege IAM and SOC 2 controls.

Coach & collaborate : mentor engineers on SRE best practices, work closely with ML and product squads, and influence architecture decisions through strong opinions loosely held.

Continuously improve : identify systemic bottlenecks, build tooling that eliminates toil and scale our platform without scaling pager fatigue.

What you’ll bring

Must‑have :

6+ yrs building and operating production systems in AWS, GCP or Azure (AWS preferred).

Demonstrated ownership of SLOs, incident response and post‑incident analysis.

Expert in IaC (Terraform, CDK, Pulumi) and container orchestration (ECS, EKS or K8s).

Proficient with at least one modern language (Python, Rust, Go) and strong bash skills.

Deep familiarity with observability stacks (Datadog, Grafana, Prometheus, OTEL).

Track record of raising the bar for security, compliance and cost optimisation.

Nice-to-have :

Experience with infrastructure for ML workflows (model training, feature stores).

Prior work in construction‑tech, gov‑tech or other regulated domains.

Certification : AWS Solutions Architect or DevOps Pro.

Experience introducing chaos engineering or game‑days.

Public track record (blog posts, OSS) advancing the SRE discipline.

Leadership in defining hiring / on‑call processes at a high‑growth startup.

In your first 180 days you will

30 days – Stand up staging / production dashboards, own the on‑call rotation and deliver a gap‑analysis of our reliability posture. Take ownership of our migration into AWS Control Tower, and contribute to architecture for hosting our production applications, including AI engineering.

60 days – Roll out error‑budget policies, automated canary deploys and service‑level telemetry across all micro‑services. Complete migration off of AWS CoPilot. Plan migration from RDS Postgres to Aurora Postgres, including metrics. Establish production infrastructure for AI engineering.

90 days – Reduce p95 latency by ≥20 %, cut mean time‑to‑recovery (MTTR) to

180 days – Mentor two mid‑level engineers into effective first responders and established infrastructure for ML products. Train team on disaster recovery plan, and do a dry run of restoration from backups.

What success looks like

99.95 % customer‑visible uptime with clearly defined SLAs.

Engineering velocity accelerates because infrastructure just works and developers ship confidently.

Post‑incident reviews focus on learning; recurring classes of incidents drop each quarter.

Stakeholders (product, customer success, city reviewers) describe reliability as a core differentiator.

Our team thrives on collaboration, so we’re in the office 4 days per week. In the summer, from Memorial Day to Labor Day, we switch to a 3-day in-office schedule to give everyone extra flexibility.

Our hiring process

Intro with Talent Partner

values & architecture deep‑dive with Head of Engineering

On-site : Practical systems‑design exercise (real scenarios we face)

On-site : On‑call simulation & retrospective with two engineers

On-site : Cross functional Panel interview

Final exec conversation and offer discussion

Thrive With GreenLite

Competitive Compensation - Generous base salary & access to our Employee Equity Program, so you can grow with us.

Performance-Based Annual Bonuses - Rewards for high-impact results and contributions that move the needle.

Premium Health Coverage - Comprehensive medical, dental, and vision insurance for full-time team members : 100% of premiums covered under our HDHP plan & 98% coverage for employees and their spouses.

401(k) Retirement Plan - Helping you invest in your future with smart saving options.

Parental Leave - Generous parental leave for all parents to support your growing family.

Wellness Support - Monthly Wellness Stipend and full access to Wellhub, Talkspace, & Teladoc for your physical and mental well-being.

Weekly Team Lunches - Enjoy catered lunches every week in our NYC office. Great food, better company.

Company-Wide Team All Hands - Held twice a year, fostering transparency, alignment, and inspiration.

Team-Building Events - Regular opportunities to connect, collaborate, and celebrate as a team.

Unlimited PTO - Flexible time off so you can recharge, travel, or take care of life as needed.

Hybrid Work Environment – Our team thrives on collaboration, so we’re in the office 4 days per week. In the summer, from Memorial Day to Labor Day, we switch to a 3-day in-office schedule to give everyone extra flexibility.

Equal Opportunity Statement

GreenLite values people from all walks of life and professional backgrounds. We understand not everyone will meet all the above qualifications on day one. That's okay. If you’re passionate about the construction industry or solving the housing crisis in America, and want the opportunity to grow in your career, we encourage you to apply.

GreenLite is an equal employment opportunity employer, committed to an inclusive workplace where we do not discriminate on the basis of race, sex, gender, national origin, religion, sexual orientation, gender identity, marital or familial status, age, ancestry, disability, genetic information, or any other characteristic protected by applicable laws. We believe in diversity and encourage any qualified individual to apply.

Create a job alert for this search

Senior Site Reliability Engineer • New York, New York, United States

Related jobs
  • Promoted
Senior Site Reliability Engineer

Senior Site Reliability Engineer

JustworksNew York, New York, United States
Full-time
At Justworks, you’ll enjoy a welcoming and casual environment, great benefits, wellness program offerings, company retreats, and the ability to interact with and learn from leaders in the startup c...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

WriterNew York, New York, United States
Full-time
Writer is the full-stack generative AI platform delivering transformative ROI for the world’s leading enterprises.Named one of the top 50 companies in AI by Forbes and one of the best places to wor...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer 3

Site Reliability Engineer 3

MongodbNew York, New York, United States
Full-time
MongoDB’s mission is to empower innovators to create, transform, and disrupt industries by unleashing the power of software and data. We enable organizations of all sizes to easily build, scale, and...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer III

Site Reliability Engineer III

VimeoNew York, New York, United States
Full-time
Do you love working with cloud infrastructure at scale? Optimizing the last bit of performance and efficiency out of applications that get hundreds of thousands of requests per second? Digging deep...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer - Cloud

Site Reliability Engineer - Cloud

DataikuNew York, New York, United States
Full-time
At Dataiku, we're not just adapting to the AI revolution, we're leading it.Since our beginning in Paris in 2013, we've been pioneering the future of AI with a platform that makes data actionable an...Show moreLast updated: 30+ days ago
Site Reliability Engineer Charlotte, NC / Chandler, AZ / , NJ

Site Reliability Engineer Charlotte, NC / Chandler, AZ / , NJ

Career Mentors, LLCJersey City, NJ, US
Full-time
Pay Rate : upto $75 pr hr on W2.Jersey City, NJ - Near by candidates.Previously functioned in an SRE role within a large production environment, with a focus on automation testing experience.Hands-o...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

AlchemyNew York, New York, United States
Full-time
Our mission is to bring web3 to a billion people, by providing builders with the tools they need to build exceptional onchain products. Alchemy is the only complete developer platform that offers th...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

HebbiaNew York, New York, United States
Full-time
The user interface for AGI – Hebbia is AI that works the way you work.Designed to be generally capable– it can tackle even the most complex tasks, citing answers over any amount of sources.By showi...Show moreLast updated: 30+ days ago
  • Promoted
Sr. Site Reliability Engineer

Sr. Site Reliability Engineer

VimeoNew York, New York, United States
Full-time
Do you love working with cloud infrastructure at scale? Optimizing the last bit of performance and efficiency out of applications that get hundreds of thousands of requests per second? Digging deep...Show moreLast updated: 30+ days ago
  • Promoted
Senior Site Reliability Engineer - Infrastructure

Senior Site Reliability Engineer - Infrastructure

The Trade DeskNew York, NY, United States
Full-time
The Trade Desk is changing the way global brands and their agencies advertise to audiences around the world.How? With a media buying platform that helps brands deliver a more insightful and relevan...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer, Distribution Engineering

Site Reliability Engineer, Distribution Engineering

NbcuniversalStamford, Connecticut, United States
Full-time
NBCUniversal is one of the world's leading media and entertainment companies.We create world-class content, which we distribute across our portfolio of film, television, and streaming, and bring to...Show moreLast updated: 30+ days ago
  • Promoted
DevOps Site Reliability Engineer- Visa Independent

DevOps Site Reliability Engineer- Visa Independent

Shrive Technologies LLCEnglewood Cliffs, New Jersey, USA
Full-time
DevOps / Site Reliability Engineer.Englewood Cliffs NJ (Onsite from day one).CI / CD AWS and / or GCP Python or Bash or Groovy monitoring tools like Datadog Ansible JMeter. Support and enhance observabi...Show moreLast updated: 1 day ago
  • Promoted
Senior Site Reliability Engineer (SRE)

Senior Site Reliability Engineer (SRE)

StubhubNew York, New York, United States
Full-time
StubHub is on a mission to redefine the live event experience on a global scale.Whether someone is looking to attend their first event or their hundredth, we’re here to delight them all the way fro...Show moreLast updated: 30+ days ago
  • Promoted
Senior Infrastructure / Site Reliability Engineer

Senior Infrastructure / Site Reliability Engineer

Particle HealthNew York, New York, United States
Full-time
Particle Health is revolutionizing healthcare data analytics and interoperability.Our mission is to unlock the power of medical records in an intelligent platform that focuses health back on the pa...Show moreLast updated: 30+ days ago
  • Promoted
Staff Site Reliability Engineer

Staff Site Reliability Engineer

Altana AINew York, NY, United States
Full-time
AI can be a powerful tool for good in the world – at Altana we apply AI to the world’s largest organized body of supply chain data to power a more resilient, more secure, and more sustainable model...Show moreLast updated: 30+ days ago
Senior Site Reliability Engineer (SRE

Senior Site Reliability Engineer (SRE

GovServicesHubNew York, NY, us
Full-time
Quick Apply
Senior Site Reliability Engineer (SRE).At January, we’re transforming the lives of borrowers by bringing humanity to consumer finance. Our data-driven products empower financial institutions to stre...Show moreLast updated: 11 days ago
Site Engineer

Site Engineer

ReconUSA, New Jersey, Newark
Full-time
RECON has successfully completed over 6,000 environmental remediation, geotechnical and mine reclamation projects across North America since 1989. Our team of dedicated employees, financial strength...Show moreLast updated: 18 days ago
  • Promoted
Manager, Site Safety-Production Center

Manager, Site Safety-Production Center

libertycokeElmsford, NY, USA
Full-time
Working at Liberty Coca-Cola Beverages LLC is all about pursuing a career not just a job.Discover what it means to be energized by a multitude of possibilities and a dynamic team.At Liberty Coca-Co...Show moreLast updated: 30+ days ago