Senior Site Reliability Engineer (SRE)People Data Labs • San Francisco, CA, US

Senior Site Reliability Engineer (SRE)

People Data Labs • San Francisco, CA, US

18 days ago

Job type

Permanent

Job description

Job Description

Note for all engineering roles : With the rise of fake applicants and AI-enabled candidate fraud, we have built in additional measures throughout the process to identify such candidates and remove them.

About Us

People Data Labs (PDL) is the provider of people and company data. We do the heavy lifting of data collection and standardization so our customers can focus on building and scaling innovative, compliant data solutions. Our sole focus is on building the best data available by integrating thousands of compliantly sourced datasets into a single, developer-friendly source of truth. Leading companies across the world use PDL's workforce data to enrich recruiting platforms, power AI models, create custom audiences, and more.

We are looking for individuals who can balance extreme ownership with a "one-team, one-dream" mindset. Our customers are trying to solve complex problems, and we only help them achieve their goals as a team. Our Platform Engineering Team oversees the foundational work that the rest of our engineering teams build their success upon.

You will be crucial in accelerating our efforts to build standalone data products that enable data teams and independent developers to create innovative solutions at massive scale. In this role, you will be working with a team, defining tools and infrastructure that facilitate big data processing, primarily within AWS.

If you are looking to be part of a team discovering the next frontier of data-as-a-service (DaaS) with a high level of autonomy and opportunity for direct contributions, this might be the role for you. We like our engineers to be thoughtful, quirky, and willing to fearlessly try new things. Failure is embraced at PDL as long as we continue to learn and grow from it.

What You Get To Do

Manage and improve our growing AWS and data center infrastructures
Design, implement, and maintain a CI / CD pipeline to improve developer workflows
Utilize centralized monitoring and logging to improve visibility across the team
Assist development teams in solving issues around scaling and bottlenecks
Work with teammates to develop high-quality software, balancing security, reliability, and operational concerns

The Technical Chops You'll Need

5-7+ years of software development experience with a background in platform or cloud infrastructure engineering and clear examples of strategic technical problem-solving and implementation

3+ years of experience with Python in a production environment

Strong software development fundamentals and system design experience

Strong experience with our core technologies (AWS, ElasticSearch / OpenSearch, Python, Docker, scaled data processing technologies)

AWS, including EC2, Lambda, OpenSearch, API Gateway, ALB, others

Data stores, including Postgres / MySQL, Dynamo, Redis, S3

Experience with Infrastructure-as-code (IaC) frameworks (e.g. Pulumi, Terraform, CloudFormation, or similar)

Experience with network design, including public / private availability, routing, firewalls / security groups, and VPN

Experience with Identity and Access Management

Experience with configuration management tools (e.g. Chef, Puppet, Ansible, etc)

Experience with observability tools such as Datadog for metrics, logging, etc

Experience with build and deploy systems, architecting and developing CI / CD infrastructure, repo management, and integrating with tools like Github Actions (or similar)

People Thrive Here Who Can

Balance high ownership and autonomy with a strong ability to collaborate

Can work effectively remotely (able to be proactive about managing blockers, proactive on reaching out and asking questions, and participating in team activities)

Strong written communication skills on Slack / Chat and in documents

Are experienced in writing data design docs (pipeline design, dataflow, schema design)

Can scope and breakdown projects, communicate and collaborate progress and blockers effectively with your manager, team, and stakeholders

Some Nice To Haves

Degree in a quantitative discipline such as computer science, mathematics, statistics, or engineering

Expertise with Apache Spark (Java, Scala, and / or Python-based)

Experience with SQL Data Pipeline Development

Experience supporting developer-oriented data pipeline and workflow orchestration (e.g., Airflow (preferred), dbt, dagster, or similar)

Experience with managing, deploying, and ensuring the reliability of streaming platforms (e.g., Kafka)

Experience evaluating data quality and maintaining consistently high standards across new feature releases (e.g., consistency, accuracy, validity, completeness)

Experience using Databricks or similar data-development platforms

Experience managing hybrid environments split between local datacenters and AWS; experience managing bare metal / co-location infrastructure

Our Benefits

Great people make great teams. We believe in building highly functional, energetic, and engaging teams to serve our customers. People, Customers, Shareholders, in that order, sets us up for success and delivering on our promises.

Stock

Competitive Salaries

Unlimited paid time off

Medical, dental, & vision insurance

Health, fitness, and office stipends

The permanent ability to work wherever and however you want

Comp : $160K - $180K

No C2C, 1099, or Contract-to-Hire. Recruiters need not apply.

People Data Labs does not discriminate on the basis of race, sex, color, religion, age, national origin, marital status, disability, veteran status, genetic information, sexual orientation, gender identity or any other reason prohibited by law in provision of employment opportunities and benefits.

Qualified Applicants with arrest or conviction records will be considered for Employment in accordance with the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act.

Personal Privacy Policy For California Residents

https : / / privacy.peopledatalabs.com / policies?name=personnel -privacy-policy

Note : This is a duplicate post of our Senior Software Engineer, Platform role.

Create a job alert for this search

Site Reliability Engineer Sre • San Francisco, CA, US

Related jobs

Senior Site Reliability Engineer

Checkr, Inc. • San Francisco, CA, US

Full-time

Checkr is building the data platform to power safe and fair decisions.Established in 2014, Checkr's innovative technology and robust data platform help customers assess risk and ensure safety and c...Show more

Last updated: 2 hours ago • Promoted • New!

Site Reliability Engineer (SRE)

AI Fund • San Francisco, CA, United States

Full-time

Baseten powers inference for the world's most dynamic AI companies, like.As a Site Reliability Engineer, you'll envision and build robust systems and processes that ensure our infrastructure is sca...Show more

Last updated: 30+ days ago • Promoted

Site Reliability Engineer - SRE at Descope Los Altos, CA

Itlearn360 • Los Altos, CA, United States

Full-time

Site Reliability Engineer - SRE job at Descope.Descope R&D group is a skilled team of developers with a unique DNA of creativity,flexibility,anopen mindset. We are looking for a passionate SRE to jo...Show more

Last updated: 30+ days ago • Promoted

Site Reliability Engineer

Latent • San Francisco, CA, United States

Full-time

Latent is building the intelligence infrastructure for American healthcare.Our products are already helping hospitals and clinics dramatically increase workflow output, speed up patient access to m...Show more

Last updated: 30+ days ago • Promoted

Site Reliability Engineer (SRE)

SS&C Technologies • San Francisco, CA, United States

Full-time

SS&C Technologies is a global investment and financial services software provider, headquartered in Windsor, Connecticut, and supporting more than 28,000 employees across 35 countries.It specialize...Show more

Last updated: 9 days ago • Promoted

Site Reliability Engineer

Redwood Materials, Inc. • San Francisco, CA, United States

Full-time

Redwood is localizing a global battery supply chain that seamlessly integrates recovery, reuse, and recycling—keeping critical minerals in circulation and driving the energy transition.Founded in 2...Show more

Last updated: 30+ days ago • Promoted

Site Reliability Engineer

Runloop AI • San Francisco, CA, United States

Full-time

Runloop is building the foundational infrastructure for the next generation of AI development.We provide AI engineers and data scientists with lightning-fast, secure, and reproducible code sandboxe...Show more

Last updated: 5 days ago • Promoted

Site Reliability Engineer

Rockwoods Inc • Pleasanton, CA, US

Full-time

Note : Candidates must have relevant experience in Medical / Healthcare domains, this is mandatory.Senior SRE Engineer - Pleasanton, 5 days office. Primary work : 24x7 On-call support and setting up mo...Show more

Last updated: 15 days ago • Promoted

Site Reliability Engineer

Together AI • San Francisco, CA, United States

Full-time

As a Site Reliability Engineer (SRE) at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a soft...Show more

Last updated: 30+ days ago • Promoted

Site Reliability Engineer

WorkOS • San Francisco, CA, United States

Full-time

About WorkOS 🚀 WorkOS builds tools and services for developers to help them implement authentication, identity, authorization, and overall enterprise readiness. We’re a fully distributed team with ...Show more

Last updated: 30+ days ago • Promoted

Site Reliability Engineer (SRE)

Baseten • San Francisco, CA, United States

Full-time

Baseten powers inference for the world's most dynamic AI companies, like OpenEvidence, Clay, Mirage, Gamma, Sourcegraph, Writer, Abridge, Bland, and Zed. By uniting applied AI research, flexible inf...Show more

Last updated: 1 day ago • Promoted

Site Reliability Engineer (SRE) - grok.com & API

Pantera Capital • Palo Alto, CA, United States

Full-time

AI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excelle...Show more

Last updated: 28 days ago • Promoted

Site Reliability Engineer

Redwood Materials • San Francisco, CA, United States

Full-time

Redwood is localizing a global battery supply chain that seamlessly integrates recovery, reuse, and recycling — keeping critical minerals in circulation and driving the energy transition.Founded in...Show more

Last updated: 30+ days ago • Promoted

Site Reliability Engineer (SRE)

Air Apps • San Francisco, CA, United States

Full-time

At Air Apps, we believe in thinking bigger—and moving faster.We’re a family-founded company on a mission to create the world’s first AI-powered Personal & Entrepreneurial Resource Planner (PRP), an...Show more

Last updated: 30+ days ago • Promoted

Site Reliability Engineer

Signify Technology • Palo Alto, CA, US

Full-time

Competitive, based on experience.We are a technology startup advancing healthcare with a safety-focused AI platform that assists medical professionals by managing patient communications, including ...Show more

Last updated: 15 days ago • Promoted

Site Reliability Engineer

Fractal • San Francisco, CA, United States

Full-time

This range is provided by Fractal.Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Fractal Analytics is a strategic AI partner to Fortune 500 com...Show more

Last updated: 30+ days ago • Promoted

Senior Site Reliability Engineer

Gridware Technologies Inc. • San Francisco, CA, United States

Full-time

Gridware is a San Francisco-based technology company dedicated to protecting and enhancing the electrical grid.We pioneered a groundbreaking new class of grid management called active grid response...Show more

Last updated: 17 days ago • Promoted

Site Reliability Engineer

Together • San Francisco, CA, United States

Full-time

Last updated: 1 day ago • Promoted