Talent.com
SRE, Data Management and Applications - USDS
SRE, Data Management and Applications - USDSTik Tok • San Jose, CA, United States
SRE, Data Management and Applications - USDS

SRE, Data Management and Applications - USDS

Tik Tok • San Jose, CA, United States
2 days ago
Job type
  • Full-time
Job description

Responsibilities

Team Intro : The Data Engineering team in Data Platform USDS is focused on ensuring the stability, reliability, scalability and risk management of TikTok's US data processing ecosystem. We maintain and operate both batch and streaming pipelines for multiple vertical businesses such as TikTok and TikTok Shop while simultaneously adhering to strict data compliance standards. We collaborate with a host of internal teams to better enable user experiences by delivering high quality at scale. Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed services and infrastructures. As a site reliability engineer in the data platform area, you will have the opportunity to manage the services and infrastructures in one of the largest data platforms in the world that directly supports the TikTok app. You'll need to ensure the data, services and infrastructures are reliable, fault-tolerant, efficiently scalable and cost-effective. In order to enhance collaboration and cross-functional partnerships, among other things, at this time, our organization follows a hybrid work schedule that requires employees to work in the office 3 days a week, or as directed by their manager / department. We regularly review our hybrid work model, and the specific requirements may change at any time. Responsibilities : - End-to-End Service Lifecycle Management : Participate in and continuously improve the full lifecycle of services, from initial design and development to deployment, ongoing operation, and iterative optimization. - Ensure Reliability and Scalability : Maintain highly reliable, fault-tolerant, and scalable systems that are both cost-effective and efficient, ensuring data, services, and infrastructure meet business needs. - Performance Troubleshooting : Diagnose and resolve performance issues, including slow queries, resource contention, and bottlenecks across distributed storage engines and services. - Cluster Scaling and Data Growth : Plan and implement strategies for scaling clusters effectively to accommodate increasing data volume while optimizing performance and cost-efficiency. - Documentation and Incident Response : Develop and maintain clear runbooks, Standard Operating Procedures (SOPs), and lead sustainable, blameless incident response practices with post-incident analysis to drive continuous improvement. - Big Data System Design : Architect and implement robust, scalable, and extensible big data systems that support the core business and products, ensuring seamless data flow and system integration. - On-Call Rotation : Participate in on-call rotations for production incidents, ensuring critical issues are addressed swiftly, with availability to troubleshoot and resolve problems outside of regular business hours as needed. - Incident Ownership and Analysis : Take ownership of incidents during on-call hours, coordinate escalations as necessary, and conduct thorough post-incident analyses to identify root causes and implement preventive measures.

Qualifications

Minimum Qualifications : - Bachelor's degree in Computer Science, a related technical field involving software or systems engineering, or equivalent practical experience. - Experience writing code in Java, Scala, Go, Python, or a similar language. Strong scripting skills (e.g., Bash and Shell) for automation tasks. - Experience with algorithms, data structures, complexity analysis, and software design : Solid understanding of how to build scalable and efficient systems. - Basic SQL (MySQL, PostgreSQL, or similar) : Strong understanding of traditional relational databases like MySQL or PostgreSQL. Ability to write queries, perform joins, use aggregate functions, and optimize basic SQL queries. - Systems and Infrastructure : Knowledge of Linux / Unix systems, as most infrastructure is based on Linux. Familiarity with system internals, networking, and resource management (memory, CPU, storage). - Hands-on experience with observability tools such as Prometheus, Grafana, & OpenTSDB : For monitoring, logging, and real-time performance tracking. - CI / CD Tools : Familiarity with Continuous Integration / Continuous Deployment pipelines and tools (e.g., Jenkins, GitLab CI, Aegis , Bits). Preferred Qualifications : - Strong troubleshooting and debugging skills to work in a fast paced oncall environment. - Familiarity with containerized deployments such as Docker / Kubernetes. - Experience running production-grade services at scale : Understanding of cloud-native technologies, networking, and storage management to support high-availability and large-scale environments. - Experience developing tools and APIs : To reduce human intervention in systems administration and improve automation and operational efficiency. - Expertise in designing, analyzing, and troubleshooting large-scale systems : Experience with Hadoop, Spark, Hive, Presto, Kafka, Flink, or comparable solutions is a strong plus. As a condition of employment, all successful candidates must be able to establish authorization to work in the United States. For this position, the Company does not provide sponsorship or any immigration-related benefits.

Job Information

[For Pay Transparency]Compensation Description (Annually)

The base salary range for this position in the selected city is $118657 - $259200 annually.

Compensation may vary outside of this range depending on a number of factors, including a candidate's qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses / incentives, and restricted stock units.

Benefits may vary depending on the nature of employment and the country work location. Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short-term and long-term disability coverage, life insurance, wellbeing benefits, among others. Employees also receive 10 paid holidays per year, 10 paid sick days per year and 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure).

The Company reserves the right to modify or change these benefits programs at any time, with or without notice.

For Los Angeles County (unincorporated) Candidates :

Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state, and local laws including the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act. Our company believes that criminal history may have a direct, adverse and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment :

  • 1. Interacting and occasionally having unsupervised contact with internal / external clients and / or colleagues;

2. Appropriately handling and managing confidential information including proprietary and trade secret information and access to information technology systems; and

3. Exercising sound judgment.

About USDS

TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. U.S. Data Security ("USDS") is a subsidiary of TikTok in the U.S. This new, security-first division was created to bring heightened focus and governance to our data protection policies and content assurance protocols to keep U.S. users safe. Our focus is on providing oversight and protection of the TikTok platform and U.S. user data, so millions of Americans can continue turning to TikTok to learn something new, earn a living, express themselves creatively, or be entertained. The teams within USDS that deliver on this commitment daily span across Trust & Safety, Security & Privacy, Engineering, User & Product Ops, Corporate Functions and more.

Why Join Us

Inspiring creativity is at the core of TikTok's mission. Our innovative product is built to help people authentically express themselves, discover and connect - and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and bring joy - a mission we work towards every day.

We strive to do great things with great people. We lead with curiosity, humility, and a desire to make impact in a rapidly growing tech company. Every challenge is an opportunity to learn and innovate as one team. We're resilient and embrace challenges as they come. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our company, and our users. When we create and grow together, the possibilities are limitless. Join us.

Diversity & Inclusion

TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.

USDS Reasonable Accommodation

USDS is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at

Create a job alert for this search

Data Management • San Jose, CA, United States

Related jobs
SDR

SDR

Paystand • Santa Cruz, CA, US
Full-time
At Paystand, we're not just another fintech company—we're trailblazers in decentralized finance (DeFi), transforming how businesses manage their finances. With thriving hubs in Santa C...Show more
Last updated: 30+ days ago • Promoted
County Meter Reader

County Meter Reader

Meter Reader • Gilroy, CA
Full-time
Responsibilities The primary responsibility of this position is to read meters and record consumption of the water used, cleaning of meter boxes, and removal of vegetation impeding access to meters...Show more
Last updated: 1 day ago • Promoted
Senior Manager, REMS Data Programmer (Remote)

Senior Manager, REMS Data Programmer (Remote)

Jazz Pharmaceuticals • Mountain View, California, USA
Remote
Full-time
If you are a current Jazz employee please apply via the Internal Career site.Jazz Pharmaceuticals is a global biopharma company whose purpose is to innovate to transform the lives of patients and ...Show more
Last updated: 30+ days ago • Promoted
Manager, Revenue Cycle Management

Manager, Revenue Cycle Management

Accordance Search Group • Livermore, CA, US
Full-time
Our Fortune 500 client is seeking a Manager, Revenue Cycle Management, who will lead and manage the company’s healthcare cash posting process within the Revenue Cycle management team on EMR, ...Show more
Last updated: 30+ days ago • Promoted
Ocean Data Assimilation Postdoctoral Scholar

Ocean Data Assimilation Postdoctoral Scholar

University of California - Santa Cruz • Santa Cruz, CA, United States
Full-time
Ocean Data Assimilation Postdoctoral Scholar .Commensurate with qualifications and experience.Postdoctoral Scholar-Employee / Postdoctoral Scholar-Fellow / Postdoctoral Scholar-Paid Direct -Fiscal...Show more
Last updated: 30+ days ago • Promoted
Technology and Information Management Program : Adjunct Professor Pool

Technology and Information Management Program : Adjunct Professor Pool

University of California - Santa Cruz • Santa Cruz, CA, United States
Full-time
Assistant Adjunct Professor, Associate Adjunct Professor, and Adjunct Professor .See the scale titled, Faculty Ladder Rank Business / Economics / Engineering Fiscal Year. A reasonable estimate for a f...Show more
Last updated: 30+ days ago • Promoted
Fisheries Collaborative Program Specialist Pool

Fisheries Collaborative Program Specialist Pool

University of California - Santa Cruz • Santa Cruz, CA, United States
Full-time
FCP Specialists (Junior, Assistant, Associate and Specialist ranks) .Commensurate with qualifications and experience (see section. See the scale titled, •Represented Specialist Series Fiscal Year....Show more
Last updated: 30+ days ago • Promoted
Sr. Enterprise Applications Analyst (26613)

Sr. Enterprise Applications Analyst (26613)

Supermicro • San Jose, CA, United States
Full-time
Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show more
Last updated: 5 days ago • Promoted
Associate Director Program Management

Associate Director Program Management

Avails Medical, Inc. • Menlo Park, CA, US
Full-time
We are seeking a proven leader to join our team as an.Associate Director / Director of Program Management.In this role, you won’t just manage projects—you’ll lead multidisciplinary ...Show more
Last updated: 1 day ago • Promoted
Technology and Information Management Program : Lecturer Pool

Technology and Information Management Program : Lecturer Pool

University of California - Santa Cruz • Santa Cruz, CA, United States
Full-time
A reasonable estimate for an appointment to teach a standard five-credit course is $9,986-$20,908 (based on salary points 5-30). Compensation for Summer Session courses may vary from courses taught ...Show more
Last updated: 30+ days ago • Promoted
Sr. Manager, Project Management / Data Center Rack Solution (27501)

Sr. Manager, Project Management / Data Center Rack Solution (27501)

Supermicro • San Jose, CA, United States
Full-time
Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show more
Last updated: 5 days ago • Promoted
AWS Integration Lead

AWS Integration Lead

Reveille Technologies,Inc • Fremont, CA, US
Full-time
This is to support the migration activities between.Medical Super Search portal and to deliver the below with good collaboration with the customer stakeholders. Provide technical expertise in softwa...Show more
Last updated: 11 hours ago • Promoted • New!
Senior Director, Data and Analytics Platforms

Senior Director, Data and Analytics Platforms

Exelixis • Alameda, CA, United States
Full-time
Senior Director, Data and Analytics Platforms.Exelixis's success and ambition to launch innovative medicines for patients. Operating within a product-centric model, the Senior Director, IT Product M...Show more
Last updated: 12 days ago • Promoted
Technical Field Specialist

Technical Field Specialist

Cognizant • Redwood Estates, CA, US
Full-time
Cognizant is one of the world’s leading professional services companies, we help our clients modernize technology, reinvent processes and transform experiences, so they can stay ahead in our consta...Show more
Last updated: 1 day ago • Promoted
Director, Solution Management (27489)

Director, Solution Management (27489)

Supermicro • San Jose, CA, United States
Full-time
Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Show more
Last updated: 5 days ago • Promoted
Senior Solution Architect

Senior Solution Architect

Medasource • Palo Alto, CA, US
Temporary
Palo Alto, CA (Remote with occasional travel).The ideal candidate will bring deep expertise in healthcare data architecture, particularly around physician credentialing, Epic systems, and regulator...Show more
Last updated: 20 days ago • Promoted
Assistant Director - IDD Programs

Assistant Director - IDD Programs

Hope Services • Santa Cruz, CA, US
Full-time
Are you a person who enjoys helping others? Are you currently seeking fulfillment in your professional life?.Hope Services is Silicon Valley’s leading provider of services to people with deve...Show more
Last updated: 19 days ago • Promoted
Reserve Entomologist

Reserve Entomologist

United States Army • San Juan Bautista, CA, US
Full-time
THE ARMY HEALTH CARE ADVANTAGE As a member of the Army health care team, you'll receive benefits that you won't be able to get in a civilian career. Challenging Work Feel inspired with great case di...Show more
Last updated: 16 days ago • Promoted