Talent.com
Senior Platform Engineer II, Compute Services

Senior Platform Engineer II, Compute Services

CoreWeaveLivingston, NJ, US
6 hours ago
Job type
  • Permanent
Job description

Job Description

Job Description

CoreWeave is the AI Hyperscaler™, delivering a cloud platform of cutting edge services powering the next wave of AI. Our technology provides enterprises and leading AI labs with the most performant, efficient and resilient solutions for accelerated computing. Since 2017, CoreWeave has operated a growing footprint of data centers covering every region of the US and across Europe. CoreWeave was ranked as one of the TIME100 most influential companies of 2024.

As the leader in the industry, we thrive in an environment where adaptability and resilience are key. Our culture offers career-defining opportunities for those who excel amid change and challenge. If you're someone who thrives in a dynamic environment, enjoys solving complex problems, and is eager to make a significant impact, CoreWeave is the place for you. Join us, and be part of a team solving some of the most exciting challenges in the industry.

CoreWeave powers the creation and delivery of the intelligence that drives innovation.

What You'll Do :

We are seeking a Senior Platform Engineer to join our Kubernetes Infrastructure team. This role involves administering our critical multi-tenant Kubernetes platforms and collaborating with development teams to establish proper deployment architectures. The ideal candidate will have a strong background in resilient kubernetes application architecture and deployment.

About the role :

  • Champion reliability initiatives for Kubernetes application deployments : Advocate for best practices to ensure high availability, scalability, and resilience of applications in Kubernetes, focusing on robust testing, secure pipelines, and efficient resource use.
  • Administer multi-tenant Kubernetes platforms : Manage complex multi-tenant Kubernetes clusters, configuring access, quotas, and security for isolation and optimal resource allocation while upholding SLAs.
  • Perform lifecycle and day 2 operations on clusters : Execute Kubernetes cluster lifecycle, including provisioning, patching, monitoring, backup, disaster recovery, and troubleshooting.
  • Deep dive into reliability issues : Conduct in-depth analysis and root cause identification for complex reliability incidents in Kubernetes, utilizing advanced debugging and monitoring tools to propose preventative measures.
  • Perform on-call duties : Respond to critical alerts and incidents outside business hours, providing timely resolution to minimize disruptions, collaborating with teams, and communicating clearly.

Who You Are :

  • Bachelor's in CS, Engineering, or related field, or equivalent experience preferred.
  • CKA or similar certifications is highly desired.
  • 2+ years administering multi-tenant SAAS Kubernetes (EKS, AKS, GKS).
  • Strong Gitops / Devops with Argocd or similar helm chart management.
  • Proven Docker and containerization experience.
  • Strong Linux OS experience.
  • Proficient in Go.
  • Excellent problem-solving, debugging, and analytical skills.
  • Strong communication and collaboration.
  • Preferred :

  • Master's degree in Computer Science, Engineering, or a related field.
  • Experience with performance profiling and optimization of distributed systems.
  • Knowledge of network protocols and distributed consensus algorithms.
  • Wondering if you're a good fit? We believe in investing in our people, and value candidates who can bring their own diversified experiences to our teams – even if you aren't a 100% skill or experience match.

    Why CoreWeave?

    At CoreWeave, we work hard, have fun, and move fast! We're in an exciting stage of hyper-growth that you will not want to miss out on. We're not afraid of a little chaos, and we're constantly learning. Our team cares deeply about how we build our product and how we work together, which is represented through our core values :

  • Be Curious at Your Core
  • Act Like an Owner
  • Empower Employees
  • Deliver Best-in-Class Client Experiences
  • Achieve More Together
  • We support and encourage an entrepreneurial outlook and independent thinking. We foster an environment that encourages collaboration and provides the opportunity to develop innovative solutions to complex problems. As we get set for take off, the growth opportunities within the organization are constantly expanding. You will be surrounded by some of the best talent in the industry, who will want to learn from you, too. Come join us!

    The base salary range for this role is $165,000 to $242,000. The starting salary will be determined based on job-related knowledge, skills, experience, and market location. We strive for both market alignment and internal equity when determining compensation. In addition to base salary, our total rewards package includes a discretionary bonus, equity awards, and a comprehensive benefits program (all based on eligibility).

    What We Offer

    The range we've posted represents the typical compensation range for this role. To determine actual compensation, we review the market rate for each candidate which can include a variety of factors. These include qualifications, experience, interview performance, and location.

    In addition to a competitive salary, we offer a variety of benefits to support your needs, including :

  • Medical, dental, and vision insurance - 100% paid for by CoreWeave
  • Company-paid Life Insurance
  • Voluntary supplemental life insurance
  • Short and long-term disability insurance
  • Flexible Spending Account
  • Health Savings Account
  • Tuition Reimbursement
  • Ability to Participate in Employee Stock Purchase Program (ESPP)
  • Mental Wellness Benefits through Spring Health
  • Family-Forming support provided by Carrot
  • Paid Parental Leave
  • Flexible, full-service childcare support with Kinside
  • 401(k) with a generous employer match
  • Flexible PTO
  • Catered lunch each day in our office and data center locations
  • A casual work environment
  • A work culture focused on innovative disruption
  • Our Workplace

    While we prioritize a hybrid work environment, remote work may be considered for candidates located more than 30 miles from an office, based on role requirements for specialized skill sets. New hires will be invited to attend onboarding at one of our hubs within their first month. Teams also gather quarterly to support collaboration

    California Consumer Privacy Act - California applicants only

    CoreWeave is an equal opportunity employer, committed to fostering an inclusive and supportive workplace. All qualified applicants and candidates will receive consideration for employment without regard to race, color, religion, sex, disability, age, sexual orientation, gender identity, national origin, veteran status, or genetic information.

    As part of this commitment and consistent with the Americans with Disabilities Act (ADA) , CoreWeave will ensure that qualified applicants and candidates with disabilities are provided reasonable accommodations for the hiring process, unless such accommodation would cause an undue hardship. If reasonable accommodation is needed, please contact : careers@coreweave.com.

    Export Control Compliance

    This position requires access to export controlled information. To conform to U.S. Government export regulations applicable to that information, applicant must either be (A) a U.S. person, defined as a (i) U.S. citizen or national, (ii) U.S. lawful permanent resident (green card holder), (iii) refugee under 8 U.S.C. § 1157, or (iv) asylee under 8 U.S.C. § 1158, (B) eligible to access the export controlled information without a required export authorization, or (C) eligible and reasonably likely to obtain the required export authorization from the applicable U.S. government agency. CoreWeave may, for legitimate business reasons, decline to pursue any export licensing process.

    Create a job alert for this search

    Senior Engineer Platform • Livingston, NJ, US

    Related jobs
    • Promoted
    IT Manager - Treasury Apps, Plt & Reliability

    IT Manager - Treasury Apps, Plt & Reliability

    J&J Family of CompaniesRaritan, NJ, US
    Full-time
    IT Manager - Treasury Apps, Plt & Reliability.We are seeking the best talent for an IT Manager - Treasury Apps, Plt & Reliability to be located in Raritan, NJ. This challenging opportunity will resi...Show moreLast updated: 17 days ago
    • Promoted
    Solutions Architect (Cloud & Infrastructure)

    Solutions Architect (Cloud & Infrastructure)

    Randstad DigitalParsippany, NJ, US
    Full-time
    The Solution Architect is a critical, customer-facing role responsible for demonstrating the value of our solutions to existing and prospective clients. The Solution Architect acts as a technical ex...Show moreLast updated: 1 day ago
    • Promoted
    Senior Manager DevOps

    Senior Manager DevOps

    Bristol-Myers SquibbSomerset, NJ, US
    Full-time +1
    Those aren't words that are usually associated with a job.But working at Bristol Myers Squibb is anything but usual.Here, uniquely interesting work happens every day, in every department.From optim...Show moreLast updated: 2 days ago
    • Promoted
    Application Development Manager

    Application Development Manager

    ThrotleRed Bank, NJ, US
    Full-time
    Application Development Manager.The Application Development Manager will be responsible for delivering Throtle's suite of internal and external facing applications. From technical design to performa...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Cloud Architect - Hybrid

    Senior Cloud Architect - Hybrid

    NJM Insurance GroupTrenton, NJ, US
    Full-time
    Cloud / Infrastructure Technology Supervisor.The Cloud / Infrastructure Technology Supervisor will be responsible for overseeing the technology strategy and operations of the IT Server Administration t...Show moreLast updated: 30+ days ago
    DevSecOps Engineer

    DevSecOps Engineer

    Bask HealthNew York, NY, US
    Full-time
    Quick Apply
    Participate in the entire application lifecycle, contributing to design considerations and focusing on building the infrastructure as code. Work closely with developers, automation QA engineers, and...Show moreLast updated: 30+ days ago
    • Promoted
    Software Engineer, Platform Engineering

    Software Engineer, Platform Engineering

    WhatnotNew York, NY, US
    Full-time
    Join the Future of Commerce with Whatnot!.Whatnot is the largest live shopping platform in North America and Europe to buy, sell, and discover the things you love. We’re re-defining e-commerce...Show moreLast updated: 7 days ago
    • Promoted
    ABAP Developer (Raritan)

    ABAP Developer (Raritan)

    JSR Tech ConsultingRaritan, NJ, US
    Part-time
    ABAP Cloud Clean Core Developer.SAP team supporting digital transformation in the.The successful candidate will combine strong. SAP S / 4HANA Cloud following Clean Core principles.ABAP RESTful Applica...Show moreLast updated: 7 days ago
    • Promoted
    Solutions Architect

    Solutions Architect

    Tata Consultancy ServicesSomerville, NJ, US
    Full-time
    ServiceNow Solution Architect (HRSD,ITSM).Typically requires 8-10+ years of experience with ServiceNow, with a focus on ITSM, HRSD, and / or ITOM modules. Experience with ServiceNow modules HRSD , WSD...Show moreLast updated: 3 days ago
    Lead Compute / Storage Infrastructure Engineer

    Lead Compute / Storage Infrastructure Engineer

    RoktNew York, NY, US
    Full-time
    Quick Apply
    We are Rokt, a hyper-growth ecommerce leader.Rokt is the global leader in ecommerce, unlocking real-time relevance in the moment that matters most. Rokt’s AI Brain and ecommerce Network powers billi...Show moreLast updated: 30+ days ago
    • Promoted
    TIBCO developer

    TIBCO developer

    LuxoftHolmdel, NJ, US
    Full-time
    Luxoft DXC Technology Company is an established company focusing on consulting and implementation of complex projects in the financial industry. At the interface been technology and business, we con...Show moreLast updated: 3 days ago
    • Promoted
    Application Integration Lead (202924)

    Application Integration Lead (202924)

    Bull City Talent GroupBridgewater, NJ, US
    Full-time
    BCTG's direct client is looking for a Application Integration Lead to be onsite 3 days per week in Bridgewater, NJ.Integration into various systems. Work with business, understand requirements, ...Show moreLast updated: 6 days ago
    • Promoted
    Sr Desktop Engineer (SCCM)

    Sr Desktop Engineer (SCCM)

    STAND 8 Technology ConsultingTroy Hills, NJ, US
    Full-time
    STAND 8 provides end to end IT solutions to enterprise partners across the United States and with offices in Los Angeles, New York, New Jersey, Atlanta, and more including internationally in Mexico...Show moreLast updated: 3 days ago
    • Promoted
    IT Systems Manager (East Windsor)

    IT Systems Manager (East Windsor)

    Empower PharmacyEast Windsor, NJ, US
    Part-time
    Empower is a visionary healthcare company committed to providing quality, affordable medication to millions of patients across the nation. We hold the distinguished position of being the largest 503...Show moreLast updated: 7 days ago
    • Promoted
    IT Manager - ERP Platforms

    IT Manager - ERP Platforms

    J&J Family of CompaniesRaritan, NJ, US
    Full-time
    At Johnson & Johnson, we believe health is everything.Our strength in healthcare innovation empowers us to build a world where complex diseases are prevented, treated, and cured, where treatments a...Show moreLast updated: 13 days ago
    • Promoted
    Linux Systems Technician

    Linux Systems Technician

    NetfireHackettstown, NJ, US
    Full-time
    NetFire is a New Jersey, USA based cloud technology company that specializes in engineering and delivering high quality products and solutions to businesses of any size. We pride ourselves in our pr...Show moreLast updated: 30+ days ago
    • Promoted
    ABAP Developer

    ABAP Developer

    JSR Tech ConsultingSomerville, NJ, US
    Full-time
    ABAP Cloud Clean Core Developer.SAP team supporting digital transformation in the.The successful candidate will combine strong. SAP S / 4HANA Cloud following Clean Core principles.ABAP RESTful Applica...Show moreLast updated: 11 days ago
    • Promoted
    SYSTEMS DIR II

    SYSTEMS DIR II

    Compass GroupAvenel, NJ, US
    Full-time
    System Director Of Dining Services.Driven by our passion in the pursuit of hospitality and culinary excellence, Morrison Living has built community through dining experiences for over a century.Emb...Show moreLast updated: 9 days ago
    • Promoted
    SYSTEMS DIR II

    SYSTEMS DIR II

    Morrison LivingAvenel, NJ, US
    Full-time
    Bonus up to 15% Car allowance ?.Driven by our passion in the pursuit of hospitality and culinary excellence, Morrison Living has built community through dining experiences for over a century.Embedd...Show moreLast updated: 10 days ago
    • Promoted
    Solution Deployment Engineer

    Solution Deployment Engineer

    Flock SafetyNew York, NY, US
    Full-time
    Flock Safety is the leading safety technology platform, helping communities thrive by taking a proactive approach to crime prevention and security. Our hardware and software suite connects cities, l...Show moreLast updated: 30+ days ago