Talent.com
Software Engineer - Supercomputing Platform & Infrastructure
Software Engineer - Supercomputing Platform & InfrastructureMagic AI Corp. • New York, NY, United States
Software Engineer - Supercomputing Platform & Infrastructure

Software Engineer - Supercomputing Platform & Infrastructure

Magic AI Corp. • New York, NY, United States
2 days ago
Job type
  • Full-time
Job description

Magic's mission is to build safe AGI that accelerates humanity's progress on the world's most important problems. We believe the most promising path to safe AGI lies in automating research and code generation to improve models and solve alignment more reliably than humans can alone. Our approach combines frontier-scale pre-training, domain-specific RL, ultra-long context, and inference-time compute to achieve this goal.

About the role :

As a Software Engineer on our Supercomputing Platform & Infrastructure team, you will design and build resilient and optimized solutions for AI workloads on massive Computing Clusters.

What you might work on :

  • Work closely with the training and inference teams to deliver high performance and reliability across storage, networking, and distributed computing designs.
  • Build the software stack to run massive-scale (thousands of GPUs), highly available supercomputing infrastructure
  • Troubleshoot and resolve complex issues across hardware accelerated devices, networking, storage subsystems (local NVMe / Block Storage / NFS), OS, drivers and cloud environments, and automate detection and recovery processes
  • Operate data-intensive workloads at petabyte-scale
  • Increase the ease-of-use and self-serviceability of the compute platforms at Magic through top-notch documentation and developer workflow design
  • Investigate and resolve incidents across security and availability

What we're looking for :

  • Experience working with production GPU deployments, data-intensive applications, large-scale model training and HPC
  • Strong understanding of networking-, storage- and data-related technologies
  • Experience with GCP, AWS, Azure, OCI or similar cloud platforms
  • Strong software engineering skills
  • Strong IaC knowledge with extensive experience in Terraform, Pulumi, AWS CDK / CloudFormation or similar
  • Magic strives to be the place where high-potential individuals can do their best work. We value quick learning and grit just as much as skill and experience.

    Our culture :

  • Integrity. Words and actions should be aligned
  • Hands-on. At Magic, everyone is building
  • Teamwork. We move as one team, not N individuals
  • Focus. Safely deploy AGI. Everything else is noise
  • Quality. Magic should feel like magic
  • Compensation, benefits and perks (US) :

  • Annual salary range : $225K - $550K
  • Equity is a significant part of total compensation, in addition to salary
  • 401(k) plan with 6% salary matching
  • Generous health, dental and vision insurance for you and your dependents
  • Unlimited paid time off
  • Visa sponsorship and relocation stipend to bring you to SF, if possible
  • A small, fast-paced, highly focused team
  • Create a job alert for this search

    Software Engineer Infrastructure • New York, NY, United States

    Related jobs
    Infrastructure Software Engineer, Public Sector

    Infrastructure Software Engineer, Public Sector

    Scale AI, Inc. • New York, NY, United States
    Full-time
    Scale AI is seeking a highly skilled and motivated.Software Engineer, AI Infrastructure & Security.Public Sector Engineering team. As a part of this team, you will play a critical role in delivering...Show more
    Last updated: 30+ days ago • Promoted
    Staff Infrastructure Software Engineer, Enterprise AI

    Staff Infrastructure Software Engineer, Enterprise AI

    Scale AI, Inc. • New York, NY, United States
    Full-time
    Scale GP is building the next generation of enterprise-grade Generative AI products.Our platform provides APIs for knowledge retrieval, inference, and evaluation, enabling customers to build and de...Show more
    Last updated: 17 days ago • Promoted
    Senior Software Engineer, Infrastructure

    Senior Software Engineer, Infrastructure

    Kiddom • New York, NY, United States
    Full-time +1
    Kiddom is a groundbreaking educational platform that promotes student equity and growth by uniting high-quality instructional materials with dynamic digital learning. Through unparalleled curriculum...Show more
    Last updated: 2 days ago • Promoted
    Principal Software Engineer, Infrastructure & Operations

    Principal Software Engineer, Infrastructure & Operations

    Jobgether • New York, NY, US
    Remote
    Full-time
    Quick Apply
    This position is posted by Jobgether on behalf of a partner company.We are currently looking for a Principal Software Engineer, Infrastructure & Operations in New York (USA).As a Principal Soft...Show more
    Last updated: 14 days ago
    Senior Software Engineer (Infrastructure)

    Senior Software Engineer (Infrastructure)

    Owl • New York, NY, United States
    Full-time
    AI systems for high‑stakes, real‑world decisions.Our platform ingests and reasons over large, messy data to surface evidence with hard constraints around fairness, auditability, and low bias.The si...Show more
    Last updated: 30+ days ago • Promoted
    Staff+ Software Engineer - Infrastructure

    Staff+ Software Engineer - Infrastructure

    Anthropic • New York, NY, United States
    Full-time
    Anthropic’s mission is to create reliable, interpretable, and steerable AI systems.We want AI to be safe and beneficial for users and society. Our team includes researchers, engineers, policy expert...Show more
    Last updated: 16 days ago • Promoted
    Software Engineer - AI Infrastructure

    Software Engineer - AI Infrastructure

    AssembledHQ, Inc • New York, NY, United States
    Full-time
    Great customer support requires human agents and AI in perfect balance, and Assembled is the only unified platform that orchestrates both at scale. Companies like Canva, Etsy, and Robinhood use Asse...Show more
    Last updated: 30+ days ago • Promoted
    Critical Infrastructure Engineer

    Critical Infrastructure Engineer

    DataBank Holdings, Ltd. • Orangeburg, NY, United States
    Full-time
    DataBank's managed data center services are anchored in world-class facilities.Our customized technology solutions are designed to help customers effectively manage risk, improve technology perform...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, Internal Infrastructure (North America)

    Software Engineer, Internal Infrastructure (North America)

    Cohere • New York, NY, United States
    Full-time
    Our mission is to scale intelligence to serve humanity.We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like cont...Show more
    Last updated: 2 days ago • Promoted
    Senior Software Engineer, Infrastructure

    Senior Software Engineer, Infrastructure

    S&P Global • New York, NY, United States
    Full-time
    Kensho is S&P Global's hub for AI innovation and transformation.With expertise in machine learning, natural language processing, and data discovery, we develop and deploy novel solutions to innovat...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer - Infrastructure

    Senior Software Engineer - Infrastructure

    Epirus • Brooklyn, NY, United States
    Permanent
    Senior Software Engineer - Infrastructure.Epirus is a high-growth technology company dedicated to overcoming the asymmetric challenges inherent to the future of national security.Epirus' flagship p...Show more
    Last updated: 1 day ago • Promoted
    Software Engineer - Infrastructure, Foundry Platform

    Software Engineer - Infrastructure, Foundry Platform

    Palantir Technologies • New York, NY, United States
    Full-time
    Palantir builds the world's leading software for data-driven decisions and operations.By bringing the right data to the people who need it, our platforms empower our partners to develop lifesaving ...Show more
    Last updated: 1 day ago • Promoted
    Senior Software Engineer (Storage Infrastructure)

    Senior Software Engineer (Storage Infrastructure)

    ROKT • New York, NY, United States
    Full-time
    Senior Software Engineer (Storage Infrastructure).We are Rokt, a hyper-growth ecommerce leader.Rokt is the global leader in ecommerce, unlocking real-time relevance in the moment that matters most....Show more
    Last updated: 30+ days ago • Promoted
    Lead Platform Engineer (Network Infrastructure)

    Lead Platform Engineer (Network Infrastructure)

    Capital One • New York, NY, United States
    Full-time +1
    Lead Platform Engineer (Network Infrastructure) Do you love building and pioneering in the technology space? Do you enjoy solving complex technical problems in a fast-paced, collaborative, inclusiv...Show more
    Last updated: 30+ days ago • Promoted
    Infrastructure Software Engineer, Public Sector

    Infrastructure Software Engineer, Public Sector

    Scale AI • New York, NY, United States
    Full-time
    Scale AI is seeking a highly skilled and motivated.Software Engineer, AI Infrastructure & Security.Public Sector Engineering team. As a part of this team, you will play a critical role in delivering...Show more
    Last updated: 9 days ago • Promoted
    Senior Software Engineer - Infrastructure & Tools

    Senior Software Engineer - Infrastructure & Tools

    Beacon • New York, NY, United States
    Full-time
    Our client is a fast-scaling digital platform focused on transforming the way physical and digital operations connect across industries such as design, logistics, and fulfillment.With a strong engi...Show more
    Last updated: 2 days ago • Promoted
    Senior Software Engineer - Infrastructure

    Senior Software Engineer - Infrastructure

    Baseten • New York, NY, United States
    Full-time
    Baseten powers inference for the world's most dynamic AI companies, like OpenEvidence, Clay, Mirage, Gamma, Sourcegraph, Writer, Abridge, Bland, and Zed. By uniting applied AI research, flexible inf...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer, Infrastructure

    Senior Software Engineer, Infrastructure

    Current • New York, NY, United States
    Full-time
    SENIOR SOFTWARE ENGINEER, INFRASTRUCTURE.Current is a leading consumer fintech platform transforming financial access for everyday Americans with over 5 million members. We provide access to financi...Show more
    Last updated: 2 days ago • Promoted