Talent.com
Software Engineer, AI Training and Infrastructure
Software Engineer, AI Training and InfrastructureSkild AI • San Francisco, CA, United States
Software Engineer, AI Training and Infrastructure

Software Engineer, AI Training and Infrastructure

Skild AI • San Francisco, CA, United States
30+ days ago
Job type
  • Full-time
Job description

Company Overview

At Skild AI, we are building the world's first general purpose robotic intelligence that is robust and adapts to unseen scenarios without failing. We believe massive scale through data-driven machine learning is the key to unlocking these capabilities for the widespread deployment of robots within society. Our team consists of individuals with varying levels of experience and backgrounds, from new graduates to domain experts. Relevant industry experience is important, but ultimately less so than your demonstrated abilities and attitude. We are looking for passionate individuals who are eager to explore uncharted waters and contribute to our innovative projects.

Position Overview

We are looking for a Software Engineer to work at the forefront of developing and optimizing the software infrastructure and tools necessary for training cutting-edge AI models. You will focus on building robust, scalable, and efficient training pipelines and frameworks that support the entire machine learning lifecycle, from data preparation to model deployment. You will collaborate with researchers and machine learning engineers to ensure seamless integration and operation of training systems, pushing the boundaries of what AI can achieve in real-world robotics applications. You will explore new ways to efficiently make use of many types of data in our training pipeline.

Responsibilities

  • Develop and maintain robust, scalable, and distributed training pipelines (data preprocessing, training orchestration, and model evaluation) and frameworks for large-scale AI models.
  • Optimize training processes for performance and resource utilization, ensuring scalability and reliability.
  • Collaborate with researchers and machine learning engineers to integrate state-of-the-art algorithms and techniques into training pipelines.
  • Monitor and analyze training, identifying bottlenecks and proposing solutions to improve efficiency and performance.
  • Ensure the robustness and reliability of the training infrastructure, including automated testing and continuous integration.

Preferred Qualifications

  • BS, MS or higher degree in Computer Science, Robotics, Engineering or a related field, or equivalent practical experience.
  • Proficiency in Python, C++, or similar and at least one deep learning library such as PyTorch, TensorFlow, JAX, etc.
  • Strong background in distributed computing, parallel processing techniques, handling large-scale datasets and data preprocessing.
  • Deep understanding of state-of-the-art machine learning techniques and models.
  • Experience with cloud-based training environments (AWS, Google Cloud, Azure).
  • Experience in developing and maintaining software tooling and infrastructure for machine learning.
  • Deep understanding and practical experience with software engineering principles, including algorithms, data structures, and system design.
  • Experience with continuous integration and automated testing frameworks.
  • Base Salary Range

    $100,000 — $300,000 USD

    #J-18808-Ljbffr

    Create a job alert for this search

    Software Engineer Infrastructure • San Francisco, CA, United States

    Related jobs
    Software Engineer, Training & Inference Infrastructure

    Software Engineer, Training & Inference Infrastructure

    DatologyAI • Redwood City, CA, United States
    Full-time
    But a large portion of training compute is wasted training on data that are already learned, irrelevant, or even harmful, leading to worse models that cost more to train and deploy.At DatologyAI, w...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, CMake - AI Training (Freelance, Remote)

    Software Engineer, CMake - AI Training (Freelance, Remote)

    Alignerr • San Francisco, CA, United States
    Remote
    Full-time
    Software Engineer, CMake - AI Training (Freelance, Remote).Software Engineer, CMake - AI Training (Freelance, Remote).Continue with Google Continue with Google. Software Engineer, CMake - AI Trainin...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer, Enterprise AI - ML Training

    Senior Software Engineer, Enterprise AI - ML Training

    Woven by Toyota • Stanford, CA, United States
    Full-time
    Senior Software Engineer, Enterprise AI - ML Training.Woven by Toyota is enabling Toyota's once-in-a-century transformation into a mobility company. Our mission is to challenge the current state of ...Show more
    Last updated: 3 hours ago • Promoted • New!
    Software Engineer, YAML - AI Training (Freelance, Remote)

    Software Engineer, YAML - AI Training (Freelance, Remote)

    Alignerr • San Francisco, CA, United States
    Remote
    Full-time
    Software Engineer, YAML - AI Training (Freelance, Remote).Software Engineer, YAML - AI Training (Freelance, Remote).Continue with Google Continue with Google. Software Engineer, YAML - AI Training (...Show more
    Last updated: 30+ days ago • Promoted
    AI Infrastructure Engineer, Model Serving Platform

    AI Infrastructure Engineer, Model Serving Platform

    Scale AI • San Francisco, CA, United States
    Full-time
    As a software engineer on the ML Infrastructure team, you will work on developing the platform for orchestrating post-training and model evaluation jobs. At Scale, we are constantly developing new d...Show more
    Last updated: 30+ days ago • Promoted
    AI Infrastructure Engineer, Core Infrastructure

    AI Infrastructure Engineer, Core Infrastructure

    Scale AI, Inc. • San Francisco, CA, United States
    Full-time
    AI Infrastructure Engineer, Core Infrastructure.Join the team shaping the future of AI at Scale.As a Software Engineer on the ML Infrastructure team, you will design and build the next generation o...Show more
    Last updated: 4 days ago • Promoted
    Software Engineer, AI Infra Innovation

    Software Engineer, AI Infra Innovation

    Pure Storage • Santa Clara, CA, United States
    Full-time
    We're in an unbelievably exciting area of tech and are fundamentally reshaping the data storage industry.Here, you lead with innovative thinking, grow along with us, and join the smartest team in t...Show more
    Last updated: 23 hours ago • Promoted
    AI Infrastructure Architect

    AI Infrastructure Architect

    Okta for Developers • San Francisco, CA, United States
    Full-time
    We are looking for a smart and versatile AI Infrastructure Architect to build and evolve the AI infrastructure and platform that powers our identity security solutions. Your work will enable interna...Show more
    Last updated: 8 days ago • Promoted
    Remote Distributed AI Training Engineer

    Remote Distributed AI Training Engineer

    Prime Intellect • San Francisco, CA, United States
    Remote
    Full-time
    A cutting-edge AI company in San Francisco seeks a Research Engineer specializing in decentralized training.You'll lead research, optimize AI workloads, and contribute to open-source libraries, all...Show more
    Last updated: 3 hours ago • Promoted • New!
    Software Engineer, Training Performance, AI Infrastructure

    Software Engineer, Training Performance, AI Infrastructure

    Tesla • Palo Alto, CA, United States
    Full-time
    As a Software Engineer within the Autopilot AI Infrastructure team, you will work on reinforcing, optimizing, and scaling our infrastructure components supporting AI research activities for Autopil...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, Distributed Training, AI Infrastructure

    Software Engineer, Distributed Training, AI Infrastructure

    Tesla • Palo Alto, CA, United States
    Full-time
    As a Software Engineer within the Autopilot AI Infrastructure team, you will work on reinforcing, optimizing, and scaling our infrastructure components supporting AI research activities for Autopil...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer - Model Training Infrastructure - USDS

    Software Engineer - Model Training Infrastructure - USDS

    Tik Tok • San Jose, CA, United States
    Full-time
    About the team The mission of our AML team is to push the next-generation AI infrastructure and recommendation platform for the ads ranking, search ranking, live & ecom ranking in our company.We al...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, SQL - AI Training (Freelance, Remote)

    Software Engineer, SQL - AI Training (Freelance, Remote)

    Alignerr • San Francisco, CA, United States
    Remote
    Full-time
    Software Engineer, SQL - AI Training (Freelance, Remote).Software Engineer, SQL - AI Training (Freelance, Remote).Continue with Google Continue with Google. Software Engineer, SQL - AI Training (Fre...Show more
    Last updated: 30+ days ago • Promoted
    AI Infrastructure Engineer, Model Serving Platform

    AI Infrastructure Engineer, Model Serving Platform

    Scale AI, Inc. • San Francisco, CA, United States
    Full-time
    As a Software Engineer on the ML Infrastructure team, you will design and build platforms for scalable, reliable, and efficient serving of LLMs. Our platform powers cutting-edge research and product...Show more
    Last updated: 30+ days ago • Promoted
    AI Infrastructure Engineer, Core Infrastructure

    AI Infrastructure Engineer, Core Infrastructure

    Scale AI • San Francisco, CA, United States
    Full-time
    AI Infrastructure Engineer, Core Infrastructure.As a Software Engineer on the ML Infrastructure team, you will design and build the next generation of foundational systems that power all ML Infrast...Show more
    Last updated: 8 days ago • Promoted
    Software Engineer- AI / ML, AWS Neuron Distributed Training

    Software Engineer- AI / ML, AWS Neuron Distributed Training

    Amazon • Cupertino, CA, United States
    Full-time
    Annapurna Labs designs silicon and software that accelerates innovation.Customers choose us to create cloud solutions that solve challenges that were unimaginable a short time ago-even yesterday.Ou...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, C# - AI Training (Freelance, Remote)

    Software Engineer, C# - AI Training (Freelance, Remote)

    Alignerr • San Francisco, CA, United States
    Remote
    Full-time
    Software Engineer, C# - AI Training (Freelance, Remote).Software Engineer, C# - AI Training (Freelance, Remote).AI models by creating high-quality data in their fields to advance Generative AI.Oper...Show more
    Last updated: 8 days ago • Promoted
    AI Infra Engineer (San Francisco)

    AI Infra Engineer (San Francisco)

    Perplexity AI Inc. • San Francisco, CA, United States
    Full-time
    We are looking for an AI Infra engineer to join our growing team.We work with Kubernetes, Slurm, Python, C++, PyTorch, and primarily on AWS. As an AI Infrastructure Engineer, you will be partnering ...Show more
    Last updated: 3 hours ago • Promoted • New!