Talent.com
Senior Software Engineer, AI Inference Platform
Senior Software Engineer, AI Inference PlatformCerebras • Sunnyvale, CA, United States
Senior Software Engineer, AI Inference Platform

Senior Software Engineer, AI Inference Platform

Cerebras • Sunnyvale, CA, United States
1 day ago
Job type
  • Full-time
Job description

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs.

Cerebras' current customers include global corporations across multiple industries, national labs, and top-tier healthcare systems. In January, we announced a multi-year, multi-million-dollar partnership with Mayo Clinic, underscoring our commitment to transforming AI applications across various fields. In August, we launched Cerebras Inference, the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services.

About The Role

We are seeking a talented Platform Software Engineer to join the team building the Cerebras Inference Platform. You will be instrumental in designing, developing, and operating the core backend services and APIs that power the Inference platform. You\'ll build the software that allows customers to seamlessly deploy, manage, and serve inference workloads on dedicated Cerebras hardware.

Responsibilities :

  • Design, build, and maintain the core APIs for the Inference Platform, handling model catalog management, deployment of ML workloads, scaling, and status monitoring.
  • Focus on building platform capabilities that optimize for ease-of-use, robustness, and self-service access to inference models and serving.
  • Collaborate with infrastructure and ML engineering teams to ensure high reliability, uptime, and smooth user interactions with the inference service.
  • Design and implement features like multi-tenant support, deployment automation, priority queuing, and caching strategies for user requests.
  • Build robust observability features by integrating with monitoring and telemetry tools (e.g., Prometheus, Grafana) to track system health, performance metrics, and request analytics.

Skills & Qualifications :

  • Bachelor’s or Master\'s degree in computer science or related field, or equivalent practical experience.
  • 5+ years of experience in backend software development, with a focus on service APIs, orchestration platforms, or user-facing infrastructure.
  • Strong proficiency in Python (C++ is good to have).
  • Experience designing, building, and integrating with RESTful APIs and gRPC services.
  • Solid understanding of distributed systems concepts such as concurrency, scalability, and fault tolerance.
  • Hands-on experience with containerization (Docker) and orchestration frameworks (Kubernetes).
  • Experience with databases and caching systems (e.g., Postgres, Redis).
  • Experience with observability, telemetry pipelines, and system monitoring best practices.
  • Strong problem-solving and debugging abilities.
  • Excellent communication and cross-functional collaboration skills.
  • Why Join Cerebras

  • Build a breakthrough AI platform beyond the constraints of the GPU.
  • Publish and open source their cutting-edge AI research.
  • Work on one of the fastest AI supercomputers in the world.
  • Enjoy job stability with startup vitality.
  • Our simple, non-corporate work culture that respects individual beliefs.
  • Read our blog : Five Reasons to Join Cerebras in 2025.

    Apply today

    Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies. We try every day to build a work environment that empowers people to do their best work through continuous learning, growth and support of those around them.

    This website or its third-party tools process personal data. For more details, the privacy notice is available on our site.

    #J-18808-Ljbffr

    Create a job alert for this search

    Senior Software Engineer • Sunnyvale, CA, United States

    Related jobs
    AI Solutions Engineer

    AI Solutions Engineer

    VirtualVocations • Fremont, California, United States
    Full-time
    A company is looking for an AI Solutions Engineer to lead innovation around AI-assisted development tools and cloud transformation. Key Responsibilities Design and prototype use cases leveraging G...Show more
    Last updated: 30+ days ago • Promoted
    Technical Lead AI Engineer

    Technical Lead AI Engineer

    VirtualVocations • Fremont, California, United States
    Full-time
    A company is looking for a Technical Lead AI Software Engineer.Key Responsibilities Design, build, and maintain AI-augmented developer tools, services, and workflows Lead initiatives to improve ...Show more
    Last updated: 16 hours ago • Promoted • New!
    Senior Staff AI Engineer

    Senior Staff AI Engineer

    Palo Alto Networks • Santa Clara, CA, US
    Full-time
    At Palo Alto Networks® everything starts and ends with our mission : .Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer a...Show more
    Last updated: 19 days ago • Promoted
    Senior AI / ML Engineer

    Senior AI / ML Engineer

    VirtualVocations • Fremont, California, United States
    Full-time
    A company is looking for a Senior AI / ML Engineer to support data and AI initiatives for government clients.Key Responsibilities Design, develop, and deploy scalable AI / ML solutions using AWS clou...Show more
    Last updated: 30+ days ago • Promoted
    Senior AI / ML Storage Engineer

    Senior AI / ML Storage Engineer

    VirtualVocations • Fremont, California, United States
    Full-time
    A company is looking for a Senior AI and ML Storage Engineer.Key Responsibilities Design, develop, and operate distributed systems for managing data, compute, and networking for large-scale AI wo...Show more
    Last updated: 1 day ago • Promoted
    AI Architect

    AI Architect

    VirtualVocations • Hayward, California, United States
    Full-time
    A company is looking for an AI Architect & Strategist to shape its enterprise AI strategy and drive responsible AI adoption. Key Responsibilities Develop and evolve the company's AI strategy in al...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer, Enterprise AI

    Software Engineer, Enterprise AI

    Scale AI, Inc. • San Francisco, CA, United States
    Full-time
    Scale GP (Scale Generative AI Platform) is an enterprise-grade Generative AI platform that provides APIs for knowledge retrieval, inference, evaluation, and more. We are looking for a strong enginee...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer

    Senior Software Engineer

    VirtualVocations • Fremont, California, United States
    Full-time
    A company is looking for a Senior Software Engineer, DGXC Data Services.Key Responsibilities Design and build software code and cloud services for Data Management, including cataloging and managi...Show more
    Last updated: 30+ days ago • Promoted
    Senior AI Engineer

    Senior AI Engineer

    VirtualVocations • Hayward, California, United States
    Full-time
    A company is looking for a Senior AI Engineer to develop AI-powered tools aimed at transforming democracy.Key Responsibilities Architect and lead the development of human-in-the-loop AI applicati...Show more
    Last updated: 30+ days ago • Promoted
    AI Inference Engineer

    AI Inference Engineer

    Perplexity AI • San Francisco, CA, US
    Full-time
    Perplexity is an AI-powered answer engine founded in December 2022 and growing rapidly as one of the world's leading AI platforms. Perplexity has raised over $1B in venture investment from some ...Show more
    Last updated: 30+ days ago • Promoted
    AI Systems Engineer

    AI Systems Engineer

    VirtualVocations • Fremont, California, United States
    Full-time
    A company is looking for an Engineering & AI Systems Engineer to design and implement internal tools that enhance operational efficiency. Key Responsibilities Build and deploy internal tools to ad...Show more
    Last updated: 30+ days ago • Promoted
    Senior Engineer - AI Security

    Senior Engineer - AI Security

    VirtualVocations • Hayward, California, United States
    Full-time
    A company is looking for a Senior Engineer - AI Security (Remote).Key Responsibilities Implement and administer responsible AI-centric APIs and integrate Cybersecurity software with business syst...Show more
    Last updated: 1 day ago • Promoted
    Senior Data and AI Engineer

    Senior Data and AI Engineer

    VirtualVocations • Hayward, California, United States
    Full-time
    A company is looking for a Senior Data and AI Systems Engineer to lead the integration of data and AI systems for a digital assistant in the K-12 education sector. Key Responsibilities Design and ...Show more
    Last updated: 16 hours ago • Promoted • New!
    Distinguished AI Engineer

    Distinguished AI Engineer

    VirtualVocations • Hayward, California, United States
    Full-time
    A company is looking for a Director, Distinguished AI Engineer (Remote-Eligible).Key Responsibilities Collaborate with cross-functional teams to deliver AI-powered products Design, develop, test...Show more
    Last updated: 1 day ago • Promoted
    Senior Staff AI Engineer

    Senior Staff AI Engineer

    VirtualVocations • Hayward, California, United States
    Full-time
    A company is looking for a Senior Staff AI Engineer to design and deploy advanced AI systems.Key Responsibilities Architect foundational components for production-grade AI systems Collaborate wi...Show more
    Last updated: 5 days ago • Promoted
    AI Engineer

    AI Engineer

    VirtualVocations • Fremont, California, United States
    Full-time
    A company is looking for an AI Engineer.Key Responsibilities Architect, develop, and deploy scalable ML and LLM-based systems Build and maintain microservices architectures and APIs for AI appli...Show more
    Last updated: 30+ days ago • Promoted
    Senior AI Software Engineer

    Senior AI Software Engineer

    VirtualVocations • Santa Clara, California, United States
    Full-time
    A company is looking for a Senior AI Software Engineer to enhance developer experience through AI-powered tools and workflows. Key Responsibilities Design, build, and maintain AI-augmented develop...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer, Full-Stack - Enterprise Gen AI

    Senior Software Engineer, Full-Stack - Enterprise Gen AI

    Scale AI, Inc. • San Francisco, CA, United States
    Full-time
    Senior Software Engineer, Full-Stack - Enterprise Gen AI.Scale GP (Scale Generative AI Platform) is an enterprise-grade AI platform providing APIs for knowledge retrieval, inference, evaluation, an...Show more
    Last updated: 30+ days ago • Promoted
    Software Engineer - AI

    Software Engineer - AI

    VirtualVocations • San Jose, California, United States
    Full-time
    A company is looking for a Software Engineer - AI Engineering (Remote).Key Responsibilities Build and integrate AI-powered backend features that adapt and learn in production Extend APIs and dat...Show more
    Last updated: 4 days ago • Promoted
    AI Data Integration Architect

    AI Data Integration Architect

    VirtualVocations • Hayward, California, United States
    Full-time
    A company is looking for an AI & Data Integration Architect to design and implement enterprise-grade integration solutions. Key Responsibilities Design and implement integration strategies for AI ...Show more
    Last updated: 4 days ago • Promoted