Talent.com
Cantina Labs is hiring: Inference Engineer, Video AI in San Francisco
Cantina Labs is hiring: Inference Engineer, Video AI in San FranciscoMediabistro • San Francisco, CA, United States
Cantina Labs is hiring : Inference Engineer, Video AI in San Francisco

Cantina Labs is hiring : Inference Engineer, Video AI in San Francisco

Mediabistro • San Francisco, CA, United States
22 hours ago
Job type
  • Full-time
Job description

A Bit About Cantina

Cantina, founded by Sean Parker, is a new social platform with the most advanced AI character creator. Build, share, and interact with AI bots and your friends directly in the Cantina or across the internet. Cantina bots are lifelike, social creatures, capable of interacting wherever humans go on the internet. Recreate yourself using powerful AI, imagine someone new, or choose from thousands of existing characters. Bots are a new media type that offer a way for creators to share infinitely scalable and personalized content experiences combined with seamless group chat across voice, video, and text. If you're excited about the potential AI has to shape human creativity and social interactions, join us in building the future!

About the role

We're looking for an Inference Engineer who specializes in productionizing and hosting video AI models at scale. You'll be responsible for taking cutting-edge neural networks from research to production, building robust inference infrastructure, and optimizing model performance for real-time applications. This role focuses on the deployment and serving of large video models.

  • Deploy video AI models to production – Take research models and build production-ready inference endpoints with APIs, ensuring efficient operation across cloud infrastructure.
  • Maintain and optimize inference systems – Debug complex model serving issues, optimize latency performance, monitor system health, and ensure 99.9% uptime for AI-powered features.
  • Implement model optimizations – Work with neural network architectures including diffusion networks, VAEs, and transformers. Apply streaming optimizations and understand video model architectures to implement effective performance improvements.
  • Manage inference infrastructure – Leverage containerization with Docker, cloud storage solutions like S3, and cluster computing to build scalable model serving infrastructure.
  • Collaborate with research teams – Work closely with AI researchers to understand model requirements, architectural constraints, and optimization opportunities for new video generation models.

Qualifications

  • 2+ years of ML engineering experience with focus on model inference and deployment
  • Strong understanding of neural network architectures, particularly diffusion networks, VAEs, and transformer models
  • Experience with video and image models – Understanding of how video / image generation models work, their architectures, and optimization strategies specific to video processing
  • Multi-GPU inference expertise – Experience running model components across multiple GPUs, implementing parallel processing strategies for large models
  • Production model hosting experience – Track record of deploying and maintaining ML models in production environments, including streaming and real-time inference
  • Experience with containerization (Docker), AWS, and cluster computing environments
  • Familiarity with machine learning frameworks (PyTorch, TensorFlow)
  • Experience with inference platforms and model serving solutions
  • Technical Stack You'll Work With

  • Cloud : AWS (S3, DynamoDB), Kubernetes clusters
  • ML Infrastructure : Model serving platforms, Docker
  • Languages : Python
  • Frameworks : PyTorch, TensorFlow
  • Models : Video generation models, diffusion networks, VAEs, transformers
  • Optimization

    Multi-GPU inference, real-time processing techniques

    Pay Equity

    In compliance with Pay Transparency Laws, the base salary range for this role is between $175,000-$225,000 for those located in the San Francisco Bay Area, New York City and Seattle, WA. When determining compensation, a number of factors will be considered, including skills, experience, job scope, location, and competitive compensation market data.

    Benefits

  • Health Care — 99% of premiums for medical, vision, dental are fully paid for by Cantina, plus One Medical membership.
  • Monthly Wellness Stipend — $500 / month to use on whatever you’d like!
  • Rest and Recharge — 15 PTO days per year, 10 sick days, all Federal holidays, and 2 floating holidays.
  • 401(K) — Eligible to participate on day one of employment.
  • Parental Leave & Fertility Support
  • Competitive Salary & Equity
  • Lunch and snacks provided for in-office employees.
  • WFH equipment provided for full-time hybrid / remote employees.
  • #J-18808-Ljbffr

    Create a job alert for this search

    Labs Hiring Engineer • San Francisco, CA, United States

    Related jobs
    Technical Lead AI Engineer

    Technical Lead AI Engineer

    VirtualVocations • Fremont, California, United States
    Full-time
    A company is looking for a Technical Lead AI Software Engineer.Key Responsibilities Design, build, and maintain AI-augmented developer tools, services, and workflows Lead initiatives to improve ...Show more
    Last updated: 9 hours ago • Promoted • New!
    Senior AI Research Engineer, Model Inference (Remote)

    Senior AI Research Engineer, Model Inference (Remote)

    Tether Operations Limited • San Francisco, CA, United States
    Remote
    Full-time
    Join Tether and Shape the Future of Digital Finance.At Tether, we’re building solutions that empower businesses to integrate reserve-backed tokens across blockchains with transparency and trust in ...Show more
    Last updated: 25 days ago • Promoted
    AI Engineer

    AI Engineer

    VirtualVocations • Hayward, California, United States
    Full-time
    A company is looking for an AI Engineer.Key Responsibilities Architect, develop, and deploy scalable ML and LLM-based systems Build and maintain microservices architectures and APIs for AI appli...Show more
    Last updated: 30+ days ago • Promoted
    Lead AI Engineer

    Lead AI Engineer

    Recruiting from Scratch • San Francisco, CA, United States
    Full-time
    Recruiting from Scratch is a talent firm that focuses on placing the best candidate for our clients.Our team is 100% remote and we work with teams across North America, South America, and Europe to...Show more
    Last updated: 30+ days ago • Promoted
    AI Engineer, Medical Imaging

    AI Engineer, Medical Imaging

    OSI Engineering • Pleasanton, CA, US
    Full-time
    AI Engineer, Medical Imaging We are seeking a Senior level AI Engineer to join our algorithm team as a contractor, focused on developing and optimizing AI models for medical imaging applications.Th...Show more
    Last updated: 30+ days ago • Promoted
    AIVideo is hiring : AIVideo.com seeks pioneering AI Engineer to reinvent video pr

    AIVideo is hiring : AIVideo.com seeks pioneering AI Engineer to reinvent video pr

    Mediabistro • San Francisco, CA, United States
    Full-time
    We have collected the best dataset in the world for video editing.Our popular web-based video editor produces thousands of user actions per minute. The goal is to use that user data to power a human...Show more
    Last updated: 30+ days ago
    AI Inference Engineer

    AI Inference Engineer

    Perplexity AI • San Francisco, CA, US
    Full-time
    Perplexity is an AI-powered answer engine founded in December 2022 and growing rapidly as one of the world's leading AI platforms. Perplexity has raised over $1B in venture investment from some ...Show more
    Last updated: 30+ days ago • Promoted
    Distinguished AI Engineer

    Distinguished AI Engineer

    VirtualVocations • Hayward, California, United States
    Full-time
    A company is looking for a Director, Distinguished AI Engineer (Remote-Eligible).Key Responsibilities Collaborate with cross-functional teams to deliver AI-powered products Design, develop, test...Show more
    Last updated: 23 hours ago • Promoted
    AI Engineer

    AI Engineer

    MLabs • San Francisco, CA, United States
    Full-time
    Our client is an innovative AI company building the future of healthcare with AI doctors and nurses, starting in Latin America. Their mission is to reinvent healthcare delivery in the region with AI...Show more
    Last updated: 4 days ago • Promoted
    Adobe is hiring : Applied ML Research Engineer - Video AI in San Jose

    Adobe is hiring : Applied ML Research Engineer - Video AI in San Jose

    Mediabistro • San Jose, CA, United States
    Full-time
    Applied ML Research Engineer - Video AI.Adobe is a market leader in creative and marketing software, and we are expanding our GenAI capabilities in video AI to empower creators and marketers at sca...Show more
    Last updated: 2 days ago
    Principal AI Engineer, Intelligent Sensors

    Principal AI Engineer, Intelligent Sensors

    1010 Analog Devices Inc. • Rio Robles, CA, United States
    Full-time +1
    NASDAQ : ADI ) is a global semiconductor leader that bridges the physical and digital worlds to enable breakthroughs at the Intelligent Edge. ADI combines analog, digital, and software technologie...Show more
    Last updated: 30+ days ago • Promoted
    Inference Engineer, Video AI Job at Cantina Labs in San Francisco

    Inference Engineer, Video AI Job at Cantina Labs in San Francisco

    Mediabistro • San Francisco, CA, United States
    Full-time
    Cantina, founded by Sean Parker, is a new social platform with the most advanced AI character creator.Build, share, and interact with AI bots and your friends directly in the Cantina or across the ...Show more
    Last updated: 8 hours ago • New!
    Applied AI Inference Engineer

    Applied AI Inference Engineer

    Baseten • San Francisco, CA, United States
    Full-time
    Baseten provides the infrastructure, tooling, and expertise needed to bring great AI products to market - fast.Backed by top investors including IVP, Spark Capital, Greylock, and Conviction, we’re ...Show more
    Last updated: 30+ days ago • Promoted
    Applied AI Engineer

    Applied AI Engineer

    Parafin Inc. • San Francisco, CA, United States
    Full-time
    At Parafin, we’re on a mission to grow small businesses.Small businesses are the backbone of our economy, but traditional banks often don’t have their backs. We build tech that makes it simple for s...Show more
    Last updated: 26 days ago • Promoted
    AI Solutions Engineer

    AI Solutions Engineer

    ButterflyMX, Inc. • San Francisco, CA, United States
    Full-time
    ButterflyMX is on a mission to empower people to open and manage doors & gates from a smartphone.Our products are installed in more than 20,000+ multifamily, commercial, gated communities, and stud...Show more
    Last updated: 30+ days ago • Promoted
    Senior AI Research Engineer, On-Device Language Intelligence

    Senior AI Research Engineer, On-Device Language Intelligence

    Samsung Research America • Mountain View, CA, United States
    Full-time
    Lab Summary : Samsung AI Research Center (AIC) located in Mountain View, California, is currently recruiting outstanding scientists for the Language and Personal Intelligence lab.Our goal is to perf...Show more
    Last updated: 30+ days ago • Promoted
    Lead AI Engineer

    Lead AI Engineer

    OutSystems Inc. • San Francisco, CA, United States
    Full-time
    For more information, please read ourLead AI Engineer page is loaded## Lead AI Engineerlocations : US - Remote : US - San Francisco Bay Areatime type : Full timeposted on : Posted Yesterdayjob ...Show more
    Last updated: 29 days ago • Promoted
    Inference Engineer, Video AI Job at Cantina in San Francisco

    Inference Engineer, Video AI Job at Cantina in San Francisco

    Mediabistro • San Francisco, CA, United States
    Full-time
    Cantina, founded by Sean Parker, is a new social platform with the most advanced AI character creator.Build, share, and interact with AI bots and your friends directly in the Cantina or across the ...Show more
    Last updated: 8 hours ago • New!