Talent.com
LLM Engineer (Fremont)
LLM Engineer (Fremont)PeopleCaddie • Fremont, CA, United States
LLM Engineer (Fremont)

LLM Engineer (Fremont)

PeopleCaddie • Fremont, CA, United States
Hace 21 horas
Tipo de contrato
  • A tiempo completo
Descripción del trabajo

Job Title : LLM Engineer (Contract)

Company : Big Four Client

Location (hybrid) : San Jose, CA (Bay Area) - 2x / wk at client site

Work Authorization : U.S. Citizen or Green Card Holder

Pay Rate : Up to $90 per hour (W2), depending on experience

Duration : 1 Month (with possible extension)

Overview :

We are seeking a highly experienced LLM Engineer Contractor to support short-term, high-impact work focused on building, optimizing, and deploying large language models at one of our Big Four clients. The ideal candidate has deep technical expertise across modern LLM architectures, inference systems, and applied machine-learning workflows. This role is fast-paced and hands-on, requiring strong research skills, engineering excellence, and the ability to collaborate across product, data, and ML operations teams. The contractor will contribute directly to model development, performance optimization, and the deployment of LLM-backed features into production environments.

Key Responsibilities :

  • Model Development & Optimization :
  • Design, train, fine-tune, and evaluate LLMs for performance, efficiency, safety, and reliability.
  • Optimize models through techniques such as transfer learning, RLHF, low-rank adaptation (LoRA), quantization-aware training, and distillation.
  • Conduct rigorous benchmarking and ensure alignment with product or research objectives.
  • Systems Integration & Deployment :
  • Build scalable inference pipelines that support high-volume, low-latency LLM serving.
  • Implement infrastructure optimizations including quantization, caching, sharding, and model distillation.
  • Integrate models into applications, APIs, or microservices and collaborate with ML Ops to ensure robust deployment.
  • Research & Cross-Functional Collaboration :
  • Lead experimentation on new model architectures, prompting strategies, retrieval-augmented generation (RAG), and hybrid search pipelines.
  • Work closely with product managers, data engineers, ML ops, and research teams to convert experimental insights into production features.
  • Document findings, communicate results, and contribute to technical roadmaps.

Requirement Skills & Qualifications :

  • Bachelors degree in Computer Science, Machine Learning, Engineering, or related field.
  • Minimum of 6 years of experience in machine learning engineering, deep learning, or related fields.
  • Proven hands-on experience training, fine-tuning, and deploying LLMs (e.g., GPT-style, LLaMA-based, or transformer architectures).
  • Strong proficiency in Python, deep learning frameworks (e.g., PyTorch, TensorFlow), and distributed training systems.
  • Experience building scalable ML pipelines and working with modern inference stacks (e.g., Triton, Ray Serve, Hugging Face, ONNX Runtime).
  • Strong understanding of GPU acceleration, optimization, and cloud-native deployment workflows.
  • Excellent communication skills with the ability to work autonomously in a fast-moving environment.
  • Preferred Skills & Qualifications :

  • Experience with RAG systems, vector databases, and search frameworks (e.g., FAISS, Milvus, Pinecone).
  • Familiarity with model evaluation for alignment, safety, hallucination reduction, and adversarial testing.
  • Prior experience in a research-oriented or applied AI lab environment.
  • Masters degree or Ph.D. in a relevant technical field.
  • Crear una alerta de empleo para esta búsqueda

    Llm Engineer • Fremont, CA, United States

    Ofertas relacionadas
    Sr. Software Engineer - AI / LLM Applications (26456)

    Sr. Software Engineer - AI / LLM Applications (26456)

    Supermicro • San Jose, CA, United States
    A tiempo completo
    Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop / Big Data, Hyperscale, HPC and IoT / Embedded customers...Mostrar más
    Última actualización: hace 21 días • Oferta promocionada
    Sr. Staff ML Platform Engineer (TLM)

    Sr. Staff ML Platform Engineer (TLM)

    Earnin • Mountain View, California, United States
    A tiempo completo
    As one of the first pioneers of earned wage access, our passion at EarnIn is building products that deliver real-time financial flexibility for those with the unique needs of living paycheck to pay...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Commercial Lines Account Manager (HYBRID-Must have real estate exp.)

    Commercial Lines Account Manager (HYBRID-Must have real estate exp.)

    Jobot • Walnut Creek, CA, US
    A tiempo completo
    This Jobot Job is hosted by : Dana Stark.Are you a fit? Easy Apply now by clicking the "Apply Now" button and sending us your resume. Salary : $125,000 - $150,000 per year.A division of a very large i...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    LLM Engineer (Fremont)

    LLM Engineer (Fremont)

    PeopleCaddie • Fremont, CA, US
    A tiempo parcial
    San Jose, CA (Bay Area) - 2x / wk at client site.Up to $90 per hour (W2), depending on experience.Month (with possible extension). We are seeking a highly experienced.The ideal candidate has deep tech...Mostrar más
    Última actualización: hace 22 horas • Oferta promocionada • Nueva oferta
    Senior ML Engineer

    Senior ML Engineer

    Typeface • Palo Alto, California, United States
    A tiempo completo
    Typeface is on a mission to help everyone express their unique imagination.We believe technology is a creative partner that empowers any company to tell their unique stories faster and easier than ...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Senior / Staff Site Reliability Engineer

    Senior / Staff Site Reliability Engineer

    Gatik Ai • Mountain View, California, United States
    A tiempo completo
    Gatik, the leader in autonomous middle-mile logistics, is revolutionizing the B2B supply chain with its autonomous transportation-as-a-service (ATaaS) solution and prioritizing safe, consistent del...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Software Engineer, LLM Infrastructure

    Software Engineer, LLM Infrastructure

    OpenReq • Cupertino, CA, United States
    A tiempo completo
    Etched is building AI chips that are hard-coded for individual model architectures.Our first product (Sohu) only supports transformers, but has an order of magnitude more throughput and lower laten...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    PsiQuantum • Palo Alto, CA, United States
    A tiempo completo
    Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Applied ML Engineer

    Applied ML Engineer

    Kognitos • San Jose, California, United States
    A tiempo completo
    Kognitos is at the forefront of revolutionizing the trillion-dollar hyper-automation market.Our mission is to redefine how software is built and maintained by leveraging cutting-edge multi-agent au...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    Psiquantum • Palo Alto, California, United States
    A tiempo completo
    Quantum computing holds the promise of humanity’s mastery over the natural world, but only if we can build a.PsiQuantum is on a mission to build the first real, useful quantum computers, capable of...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    AI / ML Engineer

    AI / ML Engineer

    Powerline • Palo Alto, California, United States
    A tiempo completo
    Join Powerline and Shape the Future of the Electricity Grid!.Powerline is a fast-growing, VC-backed climate-tech company based in Silicon Valley, dedicated to transforming the electricity grid with...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    Tarana Wireless • Milpitas, California, United States
    A tiempo completo
    Join the Team That's Redefining Wireless Technology.Our groundbreaking Fixed Wireless Access technology is delivering .As a Site Reliability Engineer, you will help us manage software that runs on ...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Multi-Modal LLM Engineer for Robotics & Autonomous Driving

    Multi-Modal LLM Engineer for Robotics & Autonomous Driving

    XPENG • Santa Clara, CA, United States
    A tiempo completo
    A leading smart technology company in Santa Clara seeks a qualified LLM Engineer to fine-tune language models for Humanoid Robots and Autonomous Driving Cars. D in Computer Science and strong expert...Mostrar más
    Última actualización: hace 2 días • Oferta promocionada
    Customer Support Engineer

    Customer Support Engineer

    VirtualVocations • Fremont, California, United States
    A tiempo completo
    A company is looking for a Customer Support Specialist to enhance customer experience by providing technical support and engineering solutions. Key Responsibilities Triaging customer issues and de...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Site Reliability Engineer - Supercomputing

    Site Reliability Engineer - Supercomputing

    Xai • Palo Alto, California, United States
    A tiempo completo
    AI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excelle...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Senior Machine Learning Engineer II - LLM

    Senior Machine Learning Engineer II - LLM

    Moveworks • Mountain View, California, United States
    A tiempo completo
    We are looking for a Machine Learning Engineer to help build cutting edge ML infrastructure for building and serving LLM’s at Moveworks. This role will be critical in building, optimizing and scali...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Senior AI Engineer LLM, RAG

    Senior AI Engineer LLM, RAG

    A • Sunnyvale, California, United States
    A tiempo completo
    Our Wayfinder team is building scalable, certifiable autonomy systems to power the next generation of commercial aircraft. Our team of experts is driving the maturation of machine learning and other...Mostrar más
    Última actualización: hace más de 30 días • Oferta promocionada
    Site Reliability Engineer

    Site Reliability Engineer

    Paynearme • Cupertino, California, United States
    Teletrabajo
    A tiempo completo
    At PayNearMe, we’re on a mission to make paying and getting paid as simple as possible.We build innovative technology that transforms the way businesses and their customers experience payments.Our ...Mostrar más
    Última actualización: hace 7 días • Oferta promocionada