LLM Engineer
Location : San Jose, CA (Hybrid 2 days / week onsite)
Work Authorization : Only GC or USC.
Looking only for W2
Key Responsibilities :
Model Development & Optimization
Design, train, fine-tune, and evaluate large language models (LLMs).
Ensure high performance, efficiency, and alignment with product or research goals.
Systems Integration & Deployment
Implement scalable inference pipelines.
Optimize serving infrastructure (quantization, caching, distillation, etc.).
Integrate models into applications, platforms, or APIs.
Research & Cross-Functional Collaboration
Drive experimentation with new architectures, retrieval systems, and prompt-engineering techniques.
Collaborate with product, data, and MLOps teams to transition research into production-ready features.
Engineer • United States