Talent.com
AI Systems Engineer
AI Systems EngineerOpenkyber • CA, United States
AI Systems Engineer

AI Systems Engineer

Openkyber • CA, United States
4 hours ago
Job type
  • Full-time
  • Quick Apply
Job description

Staff Machine Learning Engineer, LLM Fine Tuning (Verilog / RTL Applications) :

HIGHLIGHTS :

  • Location : San Jose, CA (Onsite / Hybrid)
  • Schedule : Full Time
  • Position Type : Contract
  • Hourly : BOE

Overview :

Our client is building privacy preserving LLM capabilities that help hardware design teams reason over Verilog / SystemVerilog and RTL artifacts-code generation, refactoring, lint explanation, constraint translation, and spec to RTL assistance. Our client is looking for a Staff level engineer to technically lead a small, high leverage team that fine tunes and productizes LLMs for these workflows in a strict enterprise data privacy environment.

You don't need to be a Verilog / RTL expert to start;curiosity, drive, and deep LLM craftsmanship matter most. Any HDL / EDA fluency is a strong plus.

What you'll do (Responsibilities) :

  • Own the technical roadmap for Verilog / RTL focused LLM capabilities-from model selection and adaptation to evaluation, deployment, and continuous improvement.
  • Lead a hands on team of applied scientists / engineers : set direction, unblock technically, review designs / code, and raise the bar on experimentation velocity and reliability.
  • Fine tune and customize models using state of the art techniques (LoRA / QLoRA, PEFT, instruction tuning, preference optimization / RLAIF) with robust HDL specific evals :
  • Compile / lint / simulate based pass rates, pass@k for code generation, constrained decoding to enforce syntax, and "does it synthesize"checks.
  • Design privacy first ML pipelines on AWS :
  • Training / customization and hosting using Amazon Bedrock (including Anthropic models) where appropriate;SageMaker (or EKS + KServe / Triton / DJL) for bespoke training needs.
  • Artifacts in S3 with KMS CMKs;isolated VPC subnets & PrivateLink (including Bedrock VPC endpoints), IAM least privilege, CloudTrail auditing, and Secrets Manager for credentials.
  • Enforce encryption in transit / at rest, data minimization, no public egress for customer / RTL corpora.
  • Stand up dependable model serving : Bedrock model invocation where it fits, and / or low latency self hosted inference (vLLM / TensorRT LLM), autoscaling, and canary / blue green rollouts.
  • Build an evaluation culture : automatic regression suites that run HDL compilers / simulators, measure behavioral fidelity, and detect hallucinations / constraint violations;model cards and experiment tracking (MLflow / Weights & Biases).
  • Partner deeply with hardware design, CAD / EDA, Security, and Legal to source / prepare datasets (anonymization, redaction, licensing), define acceptance gates, and meet compliance requirements.
  • Drive productization : integrate LLMs with internal developer tools (IDEs / plug ins, code review bots, CI), retrieval (RAG) over internal HDL repos / specs, and safe tool use / function calling.
  • Mentor & uplevel : coach ICs on LLM best practices, reproducible training, critical paper reading, and building secure by default systems.
  • What you'll bring (Minimum qualifications) :

  • 10+ years total engineering experience with 5+ years in ML / AI or large scale distributed systems;3+ years working directly with transformers / LLMs.
  • Proven track record shipping LLM powered features in production and leading ambiguous, cross functional initiatives at Staff level.
  • Deep hands on skill with PyTorch, Hugging Face Transformers / PEFT / TRL, distributed training (DeepSpeed / FSDP), quantization aware fine tuning (LoRA / QLoRA), and constrained / grammar guided decoding.
  • AWS expertise to design and defend secure enterprise deployments, including :
  • Amazon Bedrock (model selection, Anthropic model usage, model customization, Guardrails, Knowledge Bases, Bedrock runtime APIs, VPC endpoints)
  • SageMaker (Training, Inference, Pipelines), S3, EC2 / EKS / ECR, VPC / Subnets / Security Groups, IAM, KMS, PrivateLink, CloudWatch / CloudTrail, Step Functions, Batch, Secrets Manager.
  • Strong software engineering fundamentals : testing, CI / CD, observability, performance tuning;Python a must (bonus for Go / Java / C++).
  • Demonstrated ability to set technical vision and influence across teams;excellent written and verbal communication for execs and engineers.
  • Nice to have (Preferred qualifications) :

  • Familiarity with Verilog / SystemVerilog / RTL workflows : lint, synthesis, timing closure, simulation, formal, test benches, and EDA tools (Synopsys / Cadence / Mentor).
  • Experience integrating static analysis / AST aware tokenization for code models or grammar constrained decoding.
  • RAG at scale over code / specs (vector stores, chunking strategies), tool use / function calling for code transformation.
  • Inference optimization : TensorRT LLM, KV cache optimization, speculative decoding;throughput / latency trade offs at batch and token levels.
  • Model governance / safety in the enterprise : model cards, red teaming, secure eval data handling;exposure to SOC2 / ISO 27001 / NIST frameworks.
  • Data anonymization, DLP scanning, and code de identification to protect IP.
  • What success looks like :

  • 90 days
  • Baseline an HDL aware eval harness that compiles / simulates;establish secure AWS training & serving environments (VPC only, KMS backed, no public egress).

  • Ship an initial fine tuned / customized model with measurable gains vs. Base (e.G., +X% compile pass rate, Y% lint findings per K LOC generated).
  • 180 days
  • Expand customization / training coverage (Bedrock for managed FMs including Anthropic;SageMaker / EKS for bespoke / open models).

  • Add constrained decoding + retrieval over internal design specs;productionize inference with SLOs (p95 latency, availability) and audited rollout to pilot hardware teams.
  • 12 months
  • Demonstrably reduce review / iteration cycles for RTL tasks with clear metrics (defect reduction, time to lint clean, % auto fix suggestions accepted), and a stable MLOps path for continuous improvement.

  • (Security & privacy by design)
  • Customer and internal design data remain within private AWS VPCs;access via IAM roles and audited by CloudTrail;all artifacts encrypted with KMS.

  • No public internet calls for sensitive workloads;Bedrock access via VPC interface endpoints / PrivateLink with endpoint policies;SageMaker and / or EKS run in private subnets.
  • Data pipelines enforce minimization, tagging, retention windows, and reproducibility;DLP scanning and redaction are first class steps.
  • We produce model cards, data lineage, and evaluation artifacts for every release.
  • Tech you'll touch :
  • Modeling : PyTorch, HF Transformers / PEFT / TRL, DeepSpeed / FSDP, vLLM, TensorRT LLM

  • AWS & MLOps : Amazon Bedrock (Anthropic and other FMs, Guardrails, Knowledge Bases, Runtime APIs), SageMaker (Training / Inference / Pipelines), MLflow / W&B, ECR, EKS / KServe / Triton, Step Functions
  • Platform / Security : S3 + KMS, IAM, VPC / PrivateLink (incl. Bedrock), CloudWatch / CloudTrail, Secrets Manager
  • Tooling (nice to have) :

  • HDL toolchains for compile / simulate / lint, vector stores (pgvector / OpenSearch), GitHub / GitLab CI
  • "We are GTN The Go To Network"

    Create a job alert for this search

    System Engineer • CA, United States

    Related jobs
    System Engineer

    System Engineer

    TradeJobsWorkForce • 93755 Fresno, CA, US
    Full-time
    System Engineer Job Duties : Manages and monitors all installed systems and infrastructure for ...Show more
    Last updated: 30+ days ago • Promoted
    Head of ML Systems

    Head of ML Systems

    Openkyber • CA, United States
    Full-time
    Quick Apply
    Hi , Hope you are doing well.We are looking for AI ML Lead / Architect - Santa Clara, CA (Onsite Hybrid 3 Days) - Contract with one of our clients. If you are available and interest...Show more
    Last updated: 8 days ago
    AI Platform Engineer

    AI Platform Engineer

    Openkyber • CA, United States
    Full-time
    Quick Apply
    Position : Senior AI Program Manager Location : Denver, CO (5 Days Onsite) Duration : Long Term In Person Interview Must after Video call interviews About R Systems : R Systems is a leading digital pro...Show more
    Last updated: 4 hours ago • New!
    Ethical AI Architect

    Ethical AI Architect

    Openkyber • CA, United States
    Full-time
    Quick Apply
    Principal AI Application Engineer Industry : Pharma | Biotech | Pharma Manufacturing | CDMO Duration : - 6 Months+ ...Show more
    Last updated: 2 days ago
    Director of Machine Learning Engineering

    Director of Machine Learning Engineering

    Openkyber • CA, United States
    Full-time
    Quick Apply
    Location : Hybrid, Walnut Creek , CA- Weekly 3 days in office and 2 days remote.Position : Senior AI / ML Engineer Type : C2H-Contract TO Hire- This is not a C2C position - it's...Show more
    Last updated: 8 days ago
    Robotics Architect

    Robotics Architect

    Openkyber • CA, United States
    Full-time
    Quick Apply
    Please carefully review the position requirements before submitting a potential candidate for consideration.The Senior Robotics Analyst develops business requirements for Robotic Process Automation...Show more
    Last updated: 1 day ago
    AI Automation Engineer

    AI Automation Engineer

    atqleads • Visalia, CA, us
    Full-time
    Quick Apply
    ATQLeads helps B2B small and mid-sized companies transform outdated go-to-market operations into scalable, intelligent, revenue-producing systems. We believe the future of growth belongs to organiza...Show more
    Last updated: 2 days ago
    AI Architect

    AI Architect

    Openkyber • CA, United States
    Full-time
    Quick Apply
    Position : AI Ops Architect Location : Denver, CO Duration : Long term Description : Enterprise Cloud and AI architect with hands on experience on Goog...Show more
    Last updated: 1 day ago
    GenAI / Agentic AI Architect (Google Cloud - Vertex AI) _ Los Angeles, CA

    GenAI / Agentic AI Architect (Google Cloud - Vertex AI) _ Los Angeles, CA

    OQ Point LLC • CA, United States
    Full-time
    Quick Apply
    Role : GenAI / Agentic AI Architect (Google Cloud Vertex AI) Location : Los Angeles, CA Job Description : ...Show more
    Last updated: 4 days ago
    AI Cloud Solutions Architect

    AI Cloud Solutions Architect

    Openkyber • CA, United States
    Full-time
    Quick Apply
    Role : Gen AI Architect Location : Pleasanton , California We are seeking an experienced Generative AI Architect to lead the design, development, and deployment of cutting-edge generative AI systems....Show more
    Last updated: 12 days ago
    Senior AI / ML Engineer (Remote)

    Senior AI / ML Engineer (Remote)

    vidIQ • California, CA, US
    Remote
    Full-time
    IQ builds software to help YouTube creators achieve their goals.Our mission is to advance the creator's journey with actionable data-driven insights and AI-powered tools. We are dedicated to our val...Show more
    Last updated: 30+ days ago
    Banking Systems Architect

    Banking Systems Architect

    Openkyber • CA, United States
    Full-time
    Quick Apply
    Greetings From Photon, We hope you are staying safe.We are hiring Product Manager to join our Digital Engineering team.Who are we? For the past 20 years, we have powered many Di...Show more
    Last updated: 3 days ago
    Staff Research Associate 2 Non-exempt - Exeter, CA, Job ID 81187

    Staff Research Associate 2 Non-exempt - Exeter, CA, Job ID 81187

    UofC • Exeter, CA, United States
    Full-time
    Staff Research Associate 2 Non-exempt - Exeter, CA, Job ID 81187.University of California Agriculture and Natural Resources. Under supervision, incumbent performs a wide variety of standard repetiti...Show more
    Last updated: 24 days ago • Promoted
    Trading Systems Architect

    Trading Systems Architect

    Openkyber • CA, United States
    Full-time
    Quick Apply
    Position : Okta SME (IAM) Location : Culver City, CA (onsite) Duration : Long Term Client : Media Client Job Description We are seeking a skilled Customer Identity and Access Managem...Show more
    Last updated: 3 days ago
    IaaS Automation Engineer

    IaaS Automation Engineer

    MetroSys • CA, US
    Full-time
    Quick Apply
    Overview MetroSys is seeking a highly skilled Infrastructure-as-a-Service (IaaS) Automation Engineer to support customer delivery of automated infrastructure solutions. This role focuses on building...Show more
    Last updated: 3 days ago
    Senior AI / ML Data Engineer

    Senior AI / ML Data Engineer

    Simarn Solutions • California, California, United States
    Full-time
    Job Title : Senior AI / ML Data Engineer .We are seeking a highly skilled Senior AI / ML Data Engineer with expertise in multimodal AI models, vector databases, and Azure-based data engineerin...Show more
    Last updated: 30+ days ago
    Senior AI Integration Engineer

    Senior AI Integration Engineer

    Openkyber • CA, United States
    Full-time
    Quick Apply
    This is Frank from Tek vivid, and I have a new job opening for you.Please have a look at the below job description, if interested please share your updated resume or feel free to reach me at < / p...Show more
    Last updated: 10 days ago
    Mammography Technologist

    Mammography Technologist

    Adventist Health • Armona, CA, US
    Full-time
    Adventist Health is seeking a Mammography Technologist for a job in Armona, California.Job Description & Requirements.Located in a tight-knit community in Kings County, Adventist Health Hanford...Show more
    Last updated: 30+ days ago • Promoted