A company is looking for a Senior AI Research Engineer, Model Inference (Remote). Key Responsibilities Implement and optimize custom inference and fine-tuning kernels for language models across multiple hardware backends Design and optimize Vulkan compute shaders for quantized operators and fine-tuning workflows Collaborate with cross-functional teams to integrate optimized serving and inference frameworks into production pipelines Required Qualifications Proficiency in C++ and GPU kernel programming Proven expertise in GPU acceleration with Vulkan framework Strong background in quantization and mixed-precision model optimization Experience with mobile GPU acceleration and model inference Familiarity with large language model architectures
Senior Engineer Ai • Vancouver, Washington, United States