A company is looking for a Senior AI Research Engineer, Model Inference (Remote). Key Responsibilities Implement and optimize custom inference and fine-tuning kernels for language models across multiple hardware backends Design and customize Vulkan compute shaders for quantized operators and fine-tuning workflows Collaborate with research and engineering teams to prototype and scale new model optimization methods Required Qualifications Proficiency in C++ and GPU kernel programming Proven expertise in GPU acceleration with Vulkan framework Strong background in quantization and mixed-precision model optimization Experience in Vulkan compute shader development and customization Hands-on experience with mobile GPU acceleration and model inference
Senior Ai Engineer • Gary, Indiana, United States