A company is looking for a Machine Learning Engineer focused on Inference and Serving.
Key Responsibilities
Design, optimize, and operate systems for real-time deployment of Behavioral AI models
Package, version, roll out, and monitor machine learning models across environments
Ensure predictions are fast, accurate, and accountable while maintaining operational maturity
Required Qualifications
Deep expertise in model deployment and production ML serving systems
Experience with low-latency inference techniques such as model graph optimization and caching
Proficiency in high-performance coding languages like Go, Rust, C++, or Java
Understanding of operational systems monitoring and model performance tracking
Applied ML knowledge to collaborate with researchers on model deployability
Machine Learning Engineer • New York, New York, United States