Overview
Tonic.ai is looking for a hands-on Machine Learning Engineer to help build production-grade NLP systems that power our data privacy and information extraction products. You’ll join a small, experienced team working at the intersection of LLMs, data privacy, and applied AI — developing and fine-tuning models that detect and redact sensitive information across diverse datasets.
Base pay range
$125,000.00 / yr - $175,000.00 / yr
Responsibilities
- Build and ship models. Fine-tune and evaluate transformer-based models (e.g., RoBERTa, Gemma, LLaMA) to support PII redaction, entity extraction, and synthetic data generation.
- Own the ML lifecycle. From dataset curation and experiment tracking to model deployment and monitoring — you’ll own the full path from prototype to production.
- Collaborate cross-functionally. Partner with Product and Design to shape how ML models drive user-facing features, and work with the broader engineering team to integrate them into scalable systems.
- Experiment responsibly. Document your experiments, evaluate results rigorously, and help push the frontier of safe and explainable AI for data privacy.
Qualifications
3+ years of professional experience in applied ML or data science with a focus on NLPProficiency in Python and deep learning frameworks such as PyTorch and Hugging Face TransformersHands-on experience with experiment tracking (e.g., Weights & Biases), distributed training (e.g., Accelerate), and model serving (e.g., vLLM)Comfort working independently and iterating quickly — you enjoy the mix of research, engineering, and product thinkingStrong communication and collaboration skillsBonus Points
Experience with supervised and reinforcement learning fine-tuning (e.g., TRL)Familiarity with data privacy, PII redaction, or healthcare dataA public portfolio, blog, or open-source contributions that demonstrate your technical depth and curiosityWhy You’ll Love It Here
High autonomy and meaningful ownership — your models will ship to production, not sit in a notebookSmall, collaborative team with deep expertise in NLP and privacyOpportunity to work with real-world, high-impact data in domains like healthcare and financial servicesBenefits
Competitive salary and company equityUnlimited PTO and generous parental leaveMedical, dental, and vision insurance401(k) with employer contributionRemote-friendly work environmentAbout Tonic.ai
Tonic.ai creates safe, high-quality synthetic data that helps developers move fast while protecting sensitive information. Thousands of engineers rely on Tonic-generated data daily to power development, testing, and CI / CD pipelines across industries including healthcare, financial services, logistics, and education. We’re growing fast and looking for builders who want to make privacy practical.
#J-18808-Ljbffr