Talent.com
AI/LLM Evaluation & Alignment Software Engineer
AI/LLM Evaluation & Alignment Software EngineerLeo Tech Services • Austin, TX, United States
AI / LLM Evaluation & Alignment Software Engineer

AI / LLM Evaluation & Alignment Software Engineer

Leo Tech Services • Austin, TX, United States
12 hours ago
Job type
  • Full-time
Job description

At LeoTech, we are passionate about building software that solves real-world problems in the Public Safety sector. Our software has been used to help the fight against continuing criminal enterprises, drug trafficking organizations, identifying financial fraud, disrupting sex and human trafficking rings and focusing on mental health matters to name a few.

Role

  • This is a remote, WFH role.
  • As an AI / LLM Evaluation & Alignment Engineer on our Data Science team, you will play a critical role in ensuring that our Large Language Model (LLM) and Agentic AI solutions are accurate, safe, and aligned with the unique requirements of public safety and law enforcement workflows. You will design and implement evaluation frameworks, guardrails, and bias-mitigation strategies that give our customers confidence in the reliability and ethical use of our AI systems. This is an individual contributor (IC) role that combines hands-on technical engineering with a focus on responsible AI deployment. You will work closely with AI engineers, product managers, and DevOps teams to establish standards for evaluation, design test harnesses for generative models, and operationalize quality assurance processes across our AI stack.

Core Responsibilities

  • Build and maintain evaluation frameworks for LLMs and generative AI systems tailored to public safety and intelligence use cases.
  • Design guardrails and alignment strategies to minimize bias, toxicity, hallucinations, and other ethical risks in production workflows.
  • Partner with AI engineers and data scientists to define online and offline evaluation metrics (e.g., model drifts, data drifts, factual accuracy, consistency, safety, interpretability).
  • Implement continuous evaluation pipelines for AI models, integrated into CI / CD and production monitoring systems.
  • Collaborate with stakeholders to stress test models against edge cases, adversarial prompts, and sensitive data scenarios.
  • Research and integrate third-party evaluation frameworks and solutions; adapt them to our regulated, high-stakes environment.
  • Work with product and customer-facing teams to ensure explainability, transparency, and auditability of AI outputs.
  • Provide technical leadership in responsible AI practices, influencing standards across the organization.
  • Contribute to DevOps / MLOps workflows for deployment, monitoring, and scaling of AI evaluation and guardrail systems (experience with Kubernetes is a plus).
  • Document best practices and findings, and share knowledge across teams to foster a culture of responsible AI innovation.
  • What We Value

  • Bachelor's or Master's in Computer Science, Artificial Intelligence, Data Science, or related field.
  • 3-5+ years of hands-on experience in ML / AI engineering, with at least 2 years working directly on LLM evaluation, QA, or safety.
  • Strong familiarity with evaluation techniques for generative AI : human-in-the-loop evaluation, automated metrics, adversarial testing, red-teaming.
  • Experience with bias detection, fairness approaches, and responsible AI design.
  • Knowledge of LLM observability, monitoring, and guardrail frameworks e.g Langfuse, Langsmith
  • Proficiency with Python and modern AI / ML / LLM / Agentic AI libraries (LangGraph, Strands Agents, Pydantic AI, LangChain, HuggingFace, PyTorch, LlamaIndex).
  • Experience integrating evaluations into DevOps / MLOps pipelines, preferably with Kubernetes, Terraform, ArgoCD, or GitHub Actions.
  • Understanding of cloud AI platforms (AWS, Azure) and deployment best practices.
  • Strong problem-solving skills, with the ability to design practical evaluation systems for real-world, high-stakes scenarios.
  • Excellent communication skills to translate technical risks and evaluation results into insights for both technical and non-technical stakeholders.
  • Technologies We Use

  • Cloud & Infrastructure : AWS (Bedrock, SageMaker, Lambda), Azure AI, Kubernetes (EKS), Terraform, ArgoCD.
  • LLMs & Evaluation : HuggingFace, OpenAI API, Anthropic, LangChain, LlamaIndex, Ragas, DeepEval, OpenAI Evals.
  • Observability & Guardrails : Langfuse, GuardrailsAI.
  • Backend & Data : Python (primary), ElasticSearch, Kafka, Airflow.
  • DevOps & Automation : GitHub Actions, CodePipeline.
  • What You Can Expect

  • Work from home opportunity
  • Enjoy great team camaraderie.
  • Thrive on the fast pace and challenging problems to solve.
  • Modern technologies and tools.
  • Continuous learning environment.
  • Opportunity to communicate and work with people of all technical levels in a team environment.
  • Grow as you are given feedback and incorporate it into your work.
  • Be part of a self-managing team that enjoys support and direction when required.
  • 3 weeks of paid vacation - out the gate!!
  • Competitive Salary.
  • Generous medical, dental, and vision plans.
  • Sick, and paid holidays are offered.
  • $135,000 - $160,000 a year

    Please note the national salary range listed in the job posting reflects the new hire salary range across levels and U.S. locations that would be applicable to the position. The final salary will be commensurate with the candidate's accepted hiring level and work location. Also, this range represents base salary only and does not include equity, or benefits if applicable.

    LeoTech is an equal opportunity employer and does not discriminate on the basis of any legally protected status.

    Create a job alert for this search

    Software Engineer • Austin, TX, United States

    Related jobs
    Remote AI Writing Evaluator

    Remote AI Writing Evaluator

    Outlier • Pflugerville, TX, United States
    Remote
    Full-time
    Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Show more
    Last updated: 1 day ago • Promoted
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    Mitratech Holdings, Inc. • Austin, TX, United States
    Full-time
    At Mitratech, we are a team of.Legal, Risk, Compliance, and HR functions of companies the world over.We are a close-knit, globally dispersed team that thrives in an ecosystem that supports individu...Show more
    Last updated: 17 days ago • Promoted
    Sr. ML Engineer, AI Cloud

    Sr. ML Engineer, AI Cloud

    Tenstorrent • Austin, Texas, United States
    Full-time +1
    Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions mu...Show more
    Last updated: 30+ days ago • Promoted
    Staff Machine Learning Engineer

    Staff Machine Learning Engineer

    Realtor.com Careers • Austin, Texas, United States
    Full-time
    Through its robust suite of tools, Realtor.Join us on our mission to empower more people to find their way home by breaking barriers to entry, making the right connections, and building confidence ...Show more
    Last updated: 30+ days ago • Promoted
    Director, AI / ML Platform

    Director, AI / ML Platform

    Visa • Austin, TX, United States
    Full-time
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...Show more
    Last updated: 15 days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Senseye • Austin, Texas, United States
    Full-time
    We're seeking an experienced ML Engineer with a strong background in computer vision and time-series modeling.You're skilled at every stage of the ML lifecycle—from defining problems and exploring ...Show more
    Last updated: 16 days ago • Promoted
    Machine Learning Engineer Sr. Consultant level

    Machine Learning Engineer Sr. Consultant level

    Visa • Austin, Texas, United States
    Full-time
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...Show more
    Last updated: 30+ days ago • Promoted
    Machine Learning Engineer 9 / 11 / 2025, 8 : 00 : 36 AM

    Machine Learning Engineer 9 / 11 / 2025, 8 : 00 : 36 AM

    Builtin Integration Sandbox • Austin, TX, United States
    Full-time
    We deliver the most advanced and flexible learning experience for certification, credentialing, test prep, continuing education, and training. Our cloud-based learning platform helps training organi...Show more
    Last updated: 30+ days ago • Promoted
    AI Data & Python Tools Engineer for a well-known consumer device company in Austin, TX

    AI Data & Python Tools Engineer for a well-known consumer device company in Austin, TX

    OSI Engineering • Austin, TX, US
    Full-time
    We're seeking an AI Data and Python Tools Engineer to develop and deploy intelligent tools that leverage big data infrastructure and modern AI architecture. This role combines strong software engine...Show more
    Last updated: 20 days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Mercor • Austin, Texas, US
    Remote
    Part-time
    At Mercor, we’re building the talent engine that helps leading labs and research orgs move AI forward.Our latest initiative focuses on benchmarking and improving model performance and training spee...Show more
    Last updated: 10 hours ago • Promoted • New!
    Machine Learning Engineer - Sr. Consultant level

    Machine Learning Engineer - Sr. Consultant level

    Visa • Austin, TX, United States
    Full-time
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...Show more
    Last updated: 30+ days ago • Promoted
    GenAI / LLM Trust & Safety Consultant

    GenAI / LLM Trust & Safety Consultant

    US Tech Solutions, Inc. • Austin, TX, US
    Full-time
    Duration : 7 Months Location : Austin, TX (onsite in a hybrid model) Job Description : The client is seeking a GenAI / LLM Trust & Safety Consultant, who can quickly ramp up and contribute to client’s R...Show more
    Last updated: 23 days ago • Promoted
    Senior Machine Learning Engineer

    Senior Machine Learning Engineer

    Realtor.com Careers • Austin, Texas, United States
    Full-time
    Through its robust suite of tools, Realtor.Join us on our mission to empower more people to find their way home by breaking barriers to entry, making the right connections, and building confidence ...Show more
    Last updated: 30+ days ago • Promoted
    Lead Machine Learning Engineer

    Lead Machine Learning Engineer

    Visa • Austin, TX, United States
    Full-time
    Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more t...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer AI Platforms

    Senior Software Engineer AI Platforms

    Cisco Systems, Inc. • Austin, TX, United States
    Full-time
    The Cisco Security AI team delivers AI products and platform for all Cisco secure products and portfolios so businesses around the world defend against threats and safeguard the most vital aspects ...Show more
    Last updated: 30+ days ago • Promoted
    Sr Machine Learning Engineer, Customer

    Sr Machine Learning Engineer, Customer

    Realtor.com Careers • Austin, Texas, United States
    Full-time
    Through its robust suite of tools, Realtor.Join us on our mission to empower more people to find their way home by breaking barriers to entry, making the right connections, and building confidence ...Show more
    Last updated: 30+ days ago • Promoted
    Principal Software Engineer - Machine Learning / AI

    Principal Software Engineer - Machine Learning / AI

    Siemens • Austin, Texas, United States
    Full-time
    Siemens EDA is a global technology leader in Electronic Design Automation software.Our software tools enable companies around the world to develop highly innovative electronic products faster and m...Show more
    Last updated: 30+ days ago • Promoted
    Senior Software Engineer, Machine Learning

    Senior Software Engineer, Machine Learning

    Roku • Austin, Texas, United States
    Full-time
    Teamwork makes the stream work.Roku is changing how the world watches TV.Roku is the #1 TV streaming platform in the U.Canada, and Mexico, and we've set our sights on powering every television in t...Show more
    Last updated: 28 days ago • Promoted