Senior Agentic AI Test and Evaluation EngineerLeidos Inc • Reston, VA, United States

Senior Agentic AI Test and Evaluation Engineer

Leidos Inc • Reston, VA, United States

30+ days ago

Job type

Full-time

Job description

Description

At Leidos, you'll contribute to AI solutions that serve critical national and global missions-ranging from defense and intelligence to healthcare, energy, and space exploration. Our work emphasizes Trusted Mission AI : systems that are transparent, ethical, resilient, and accountable. You'll collaborate with multidisciplinary teams to transition AI research into operational environments where accuracy, security, and reliability are non-negotiable. Joining Leidos means applying your expertise to solve some of the most complex and meaningful challenges of our time.

We are looking for a motivated Agentic AI Test and Evaluation Engineer who wants to work on challenging problems in a variety of domains - including enterprise IT, health, defense, intelligence, and energy - to get results that apply and go beyond the state of the art for measurably better outcomes. We apply our knowledge, capabilities, and experience to develop and deploy Trusted Mission AI - AI that deserves to be trusted by system owners, end users, and the public - to be accurate, ethical, reliable, and adaptable.

You will work with a team of agentic AI scientists, agentic AI scientists, data scientists and data engineers to operationalize new approaches for test and evaluation of Agentic AI models that produce measurable advances over state of the art solutions.

Primary Responsibilities

Develops AI Models Test and Evaluation CONOPS
Creates scalable Test and Model Evaluation plans for Agentic AI systems including process, techniques and tools.
Works with AI scientists, agentic AI scientists, data scientists and data engineers to understand the AI system under test to develop test procedures
both positive and negative testing and evaluation
Collect performance metrics as part of evalulation results documentation
Works with MLOps engineers to integrate testing tools and procedures with the CI / CD pipeline
Analyzes existing processes and resultant metrics to recommend potential improvements
Collaborates with AI Governance team to maintain visibility and explainability through testing
Implements testing process in the AI system design, development and deployment life cycle
Identifies the risk in testing of projects, particularly for assessing the limitations of planned tests on complex AI systems
Works within teams of AI / ML researchers and engineers using Agile development processes

Basic Qualifications

Bachelor's degree in Computer Science, Data Science or related field and over 8 years of relevant experience, Masters with 6 years experience. Additional experience may be considered in lieu of degree.

Strong Python programming fundamentals

Experience with system and subsystem level test process and automation

Experience with creating user acceptance test scenarios

Experience with SecDevOps tooling and MLOps pipeline development

Experience with software test automation techniques

Experience with AI Performance and vulnerability assessment

AI model assurance evaluation

Experience applying and automating AI interpretability & explainability tools and methods

Experience with developing CONOPS and presentations

Good understanding of machine learning algorithms, tools and platforms

Self-starter with high intellectual curiosity

Great communication skills, able to explain model and test results to a non-technical audience

Proficient in data exploration techniques and tools

Ability to obtain a Secret clearance

Preferred Qualifications

Experience with data visualization libraries such as Plotly, Streamlit, and matplotlib.

Experience with AI / ML tools, such as common Python packages (e.g., scikit-learn, NumPy, Pandas) and Jupyter notebooks

Experience with database administration and data repositories

Experience in data exploration techniques and tools

Experience with building LLM and other Generative AI applications.

Willing to learn new skills and platforms to support data analytics.

Ability and willingness to obtain a Top Secret security clearance

LeidosAI

If you're looking for comfort, keep scrolling. At Leidos, we outthink, outbuild, and outpace the status quo - because the mission demands it. We're not hiring followers. We're recruiting the ones who disrupt, provoke, and refuse to fail. Step 10 is ancient history. We're already at step 30 - and moving faster than anyone else dares.

Original Posting : August 29, 2025

For U.S. Positions : While subject to change based on business needs, Leidos reasonably anticipates that this job requisition will remain open for at least 3 days with an anticipated close date of no earlier than 3 days after the original posting date as listed above.

Pay Range :

Pay Range $104,650.00 - $189,175.00

The Leidos pay range for this job level is a general guideline onlyand not a guarantee of compensation or salary. Additional factors considered in extending an offer include (but are not limited to) responsibilities of the job, education, experience, knowledge, skills, and abilities, as well as internal equity, alignment with market data, applicable bargaining agreement (if any), or other law.

About Leidos

Leidos is an industry and technology leader serving government and commercial customers with smarter, more efficient digital and mission innovations. Headquartered in Reston, Virginia, with 47,000 global employees, Leidos reported annual revenues of approximately $16.7 billion for the fiscal year ended January 3, 2025. For more information, visit www.Leidos.com .

Pay and Benefits

Pay and benefits are fundamental to any career decision. That's why we craft compensation packages that reflect the importance of the work we do for our customers. Employment benefits include competitive compensation, Health and Wellness programs, Income Protection, Paid Leave and Retirement. More details are available at www.leidos.com / careers / pay-benefits .

Securing Your Data

Beware of fake employment opportunities using Leidos' name. Leidos will never ask you to provide payment-related information during any part of the employment application process (i.e., ask you for money), nor will Leidos ever advance money as part of the hiring process (i.e., send you a check or money order before doing any work). Further, Leidos will only communicate with you through emails that are generated by the Leidos.com automated system - never from free commercial services (e.g., Gmail, Yahoo, Hotmail) or via WhatsApp, Telegram, etc. If you received an email purporting to be from Leidos that asks for payment-related information or any other personal information (e.g., about you or your previous employer), and you are concerned about its legitimacy, please make us aware immediately by emailing us at LeidosCareersFraud@leidos.com .

If you believe you are the victim of a scam, contact your local law enforcement and report the incident to the U.S. Federal Trade Commission .

Commitment to Non-Discrimination

All qualified applicants will receive consideration for employment without regard to sex, race, ethnicity, age, national origin, citizenship, religion, physical or mental disability, medical condition, genetic information, pregnancy, family structure, marital status, ancestry, domestic partner status, sexual orientation, gender identity or expression, veteran or military status, or any other basis prohibited by law. Leidos will also consider for employment qualified applicants with criminal histories consistent with relevant laws.

#Remote

Create a job alert for this search

Test And Evaluation • Reston, VA, United States

Related jobs

Senior Artificial Intelligence Engineer

Impyrian • Ashburn, VA, United States

Full-time

Essential Duties and Responsibilities : .Impyrian is seeking a highly capable and innovative Senior AI / ML Engineer to lead the design, development, and deployment of advanced AI and machine learning ...Show more

Last updated: 15 days ago • Promoted

Remote Content QA Reviewer

Outlier • Germantown, MD, United States

Remote

Full-time

Earn up to $15 / hour + performance bonuses.Outlier, a platform owned and operated by Scale AI, is looking for.If you're passionate about improving models and excited by the future of AI, this is you...Show more

Last updated: 11 hours ago • Promoted • New!

AI Content Writer - Flexible Hours

Outlier • Germantown, MD, United States

Full-time

Last updated: 11 hours ago • Promoted • New!

Senior Principal Digital Design Engineer

Leonardo DRS • Frederick, MD, United States

Full-time

The Leonardo DRS Airborne and Intelligence Systems business is a global leader and strategic partner committed to delivering world-class, full life-cycle defense and intelligence products that prot...Show more

Last updated: 30+ days ago • Promoted

AI Writing Reviewer - Remote

Outlier • Germantown, MD, United States

Remote

Full-time

Last updated: 11 hours ago • Promoted • New!

Senior Software Engineer III

Leonardo DRS • Germantown, MD, United States

Full-time

DRS RADA Technologies, a subsidiary of Leonardo DRS, is focused on proprietary radar solutions and legacy avionics systems supporting the defense industry globally. The company is a global pioneer o...Show more

Last updated: 30+ days ago • Promoted

Senior Embedded Software Engineer

Leonardo DRS • Frederick, MD, United States

Full-time

Last updated: 30+ days ago • Promoted

Sr. Principal Radar Systems Engineer

Leonardo DRS • Germantown, MD, United States

Full-time

Last updated: 22 days ago • Promoted

Quality Engineer II

Leonardo DRS • Frederick, MD, United States

Full-time

Last updated: 30+ days ago • Promoted

Remote English Writer

Outlier • Germantown, MD, United States

Remote

Full-time

Last updated: 11 hours ago • Promoted • New!

Senior Engineer - RF Test & Validation

Leonardo DRS • Frederick, MD, United States

Full-time

Last updated: 30+ days ago • Promoted

Remote AI Writing Specialist

Outlier • Germantown, MD, United States

Remote

Full-time

Last updated: 11 hours ago • Promoted • New!

Senior Operational Test and Evaluation Engineer

BluePath Labs • Sterling, VA, USA

Full-time

Quick Apply

BluePath Labs is a fast growing research and management consulting company focused on the challenging research problems for both government and private sector clients. BluePath focuses on the inters...Show more

Last updated: 30+ days ago

Remote AI Content Reviewer

Outlier • Germantown, MD, United States

Remote

Full-time

Last updated: 11 hours ago • Promoted • New!

Senior Agentic AI Engineer

Leidos Inc • Reston, VA, United States

Full-time

Last updated: 30+ days ago • Promoted

Remote AI Writing Evaluator

Outlier • Germantown, MD, United States

Remote

Full-time

Last updated: 11 hours ago • Promoted • New!

Remote Text Quality Evaluator

Outlier • Germantown, MD, United States

Remote

Full-time

Last updated: 11 hours ago • Promoted • New!

English Writing and Content Reviewing Expertise Sought for AI Training

Outlier • Germantown, MD, United States

Full-time

Last updated: 11 hours ago • Promoted • New!