AIRC Perspectives

Best Practices for Addressing New Challenges in Testing and Evaluating Artificial Intelligence Enabled Systems

PUBLISHED
September 2022

AUTHORS: Laura Freeman, Justin Kauffman, Daniel Sobien, Tyler Cody, and Erin Lanus 
VIRGINIA TECH 

The integration of artificial intelligence (AI) and statistical machine learning (ML) into complex systems exposes a variety of challenges in traditional test and evaluation (T&E) practices. As more decisions at varying levels are handled by AI-enabled systems (AIES), we need T&E processes that provide a basis for ensuring system effectiveness, suitability, and survivability. This involves methods for assessing the component ML models and AI algorithms, including the ability to show how they result in repeatable and explainable decisions, as well as an understanding of any failure modes and failure mitigation techniques. Moreover, there is a need for AI assurance to certify that AI algorithms operate as intended and are free of vulnerabilities arising either from faulty design or from adversarially inserted data or algorithm code.