A Framework For Validation Of Rule-Based Systems
Expert system validation; Rule-based systems; Test case validation
This paper describes a complete methodology for the validation of rule-based expert systems. This methodology is presented as a five-step process that has two central themes: 1) to create a minimal set of test inputs that adequately cover the domain represented in the knowledge base and 2) a Turing Test-like methodology that evaluates the system's responses to the test inputs and compares them to the responses of human experts. The development of minimal set of test inputs takes into consideration various criteria, both user-defined, and domain-specific, These criteria are used to reduce the potentially very large set of test inputs to one that is practical, keeping in mind the nature and purpose of the developed system. The Turing Test-like evaluation methodology makes use of only one panel of experts to both evaluate each set of test cases and compare the results with those of the expert system, as well as with those of the other experts. The hypothesis being presented here is that much can be learned about the experts themselves by having them anonymously evaluate each other's responses to the same test inputs. Thus, we are better able to determine the validity of an expert system. Depending on its purpose, we introduce various ways to express validity as well as a technique to use the validity assessment for the refinement of the rule base. Lastly, the paper describes a partial implementation of the test input minimalization process on a small but nontrivial expert system. The effectiveness of the technique was evaluated by seeding errors into the expert system, generating the appropriate set of test inputs and determining whether the errors could be detected by the suggested methodology.
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Number of Pages
Source API URL
Knauf, Rainer; Gonzalez, Avelino J.; and Abel, Thomas, "A Framework For Validation Of Rule-Based Systems" (2002). Scopus Export 2000s. 2549.