Contingency Table
A diagram built to arrange test results
into various groups so that when performing a comparison evaluation one
can determine if the grouping is different for any given confidence level.
There should be baseline
measurements made to establish a benchmark
to base the 'given confidence level' on.
Also see the U.S.
The National Institute of Standards and Technology page called How
can we compare the results of classifying according to several categories?