Key Issue 2 – Data Integrity
Data Integrity the correctness of the information stored in the
database, and selection has to be made based on the clean data
Current Status of Database Quality – a large-scale numeric
database without critical evaluation may have an error rate of 2-5%.
Major Error Types –
 Typographical errors
 Unit-conversion errors
 Report interpretation errors
 Metadata compilation errors
 Original report errors
Systematic and iterative approach