I'm not sure exactly how to ask this, or if there is such a thing. I'm new to statistics and have just studied confidence intervals and confidence levels of survey data, such as the confidence of population mean being within the certain interval around the sample mean.
I'd like to measure some confidence in data accuracy.
I'm doing some data munging at work, basically copying some data from a source, manipulating it a bit, and then storing it in a database. I'd like to be able to tell my manager that I'm confident that the data is accurate to a certain level.
I plan on verifying the data with another tool, sampling the calculated data vs. original data (say randomly picking 10000 entries of the 62000000 and recalculating to verify), and assuming the entire sample matches, report that I am x% confident that the entire set of calculated data is correct.
However, I'm not sure what I'm measuring. I don't want to measure a mean, but instead the accuracy of n% of the population, and infer if that means that m% is indeed accurate.
Can I only be confident that if I verify n% of the population is accurate, that only n% is accurate?