Links

Login

Home

Account

Glossary

Contact Us

DE-DUPLICATION


Definition
The process of preventing or removing duplicate people, computers, or accounts from a sample or sample source so that they are not in the final data set more than once.  Duplication standards can vary based on the study objectives, however in general, only one response is permitted per individual or household.  Front-end de-duplication is often conducted using technology solutions such as digital fingerprinting, identity validation, or address verification and comparisons, and serves to prevent duplicates from ever entering the survey.  When pre-survey technology is not completed, or as a secondary check, post-survey data reviews are employed to remove apparent duplicates from the data file using such fields as birthdate, email address, or by comparing data or answer patterns.  Duplication between multiple sample sources or methodologies (e.g. between a panel and river source) is often referred to as overlap, to distinguish it from duplication within one single sample source.  See also Digital Fingerprinting, Panel Overlap.
Most Recently Proposed Edit
No Edited Versions On File
All Proposed Edits
No Edited Versions On File


 

You must be logged in to propose a new edit. Please click on the login button located at the top of the right side navigtion if you wish to propose a new edit.