Cookies on this website

We use cookies to ensure that we give you the best experience on our website. If you click 'Accept all cookies' we'll assume that you are happy to receive all cookies and you won't see this message again. If you click 'Reject all non-essential cookies' only necessary cookies providing core functionality such as security, network management, and accessibility will be enabled. Click 'Find out more' for information on how to change your cookie settings.

Background: There are large numbers of schemes that collect and aggregate data from primary care computer systems into large databases. These data are then used for market and academic research. How the data is aggregated, cleaned and processed is usually opaque. Making the method transparent allows researchers to compare methods, and users of the output to better understand the strengths and weaknesses of the data. Objectives To define the stages of the process of aggregating, processing and cleaning clinical data from multiple data sources. Methods: Identify errors in design, collection, staging, integration and analysis. Results: An eight step process defined: (1) Design (2) Data: entry, (3) Extraction, (4) Migration, (5) Integration, (6) Cleaning, (7) Processing, and (8) Analysis. Conclusions: This eight step method provides a taxonomy to enable researchers to compare their methods of data process and aggregation.


Conference paper

Publication Date





1010 - 1015