Cookies on this website

We use cookies to ensure that we give you the best experience on our website. If you click 'Accept all cookies' we'll assume that you are happy to receive all cookies and you won't see this message again. If you click 'Reject all non-essential cookies' only necessary cookies providing core functionality such as security, network management, and accessibility will be enabled. Click 'Find out more' for information on how to change your cookie settings.

© 2019 The European Federation for Medical Informatics (EFMI) and IOS Press. The analysis of primary care data plays an important role in understanding health at an individual and population level. Currently the utilization of computerized medical records is low due to the complexities, heterogeneities and veracity associated with these data. We present a deep learning methodology that clusters 11,000 records in an unsupervised manner identifying non-linear patterns in the data. This provides a useful tool for visualization as well as identify features driving the formation of clusters. Further analysis reveal the features that differentiate sub-groups that can aid clinical decision making. Our results uncover subsets that contain the highest proportion of missing data, specifically Episode type, as well as the sources that provide the most complete data.

Original publication




Journal article


Studies in Health Technology and Informatics

Publication Date





249 - 250