Data mining of tuberculosis patient data using multiple correspondence analysis

T. W. Rennie, W. Roberts

Research output: Contribution to journalArticlepeer-review

8 Citations (SciVal)


The aim of this study was to demonstrate the epidemiological use of multiple correspondence analysis (MCA), as applied to tuberculosis (TB) data from North East London. Data for TB notifications in North East London primary care trusts (PCTs) between the years 2002 and 2007 were used. TB notification data were entered for MCA allowing display of graphical data output (n=4947); MCA analyses were performed on the whole dataset, by PCT, and by year of notification. Graphical MCA output displayed variance of data categories; clustering of variable categories in MCA output signified association. Clustering patterns in MCA output demonstrated different associations by year of notification, within PCTs and between PCTs. MCA is a useful technique for displaying association of variable categories used in TB epidemiology. Results suggest that MCA could be a useful tool in informing commissioning of TB services.

Original languageEnglish
Pages (from-to)1699-1704
Number of pages6
JournalEpidemiology and Infection
Issue number12
Early online date19 May 2009
Publication statusPublished - 31 Dec 2009


  • Epidemiology
  • Infectious disease epidemiology
  • Notifications
  • Surveillance
  • Tuberculosis (TB)

ASJC Scopus subject areas

  • Epidemiology
  • Infectious Diseases


Dive into the research topics of 'Data mining of tuberculosis patient data using multiple correspondence analysis'. Together they form a unique fingerprint.

Cite this