Overall Cohort Characteristics

Cohort Characteristics
Pan-Cancer (N=930) HB-Cancer (N=1151) Pan-disease (N=4409) HB-disease (N=21002) Control (N=5895) Total (N=33387) p value
Gender < 0.0011
   Female 439 (47.2%) 430 (37.4%) 2043 (46.3%) 11186 (53.3%) 1681 (28.5%) 15779 (47.3%)
   Male 491 (52.8%) 721 (62.6%) 2366 (53.7%) 9816 (46.7%) 4214 (71.5%) 17608 (52.7%)
Ethnicity < 0.0011
   White 519 (66.6%) 549 (53.7%) 2370 (57.6%) 10396 (51.5%) 3280 (57.7%) 17114 (53.9%)
   South Asian 79 (10.1%) 230 (22.5%) 874 (21.2%) 5207 (25.8%) 942 (16.6%) 7332 (23.1%)
   Black 98 (12.6%) 121 (11.8%) 408 (9.9%) 2046 (10.1%) 806 (14.2%) 3479 (10.9%)
   Far/South-east Asian 4 (0.5%) 24 (2.3%) 19 (0.5%) 234 (1.2%) 31 (0.5%) 312 (1.0%)
   Arab/North African 0 (0.0%) 0 (0.0%) 1 (0.0%) 15 (0.1%) 5 (0.1%) 21 (0.1%)
   Asian (Other) 17 (2.2%) 46 (4.5%) 178 (4.3%) 970 (4.8%) 234 (4.1%) 1445 (4.5%)
   Mixed 9 (1.2%) 8 (0.8%) 49 (1.2%) 241 (1.2%) 78 (1.4%) 385 (1.2%)
   Other 53 (6.8%) 45 (4.4%) 217 (5.3%) 1065 (5.3%) 306 (5.4%) 1686 (5.3%)
Diagnosis age (years) < 0.0012
   Mean (SD) 66.91 (12.72) 65.24 (13.22) 52.90 (18.51) 53.64 (17.67) 54.77 (17.59) 54.51 (17.77)
   Median 67.21 66.06 51.28 52.42 54.09 53.62
   Q1,Q3 58.88, 76.07 56.79, 75.15 38.36, 67.20 39.90, 66.40 41.02, 68.00 40.66, 67.85
Diagnosis age group < 0.0011
   18-40 27 (2.9%) 54 (4.7%) 1318 (29.9%) 5663 (27.0%) 1470 (24.9%) 8532 (25.6%)
   41-50 66 (7.1%) 100 (8.7%) 862 (19.6%) 4149 (19.8%) 1097 (18.6%) 6274 (18.8%)
   51-60 189 (20.3%) 266 (23.1%) 773 (17.5%) 4186 (19.9%) 1163 (19.7%) 6577 (19.7%)
   61-70 290 (31.2%) 325 (28.2%) 574 (13.0%) 3022 (14.4%) 961 (16.3%) 5172 (15.5%)
   71-80 234 (25.2%) 273 (23.7%) 519 (11.8%) 2311 (11.0%) 733 (12.4%) 4070 (12.2%)
   >80 124 (13.3%) 133 (11.6%) 363 (8.2%) 1671 (8.0%) 471 (8.0%) 2762 (8.3%)
Mortality < 0.0011
   Deceased 754 (81.1%) 795 (69.1%) 840 (19.1%) 3634 (17.3%) 614 (10.4%) 6637 (19.9%)
   Survived 176 (18.9%) 356 (30.9%) 3569 (80.9%) 17368 (82.7%) 5281 (89.6%) 26750 (80.1%)
Survival (months) < 0.0012
   Mean (SD) 15.88 (24.91) 22.37 (29.54) 49.69 (64.39) 58.03 (77.33) 57.50 (71.17) 54.43 (73.08)
   Median 8.05 12.47 28.93 34.87 36.57 32.20
   Q1,Q3 2.58, 18.18 3.52, 29.40 9.60, 66.17 12.50, 73.46 14.57, 71.95 11.07, 69.57
  1. Pearson’s Chi-squared test
  2. Kruskal-Wallis rank sum test

Descriptive statsitics - Pan-disease Vs Control

Demographic

Descriptive statistics - table

Cohort Characteristics
Pan-disease (N=4409) Control (N=5895) Total (N=10304) p value
Gender < 0.0011
   Female 2043 (46.3%) 1681 (28.5%) 3724 (36.1%)
   Male 2366 (53.7%) 4214 (71.5%) 6580 (63.9%)
Ethnicity < 0.0011
   White 2370 (57.6%) 3280 (57.7%) 5650 (57.7%)
   South Asian 874 (21.2%) 942 (16.6%) 1816 (18.5%)
   Black 408 (9.9%) 806 (14.2%) 1214 (12.4%)
   Far/South-east Asian 19 (0.5%) 31 (0.5%) 50 (0.5%)
   Arab/North African 1 (0.0%) 5 (0.1%) 6 (0.1%)
   Asian (Other) 178 (4.3%) 234 (4.1%) 412 (4.2%)
   Mixed 49 (1.2%) 78 (1.4%) 127 (1.3%)
   Other 217 (5.3%) 306 (5.4%) 523 (5.3%)
Diagnosis age (years) < 0.0012
   Mean (SD) 52.90 (18.51) 54.77 (17.59) 53.97 (18.01)
   Median 51.28 54.09 53.02
   Q1,Q3 38.36, 67.20 41.02, 68.00 39.78, 67.70
Diagnosis age group < 0.0011
   18-40 1318 (29.9%) 1470 (24.9%) 2788 (27.1%)
   41-50 862 (19.6%) 1097 (18.6%) 1959 (19.0%)
   51-60 773 (17.5%) 1163 (19.7%) 1936 (18.8%)
   61-70 574 (13.0%) 961 (16.3%) 1535 (14.9%)
   71-80 519 (11.8%) 733 (12.4%) 1252 (12.2%)
   >80 363 (8.2%) 471 (8.0%) 834 (8.1%)
Mortality < 0.0011
   Deceased 840 (19.1%) 614 (10.4%) 1454 (14.1%)
   Survived 3569 (80.9%) 5281 (89.6%) 8850 (85.9%)
Survival (months) < 0.0012
   Mean (SD) 49.69 (64.39) 57.50 (71.17) 54.16 (68.46)
   Median 28.93 36.57 33.20
   Q1,Q3 9.60, 66.17 14.57, 71.95 12.47, 69.70
  1. Pearson’s Chi-squared test
  2. Kruskal-Wallis rank sum test

Medical History

Descriptive statistics - table

Comparative statistics: Medical History
Pan-disease (N=4409) Control (N=5895) Total (N=10304) p value
Comorbidity < 0.0011
   Mean (SD) 2.73 (2.24) 1.92 (1.87) 2.27 (2.07)
   Median 2.00 1.00 2.00
   Q1,Q3 1.00, 4.00 0.00, 3.00 1.00, 4.00
Diabetes 1354 (30.7%) 1010 (17.1%) 2364 (22.9%) < 0.0012
Hypertension 2168 (49.2%) 2545 (43.2%) 4713 (45.7%) < 0.0012
Cholesterol 1831 (41.5%) 2226 (37.8%) 4057 (39.4%) < 0.0012
Asthma 730 (16.6%) 874 (14.8%) 1604 (15.6%) 0.0162
COPD 315 (7.1%) 306 (5.2%) 621 (6.0%) < 0.0012
TB 79 (1.8%) 67 (1.1%) 146 (1.4%) 0.0052
Kidney 844 (19.1%) 821 (13.9%) 1665 (16.2%) < 0.0012
Cardiovascular 1091 (24.7%) 1362 (23.1%) 2453 (23.8%) 0.0532
Upper GI 1459 (33.1%) 1287 (21.8%) 2746 (26.6%) < 0.0012
Lower GI 541 (12.3%) 682 (11.6%) 1223 (11.9%) 0.2762
F/H: Cancer 1017 (23.1%) 1141 (19.4%) 2158 (20.9%) < 0.0012
F/H: Digestive 117 (2.7%) 120 (2.0%) 237 (2.3%) 0.0382
F/H: Diabetes 1176 (26.7%) 1484 (25.2%) 2660 (25.8%) 0.0852
  1. Kruskal-Wallis rank sum test
  2. Pearson’s Chi-squared test

Descriptive statistics - plot

Lifestyle

Descriptive statistics - table

Comparative statistics: Lifestyle
Pan-disease (N=4409) Control (N=5895) Total (N=10304) p value
Smoker < 0.0011
   Not available 679 (15.4%) 481 (8.2%) 1160 (11.3%)
   Never 1419 (32.2%) 2278 (38.6%) 3697 (35.9%)
   Past 997 (22.6%) 1767 (30.0%) 2764 (26.8%)
   Current 1314 (29.8%) 1369 (23.2%) 2683 (26.0%)
Drinker < 0.0011
   Not available 1114 (25.3%) 1228 (20.8%) 2342 (22.7%)
   Never 868 (19.7%) 1340 (22.7%) 2208 (21.4%)
   Past 229 (5.2%) 291 (4.9%) 520 (5.0%)
   Current 2198 (49.9%) 3036 (51.5%) 5234 (50.8%)
Substance user < 0.0011
   Not available 2551 (57.9%) 2739 (46.5%) 5290 (51.3%)
   Never 712 (16.1%) 2117 (35.9%) 2829 (27.5%)
   Past 31 (0.7%) 37 (0.6%) 68 (0.7%)
   Current 1115 (25.3%) 1002 (17.0%) 2117 (20.5%)
Obese < 0.0011
   Not available 634 (14.4%) 540 (9.2%) 1174 (11.4%)
   Never 1106 (25.1%) 1269 (21.5%) 2375 (23.0%)
   Past 1531 (34.7%) 2539 (43.1%) 4070 (39.5%)
   Current 1138 (25.8%) 1547 (26.2%) 2685 (26.1%)
  1. Pearson’s Chi-squared test

Descriptive statistics - plot

Risk analysis - Pan-disease Vs Control

Descriptive statsitics (GP data available) - Pan-disease Vs Control

Demographic

Descriptive statistics - table

Cohort Characteristics
Pan-disease (N=3439) Control (N=5074) Total (N=8513) p value
Gender < 0.0011
   Female 1583 (46.0%) 1457 (28.7%) 3040 (35.7%)
   Male 1856 (54.0%) 3617 (71.3%) 5473 (64.3%)
Ethnicity < 0.0011
   White 1838 (53.7%) 2788 (55.5%) 4626 (54.8%)
   South Asian 842 (24.6%) 898 (17.9%) 1740 (20.6%)
   Black 374 (10.9%) 755 (15.0%) 1129 (13.4%)
   Far/South-east Asian 17 (0.5%) 29 (0.6%) 46 (0.5%)
   Arab/North African 1 (0.0%) 5 (0.1%) 6 (0.1%)
   Asian (Other) 162 (4.7%) 219 (4.4%) 381 (4.5%)
   Mixed 41 (1.2%) 71 (1.4%) 112 (1.3%)
   Other 147 (4.3%) 260 (5.2%) 407 (4.8%)
Diagnosis age (years) < 0.0012
   Mean (SD) 52.54 (18.34) 53.84 (17.18) 53.32 (17.67)
   Median 50.69 53.14 52.11
   Q1,Q3 38.21, 66.39 40.29, 66.68 39.48, 66.54
Diagnosis age group < 0.0011
   18-40 1051 (30.6%) 1326 (26.1%) 2377 (27.9%)
   41-50 690 (20.1%) 991 (19.5%) 1681 (19.7%)
   51-60 610 (17.7%) 1010 (19.9%) 1620 (19.0%)
   61-70 432 (12.6%) 820 (16.2%) 1252 (14.7%)
   71-80 370 (10.8%) 581 (11.5%) 951 (11.2%)
   >80 286 (8.3%) 346 (6.8%) 632 (7.4%)
Mortality < 0.0011
   Deceased 618 (18.0%) 483 (9.5%) 1101 (12.9%)
   Survived 2821 (82.0%) 4591 (90.5%) 7412 (87.1%)
Survival (months) < 0.0012
   Mean (SD) 57.69 (69.11) 64.21 (73.97) 61.57 (72.11)
   Median 35.47 42.40 39.87
   Q1,Q3 15.12, 74.20 19.81, 78.11 17.83, 77.03
  1. Pearson’s Chi-squared test
  2. Kruskal-Wallis rank sum test

Medical History

Descriptive statistics - table

Comparative statistics: Medical History
Pan-disease (N=3439) Control (N=5074) Total (N=8513) p value
Comorbidity < 0.0011
   Mean (SD) 3.09 (2.27) 1.99 (1.92) 2.43 (2.14)
   Median 3.00 2.00 2.00
   Q1,Q3 1.00, 5.00 0.00, 3.00 1.00, 4.00
Diabetes 1124 (32.7%) 884 (17.4%) 2008 (23.6%) < 0.0012
Hypertension 1866 (54.3%) 2183 (43.0%) 4049 (47.6%) < 0.0012
Cholesterol 1692 (49.2%) 2065 (40.7%) 3757 (44.1%) < 0.0012
Asthma 644 (18.7%) 783 (15.4%) 1427 (16.8%) < 0.0012
COPD 259 (7.5%) 237 (4.7%) 496 (5.8%) < 0.0012
TB 78 (2.3%) 66 (1.3%) 144 (1.7%) < 0.0012
Kidney 727 (21.1%) 679 (13.4%) 1406 (16.5%) < 0.0012
Cardiovascular 927 (27.0%) 1131 (22.3%) 2058 (24.2%) < 0.0012
Upper GI 1405 (40.9%) 1266 (25.0%) 2671 (31.4%) < 0.0012
Lower GI 530 (15.4%) 653 (12.9%) 1183 (13.9%) < 0.0012
F/H: Cancer 1017 (29.6%) 1141 (22.5%) 2158 (25.3%) < 0.0012
F/H: Digestive 108 (3.1%) 116 (2.3%) 224 (2.6%) 0.0162
F/H: Diabetes 1171 (34.1%) 1482 (29.2%) 2653 (31.2%) < 0.0012
  1. Kruskal-Wallis rank sum test
  2. Pearson’s Chi-squared test

Descriptive statistics - plot

Lifestyle

Descriptive statistics - table

Comparative statistics: Lifestyle
Pan-disease (N=3439) Control (N=5074) Total (N=8513) p value
Smoker < 0.0011
   Not available 12 (0.3%) 16 (0.3%) 28 (0.3%)
   Never 1312 (38.2%) 2130 (42.0%) 3442 (40.4%)
   Past 955 (27.8%) 1685 (33.2%) 2640 (31.0%)
   Current 1160 (33.7%) 1243 (24.5%) 2403 (28.2%)
Drinker 0.2561
   Not available 449 (13.1%) 674 (13.3%) 1123 (13.2%)
   Never 779 (22.7%) 1204 (23.7%) 1983 (23.3%)
   Past 227 (6.6%) 288 (5.7%) 515 (6.0%)
   Current 1984 (57.7%) 2908 (57.3%) 4892 (57.5%)
Substance user < 0.0011
   Not available 1828 (53.2%) 2237 (44.1%) 4065 (47.8%)
   Never 630 (18.3%) 1965 (38.7%) 2595 (30.5%)
   Past 28 (0.8%) 33 (0.7%) 61 (0.7%)
   Current 953 (27.7%) 839 (16.5%) 1792 (21.1%)
Obese < 0.0011
   Not available 65 (1.9%) 101 (2.0%) 166 (1.9%)
   Never 933 (27.1%) 1174 (23.1%) 2107 (24.8%)
   Past 1420 (41.3%) 2400 (47.3%) 3820 (44.9%)
   Current 1021 (29.7%) 1399 (27.6%) 2420 (28.4%)
  1. Pearson’s Chi-squared test

Descriptive statistics - plot

Risk analysis (GP data available) - Pan-disease Vs Control

Survival analysis

Baseline plots

Kaplan-Meier plots - Pan-disease

Demographics

Medical History

Lifestyle conditions

Hazard Ratio analysis - Pan-disease