Ton J. Cleophas and Aeilko H. ZwindermanStatistical Analysis of Clinical Data on a Pocket CalculatorStatistics on a Pocket Calculator10.1007/978-94-007-1211-9_17© Springer Science+Business Media B.V. 2011

17. Confounding

Ton J. Cleophas1, 2   and Aeilko H. Zwinderman2, 3  
(1)
Department of Medicine, Albert Schweitzer Hospital, Dordrecht, The Netherlands
(2)
European College of Pharmaceutical Medicine, Lyon, France
(3)
Department of Epidemiology and Biostatistics, Academic Medical Center, Amsterdam, The Netherlands
 
 
Ton J. Cleophas (Corresponding author)
 
Aeilko H. Zwinderman
Abstract
In the above study the treatment effects are better in the males than they are in the females. This difference in efficacy does not influence the overall assessment as long as the numbers of males and females in the treatment comparison are equally distributed. If, however, many females received the new treatment, and many males received the control treatment, a peculiar effect on the overall data analysis is observed as demonstrated by the difference in magnitudes of the circles in the above figure: the overall regression line will become close to horizontal, giving rise to the erroneous conclusion that no difference in efficacy exists between treatment and control. This phenomenon is called confounding, and may have a profound effect on the outcome of the study.
A216868_1_En_17_Figa_HTML.gif
In the above study the treatment effects are better in the males than they are in the females. This difference in efficacy does not influence the overall assessment as long as the numbers of males and females in the treatment comparison are equally distributed. If, however, many females received the new treatment, and many males received the control treatment, a peculiar effect on the overall data analysis is observed as demonstrated by the difference in magnitudes of the circles in the above figure: the overall regression line will become close to horizontal, giving rise to the erroneous conclusion that no difference in efficacy exists between treatment and control. This phenomenon is called confounding, and may have a profound effect on the outcome of the study.
Confounding can be assessed by the method of subclassification. In the above example an overall mean difference between the two treatment modalities is calculated.
For treatment zero
 $$ \text{Mean effect}\pm \text{standard error}(\text{SE})=1.\text{5 units}\pm 0.\text{5 units}$$
For treatment one
 $$ \text{Mean effect}\pm \text{SE}=2.\text{5 units}\pm 0.\text{6 units}$$
The mean difference of the two treatments
 $$ =1.0\text{units}\pm \text{pooled standard error}$$
 $$ =1.0\pm \surd ({0.5}^{2}+{0.6}^{2})$$
With 100  −  2 (100 patients, 2 groups) = 98 degrees of freedom the p-value of this difference is calculated to be
= p  >  0.10 (according to t-table page 21).
In order to assess the possibility of confounding, a weighted mean has to be calculated. The underneath equation is adequate for the purpose.
 $$ =1.0\pm 0.61$$
For the males we find means of 2.0 and 3.0 units, for the females 1.0 and 2.0 units. The mean difference for the males and females separately are 1.0 and 1.0 as expected from the above figure. However, the pooled standard errors are different, for the males 0.4, and for the females 0.3 units.
According to the above equation a weighted t-value is calculated
 $$ \text{The t-value as calculated}=1.0/0.61=1.639$$
The weighted mean is equal to the unweighted mean. However, its SE is much smaller. It means that after adjustment for confounding a very significant difference is observed.
Other methods for assessing confounding include multiple regression analysis and propensity score assessments. Particularly, with more than a single confounder these two methods are unavoidable, and they can not be carried out on a pocket calculator.