`
Instructions for use

The Simpson's Paradox





Simpson's paradox is a phenomenon in probability and statistics, in which a trend appears in several different groups of data but disappears or reverses when these groups are combined.

This result is often encountered in social-science and medical-science statistics and is particularly problematic when frequency data is unduly given causal interpretations. The paradox can be resolved when causal relations are appropriately taken care of in the statistical modeling. It has been used to try to inform the non-specialist or public audience about the kind of misleading results mis-applied statistics can generate. Martin Gardner wrote a popular account of Simpson's paradox in his March 1976 Mathematical Games column in Scientific American.


You can alter the number of datasets used for the plot

Linear Regression must be found after identifying each individual group.
However, it is not always clear to separate data points into groups.

Here Support Vector Machine(SVM) enables us to do this automatically.