`
This is the guidance for the Simpson's Paradox.
Use the slider to alter the number of data sets to be used in the plot.
Click "INDIVIDUAL CORRELATION" to see individual correlation of chosen number of datasets and Click "OVERALL CORRELATION" to see the overall correlation.
Click "CHALLENGE" to see a further question.
Simpson's paradox is a phenomenon in probability and statistics, in which a trend appears in several different groups of data but disappears or reverses when these groups are combined.
This result is often encountered in social-science and medical-science statistics and is particularly problematic when frequency data is unduly given causal interpretations. The paradox can be resolved when causal relations are appropriately taken care of in the statistical modeling. It has been used to try to inform the non-specialist or public audience about the kind of misleading results mis-applied statistics can generate. Martin Gardner wrote a popular account of Simpson's paradox in his March 1976 Mathematical Games column in Scientific American.
You can alter the number of datasets used for the plot
Here Support Vector Machine(SVM) enables us to do this automatically.