Tags

  • Published on

    Parametric vs. non-parametric statistics

    Parametric statistics are more powerful statistics

    Non-parametric statistics are used with categorical and ordinal outcomes

    As we continue our journey to break through the barriers associated with statistical lexicons, here is another dichotomy of popular statistical terms that are spoken commonly but not always understood by everyone.  

    Parametric statistics are used to assess differences and effects for continuous outcomes. These statistical tests include one-sample t-tests, independent samples t-tests, one-way ANOVA, repeated-measures ANOVA, ANCOVA, factorial ANOVA, multiple regression, MANOVA, and MANCOVA. 

    Non-parametric statistics are used to assess differences and effects for:

    1. Ordinal outcomes - One-sample median tests, Mann-Whitney U, Wilcoxon, Kruskal-Wallis, Friedman's ANOVA, Proportional odds regression

    2. Categorical outcomes - Chi-square, Chi-square Goodness-of-fit, odds ratio, relative risk, McNemar's, Cochran's Q, Kaplan-Meier, log-rank test, Cochran-Mantel-Haenszel, Cox regression, logistic regression, multinomial logistic regression

    3. Small sample sizes (n < 30) - Smaller sample sizes make it harder to meet the statistical assumptions associated with parametric statistics.  Non-parametric statistics can generate valid statistical inferences in these situations.

    4. Violations of statistical assumptions for parametric tests - Normality, Homogeneity of variance, Normality of difference scores
  • Published on

    95% confidence intervals

    Precision and consistency of treatment effects

    95% confidence intervals are dependent upon sample size

    If there is ANY statistical calculation that holds true value for researchers and clinicians on a day-to-day basis, it is the 95% confidence interval wrapped around the findings of inferential analyses. Statistics is not an exact mathematical science as far as other exact mathematical sciences go, measurement error is inherent when attempting to measure for anything related to human beings, and FEW tried and true causal effects have been proven scientifically. Statistics' strength as a mathematical science is in its ability to build confidence intervals around findings to put them into a relative context.  

    Also, 95% confidence intervals act as the primary inference associated with unadjusted odds ratios, relative risk, hazard ratios, and adjusted odds ratios. If the confidence interval crosses over 1.0, there is a non-significant effect. Wide 95% confidence intervals are indicative of small sample sizes and lead to decreased precision of the effect. Constricted or narrow 95% confidence intervals reflect increased precision and consistency of a treatment effect.

    In essence, p-values should not be what people get excited about when it comes to statistical analyses. The interpretation of your findings within the context of the subsequent population means, odds, risk, hazard, and 95% confidence intervals IS the real "meat" of applied statistics.
  • Published on

    Chi-square p-values are not enough

    Chi-square p-value

    Odds ratio with 95% confidence interval should be reported and interpreted

    Most people that need statistics are focused only on the almighty p-value of less than .05. When running Chi-square analyses between a dichotomous categorical predictor and a dichotomous categorical outcome, p-values are not the primary inference that should be interpreted for practical purposes. The lack of precision and accuracy in categorical measures coupled with sampling error makes the p-values yielded from Chi-square analyses virtually worthless in the applied sense.

    The correct statistic to run is an unadjusted odds ratio with 95% confidence interval. This is the best measure for interpreting the magnitude of the association between two dichotomous categorical variables collected in a retrospective fashionRelative risk can be calculated when the association is assessed in a prospective fashion.

    The width of the 95% confidence interval and it crossing over 1.0 dictate the significance and precision of the association between the variables.  With smaller sample sizes, the 95% confidence interval will be wider and less precise. Larger sample sizes will yield more precise effects.
  • Published on

    Statistical Designs

    Research questions lead to choice of statistical design

    Differences between-subjects and within-subjects designs

    There are terms in statistics that many people do not understand from a practical standpoint. I'm a biostatistical scientist and it took me YEARS to wrap my head around some fundamental aspects of statistical reasoning, much less the lexicon. I would hypothesize that 90% of the statistics reported in the empirical literature as a whole fall between two different categories of statistics, between-subjects and within-subjects. Here is a basic breakdown of the differences in these types of statistical tests:

    1. Between-subjects - When you are comparing independent groups on a categorical, ordinal, or continuous outcome variable, you are conducting between-subjects analyses. The "between-" denotes the differences between mutually exclusive groups or levels of a categorical predictor variable. Chi-square, Mann-Whitney U, independent-samples t-tests, odds ratio, Kruskal-Wallis, and one-way ANOVA are all considered between-subjects analyses because of the comparison of independent groups.  

    2. Within-subjects - When you are comparing THE SAME GROUP on a categorical, ordinal, or continuous outcome ACROSS TIME OR WITHIN THE SAME OBJECT OF MEASUREMENT MULTIPLE TIMES, then you are conducting within-subjects analyses. The "within-" relates to the differences within the same object of measurement across multiple observations, time, or literally, "within-subjects." Chi-square Goodness-of-fit, Wilcoxon, repeated-measures t-tests, relative risk, Friedman's ANOVA, and repeated-measures ANOVA are within-subjects analyses because the same group or cohort of individuals is measured at several different time-points or observations.
  • Published on

    Prevalence vs. Incidence

    Prevalence and incidence used correctly

    Difference in important epidemiological measures

    The terms prevalence and incidence are often used interchangeably. However, they are extremely different in their utility and interpretability within epidemiology.

    Prevalence is the proportion of cases or disease states that exist in a population at any given time.  Prevalence is established using cross-sectional research designs.  Measures of prevalence can be used to generate odds ratios for outcomes occurring given an exposure or non-exposure.  It is calculated when data is collected in a retrospective fashion

    Incidence is the number of new cases or disease states that occur in a population.  Incidence is established in cohort designs.  Measures of incidence are used to establish the relative risk of an outcome given treatment or no treatment.  It is calculated when data is collected in a prospective fashion.

    Click on the Epidemiology button below to continue.
  • Published on

    Retrospective cohort designs are useful to many researchers

    Retrospective cohort designs are very feasible

    The "go-to" research design for busy clinicians

    In my experience working as a biostatistician at a graduate school of medicine, I have learned that there are three precious commodities for busy clinicians and researchers: Time, competency, and accessibility to data.

    Systematic reviews constitute the most prodigious contribution that scientific-practitioners can make to a given body of clinical knowledge. When conducted in a rigorous and objective fashion, the pooled treatment effects yielded from this design are considered the highest level of applied clinical evidence that exists. It is much more of an academic/empirical task versus applied experimental and observational designs. Yet, the time and experience needed to conduct a systematic review often impede these pursuits by researchers. (However, they are greatly needed and should be undertaken if at all possible!  I'm going to start my first one soon.) 

    True experiments such as randomized controlled trials are not feasible for most researchers due to lack of funding, logistical support, and available resources. Also, researchers should first show observational evidence of a treatment effect before conducting a randomized controlled trial.  

    Prospective cohort studies can generate important measures of incidence and relative risk, as well as longitudinal data. However, this type of design means you are moving forward in time and are dependent upon enough observations being generated from you methodology to have adequate statistical power. The logistics and time associated with this design also tend to hinder its application in busy clinical environments.

    The next highest level of evidence is the retrospective cohort design and it is easily applied in a busy clinical environment. This is a retrospective design so the data already exists. One defines a cohort with inclusion and exclusion criteria. Then, members of the cohort are separated into independent groups according to some sort of exposure or non-exposure to a treatment, intervention, or risk. They are then followed up to a certain point in time to see if they did or did not have the outcome. There are obvious selection and observation biases associated with this design but it yields important measures of relative risk and many years of data can be mined for longitudinal or large-scale cohort analyses. Survival analyses are perfect for this type of design when establishing the 1-year, 3-year, or 5-year survival or "time-to-event" rates of an outcome.  They are also relatively inexpensive to conduct and time-friendly. This research design is much more preferable to case-control designs.