|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
March 4, 2010SkewnessSkewness quantifies the asymmetry of a distribution of a set of values. GraphPad Prism can compute the skewness as part of the Column Statistics analysis. How skewness is computed Understanding how skewness is computed can help you understand what it means. These steps compute the skewness of a distribution of values:
Interpreting skewness The basics:
How useful is it to assess skewness? Not very, I think. The numerical value of the skewness does not really answer any of these questions:
The skewness doesn't directly answer any of those questions. Note that the D' Agostino and Pearson omnibus normality test (a choice within Prism's column statistics analysis) is a normality test that combines the skewness with the kurtosis (a measure of how far the shape of the distribution deviates from the bell shape of a Gaussian distribution), and so tries to answer the first question. The definition of the skewness is part of a mathematical progression. The standard deviation is computed by first summing the squares of he differences each value and the mean. The skewness is computed by first summing the cube of those distances. And the kurtosis is computed by first summing the fourth power of those distances. While there are good reasons for computing the standard deviation by squaring the deviations, there doesn't appear to be a deeper meaning to summing the cube of the differences between each value and the mean. Since the skewness is computed based on cubes, a value that is twice as far from the mean as another value increases the skewness eight times as much as that other value (because 23=8). I don't see why alternative definitions of skewness where that factor is some other value (4, or 7 or 10 or any other value greater than 1) wouldn't be just as informative and useful. Multiple definitions of skewness Skewness has been defined in multiple ways. The method used by Prism (and described above) is the most common method. It is identical to the skew() function in Excel. This value of skewness is often abbreviated g1.
Whenever a value is computed from a sample, it helps to compute a confidence interval. In most cases, the confidence interval is computed from a standard error. The standard error of skewness (SES) depends on sample size. Prism does not calculate it, but it can be computed easily by hand using this formula: The margin of error equals 1.96 times that value, and the confidence interval for the skewness equals the computed skewness plus or minus the margin of error. This table gives the standard error and margin of error for various sample sizes.
March 3, 2010The unequal variance (Welch) t testTwo unpaired t tests When you choose to compare the means of two nonpaired groups with a t test, you have two choices:
These choices are offered by GraphPad InStat, GraphPad Prism, the GraphPad free web t test QuickCalc, as well as many other programs. The usefulness of the unequal variance t test To interpret any P value, it is essential that the null hypothesis be carefully defined. For the unequal variance t test, the null hypothesis is that the two population means are the same but the two population variances may differ. If the P value is large, you don't reject that null hypothesis, so conclude that the evidence does not persuade you that the two population means are different, even though you assume the two populations have (or may have) different standard deviations. What a strange set of assumptions. What would it mean for two populations to have the same mean but different standard deviations? Why would you want to test for that? Swailowsky points out that this situation simply doesn't often come up in science (1). I think the unequal variance t test is more useful when you think about it as a way to create a confidence interval. Your prime goal is not to ask whether two populations differ, but to quantify how far apart the two means are. The unequal variance t test reports a confidence interval for the difference between two means that is usable even if the standard deviations differ. How the unequal variance t test is computed Both t tests report both a P value and confidence interval. The calculations differ in two ways:
When to chose the unequal variance (Welch) t test Deciding when to use the unequal variance t test is not straightforward. It seems sensible to first test whether the variances are different, and then choose the ordinary or Welch t test accordingly. In fact, this is not a good plan. You should decide to use this test as part of the experimental planning. References 1. S.S. Sawilowsky. Fermat, Schubert, Einstein, and Behrens-Fisher: The Probable Difference Between Two Means With Different Variances. J. Modern Applied Statistical Methods (2002) vol. 1 pp. 461-472
The term nonparametric is used inconsistently. Nonparametric method or nonparametric data? The term nonparametric should only refers to an analysis method. A statistical test can be nonparametric or not, although the distinction is not as crisp as you'd guess. It makes no sense to describe data as being nonparametric, and the phrase "nonparametric data" should never ever be used. The term nonparametric simply does not describe data, or distributions of data. That term should only be used to describe the method used to analyze data. Which methods are nonparametric? Methods that analyze ranks are uniformly called nonparametric. These tests are all named after their inventors, including: Mann-Whitney, Wilcoxon, Kruskal-Wallis, Friedman, and Spearman. Beyond that, the definition gets slippery. What about modern statistical methods including randomization, resampling and bootstrapping? These methods do not necessarily assume any assumption about the population. They do not assume sampling from a Gaussian distribution. They analyze the actual data, and not the ranks. Are these methods nonparametric? Wilcox and Manly have each written texts about modern methods, but they do not refer to these methods as "nonparametric". Four texts of nonparametric statistics (by Conover, Gibbons, Lehman, and Daniel) don't mention randomization, resampling or bootstrapping at all, but the texts by Hollander and Wasserman do. What about chi-square test, and Fisher's exact test? Are they nonparametric? Daniel and Gibbons include a chapter on these tests their texts of nonparametric statistics, but Lehman and Hollander do not. What about survival data? Are the methods used to create a survival curve (Kaplan-Meier) and to compare survival curves (log-rank or Mantel-Haenszel) nonparametric? Hollander includes survival data in his text of nonparametric statistics, but the other texts of nonparametric statistics don't mention survival data at all. I think everyone would agree that fancier methods of analyzing survival curves (which involve fitting the data to a model) are not nonparametric. Two purposes for analyzes ranks rather than data I think the confusion arises because there are two distinct reasons to choose rank-based tests (like the Mann-Whitney test):
Once you get beyond the rank-based tests, these two goals do not always go together. Modern methods can be distribution free, but not robust to outliers. And some robust methods are not distribution free.The term 'nonparametric' can be confusing because it can be used as a synonym for three different phrases:
Because of these ambiguities, I would suggest avoiding the term nonparametric when possible. Instead, write up your analyses with the name of the test used.
January 20, 201050% of what? How exactly are IC50 and EC50 defined?The definition of EC50 and IC50 The concepts of IC50 and EC50 are fundamental to pharmacology. The EC50 is the concentration of a drug that gives half-maximal response. The IC50 is the concentration of an inhibitor where the response (or binding) is reduced by half. Seems simple enough. But when you actually go to fit data to determine these values, there are several complexities and ambiguities. The rest of this article is about IC50 (I for inhibition, for downward sloping dose-response curves). All the ideas can be applied to stimulatory curves and EC50 (E for effective) as well. Just stand on your head when you view the figures. The ideal situation This figure shows an ideal situation:
The green symbols show measurements made with controls. The ones on the left (Blank) have no inhibitor, so define "100%". The ones on the right are in the presence of a maximal concentration of a standard inhibitor, so define "0%". The data of the experimental dose-response curve (red dots) extend all the way between the two control values. When fitting this curve, you need to decide how to fit the top plateau of the curve. You have three choices:
The results will be very similar with any of these methods, because the data form a complete dose-response curve with a clear top plateau that is indistinguishable from the blank. I prefer the third method, as it analyzes all the data, but that is not a strong preference. Similarly, there are three ways to deal with the bottom plateau: Fit the data only, set Bottom to be a constant equal to the average of the NS controls, and put the NS controls into the fit as if they were a very high concentration of inhibitor. That is the ideal situation. There is no ambiguity about what IC50 means. A situation where IC50 can be defined in two ways This figure shows an unusual situation where the inhibition curve plateaus well above the control values (NS) defined by a high concentration of a standard drug. This leads to alternative definitions of IC50.
Clearly, a single value cannot summarize such a curve. You'd need at least two values, one to quantify the middle of the curve (the drug's potency) and one to quantify how low it gets (the drug's maximum effect). The graph above shows two definitions of the IC50. The relative IC50 is by far the most common definition, and the adjective relative is usually omitted. It is the concentration required to bring the curve down to point half way between the top and bottom plateaus of the curve. The NS values are totally ignored with this definition of IC50. This definition is the one upon which classical pharmacological analysis of agonist and antagonist interactions is based. With appropriate consideration of the biological system and concentrations of interacting ligands, estimated Kd values can often be derived from the IC50 value defined this way (not so for the "so-called absolute IC50" mentioned below). The concentration that provokes a response halfway between the Blank and the NS value is sometimes called the absolute IC50, The horizontal dotted lines show how 100% and 0% are defined, which then defines 50%. This term is not very standard, and is a bit misleading as there is nothing absolute about an "absolute IC50". Since this value does not quantify the potency of a drug, I think it is more miselading than helpful. Authors of the International Union of Pharmacology Committee on Receptor Nomenclature (1) agree that the concept of absolute IC50 (and that term) is not useful (R. Neubig, personal communication). If you really want to use the absolute IC50, here are instructions for fitting a curve to find it.
Incomplete dose-response curves
Any attempt to determine an IC50 by fitting a curve to the data in the graph above will be useless. A curve fitting program might, or might not, be able to fit a dose-response curve to the data. But if the curve fits, the value of the IC50 is likely to be meaningless and have a very wide confidence interval. The data simply don't form a top plateau (which would define 100) or a bottom plateau (which would define 0). If data haven't defined 100 or 0, then 50 is undefined too, as is the IC50.
If you also have control values that define 100 and 0, then the curve can be easily fit. The curve below was created by fitting a dose response curve, but constraining the Top plateau to be a constant value equal to the mean of the Blanks values, and the Bottom plateau equal to the mean of the NS values.
The value of the IC50 fit this way only makes sense if you assume that higher concentrations of the inhibitor would eventually inhibit down to the NS values. That is an assumption that can't be tested with the data at hand. The distinction between relative and absolute IC50 doesn't really apply to these data. Because the data don't define a bottom plateau, the IC50 must be defined relative to the NS control values.
Fitting normalized data As you can see from all the examples above, it is not necessary to normalize the data to run from100% down to 0%. You can fit curves using data in their natural units. A common mistake is to assume that fitting dose-response curves requires that data first be normalized. If you choose to normalize your data, it is essential that you think through carefully (and document in methods sections of papers) how 100% and 0% are defined. There are three strategies you can use:
If you fit normalized data, you probably want Prism to force the curve to go from 100 down to 0. It won't know to do this, unless you tell it. Don't make the common mistake of normalizing your data, but not constraining the curve to go from 100 down to 0. You can constrain the curve in two ways:
Summary The concept of IC50 (or EC50) is a bit ambiguous unless you clearly specify which values define 100% and 0%. Reference 1. R. R. Neubig et al. International Union of Pharmacology Committee on Receptor Nomenclature and Drug Classification. XXXVIII. Update on terms and symbols in quantitative pharmacology. Pharmacol Rev (2003) vol. 55 (4) pp. 597-606
Download Download the Prism file used to create all the graph in this article.
January 15, 2010Pooled SD in ANOVAANOVA (one- and two-way) assumes that all the groups are sampled from populations that follow a Gaussian distribution, and that all these populations have the same standard deviation, even if the means differ. Based on this assumption, ANOVA computes a pooled standard deviation. This value is used in post tests. The ANOVA results in Prism (and most programs) don't report this pooled standard deviation. But it is easy to calculate. As part of the ANOVA table, Prism reports several Mean Square values. One of these is the residual Mean Square (some programs use the term error rather than residual). The mean square values are essentially variances. The square root of the residual Mean Square is the pooled SD. How is this a pooled SD? First, review how a SD of one group is computed: Calculate the difference between each value and the group mean, square those differences, add them up, and divide by the number of degrees of freedom (df), which equals n-1. That value is the variance. Its square root is the SD. To compute the pooled SD from several groups, calculate the difference between each value and its group mean, square those differences, add them all up (for all groups), and divide by the number of df, which equals the total sample size minus the number of groups. That value is the residual mean square of ANOVA. Its square root is the pooled SD. This case study uses the concept of pooled SD.
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||