Drawing Causal Inferences

Whether it is inductive theorizing and research (trying to determine if there are general relationships by looking at samples, or specifics) or deductive theorizing and research (testing hypotheses; determining if proposed general relationships are found in specifics), we are likely to assert causal/functional propositions (i.e., to make causal inferences). However, as David Hume shows in A treatise of human nature, we cannot see causation directly. We only infer causal connection. Somehow, we feel more comfortable with the proposition "X causes Y" when the inference is drawn under certain evidentiary conditions, which are as follows:
1.  Evidence that the alleged cause preceded the alleged effect ("temporal priority").

2.  Empirical evidence that the alleged cause and effect occur together ("contiguity").

3.  Logical evidence that ties them together ("constant conjunction”).

4.  Evidence that alternative explanations (from extraneous variables) are implausible.

Let us examine each of these criteria.

Alleged Cause Precedes Alleged Effect.
         
Consider the following assertion. "An increase in teachers' authority to make curricular decisions (independent variable) fosters an increase in teachers' attachment to their school." This proposition (it could be an empirical generalization from research) seems plausible only if there is evidence that teachers' authority to make curricular decisions preceded an increase in teachers' attachment to their school. Evidence of temporal priority might be supplied by observation, experimental control, and/or commonsense reasoning (e.g., it is not likely that a house burned down and then someone smoked in bed).

Empirical Evidence of Association.
         
The inference that an increase in teachers' authority to make curricular decisions fosters an increase in teachers' attachment to their school, is more compelling if we have data showing that these two variables changed in close succession (V->Y: a proximal relationship) or in a sequence of variables that changed in close succession (V->W->X->Y: a distal relationship), and in the order asserted. Similarly, we can conclude that a family training program produced beneficial effects only if we have evidence of change in families and evidence that family members attended meetings, understood what was presented during meetings, and read and understood materials.

Evidence Provided by Inductive Logic.
         
Logical evidence is obtained by designing research, analyzing data, and interpreting findings such that we can apply one or more of John Stuart Mill's methods of inductive inference, as described in his A system of logic. These methods include: concomitant variation, agreement, difference, joint agreement and difference, and residues.

          1. The method of concomitant variation. If two variables are changing with respect to one another (e.g., both are increasing, both are decreasing, or one is increasing and the other is decreasing) while everything else remains at about the same level, then we have logical evidence that one variable is a cause or an effect of the other (or they are both being changed by a third variable.)

For instance, an experiment was conducted in a class of 20 elementary school children to identify what affects the rate of children's aggression. During the first experimental period (A1 or Baseline), the teacher was asked to go about her business and handle the children's aggression (operationally defined as hitting, kicking, insulting, etc.) in her usual way. Prior observation showed that her usual way involved staring at the "offender," reminding the offender of the rules, telling the offender to stop, or even giving the offender an enjoyable activity to "distract him" or "settle him down."

In the next experimental period (B1), the teacher was coached to ignore aggression and, instead, to comfort and reinforce other children who were engaging in nonaggressive behavior at that time.

In the third experimental period (A2), called a "reversal," the teacher was asked to do what she used to do during A1 (which, again, meant that she tried to stop aggresssion). And during the final period (B2), she was asked to go back to ignoring aggression and reinforcing nonaggression.

Let's say that we graph the number of aggressive acts per day. Suppose we find that when the children received a lot of teacher contact following aggression (A1 and A2), the rate of aggression was high, and when the amount of teacher contact following aggression decreased (B1 and B2), the rate of aggression decreased. Since nothing else in the classroom was changing along with changes in the teacher's responses to aggression, it is plausible to infer that changes in the teacher's responses somehow caused changes in the children's rate of aggression. [Note that it would be important to try to determine how the teacher and the children made sense of what was happening--the subjective and intersubjective sides of the social system.]

          2. The method of agreement. Imagine that we study twenty failed school reform efforts. Each school and each reform effort was a different configuration of variables (e.g., school size, socioeconomic status of school, location, teacher-student ratio, speed of reform). Despite these differences, however, all of the schools and failed reform efforts had one thing in common--staff did not fully understand and were not fully committed to the mission or the reform plans. Since nothing else in the schools and plans was common across the schools, it is reasonable to infer that the way in which they "agreed" (i.e., were the same) was the cause of the failed reform efforts.

          3. The method of difference. Mill's method of difference is the form of inductive logic used in the typical pre-test, post-test, experimental-group, control-group study. Let us say that we have a pool of 50 families whom we randomly assign to two comparison groups. One group receives written materials, ten weekly group meetings, and weekly home visits aimed to improve family interaction and home teaching. The second group receives written materials only. We compare pre-test and post-test scores on the quality of family interaction and home teaching. Families in the first group have significantly larger pre-post-test differences. What can we infer? Since we randomly assigned families to the two groups, any personal and family differences that might have accounted for improvement or lack of improvement (e.g., religion, support network, expectations of success, initial teaching skill) had an equal chance of being in each group. Therefore, we can assume that the groups were fairly similar on these extraneous factors. (Of course, we could also measure those factors that we think are important and see how similar the two groups actually are.) Since the only other systematic difference between the two groups (which we know about) was group meetings and home visits, it seems likely that these two features of the training made the difference in the amounts of improvement.

          4. The joint method of agreement and difference. This method combines the methods of agreement and difference. Let us take the above research on family training one step farther. We compared pre-post-test scores of families in the two groups which systematically differed only on whether they received written materials or received materials, meetings, and home visits. We used the method of difference to infer that the meetings and home visits accounted for the difference in improvement. Now imagine that, in addition, we obtain a large sample of families who differ in many ways (income, ethnicity, education, etc.). In each family we examine the quality of family interaction and teaching (dependent variables). We also examine whether each family reads materials on interaction and teaching (e.g., books, magazines), is part of some kind of group in which family interaction and teaching are discussed, or receives any in-home assistance or support (e.g., from relatives or other families) (independent variables). If we find that families who attend family-oriented meetings and receive home assistance also have higher quality family interaction and teaching, then we have logical evidence through the method of agreement that these variables make a difference. In summary, the combined use of the methods of agreement and difference provides compelling evidence.

          5. The method of residues. Imagine a situation in which some phenomenon (Y) might be explained by four factors. We may be able plausibly to infer the one that is the cause through a process of elimination. If we know that factor 1 is a cause of Q, factor 2 is a cause of R, and factor 3 is a cause of S, then factor 4, the only one left, is likely to be the cause of Y. As Sherlock Holmes used to tell Dr. Watson, when you eliminate all of the other possible explanations, the one that remains, improbable though it may seem, must be the correct explanation.

Alternative Explanations (from extraneous variables) are Implausible.
          This is done via research design
.  For example, using equivalent comparison groups (so that differences in outcome can’t logically be attributed to differences in groups); using validated instruments; checking reliability of measurements before and during the research; replicating the research with similar groups (to determine whether the findings are repeatable and likely to be “the way things really are”); replicating the research with different groups (to determine whether the findings are general or apply only to groups with certain characteristics)