The Inferential View of
Scientific Explanation
At the onset: Explanandum
and Explanans (Explicandum and Explicans)
That
which needs to be explained (explanandum) and that which contains the
explanation (explanans)—either as a cause, antecedent event, or necessary
condition.
What distinguishes
philosophy of science from other studies of science is that it
1.
Takes
a critical, evaluative approach recommending (or criticizing) certain methods
of analyzing data, drawing inferences, and offering explanations. (not merely
descriptive)
2.
There
is also an emphasis on conceptual analysis
--e.g.,
explaining what a scientific explanation is (should be), or, in other words,
what it means when we say that one thing "explains" another.
Philosophers often
discuss the meaning of many terms whose meaning other people take for granted.
This is the project of conceptual analysis.
Philosophers examine closely concepts crucial to the practice of some
discipline, etc. and the philosophical issues arising from these concepts. (e.g. Philosophy of Art, Philosophy of
Religion, Philosophy of Science, - Philosophy of Plumbing?)
The Difference Between
Explanation And Description
It is commonly agreed
that science aims at not only describing regularities in the
things that we observe around us, but also at explaining those
phenomena. But these are distinct projects. The regularities between phases of the moon
and the level of tides was noted (described) long before scientific explanations
were offered for this observed regularity.
Example 1: The Redshift
Phenomenon
We observe a regularity:
the redshift in the spectra of stars and distant galaxies.
Science seeks to explain why we
see this.
The physical principles
offered to explain this (analogous to the Doppler effect in sound), is a law
which holds that observed wavelengths are increased if the object is moving
away from the observer and shortened if the object is moving towards the
observer.
(Also, there is a
derivation in general relativity theory of the redshift phenomenon, due to
gravitation.)
Willem de Sitter (1872 –
1934) was a Dutch mathematician, physicist and astronomer.
In 1917, de Sitter predicted
that there would be a relationship between distance and redshift.
in 1927, Georges Lemaître, a Belgian Catholic priest and physicist,
published a paper in an obscure Belgian journal, Annales de la Société Scientifique de Bruxelles. In that paper, he showed that the data
collected by Edwin Hubble and two other astronomers up to that time was enough
to derive a linear velocity-distance relation between the galaxies, and that
this supported a model of an expanding universe based on Einstein's equations
for General Relativity.
This discovery was the
first observational support for the Big Bang theory which had been proposed by Lemaître in 1927. The observed velocities of distant
galaxies, taken together with the cosmological principle, appeared to show that
the universe was expanding in a manner consistent with the Friedmann-Lemaître model of general relativity.
In 1931 Hubble wrote a
letter to the Dutch cosmologist Willem de Sitter expressing his opinion on the
theoretical interpretation of the redshift-distance relation:
Mr.
Humason and I are both deeply sensible of your
gracious appreciation of the papers on velocities and distances of nebulae. We
use the term 'apparent' velocities to emphasize the empirical features of the
correlation. The interpretation, we feel, should be left to you and the very
few others who are competent to discuss the matter with authority.
Example 2: Mars “backing up” in the night sky
Retrograde refers to the
observed periodic reversal in paths of the outer planets in the night sky. We observed that there are such reversals
in the apparent paths the planets. And
we could even predict these mathematically based on formulae derived from those
observations. But observation and
prediction is not the same thing as explaining. What is needed for that is a
theory of the solar system which details how the planets' real motions produce
the apparent motions that we observe. As
a matter of fact, that did not come until long after the observations and the
predictions.
It was well known at least
since the days of Ptolemy (100-178) that motion of the planets could be equally
well accounted for by various mathematical models. The question then becomes whether to view a mathematical
description as revealing the underlying nature of the phenomenon or as merely
providing a convenient description of the observable phenomena (saving the
appearances). If these are merely models,
they describe and predict, but they explain nothing.
Example 3: Hempel’s Thermometer
We observe that when we
put a thermometer into a hot liquid the level drops initially and then rises.
But why?
“A
mercury thermometer is rapidly immersed in hot water; there occurs a temporary
drop of the mercury column, which is then followed by a swift rise. How is this
phenomenon to be explained? The increase in temperature affects at first only
the glass tube of the thermometer; it expands and thus provides a larger space
for the mercury inside, whose surface therefore drops. As soon as by heat
conduction the rise in temperature reaches the mercury, however, the latter
expands, and as its coefficient of expansion is considerably larger than that
of glass, a rise of the mercury level results.”[1]
Hempel goes on to suggest
the requirements of an adequate explanation in science:
This
account consists of statements of two kinds. Those of the first kind indicate
certain conditions which are realized prior to, or at the same time as, the
phenomenon to be explained; we shall refer to them briefly as antecedent
conditions. In our illustration, the antecedent conditions include, among
others, the fact that the thermometer consists of a glass tube which is partly
filled with mercury, and that it is immersed into hot water. The statements of
the second kind express certain general laws; in our case, these include the
law of the thermic expansion of mercury and of glass, and a statement about the
small thermic conductivity of glass. The two sets of statements, if
adequately and completely formulated explain the phenomenon under consideration:
they entail the consequence that the mercury will first drop, then rise. Thus,
the event under discussion is explained by subsuming it under general laws,
i.e., by showing that it occurred in accordance with those laws, in virtue of
the realization of certain specified antecedent conditions.
Three Approaches To
Explanation
A philosopher of science
asks:
1.
What
is the difference between describing a phenomenon and explaining it?
2.
What
makes something an adequate explanation?
3.
What
form(s) do explanations come in?
Initially we will look at
three basic accounts:
1.
Inferential
View (Hempel, Oppenheim)
–
An
explanation is a type of argument included in the premises sentences describing
antecedent conditions and sentences expressing laws
of nature.
Within
this type there are three varieties, differing from one another with respect to
the degree of confidence we can have in the conclusion of the argument given
the truth of the premises. (i.e. The "Deductive-Nomological" model of
explanation (D-N), The "Deductive-Statistical" model (D-S) and The
"Inductive-Statistical" model (I-S)
2.
Causal
View (Salmon, Lewis) –
An
explanation is a description of the various causes of the phenomenon:
to explain is to give information about the causal history that led to the
phenomenon.
3.
Pragmatic
View (van Fraassen) –
An
explanation is a body of information that implies that the phenomenon is more
likely than its alternatives, where the information is of the sort deemed
"relevant" in that context, and the class of alternatives to the
phenomenon are also fixed by the context.
Van Frassen’s view is very similar to the
inferential view in some respects, but his unique contribution is that he
points out and explains why the same “why” question can have multiple,
distinct, correct answers.
E.g.
Why did Adam
eat the apple? As opposed to…?
The Inferential Theory of
Explanation
Let’s start with Hempel's
inferential view.
The
"Deductive-Nomological" model of explanation. (D-N)
But also the two
probabilistic “siblings”
The
"deductive-statistical" model (D-S)
The
"inductive-statistical" model (I-S)
All three views of scientific
explanation are inferential. That is,
all believe the observed event can be inferred from an existing law and known
field conditions. This is the "received" or standard view of
explanation, still widely held and was generally agreed to by most philosophers
or science and scientists until the early 1960s. It, perhaps, reflects that fact that early
work in Philosophy of Science was done with “Physics” in mind. Physics was paradigmatic science and thus
explanations from physics were the sort of explanations these theories of
explanation were modeled on.
·
The
Hempel-Oppenheim paper appeared in 1948[2]
·
They
view explanation as a certain sort of argument: (premise sentences which
collectively imply a conclusion sentence).
·
The
arguments they considered were deductively valid arguments: if
the premises are true the conclusion has to be true.
(Something of a holdover
from Aristotle)
P:
All human beings are mortal.
P:
Socrates is a human being.
Therefore:
CS:
Socrates is mortal.
However, even if being a
deductively valid argument is necessary to being an explanation, it is clearly
not sufficient. Not every argument that
is deductive is an explanation. More
needs to be said. What further
conditions need to be added to narrow this account sufficiently?
To this end, Hempel and
Oppenheim offer their:
"General
Conditions of Adequacy"
These are meant to
specify when a deductive argument counts as an adequate explanation.
An explanation must:
(a)
be a valid deductive argument (hence "deductive")
(b)
contain essentially at least one general law of nature as a premise (hence
"nomological")
(c)
have empirical content
(i.e.,
it must be empirically falsifiable/ logically possible for an
observation-statement to contradict it)
These conditions are
formal, structural, semantic features.
The presence of these features can all be determined simply by looking at
the language before us.
But to complete the conditions of adequacy, Hempel and Oppenheim add a
fourth, "empirical" condition.
(d)
The premises (statements in the explanans) must all be true.
This further condition
cannot be known by simply examining the sentences themselves. One must actually investigate the world to
know whether the sentences are true or not.
Thus, on the inferential
view, explanations all have the following structure (where the antecedent
conditions and laws of nature make up the "explanans").
C1,
C2, ... , Cn (antecedent conditions)
L1,
... , Ln (law(s) of nature]
Therefore,
E
[explanandum]
We can look at the
statistical variants of this pattern later.
(An Aside) Laws of Nature
Central to their account
is the notion of a “Law of Nature.” They take a law of nature to be a true
"law-like" sentence.
(It
seems that they are not saying that a Law of Nature is expressed in/by a
sentence; it is the sentence. Perhaps
this is not crucial here.)
This means that a law is
a linguistic entity and they suggest that it is distinguished by its peculiar, linguistic
features. So we have a big basket of
sentences, only some of which are laws of nature. They attempt to specify the criteria by which
we can pick out all and only the laws of nature.
According to Hempel and
Oppenheim, laws are to be distinguished from other non-law sentences in a
language in that law sentences are
1.
universal
2.
have
unlimited scope
3.
contain
no designation of particular objects
4.
contain
only "purely qualitative" predicates.
EX:
"All gases that are heated under constant pressure expand in volume."
In short, Laws express
true universal claims. But even so, how
are we to distinguish a “laws of nature” sentence from other true sentences
expressing accidental universal generalizations (i.e., general universal truths
that happen to be true, though they are not true as a matter of
physical law). Where laws are held to
necessitate their explanandum, in cases of accidental generalizations, there is
no “necessity” to them. For example,
suppose that all the apples that ever get into my refrigerator are yellow. Then
the following is a true generalization:
“All
the apples in my refrigerator are yellow."
This sentence seems to be
true, universal, unlimited in scope, and containing only purely qualitative
predicates. But we hardly want to say it
is a “Law of Nature” that only yellow apples get into my fridge. One might object that the statement does not
meet one of their criteria: #3 contain no designation of particular
objects. Laws of nature are supposed to
refer to universal classes of objects (or phenomena).
Consider the sentence
which is a Law of Nature:
"All gases that are heated
under constant pressure expand in volume."
This is the very reason
that Hempel and Oppenheim include the requirement that to be a law of nature a
sentence must NOT designate any particular objects.
But we can easily get
around that with a bit of creative linguistic trickery.
“All
the apples in any and all refrigerators owned by Kenton Harris are yellow."
Or, alternatively, use
some physical description in terms of "purely qualitative" predicates
that constitute a definite description for my refrigerator from the class of
all refrigerators in the universe (a definite description which would reference
the -entire- unit class of my refrigerator such as its serial number). Such a description would pick out all the
members of a (unit) class and thus be universal and not explicitly
naming a particular object. Would the
existence of such a sentence convince you that you have discovered a new law of
nature?
(No.)
Moreover, consider the
following two true sentences.
G:
No gold sphere has a mass greater than 100,000 kilograms.
U:
No enriched uranium sphere has a mass greater than 100,000 kilograms.
Since the critical mass
of enriched uranium is just a few kilograms, U is rightly considered lawful
while G is not. Granted that U may be a
relatively “low-level” variety of law, given its particularity and that it
follows from more general, and therefore more properly considered, laws. Nevertheless, there is a necessity to U,
whereas G expresses a contingently true generalization.
This returns us to the question
of what, precisely, is a law of nature. Our
modern understanding of the term “law of nature” is an issue philosophers continue
to argue at length. For example,
philosopher John W. Carroll[3] compares similar
statements.
“All
gold spheres are less than a mile in diameter.”
And
“All
uranium-235 spheres are less than a mile in diameter.”
Our observations of the
world tell us that there are no gold spheres larger than a mile wide, and we
can be pretty confident there never will be. Still, we have no reason to
believe that there couldn’t be one, and so the statement is not considered a law. On the other hand, the statement “All
uranium-235 spheres are less than a mile in diameter.” could be thought of as a
law of nature because, according to what we know about nuclear physics, once a
sphere of uranium-235 grew to a diameter greater than about 6 inches (because
critical mass for uranium-235 is about 15 kilograms), it would demolish itself
in a nuclear explosion. (Boom!)
Hence we can be sure that
such spheres do not exist. The
distinction matters because it illustrates two important features about our
concept of a “Law of Nature.” First, contra
Hempel-Oppenheim, not all true generalizations we observe can be thought of as
laws of nature, but second, most laws of nature are thought to exist as part of
a larger, interconnected system of laws.
Laws of Nature and
Counterfactuals
A counterfactual claim is
a subjunctive, condition-counter-to-(actual)-fact. Nevertheless, we think some counterfactuals
are
true and others false.
1.
If
I had not been offered a job at FIU in 2003, I would have pursued a career in private
industry.
2.
If
Hitler had won WWII, Israel would not have been founded in 1948.
3.
If
you had just been a few inches taller, you would have a 4.0 GPA.
4.
If
the sun were made of pudding, I would be a famous soccer player.
All of these four claims
are counterfactuals; their antecedents express conditions counter to fact. But while #1 is true, and #2 is likely true,
#3 is likely false and #4 is nonsense, or quite near it. Counterfactuals are tricky; what precisely
are the truth conditions of a counterfactual or how might we otherwise know
them to me true?[4]
One reason that the true
universal statements about apples in my refrigerator and gold might not be laws
of nature, a reason NOT captured by the Hempel-Oppenheim analysis
of laws, is that they do not support inferences to counterfactual statements. On the basis of the true universal alone, we
do not suppose counterfactuals. For
instance from the (apparently) true universal “All planets that support
intelligent life have one and only one moon.” We do not suppose that, if the
Earth had had two moons, it would not have supported intelligent life. Nor from my “apple” universal do we suppose
that, if a red apple were to be placed into my refrigerator, it would turn
yellow. And again, we do not suppose
that if we were to collect 100,000 kilograms of gold into a sphere, it would
explode.
Genuine laws of nature,
by contrast, are supposed to support legitimate counterfactual inferences. From the law “All gases that are heated under
constant pressure expand.” we can infer that if a sample of gas in a particular
container were heated under constant pressure, it would expand. This seems true even if we never
actually heat that sample of gas under constant pressure. Similarly, we can infer that if were to
successfully gather 100,000 kilograms of uranium and try to fashion it into a
sphere, we would fail.
(Boom!)
The difference between
statements G and U does not seem to be something that can be captured purely linguistically.
Thus, Hempel and Oppenheim's view that
laws of nature are sentences of a certain sort must be fundamentally mistaken.
What laws of nature are
is still a matter of dispute. But, as we
are primarily concerned with scientific explanation here, let’s assume some
satisfactory account can be given of “Law of Nature” to alleviate this issue
and look at some more specific problems with the inferential account of
explanation.
Counterexamples To The
Inferential View Of Scientific Explanation: Asymmetry and Irrelevance
The Hempel-Oppenheim
analysis of scientific explanation has the following four key features.
(a) Inferential -
Explanations are arguments: to explain why E occurred is to provide
information that would have been sufficient to predict beforehand that E would
occur.
(b) Covering Law -
Explanations explain by showing that E could have
been predicted from the laws of nature, along with a complete
specification of the initial conditions.
(c) Explanation-Prediction
Symmetry - The information (i.e., laws, antecedent conditions)
appearing in an adequate explanation of E could have been used to predict E;
conversely, any information that can be used to predict E can be used after the
fact to explain why E occurred. Thus
there is a perfect symmetry between predicting X and explaining X. A prediction of X is just as appropriately
regarded as an explanation of X and vise versa.
(d) No Essential
Role for Causality - Laws of nature do not have to describe causal
processes. Thus “causes” seem to play no
role in these scientific explanations.
Counterexamples target
one or more of these features.
First Problem: Asymmetry: Hempel-Oppenheim assert that
explanation and prediction are symmetric, whereas that does not seem to be the
case, as the following examples show.
(1) Eclipse - You can predict when and
where a solar eclipse will occur using the laws governing the orbit of the
Earth around the Sun, and the orbit of the Moon around the Earth, as well as
the initial configuration these three bodies were in at an earlier time. Given that the Sun, Moon and Earth are in
certain position relative to each other now (field conditions) and given
certain known laws governing their motions (laws of nature) the eclipse will
occur at time T.
But you can also make the
same prediction by extrapolating backwards in time from the
subsequent positions of these three bodies.
Given
that the Sun, Moon and Earth are in certain position relative to each other now
(field conditions) and given certain known laws governing their motions (laws
of nature) the eclipse did occur at time T.
In other words, we can retrodict
(infer) that the eclipse had occurred by running a simulation
backward. However, only the first which references
law and conditions prior to the eclipse would count as an
explanation of why the eclipse occurs when and where it did. But that the sun and moon are in the positions
they are in now taken together with certain laws of motion does NOT explain why
an eclipse did occur. Thus, inference and explanation are NOT symmetric here.
(2) Flagpole - Using the laws of
trigonometry and the law that light travels in straight lines, you can predict
the length of the shadow that a flagpole of a certain height will cast when the
Sun is at a certain elevation. The known
height of the flagpole and the law-governed behavior of light thus explain the what
the shadow is the length that it is.
But
note, one can also determine what the height of the flagpole is by measuring
the length of its shadow and the elevation of the Sun. Nevertheless, the length of the shadow does
not explain the height of the flagpole. Only the first of these two derivations/inferences
would count as an explanation.
(3) Barometer - Using the laws governing
weather patterns, storm formation, and the effect of air pressure on the
behavior of barometers, you can predict that when a barometer falls that a
storm will soon follow.
You
can also predict that when a storm is approaching, the barometer will fall.
However,
in this case, neither of these inferences are explanatory, since
both are explained by antecedent atmospheric conditions.
In each of the cases the
inferential relations between the sets of facts is symmetric, but the
explanatory relationship is not. This
suggests that the nature of explanation is not (fully) captured by the current
inferential account.
Second Problem: Irrelevance: the Hempel-Oppenheim analysis would
in some cases endorse information as explanatory when it is irrelevant to the
explanandum.
(1) Birth Control Pills - All men who take birth
control pills never get pregnant. Thus, from the fact that John is taking birth
control pills we can infer logically that he won't get pregnant.
All
men who take birth control pills fail to get pregnant.
John
is a man who takes birth control pills
Therefore
John
will fail to get pregnant.
By their criteria, the true
universal together with the field conditions explains why John does not get
pregnant. However, this would hardly be
an explanation of John's failing to get pregnant since he couldn't have gotten
pregnant whether or not he took birth control pills.
(2) Hexed Salt - All salt that has had a
hex placed on it by a witch will dissolve in unsaturated water. Hence, we can
logically infer from the fact that a sample of salt that had a hex placed on it
by a witch that it will dissolve in water.
However,
this wouldn't give us an explanation of why the salt dissolved in the unsaturated
water since the salt would have dissolved even if it hadn't been hexed.
Note
in both these cases the failure to support counterfactual reasoning.
The Causal Theory of Explanation
A third set of problems
arise from the contention that to explain a phenomenon is merely to provide
information sufficient to predict that it will occur. Some explanations for events do not in fact
provide us with the ability to have predicted the event would have occurred. This is more easily seen in the
inductive-statistical model of explanation.
The I-S differs from D-N
explanation only in that the laws that are cited in the explanation can be
statistical.
Example,
it is a law of nature that 90% of electrons in a 90-10 superposition of spin-up
and spin-down will go up if passed through a vertically oriented Stern-Gerlach
magnet. (Or so I have been told. 😊)
With this information we
can construct a statistical syllogism similar to the D-N inferences.
P1.
Ninety percent of electrons in a 90-10 superposition of spin-up and spin-down
will go up if passed through a vertically oriented Stern-Gerlach magnet. (Law
of Nature)
P2.
This electron is in a 90-10 superposition of spin-up and spin-down and is
passed through a vertically oriented Stern-Gerlach magnet. (Statement of
Initial Conditions)
Therefore
CS:
This electron goes up. (Explanandum) [90% chance]
This argument pattern is
obviously similar to that exhibited by D-N explanation, the only difference
being that the law in the inductive argument stated above is statistical rather
than a universal generalization. On the
inferential view, this argument constitutes an explanation since the
initial conditions and laws confer a high probability (though not
a 100% probability) on the explanandum. (i.e. If you knew that these laws and
initial conditions held of a particular electron, you could predict with high
confidence that the electron would go up.)
First, this seems
odd. How does the fact that an event
frequently occurs explain the fact that the event frequently
occurs? It strikes me as a case of an unsatisfying
“dispositional” explanation. This is something
like saying that my boastfulness explains my pompous and boorish
behavior. “And what does it mean to say
I am boastful?” you ask. Well I mean
that I am such as to frequently behave pompously and boorishly. Similarly, I might “explain” that the window
shattered when struck with a hammer because it is fragile. And what does it mean to say a think is
fragile? It means it is such as to
shatter when it is hit with a hammer.
Thus one is saying that the reason the window shattered when struck with
a hammer is because the window is such as to shatter when struck with a
hammer. I think non-statistical laws may
suffer from that same problem. Why do
unsupported objects close to the surface of the Earth fall at a rate of 32 feet
per second/ per second? Oh, because they
always
do that!
Second, you can't always
use explanatory information as the basis for a prediction. That is because we frequently offer
explanations of events with low probability.
For instance two heterogeneous brown-eyed individuals having a
brown-eyed child is highly probable. Two
heterogeneous brown-eyed individuals having a blue-eyed child is
improbable. Yet the corresponding probability
of both events seem to be explained by exactly the same processes
and “laws.”
(Numbers in the examples
below are for purposes of illustration only.)
Atomic Blasts &
Leukemia.
We might explain why a person
contracted leukemia by pointing out the person was once only two miles away
from an atomic blast, and that exposure to an atomic blast from that distance
increases one's chances of contracting leukemia in later life, this despite the
fact that only 1 in 1,000 persons exposed to an atomic blast eventually
contract leukemia. Nevertheless,
exposure to an atomic blast explains the leukemia since people who haven't been
exposed to an atomic blast have a much lower probability (say, 1 in 10,000) of
contracting leukemia. But notice that, while
once being only two miles away from an atomic blast can be said to explain the
fact that an individual contracted leukemia, it does not make it so probable
that an individual will contract leukemia that you would predict this result (e.g.
greater that 50%).
Smoking & Lung
Cancer.
We can explain why
someone contracted lung cancer by pointing out that the person had smoked two
packs of cigarettes a day for forty years.
Actually this is offered and accepted as a pretty good explanation of
why this person got lung cancer. This is
an explanation since people who smoke that much have a much higher probability
(say, 1 in 100) of contracting lung cancer than non-smokers (say, 1 in 10,000).
Still, the vast majority of smokers (99
percent) will never contract lung cancer.
Thus, it would be false to say this makes it probable that the person
will get lung cancer.
The point here is that, In
each of these cases, you can't predict that the result will occur
since the information does not confer a high probability on the result. Nevertheless, the information offered
constitutes an explanation of that result, since it increases the probability
that that result will occur.
Wesley Salmon’s Statistical
Explanation (which he later rejects)
In the 1960s and 1970s,
Wesley Salmon developed a view of statistical explanation that postulated that,
contrary to what Hempel claimed earlier, high probability was NOT
necessary for an explanation, but only positive statistical relevance.
Definition. A hypothesis h is
positively relevant (correlated) to e if h makes e more
likely, i.e., pr(h|e) > pr(~h|e).
The problem Salmon faced
was distinguishing cases where the information could provide a substantive
explanation from cases where the information reported a mere correlation
and so could not. This often occurs when what explains one of the correlates
also explains the other (common cause).
They are positively correlated because they have a common explanation. For example, having nicotine stains on one's
fingers is positively correlated with contracting lung cancer, but you could
not explain why a person contracted lung cancer by pointing out that the person
had nicotine stains on their fingers.
Distinguishing these
cases proved to be impossible using purely formal (statistical) relations.
Obviously, some other type of information was needed to make the distinction. Ultimately, Salmon rejected the received view
of explanation. Eventually Salmon came
to believe that to explain a phenomenon is NOT to offer information sufficient
for a person to predict that the phenomenon will occur, but to give information
about the causes of that phenomenon. On this view, an explanation is NOT
a type of argument containing laws of nature as premises but an assembly of
statistically relevant information about an event's causal history.
Two Reasons in Support of Causal
Explanations
1.
The Explanans must precede the explanadum
The
initial conditions given in the explanatory information have to precede the
explanandum temporally to constitute an explanation of the explanandum.
Hempel's theory has no restriction of this sort.
The
eclipse example illustrates this fact: you can just as well use information
about the subsequent positions of the Sun and Moon to derive that an eclipse
occurred at an earlier time as use information about the current positions of
the Sun and Moon to derive that an eclipse will occur later.
The
former is a case of retrodiction, whereas the latter is a (familiar) case of
prediction. This is an example of the prediction-explanation symmetry
postulated by Hempel. However, as we saw earlier when discussing the problem of
asymmetry, only the forward-looking derivation counts as an explanation.
Interestingly,
Salmon points out that the temporal direction of explanation matches the
temporal direction of causation, which is forward-looking (i.e., causes must
precede their effects in time).
2.
Not all derivations from laws
count as explanations.
Salmon
argues that some D-N "explanations" (e.g., a derivation from the
ideal gas law and a description of initial conditions) are not explanations at
all. The ideal gas law simply describes a set of constraints on how various
parameters (pressure, volume, and temperature) are related; it does not explain
why these parameters are related in that way.
Why
these constraints exist (in the sense of the sufficient reason) is a substantive
question that is answered by the kinetic theory of gases.
Another
example: People knew for centuries how the phases of the Moon were related to
the height of tides, but simply describing how these two things are related did
not constitute an explanation. An explanation was not provided until Newton
developed his theory of gravitation.
Salmon
argues that the difference between explanatory and non-explanatory laws is that
the former describe causal processes, whereas non-explanatory laws (such as the
ideal gas law) only describe empirical regularities.
Salmon’s Causal Account
of Explanation
An
explanation is a body of information about the causes of a particular event.
Salmon's theory of causal
explanation has three elements.
(1) Statistical
Relevance - the explanans (C) increases the probability of the
explanandum (E), i.e., pr(E|C) > pr(E).
(2) Causal
Processes - the explanans and the explanandum are both parts of
different causal processes
(3) Causal
Interaction - these causal processes interact in such a way as to
bring about the event (E) in question
Ah, but then, what
precisely is a “causal process?”
Salmon's view is that
causal processes are characterized by two features.
1.
A
causal process is a sequence of events in a continuous region of spacetime.
2.
A
causal process can transmit information (a "mark")
I will not elaborate
here.
Now we turn to Explanation
Common Cause
According to Salmon, a
powerful explanatory principle is that whenever there is a coincidence
(correlation) between the features of two processes, the explanation is an
event common to the two processes that accounts for the correlation. This is a
"common cause." To cite an example discussed earlier, there is a
correlation between lung cancer (C) and nicotine stains on a person's fingers
(N). That is,
Pr (C|N) > Pr(C).
The probability of C
given N is greater than the probability of C alone.
The common cause of these
two events is a lifetime habit of smoking two packs of cigarettes each day. Let
this fact be (S).
Relative to S, C and N
are independent of one another and thus neither is the cause of the other.,
i.e.,
Pr (C|N&S) = Pr(C|S).
The probability of C
given N&S is no different than the probability of C given S alone. Therefore, N has nothing to do with the
probability of C.
Thus N adds nothing to
the probability of C. The same can be
said about C in relation to N.
This is sometimes
expressed as "S screens C off from N" That is, once S is brought into the picture N becomes
irrelevant. This is part of a precise
definition of "common cause," which is constrained by the formal
probabilistic conditions.
Salmon's attempt here is
to analyze a type of explanation that is commonly used in science, but the
notion of causal explanation can be considered more broadly than he does. For
example, David Lewis points out that the notion of a causal explanation is
quite fluid. In his essay on causal
explanation[5],
he points out that there is an extremely rich causal history behind every
event. Lewis too argues that to explain
an event is to provide some information about its causal history.
The question arises, what
kind of information? Where to “start”
and where to “end?” There might be many
situations in which we might only want a partial description of
the causal history. Consider trying to
assign blame for an automobile accident in a court trial. We need not cite all the
physical principles at work to account for the accident.
We might consider:
1.
whether
gas was available for him to drive his car.
2.
whether
he was old enough to have received a driver's license by the time of the
accident.
3.
whether
he was likely to have lived to the age that he did.
These features all figure
into the causal history of the event, but they are not needed for the kind
of explanation we seek. While all
of these are part of the "causal history" leading up to the person
having an accident while drunk, but we would not want to cite any of these as
causing or explaining the accident. ("Explaining Well vs. Badly.") One might object that the availability of gas
was not "the" cause of the person's accident. But it is a mistake to think we can really
single out “the cause.” We can (must) focus
on a given chunk of the causal history leading up to the event to get at the explanation
we are looking for.
Lewis separates the
causal history--any portion of which can in principle be cited in a given
explanation--from the portion of that causal history that we are interested in
or find most salient at a given time. We
might not be interested in some of the information about any portion of the
causal history, Lewis says, but it remains the case that to explain and event
is to give information about the causal history leading up to that event. Or we may just want to know something about
the type of causal history that leads to events of that sort, and so on.
To explain then is to
give information about a causal history, but giving information about a causal
history is not limited to citing one or more causes of the event in
question. Lewis allows negative
information about the causal history to count as an explanation:
·
There
was nothing to prevent it from happening.
·
There
was no state for the collapsing star to get into.
·
There
was no connection between the CIA agent being in the room and the Shah's death. It was just a coincidence.
Problems With the Causal Account
of Explanation
If this account is
correct (exhaustive), then there should be no explanations that fail to cite information
about the causal history of the explanandum.
Is this so? Remember the D-N
explanation was long thought to model legitimate explanations and it does not
cite causal histories. Do all purported
D-N explanations fail to be explanations?
Salmon seems to say as much. He
argues that
1.
Non-causal
laws allow for "backwards" explanations.
2.
Cry
out to be explained themselves.
3.
Are
in fact simply descriptions of empirical regularities that themselves need to be
explained.
The same might occur in
the redshift case, if the law connecting the redshift with the velocity was
simply an empirical generalization. (Also, consider Newton's explanation of the
tides.)
Van Fraassen's
Pragmatic View Of Explanation
There are two basic challenges that
can be given to the causal view:
1. Sometimes non-causal generalizations can
explain (see above)
2. Laws can be explained by other laws. But such a relationship between laws does not
seem to be causal. Laws do not cause
other laws, since neither is an event.
The Basic Elements Of The Pragmatic
View Of Explanation
Van Fraassen
gives a somewhat pragmatic, epistemic view of explanation. An explanation is a particular type of answer
to a “why-question.” A satisfying
explanation is an answer that provides relevant information that
"favors" the event to be explained over its alternatives.
These features are determined by the
context in which the why-question is asked.
According to van Fraassen,
a why-question consists of:
(1)
A
presupposition (Why X)
(2)
S contrast
class (Why X rather than Y, Z, and so on)
(3)
An implicitly
understood criterion of relevance.
Information given in response to a
particular why-question constitutes an explanation of the presupposition if the
information is relevant and "favors" the presupposition over the
alternatives in its contrast class.
E.g.
·
Why
did Adam
eat the apple? (as opposed to Charlie?)
·
Why
did Adam eat the apple? (as
opposed to baking it into a pie?)
·
Why
did Adam eat the apple? (as opposed to a pear?)
Context fixes relevant
alternatives and thus, what it is that needs to be explained.
Both the contrast class and the
criterion of relevance are contextually determined, based on interests of those
involved. Therefore, According to van Fraasen, subjective interests define what count as an
explanation in that context, but then it's an objective matter whether that
information really favors the presupposition over the alternatives in its contrast
class.
Contrasts Between The Pragmatic And
Causal Views Of Explanation
Any type of information can be
counted as relevant. Of course, it's a scientific
explanation if only information provided by science counts. However, even so, there might be different kinds
of scientific explanation; not any old information will do. Again, context (interests) determines when
something counts as an explanation, that is, when we would find an explanation
interesting or salient.
According to Lewis, what makes it an explanation
is that it gives information about the causal history leading up to a given
event; whether we find that explanatory information interesting or salient is
another matter. By contrast, on the
pragmatic view a mere description of the causal history leading up to an event --even
a God-like complete one - -is not an explanation of any sort. According to the pragmatic view, even God
could never have an explanation of an event, unless he had interests. This is because it must serve some actual
interest to be an explanation.
Asymmetries Between Inference and Explanations
Not Really a Problem
VanFraassen suggests that asymmetries only exist because
of the context; thus, they can be reversed with a change in context. To illustrate this point, he offers a story when
the height of a tower is explained by the length of its shadow, rather than the
other way around. In van Fraassen's story, a character offers the following
explanation of the height of a tower:
“That tower
marks the spot where the Chevalier killed the maid with whom he had been in
love to the point of madness. And the height of the tower? He vowed that shadow
would cover the terrace where he first proclaimed his love, with every setting
sun-that is why the tower had to be so high.
Van Fraassen
here suggests that explanations are arguments, but only relative to context. We assess the explanatory merits of the
derivations by tacitly supposing contexts that occur in everyday life. With a
little imagination, we can see that there are alternative contexts in which the
argument we normally would dismiss would count as explanatory.
Lewis' Objection to the Tower example:
Van Fraassen's
story supposedly describes a context in which the length of the shadow explains
the height of the tower. But this will
solve the traditional problem of the asymmetries of explanation only if one can
claim that the argument underlying the quoted passage is the argument that the
unimaginative have dismissed as nonexplanatory. But the explanation van Fraassen
relates does not in fact take the form of deducing the height of the tower from
the length of the shadow (with the
elevation of the sun and the principles of optics as the only other premises).
What is really doing the explaining
is the intentions and beliefs of the Chevalier.
This is what causes the tower to be of a certain height. We must begin with some initial conditions
about the psychological characteristics of the Chevalier (He wanted to build a
tower with certain properties. He knew
certain physical facts.) Using general
principles of rationality, we infer a statement to the effect that the
Chevalier came to believe that, if he built a tower of the appropriate height
on the appropriate spot, it would meet his desiderata.
Thus Lewis claims that van Fraassen is mistaken.
His story does not provide a context in which the argument wrongly
dismissed as explanatory shows its explanatory worth (mere length explains height)
by a shift in context alone.
“Moreover,
since van Fraassen points out, quite explicitly, the
dependency on desires, we take him to appreciate that his story does not solve
the traditional problem of the asymmetries of explanation.
Further, can you think of a story in
which the redshift would explain the galaxies moving away? Where human intention is not possible, it
seems difficult; this would seem to confirm Lewis' diagnosis of the Tower
story.
[1] Hempel, Carl,” Studies in the Logic of Explanation” http://people.loyno.edu/~folse/hempelstudies.htm
[2] Carl G. Hempel and Paul Oppenheim “Studies in the Logic of Explanation” Philosophy of Science Vol. 15, No. 2 (Apr., 1948), pp. 135-175 (41 pages)
[3] Laws of Nature, John W. Carroll Cambridge University Press (1994)
[4] One popular way of understanding the truth conditions of counterfactuals employs the use of possible worlds discourse. To say that a counterfactual is true is to say that in the nearest possible world where the antecedent obtains the consequent obtains.
[5] Lewis, David [1973]: `Causation’ Journal of Philosophy 70, pp. 556-567.