Structure of Explanation

When we explain a thing, we give a cause; we assign the thing an origin that explains it.

We can go into a little more detail here. When we ask “why” something is the case, there is always an implication of possible alternatives. At the very least, the question implies, “Why is this the case rather than not being the case?” Thus “being the case” and “not being the case” are two possible alternatives.

The alternatives can be seen as possibilities in the sense explained in an earlier post. There may or may not be any actual matter involved, but again, the idea is that reality (or more specifically some part of reality) seems like something that would be open to being formed in one way or another, and we are asking why it is formed in one particular way rather than the other way. “Why is it raining?” In principle, the sky is open to being clear, or being filled with clouds and a thunderstorm, and to many other possibilities.

A successful explanation will be a complete explanation when it says “once you take the origin into account, the apparent alternatives were only apparent, and not really possible.” It will be a partial explanation when it says, “once you take the origin into account, the other alternatives were less sensible (i.e. made less sense as possibilities) than the actual thing.”

Let’s consider some examples in the form of “why” questions and answers.

Q1. Why do rocks fall? (e.g. instead of the alternatives of hovering in the air, going upwards, or anything else.)

A1. Gravity pulls things downwards, and rocks are heavier than air.

The answer gives an efficient cause, and once this cause is taken into account, it can be seen that hovering in the air or going upwards were not possibilities relative to that cause.

Obviously there is not meant to be a deep explanation here; the point here is to discuss the structure of explanation. The given answer is in fact basically Newton’s answer (although he provided more mathematical detail), while with general relativity Einstein provided a better explanation.

The explanation is incomplete in several ways. It is not a first cause; someone can now ask, “Why does gravity pull things downwards, instead of upwards or to the side?” Similarly, while it is in fact the cause of falling rocks, someone can still ask, “Why didn’t anything else prevent gravity from making the rocks fall?” This is a different question, and would require a different answer, but it seems to reopen the possibility of the rocks hovering or moving upwards, from a more general point of view. David Hume was in part appealing to the possibility of such additional questions when he said that we can see no necessary connection between cause and effect.

Q2. Why is 7 prime? (i.e. instead of the alternative of not being prime.)

A2. 7/2 = 3.5, so 7 is not divisible by 2. 7/3 = 2.333…, so 7 is not divisible by 3. In a similar way, it is not divisible by 4, 5, or 6. Thus in general it is not divisible by any number except 1 and itself, which is what it means to be prime.

If we assumed that the questioner did not know what being prime means, we could have given a purely formal response simply by noting that it is not divisible by numbers between 1 and itself, and explaining that this is what it is to be prime. As it is, the response gives a sufficient material disposition. Relative to this explanation, “not being prime,” was never a real possibility for 7 in the first place. The explanation is complete in that it completely excludes the apparent alternative.

Q3. Why did Peter go to the store? (e.g. instead of going to the park or the museum, or instead of staying home.)

A3. He went to the store in order to buy groceries.

The answer gives a final cause. In view of this cause the alternatives were merely apparent. Going to the park or the museum, or even staying home, were not possible since there were no groceries there.

As in the case of the rock, the explanation is partial in several ways. Someone can still ask, “Why did he want groceries?” And again someone can ask why he didn’t go to some other store, or why something didn’t hinder him, and so on. Such questions seem to reopen various possibilities, and thus the explanation is not an ultimately complete one.

Suppose, however, that someone brings up the possibility that instead of going to the store, he could have gone to his neighbor and offered money for groceries in his neighbor’s refrigerator. This possibility is not excluded simply by the purpose of buying groceries. Nonetheless, the possibility seems less sensible than getting them from the store, for multiple reasons. Again, the implication is that our explanation is only partial: it does not completely exclude alternatives, but it makes them less sensible.

Let’s consider a weirder question: Why is there something rather than nothing?

Now the alternatives are explicit, namely there being something, and there being nothing.

It can be seen that in one sense, as I said in the linked post, the question cannot have an answer, since there cannot be a cause or origin for “there is something” which would itself not be something. Nonetheless, if we consider the idea of possible alternatives, it is possible to see that the question does not need an answer; one of the alternatives was only an apparent alternative all along.

In other words, the sky can be open to being clear or cloudy. But there cannot be something which is open both to “there is something” and “there is nothing”, since any possibility of that kind would be “something which is open…”, which would already be something rather than nothing. The “nothing” alternative was merely apparent. Nothing was ever open to there being nothing.

Let’s consider another weird question. Suppose we throw a ball, and in the middle of the path we ask, Why is the ball in the middle of the path instead of at the end of the path?

We could respond in terms of a sufficient material disposition: it is in the middle of the path because you are asking your question at the middle, instead of waiting until the end.

Suppose the questioner responds: Look, I asked my question at the middle of the path. But that was just chance. I could have asked at any moment, including at the end. So I want to know why it was in the middle without considering when I am asking the question.

If we look at the question in this way, it can be seen in one way that no cause or origin can be given. Asked in this way, being at the end cannot be excluded, since they could have asked their question at the end. But like the question about something rather than nothing, the question does not need an answer. In this case, this is not because the alternatives were merely apparent in the sense that one was possible and the other not. But they were merely apparent in the sense that they were not alternatives. The ball goes both goes through the middle, and reaches the end. With the stipulation that we not consider the time of the question, the two possibilities are not mutually exclusive.

Additional Considerations

The above considerations about the nature of “explanation” lead to various conclusions, but also to various new questions. For example, one commenter suggested that “explanation” is merely subjective. Now as I said there, all experience is subjective experience (what would “objective experience” even mean, except that someone truly had a subjective experience?), including the experience of having an explanation. Nonetheless, the thing experienced is not subjective: the origins that we call explanations objectively exclude the apparent possibilities, or objectively make them less intelligible. The explanation of explanation here, however, provides an answer to what was perhaps the implicit question. Namely, why are we so interested in explanations in the first place, so that the experience of understanding something becomes a particularly special type of experience? Why, as Aristotle puts it, do “all men desire to know,” and why is that desire particularly satisfied by explanations?

In one sense it is sufficient simply to say that understanding is good in itself. Nonetheless, there is something particular about the structure of a human being that makes knowledge good for us, and which makes explanation a particularly desirable form of knowledge. In my employer and employee model of human psychology, I said that “the whole company is functioning well overall when the CEO’s goal of accurate prediction is regularly being achieved.” This very obviously requires knowledge, and explanation is especially beneficial because it excludes alternatives, which reduces uncertainty and therefore tends to make prediction more accurate.

However, my account also raises new questions. If explanation eliminates alternatives, what would happen if everything was explained? We could respond that “explaining everything” is not possible in the first place, but this is probably an inadequate response, because (from the linked argument) we only know that we cannot explain everything all at once, the way the person in the room cannot draw everything at once; we do not know that there is any particular thing that cannot be explained, just as there is no particular aspect of the room that cannot be drawn. So there can still be a question about what would happen if every particular thing in fact has an explanation, even if we cannot know all the explanations at once. In particular, since explanation eliminates alternatives, does the existence of explanations imply that there are not really any alternatives? This would suggest something like Leibniz’s argument that the actual world is the best possible world. It is easy to see that such an idea implies that there was only one “possibility” in the first place: Leibniz’s “best possible world” would be rather “the only possible world,” since the apparent alternatives, given that they would have been worse, were not real alternatives in the first place.

On the other hand, if we suppose that this is not the case, and there are ultimately many possibilities, does this imply the existence of “brute facts,” things that could have been otherwise, but which simply have no explanation? Or at least things that have no complete explanation?

Let the reader understand. I have already implicitly answered these questions. However, I will not link here to the implicit answers because if one finds it unclear when and where this was done, one would probably also find those answers unclear and inconclusive. Of course it is also possible that the reader does see when this was done, but still believes those responses inadequate. In any case, it is possible to provide the answers in a form which is much clearer and more conclusive, but this will likely not be a short or simple project.

Perfectly Random

Suppose you have a string of random binary digits such as the following:

00111100010101001100011011001100110110010010100111

This string is 50 digits long, and was the result of a single attempt using the linked generator.

However, something seems distinctly non-random about it: there are exactly 25 zeros and exactly 25 ones. Naturally, this will not always happen, but most of the time the proportion of zeros will be fairly close to half. And evidently this is necessary, since if the proportion was usually much different from half, then the selection could not have been random in the first place.

There are other things about this string that are definitely not random. It contains only zeros and ones, and no other digits, much less items like letters from the alphabet, or items like ‘%’ and ‘$’.

Why do we have these apparently non-random characteristics? Both sorts of characteristics, the approximate and typical proportion, and the more rigid characteristics, are necessary consequences of the way we obtained or defined this number.

It is easy to see that such characteristics are inevitable. Suppose someone wants to choose something random without any non-random characteristics. Let’s suppose they want to avoid the first sort of characteristic, which is perhaps the “easier” task. They can certainly make the proportion of zeros approximately 75% or anything else that they please. But this will still be a non-random characteristic.

They try again. Suppose they succeed in preventing the series of digits from converging to any specific probability. If they do, there is one and only one way to do this. Much as in our discussion of the mathematical laws of nature, the only way to accomplish this will be to go back and forth between longer and longer strings of zeros and ones. But this is an extremely non-random characteristic. So they may have succeeded in avoiding one particular type of non-randomness, but only at the cost of adding something else very non-random.

Again, consider the second kind of characteristic. Here things are even clearer: the only way to avoid the second kind of characteristic is not to attempt any task in the first place. The only way to win is not to play. Once we have said “your task is to do such and such,” we have already specified some non-random characteristics of the second kind; to avoid such characteristics is to avoid the task completely.

“Completely random,” in fact, is an incoherent idea. No such thing can exist anywhere, in the same way that “formless matter” cannot actually exist, but all matter is formed in one way or another.

The same thing applies to David Hume’s supposed problem of induction. I ended that post with the remark that for his argument to work, he must be “absolutely certain that the future will resemble the past in no way.” But this of course is impossible in the first place; the past and the future are both defined as periods of time, and so there is some resemblance in their very definition, in the same way that any material thing must have some form in its definition, and any “random” thing must have something non-random in its definition.

 

Violations of Bell’s Inequality: Drawing Conclusions

In the post on violations of Bell’s inequality, represented there by Mark Alford’s twin analogy, I pointed out that things did not seem to go very well for Einstein’s hope for physics, I did not draw any specific conclusions. Here I will consider the likely consequences, first by looking at the relationship of the experiments to Einstein’s position on causality and determinism, and second on their relationship to Einstein’s position on locality and action at a distance.

Einstein on Determinism

Einstein hoped for “facts” instead of probabilities. Everything should be utterly fixed by the laws, much like the position recently argued by Marvin Edwards in the comments here.

On the face of it, violations of Bell’s inequality rule this out, represented by the argument that if the twins had pre-existing determinate plans, it would be impossible for them to give the same answer less than 1/3 of the time when they are asked different questions. Bell however pointed out that it is possible to formulate a deterministic theory which would give similar probabilities at the cost of positing action at a distance (quoted here):

Moreover, a hidden variable interpretation of elementary quantum theory has been explicitly constructed. That particular interpretation has indeed a grossly non-local structure. This is characteristic, according to the result to be proved here, of any such theory which reproduces exactly the quantum mechanical predictions.

Nonetheless, I have set aside action at a distance to be discussed separately, and I would argue that we should accept the above surface appearance: the outcomes of quantum mechanical experiments are actually indeterministic. These probabilities represent something in the world, not merely something in our knowledge.

Why? In the first place, note that “reproduces exactly the quantum mechanical predictions” can be understood in two ways. A deterministic theory of that kind would say that because the details are unknown to us, we cannot know what is going to happen. But the details are there, and they in fact determine what is going to happen. There is still a difference on the object level between a world where the present fixes the future to a single possibility, and one in which the future is left open, as Aristotle supposed.

Of course there is no definitive proof here that we are actually in the situation with the open future, although the need for action at a distance in the alternative theory suggests that we are. Even apart from this, however, the general phenomena of quantum mechanics directly suggest that this is the situation. Even apart from violations of Bell’s inequality, quantum mechanics in general already looked exactly as we should have expected a world with an indeterminate future to look.

If this is the case, then Einstein was mistaken on this point, at least to this extent. But what about the deterministic aspect, which I mentioned at the end of this post, and which Schrödinger describes:

At all events it is an imagined entity that images the blurring of all variables at every moment just as clearly and faithfully as does the classical model its sharp numerical values. Its equation of motion too, the law of its time variation, so long as the system is left undisturbed, lags not one iota, in clarity and determinacy, behind the equations of motion of the classical model.

The answer is that this is deterministic not because the future, as we know it, is deterministic, but because it describes all of the possibilities at once. Thus in the case of the cat it includes both the cat living and the cat dying, which are two possible outcomes. It is “deterministic” only because once you have stated all of the alternatives, there is nothing left to say.

Why did Einstein want a deterministic theory? He openly admits that he does not have a convincing argument for it. It seems likely, however, that the fundamental motivation is the conviction that reality is intelligible. And an indeterministic world seems significantly less intelligible than a deterministic one. But this desire can in fact be satisfied by this second kind of “determinism”; thus Schrödinger calls it “one perfectly clear concept.”

In this respect, Einstein’s intuition was not mistaken. It is possible to give an intelligible account of the world, even a “deterministic” one, in this sense.

Einstein on Locality

Einstein also wanted to avoid “spooky action at a distance.” Admitting that the future is indeterminate, however, is not enough to avoid this conclusion. In Mark Alford’s twin analogy, it is not only pre-determined plans that fail, but also plans that involve randomness. Thus it first appears that the violations of Bell’s inequality absolutely require action at a distance.

If we follow my suggestion here, however, and consequently adopt Hugh Everett’s interpretation of quantum mechanics, then saying that there are multiple future possibilities implies the existence of multiple timelines. And if there are multiple timelines, violations of Bell’s inequality no longer necessarily imply action at a distance.

Why not? Consider the twin experiment with the assumption of indeterminacy and multiple timelines. Suppose that from the very beginning, there are two copies of each twin. The first copy of the first twin has the plan of responding to the three questions with “yes/yes/yes.” Likewise, the first copy of the second twin has the plan of responding to the three questions with, “yes/yes/yes.” In contrast, the second copy of each twin has the plan of responding with “no/no/no.”

Now we have four twins but the experimenter only sees two. So which ones does he see? There is nothing impossible about the following “rule”: if the twins are asked different questions, the experimenter sees the first copy of one of the twins, and the second copy of the other twin. Meanwhile, if the twins are asked the same question, the experimenter sees either the first copy of each twin, or the second copy of each twin. It is easy to see that if this is the case, the experimenter will see the twins agree, when they are asked the same question, and will see them disagree when they are asked different questions (thus agreeing less than 1/3 of the time in that situation.)

“Wait,” you will say. “If multiple timelines is just a way of describing a situation with indeterminism, and indeterminism is not enough to avoid action at a distance, how is it possible for multiple timelines to give a way out?”

From the beginning, the apparent “impossibility” of the outcome was a statistical impossibility, not a logical impossibility. Naturally this had to be the case, since if it were a logical impossibility, we could not have coherently described the actual outcomes. Thus we might imagine that David Hume would give this answer:

The twins are responding randomly to each question. By pure chance, they happened to agree the times they were asked the same question, and by pure chance they violated Bell’s inequality when they were asked different questions.

Since this was all a matter of pure chance, of course, if you do the experiment again tomorrow, it will turn out that all of the answers are random and they will agree and disagree 50% of the time on all questions.

And this answer is logically possible, but false. This account does not explain the correlation, but simply ignores it. In a similar way, the reason why indeterministic theories without action at a distance, but described as having a single timeline, cannot explain the results is that in order to explain the correlation, the outcomes of both sides need to be selected together, so to speak. But “without action at a distance” in this context simply means that they are not selected together. This makes the outcome statistically impossible.

In our multiple timelines version, in contrast, our “rule” above in effect selected the outcomes together. In other words, the guideline we gave regarding which pairs of twins the experimenter would meet, had the same effect as action at a distance.

How is all this an explanation? The point is that the particular way that timelines spread out when they come into contact with other things, in the version with multiple timelines, exactly corresponds to action at a distance, in the version without them. An indeterministic theory represented as having a single timeline and no action at a distance could be directly translated into a version with multiple timelines; but if we did that, this particular multiple timeline version would not have the rule that produces the correct outcomes. And on the other hand, if we start with the multiple timeline version that does have the rule, and translate it into a single timeline account, it will have action at a distance.

What does all this say about Einstein’s opinion about locality? Was he right, or was he wrong?

We might simply say that he was wrong, insofar as the actual situation can in fact be described as including action at a distance, even if it is not necessary to describe it in this way, since we can describe it with multiple timelines and without action at a distance. But to the degree that this suggests that Einstein made two mistakes, one about determinism and one about action at a distance, I think this is wrong. There was only one mistake, and it was the one about determinism. The fact is that as soon you speak of indeterminism at all, it becomes possible to speak of the world as having multiple timelines. So the question at that point is whether this is the “natural” description of the situation, where the natural description more or less means the best way to understand things, in which case the possibility of “action at a distance” is not an additional mistake on Einstein’s part, but rather it is an artifact of describing the situation as though there were only a single timeline.

You might say that there cannot be a better or worse way to understand things if two accounts are objectively equivalent. But this is wrong. Thus for example in general relativity it is probably possible to give an account where the earth has no daily rotation, and the universe is spinning around it every 24 hours. And this account is objectively equivalent to the usual account where the earth is spinning; exactly the same situation is being described, and nothing different is being asserted. And yet this account is weird in many ways, and makes it very hard to understand the universe. The far better and “natural” description is that the earth is spinning. Note, however, that this is an overall result; just looking out the window, you might have thought that saying that the universe is spinning is more natural. (Notice, however, that an even more natural account would be that neither the earth nor the universe is moving; it is only later in the day that you begin to figure out that one of them is moving.)

In a similar way, a single timeline account is originally more natural in the way a Ptolemaic account is more natural when you look out the window. But I would argue that in a similar way, the multiple timeline account, without action at a distance, is ultimately the more natural one. The basic reason for this is that there is no Newtonian Absolute Time. The consequence is that if we speak of “future possibilities,” they cannot be future possibilities for the entire universe at once. They will be fairly localized future possibilities: e.g. there might be more than one possible text for the ending to this blog post, which has not yet been written, and those possibilities are originally possibilities for what happens here in this room, not for the rest of the universe. These future alternatives will naturally result in future possibilities for other parts of the world, but this will happen “slowly,” so to speak (namely if one wishes to speak of the speed of light as slow!) This fits well with the idea of multiple timelines, since there will have to be some process where these multiple timelines come into contact with the rest of the world, much as with our “rule” in the twin experiment. On the other hand, it does not fit so well with a single timeline account of future possibilities, since one is forced (by the terms of the account) to imagine that when a choice among possibilities is made, it is made for the entire universe at once, which appears to require Newton’s Absolute Time.

This suggests that Einstein was basically right about action at a distance, and wrong about determinism. But the intuition that motivated him to embrace both positions, namely that the universe should be intelligible, was sound.

Employer and Employee Model of Human Psychology

This post builds on the ideas in the series of posts on predictive processing and the followup posts, and also on those relating truth and expectation. Consequently the current post will likely not make much sense to those who have not read the earlier content, or to those that read it but mainly disagreed.

We set out the model by positing three members of the “company” that constitutes a human being:

The CEO. This is the predictive engine in the predictive processing model.

The Vice President. In the same model, this is the force of the historical element in the human being, which we used to respond to the “darkened room” problem. Thus for example the Vice President is responsible for the fact that someone is likely to eat soon, regardless of what they believe about this. Likewise, it is responsible for the pursuit of sex, the desire for respect and friendship, and so on. In general it is responsible for behaviors that would have been historically chosen and preserved by natural selection.

The Employee. This is the conscious person who has beliefs and goals and free will and is reflectively aware of these things. In other words, this is you, at least in a fairly ordinary way of thinking of yourself. Obviously, in another way you are composed from all of them.

Why have we arranged things in this way? Descartes, for example, would almost certainly disagree violently with this model. The conscious person, according to him, would surely be the CEO, and not an employee. And what is responsible for the relationship between the CEO and the Vice President? Let us start with this point first, before we discuss the Employee. We make the predictive engine the CEO because in some sense this engine is responsible for everything that a human being does, including the behaviors preserved by natural selection. On the other hand, the instinctive behaviors of natural selection are not responsible for everything, but they can affect the course of things enough that it is useful for the predictive engine to take them into account. Thus for example in the post on sex and minimizing uncertainty, we explained why the predictive engine will aim for situations that include having sex and why this will make its predictions more confident. Thus, the Vice President advises certain behaviors, the CEO talks to the Vice President, and the CEO ends up deciding on a course of action, which ultimately may or may not be the one advised by the Vice President.

While neither the CEO nor the Vice President is a rational being, since in our model we place the rationality in the Employee, that does not mean they are stupid. In particular, the CEO is very good at what it does. Consider a role playing video game where you have a character that can die and then resume. When someone first starts to play the game, they may die frequently. After they are good at the game, they may die only rarely, perhaps once in many days or many weeks. Our CEO is in a similar situation, but it frequently goes 80 years or more without dying, on its very first attempt. It is extremely good at its game.

What are their goals? The CEO basically wants accurate predictions. In this sense, it has one unified goal. What exactly counts as more or less accurate here would be a scientific question that we probably cannot resolve by philosophical discussion. In fact, it is very possible that this would differ in different circumstances: in this sense, even though it has a unified goal, it might not be describable by a consistent utility function. And even if it can be described in that way, since the CEO is not rational, it does not (in itself) make plans to bring about correct predictions. Making good predictions is just what it does, as falling is what a rock does. There will be some qualifications on this, however, when we discuss how the members of the company relate to one another.

The Vice President has many goals: eating regularly, having sex, having and raising children, being respected and liked by others, and so on. And even more than in the case of the CEO, there is no reason for these desires to form a coherent set of preferences. Thus the Vice President might advise the pursuit of one goal, but then change its mind in the middle, for no apparent reason, because it is suddenly attracted by one of the other goals.

Overall, before the Employee is involved, human action is determined by a kind of negotiation between the CEO and the Vice President. The CEO, which wants good predictions, has no special interest in the goals of the Vice President, but it cooperates with them because when it cooperates its predictions tend to be better.

What about the Employee? This is the rational being, and it has abstract concepts which it uses as a formal copy of the world. Before I go on, let me insist clearly on one point. If the world is represented in a certain way in the Employee’s conceptual structure, that is the way the Employee thinks the world is. And since you are the Employee, that is the way you think the world actually is. The point is that once we start thinking this way, it is easy to say, “oh, this is just a model, it’s not meant to be the real thing.” But as I said here, it is not possible to separate the truth of statements from the way the world actually is: your thoughts are formulated in concepts, but they are thoughts about the way things are. Again, all statements are maps, and all statements are about the territory.

The CEO and the Vice President exist as soon a human being has a brain; in fact some aspects of the Vice President would exist even before that. But the Employee, insofar as it refers to something with rational and self-reflective knowledge, takes some time to develop. Conceptual knowledge of the world grows from experience: it doesn’t exist from the beginning. And the Employee represents goals in terms of its conceptual structure. This is just a way of saying that as a rational being, if you say you are pursuing a goal, you have to be able to describe that goal with the concepts that you have. Consequently you cannot do this until you have some concepts.

We are ready to address the question raised earlier. Why are you the Employee, and not the CEO? In the first place, the CEO got to the company first, as we saw above. Second, consider what the conscious person does when they decide to pursue a goal. There seems to be something incoherent about “choosing a goal” in the first place: you need a goal in order to decide which means will be a good means to choose. And yet, as I said here, people make such choices anyway. And the fact that you are the Employee, and not the CEO, is the explanation for this. If you were the CEO, there would indeed be no way to choose an end. That is why the actual CEO makes no such choice: its end is already determinate, namely good predictions. And you are hired to help out with this goal. Furthermore, as a rational being, you are smarter than the CEO and the Vice President, so to speak. So you are allowed to make complicated plans that they do not really understand, and they will often go along with these plans. Notably, this can happen in real life situations of employers and employees as well.

But take an example where you are choosing an end: suppose you ask, “What should I do with my life?” The same basic thing will happen if you ask, “What should I do today,” but the second question may be easier to answer if you have some answer to the first. What sorts of goals do you propose in answer to the first question, and what sort do you actually end up pursuing?

Note that there are constraints on the goals that you can propose. In the first place, you have to be able to describe the goal with the concepts you currently have: you cannot propose to seek a goal that you cannot describe. Second, the conceptual structure itself may rule out some goals, even if they can be described. For example, the idea of good is part of the structure, and if something is thought to be absolutely bad, the Employee will (generally) not consider proposing this as a goal. Likewise, the Employee may suppose that some things are impossible, and it will generally not propose these as goals.

What happens then is this: the Employee proposes some goal, and the CEO, after consultation with the Vice President, decides to accept or reject it, based on the CEO’s own goal of getting good predictions. This is why the Employee is an Employee: it is not the one ultimately in charge. Likewise, as was said, this is why the Employee seems to be doing something impossible, namely choosing goals. Steven Kaas makes a similar point,

You are not the king of your brain. You are the creepy guy standing next to the king going “a most judicious choice, sire”.

This is not quite the same thing, since in our model you do in fact make real decisions, including decisions about the end to be pursued. Nonetheless, the point about not being the one ultimately in charge is correct. David Hume also says something similar when he says, “Reason is, and ought only to be the slave of the passions, and can never pretend to any other office than to serve and obey them.” Hume’s position is not exactly right, and in fact seems an especially bad way of describing the situation, but the basic point that there is something, other than yourself in the ordinary sense, judging your proposed means and ends and deciding whether to accept them, is one that stands.

Sometimes the CEO will veto a proposal precisely because it very obviously leaves things vague and uncertain, which is contrary to its goal of having good predictions. I once spoke of the example that a person cannot directly choose to “write a paper.” In our present model, the Employee proposes “we’re going to write a paper now,” and the CEO responds, “That’s not a viable plan as it stands: we need more detail.”

While neither the CEO nor the Vice President is a rational being, the Vice President is especially irrational, because of the lack of unity among its goals. Both the CEO and the Employee would like to have a unified plan for one’s whole life: the CEO because this makes for good predictions, and the Employee because this is the way final causes work, because it helps to make sense of one’s life, and because “objectively good” seems to imply something which is at least consistent, which will never prefer A to B, B to C, and C to A. But the lack of unity among the Vice President’s goals means that it will always come to the CEO and object, if the person attempts to coherently pursue any goal. This will happen even if it originally accepts the proposal to seek a particular goal.

Consider this real life example from a relationship between an employer and employee:

 

Employer: Please construct a schedule for paying these bills.

Employee: [Constructs schedule.] Here it is.

Employer: Fine.

[Time passes, and the first bill comes due, according to the schedule.]

Employer: Why do we have to pay this bill now instead of later?

 

In a similar way, this sort of scenario is common in our model:

 

Vice President: Being fat makes us look bad. We need to stop being fat.

CEO: Ok, fine. Employee, please formulate a plan to stop us from being fat.

Employee: [Formulates a diet.] Here it is.

[Time passes, and the plan requires skipping a meal.]

Vice President: What is this crazy plan of not eating!?!

CEO: Fine, cancel the plan for now and we’ll get back to it tomorrow.

 

In the real life example, the behavior of the employer is frustrating and irritating to the employee because there is literally nothing they could have proposed that the employer would have found acceptable. In the same way, this sort of scenario in our model is frustrating to the Employee, the conscious person, because there is no consistent plan they could have proposed that would have been acceptable to the Vice President: either they would have objected to being fat, or they would have objected to not eating.

In later posts, we will fill in some details and continue to show how this model explains various aspects of human psychology. We will also answer various objections.

Necessity, Possibility, and Impossibility

I spoke here about various kinds of necessity, but did not explain the nature of necessity in general. And in the recent post on Hume’s idea of causality, it was not necessary to explain the nature of necessity, because the actual idea of causality does not include necessity. Thus for example a ball can break a window even if it would have been possible for someone to catch the ball, but the person did not do so.

Sometimes it is asked whether necessity implies possibility: if it is necessary that Tuesday follow Monday, it is possible for Tuesday to follow Monday? I am inclined (and I think most are inclined) to say yes, on the grounds that to say that something is not possible is normally understood to imply that the thing is impossible; thus if it is not possible for Tuesday to follow Monday, it is impossible. But this is largely a verbal question: regardless of how we answer this, the real point is that the necessary is the same kind of thing as the possible, except that possibilities are many while the necessary is one. And likewise, a count of zero for the same things implies impossibility. Thus there is something that we are counting: if we find none of them, we speak of an impossibility. If we find only one, we speak of one necessity. And if we find many, we speak of many possibilities.

What are we counting here? Let’s take an example. Horses can be white, or red, or brown, among other possibilities. So there are many possible colors for a horse. And on the other hand snow is always white (or so let us pretend.) So there is only one possible color for snow, and so snow is “necessarily” white. Meanwhile, air is always colorless (or so let us pretend.) So it is impossible for air to have a color. Based on this example, we propose that what we are counting is the number of forms that are suitable for a given matter. Someone might object that if we analyze the word “suitable” here it might involve some sort of circularity. This may well be the case; this is a common occurrence, as with desire and the good, and with virtue and happiness. Nonetheless, I think we will find it worthwhile to work with this definition, just as in those earlier cases.

 

Miracles and Anomalies: Or, Your Religion is False

In 2011 there was an apparent observation of neutrinos traveling faster than light. Wikipedia says of this, “Even before the mistake was discovered, the result was considered anomalous because speeds higher than that of light in a vacuum are generally thought to violate special relativity, a cornerstone of the modern understanding of physics for over a century.” In other words, most scientists did not take the result very seriously, even before any specific explanation was found. As I stated here, it is possible to push unreasonably far in this direction, in such a way that one will be reluctant to ever modify one’s current theories. But there is also something reasonable about this attitude.

Alexander Pruss explains why scientists tend to be skeptical of such anomalous results in this post on Bayesianism and anomaly:

One part of the problem of anomaly is this. If a well-established scientific theory seems to predict something contrary to what we observe, we tend to stick to the theory, with barely a change in credence, while being dubious of the auxiliary hypotheses. What, if anything, justifies this procedure?

Here’s my setup. We have a well-established scientific theory T and (conjoined) auxiliary hypotheses A, and T together with A uncontroversially entails the denial of some piece of observational evidence E which we uncontroversially have (“the anomaly”). The auxiliary hypotheses will typically include claims about the experimental setup, the calibration of equipment, the lack of further causal influences, mathematical claims about the derivation of not-E from T and the above, and maybe some final catch-all thesis like the material conditional that if T and all the other auxiliary hypotheses obtain, then E does not obtain.

For simplicity I will suppose that A and T are independent, though of course that simplifying assumption is rarely true.

Here’s a quick and intuitive thought. There is a region of probability space where the conjunction of T and A is false. That area is divided into three sub-regions:

  1. T is true and A is false
  2. T is false and A is true
  3. both are false.

The initial probabilities of the three regions are, respectively, 0.0999, 0.0009999 and 0.0001. We know we are in one of these three regions, and that’s all we now know. Most likely we are in the first one, and the probability that we are in that one given that we are in one of the three is around 0.99. So our credence in T has gone down from three nines (0.999) to two nines (0.99), but it’s still high, so we get to hold on to T.

Still, this answer isn’t optimistic. A move from 0.999 to 0.99 is actually an enormous decrease in confidence.

“This answer isn’t optimistic,” because in the case of the neutrinos, this analysis would imply that scientists should have instantly become ten times more willing to consider the possibility that the theory of special relativity is false. This is surely not what happened.

Pruss therefore presents an alternative calculation:

But there is a much more optimistic thought. Note that the above wasn’t a real Bayesian calculation, just a rough informal intuition. The tip-off is that I said nothing about the conditional probabilities of E on the relevant hypotheses, i.e., the “likelihoods”.

Now setup ensures:

  1. P(E|A ∧ T)=0.

What can we say about the other relevant likelihoods? Well, if some auxiliary hypothesis is false, then E is up for grabs. So, conservatively:

  1. P(E|∼A ∧ T)=0.5
  2. P(E|∼A ∧ ∼T)=0.5

But here is something that I think is really, really interesting. I think that in typical cases where T is a well-established scientific theory and A ∧ T entails the negation of E, the probability P(E|A ∧ ∼T) is still low.

The reason is that all the evidence that we have gathered for T even better confirms the hypothesis that T holds to a high degree of approximation in most cases. Thus, even if T is false, the typical predictions of T, assuming they have conservative error bounds, are likely to still be true. Newtonian physics is false, but even conditionally on its being false we take individual predictions of Newtonian physics to have a high probability. Thus, conservatively:

  1. P(E|A ∧ ∼T)=0.1

Very well, let’s put all our assumptions together, including the ones about A and T being independent and the values of P(A) and P(T). Here’s what we get:

  1. P(E|T)=P(E|A ∧ T)P(A|T)+P(E|∼A ∧ T)P(∼A|T)=0.05
  2. P(E|∼T)=P(E|A ∧ ∼T)P(A|∼T)+P(E|∼A ∧ ∼T)P(∼A|∼T) = 0.14.

Plugging this into Bayes’ theorem, we get P(T|E)=0.997. So our credence has crept down, but only a little: from 0.999 to 0.997. This is much more optimistic (and conservative) than the big move from 0.999 to 0.99 that the intuitive calculation predicted.

So, if I am right, at least one of the reasons why anomalies don’t do much damage to scientific theories is that when the scientific theory T is well-confirmed, the anomaly is not only surprising on the theory, but it is surprising on the denial of the theory—because the background includes the data that makes T “well-confirmed” and would make E surprising even if we knew that T was false.

To make the point without the mathematics (which in any case is only used to illustrate the point, since Pruss is choosing the specific values himself), if you have a theory which would make the anomaly probable, that theory would be strongly supported by the anomaly. But we already know that theories like that are false, because otherwise the anomaly would not be an anomaly. It would be normal and common. Thus all of the actually plausible theories still make the anomaly an improbable observation, and therefore these theories are only weakly supported by the observation of the anomaly. The result is that the new observation makes at most a minor difference to your previous opinion.

We can apply this analysis to the discussion of miracles. David Hume, in his discussion of miracles, seems to desire a conclusive proof against them which is unobtainable, and in this respect he is mistaken. But near the end of his discussion, he brings up the specific topic of religion and says that his argument applies to it in a special way:

Upon the whole, then, it appears, that no testimony for any kind of miracle has ever amounted to a probability, much less to a proof; and that, even supposing it amounted to a proof, it would be opposed by another proof; derived from the very nature of the fact, which it would endeavour to establish. It is experience only, which gives authority to human testimony; and it is the same experience, which assures us of the laws of nature. When, therefore, these two kinds of experience are contrary, we have nothing to do but subtract the one from the other, and embrace an opinion, either on one side or the other, with that assurance which arises from the remainder. But according to the principle here explained, this subtraction, with regard to all popular religions, amounts to an entire annihilation; and therefore we may establish it as a maxim, that no human testimony can have such force as to prove a miracle, and make it a just foundation for any such system of religion.

The idea seems to be something like this: contrary systems of religion put forth miracles in their support, so the supporting evidence for one religion is more or less balanced by the supporting evidence for the other. Likewise, the evidence is weakened even in itself by people’s propensity to lies and delusion in such matters (some of this discussion was quoted in the earlier post on Hume and miracles). But in addition to the fairly balanced evidence we have experience basically supporting the general idea that the miracles do not happen. This is not outweighed by anything in particular, and so it is the only thing that remains after the other evidence balances itself out of the equation. Hume goes on:

I beg the limitations here made may be remarked, when I say, that a miracle can never be proved, so as to be the foundation of a system of religion. For I own, that otherwise, there may possibly be miracles, or violations of the usual course of nature, of such a kind as to admit of proof from human testimony; though, perhaps, it will be impossible to find any such in all the records of history. Thus, suppose, all authors, in all languages, agree, that, from the first of January, 1600, there was a total darkness over the whole earth for eight days: suppose that the tradition of this extraordinary event is still strong and lively among the people: that all travellers, who return from foreign countries, bring us accounts of the same tradition, without the least variation or contradiction: it is evident, that our present philosophers, instead of doubting the fact, ought to receive it as certain, and ought to search for the causes whence it might be derived. The decay, corruption, and dissolution of nature, is an event rendered probable by so many analogies, that any phenomenon, which seems to have a tendency towards that catastrophe, comes within the reach of human testimony, if that testimony be very extensive and uniform.

But suppose, that all the historians who treat of England, should agree, that, on the first of January, 1600, Queen Elizabeth died; that both before and after her death she was seen by her physicians and the whole court, as is usual with persons of her rank; that her successor was acknowledged and proclaimed by the parliament; and that, after being interred a month, she again appeared, resumed the throne, and governed England for three years: I must confess that I should be surprised at the concurrence of so many odd circumstances, but should not have the least inclination to believe so miraculous an event. I should not doubt of her pretended death, and of those other public circumstances that followed it: I should only assert it to have been pretended, and that it neither was, nor possibly could be real. You would in vain object to me the difficulty, and almost impossibility of deceiving the world in an affair of such consequence; the wisdom and solid judgment of that renowned queen; with the little or no advantage which she could reap from so poor an artifice: all this might astonish me; but I would still reply, that the knavery and folly of men are such common phenomena, that I should rather believe the most extraordinary events to arise from their concurrence, than admit of so signal a violation of the laws of nature.

But should this miracle be ascribed to any new system of religion; men, in all ages, have been so much imposed on by ridiculous stories of that kind, that this very circumstance would be a full proof of a cheat, and sufficient, with all men of sense, not only to make them reject the fact, but even reject it without farther examination. Though the Being to whom the miracle is ascribed, be, in this case, Almighty, it does not, upon that account, become a whit more probable; since it is impossible for us to know the attributes or actions of such a Being, otherwise than from the experience which we have of his productions, in the usual course of nature. This still reduces us to past observation, and obliges us to compare the instances of the violation of truth in the testimony of men, with those of the violation of the laws of nature by miracles, in order to judge which of them is most likely and probable. As the violations of truth are more common in the testimony concerning religious miracles, than in that concerning any other matter of fact; this must diminish very much the authority of the former testimony, and make us form a general resolution, never to lend any attention to it, with whatever specious pretence it may be covered.

Notice how “unfair” this seems to religion, so to speak. What is the difference between the eight days of darkness, which Hume would accept, under those conditions, and the resurrection of the queen of England, which he would not? Hume’s reaction to the two situations is more consistent than first appears. Hume would accept the historical accounts about England in the same way that he would accept the accounts about the eight days of darkness. The difference is in how he would explain the accounts. He says of the darkness, “It is evident, that our present philosophers, instead of doubting the fact, ought to receive it as certain, and ought to search for the causes whence it might be derived.” Likewise, he would accept the historical accounts as certain insofar as they say the a burial ceremony took place, the queen was absent from public life, and so on. But he would not accept that the queen was dead and came back to life. Why? The “search for the causes” seems to explain this. It is plausible to Hume that causes of eight days of darkness might be found, but not plausible to him that causes of a resurrection might be found. He hints at this in the words, “The decay, corruption, and dissolution of nature, is an event rendered probable by so many analogies,” while in contrast a resurrection would be “so signal a violation of the laws of nature.”

It is clear that Hume excludes certain miracles, such as resurrection, from the possibility of being established by the evidence of testimony. But he makes the additional point that even if he did not exclude them, he would not find it reasonable to establish a “system of religion” on such testimony, given that “violations of truth are more common in the testimony concerning religious miracles, than in that concerning any other matter of fact.”

It is hard to argue with the claim that “violations of truth” are especially common in testimony about miracles. But does any of this justify Hume’s negative attitude to miracles as establishing “systems of religion,” or is this all just prejudice?  There might well be a good deal of prejudice involved here in his opinions. Nonetheless, Alexander Pruss’s discussion of anomaly allows one to formalize Hume’s idea here as actual insight as well.

One way to look at truth in religion is to look at it as a way of life or as membership in a community. And in this way, asking whether miracles can establish a system of religion is just asking whether a person can be moved to a way of life or to join a community through such things. And clearly this is possible, and often happens. But another way to consider truth in religion is to look at a doctrinal system as a set of claims about how the world is. Looked at in this way, we should look at a doctrinal system as presenting a proposed larger context of our place in the world, one that we would be unaware of without the religion. This implies that one should have a prior probability (namely prior to consideration of arguments in its favor) strongly against the system considered as such, for reasons very much like the reasons we should have a prior probability strongly against Ron Conte’s predictions.

We can thus apply Alexander Pruss’s framework. Let us take Mormonism as the “system of religion” in question. Then taken as a set of claims about the world, our initial probability would be that it is very unlikely that the world is set up this way. Then let us take a purported miracle establishing this system: Joseph Smith finds his golden plates. In principle, if this cashed out in a certain way, it could actually establish his system. But it doesn’t cash out that way. We know very little about the plates, the circumstances of their discovery (if there was any), and their actual content. Instead, what we are left with is an anomaly: something unusual happened, and it might be able to be described as “finding golden plates,” but that’s pretty much all we know.

Then we have the theory, T, which has a high prior probability: Mormonism is almost certainly false. We have the observation : Joseph Smith discovered his golden plates (in one sense or another.) And we have the auxiliary hypotheses which imply that he could not have discovered the plates if Mormonism is false. The Bayesian updates in Pruss’s scheme imply that our conclusion is this: Mormonism is almost certainly false, and there is almost certainly an error in the auxiliary hypotheses that imply he could not have discovered them if it were false.

Thus Hume’s attitude is roughly justified: he should not change his opinion about religious systems in any significant way based on testimony about miracles.

To make you feel better, this does not prove that your religion is false. It just nearly proves that. In particular, this does not take into an account an update based on the fact that “many people accept this set of claims.” This is a different fact, and it is not an anomaly. If you update on this fact and end up with a non-trivial probability that your set of claims is true, testimony about miracles might well strengthen this into conviction.

I will respond to one particular objection, however. Some will take this argument to be stubborn and wicked, because it seems to imply that people shouldn’t be “convinced even if someone rises from the dead.” And this does in fact follow, more or less. An anomalous occurrence in most cases will have a perfectly ordinary explanation in terms of things that are already a part of our ordinary understanding of the world, without having to add some larger context. For example, suppose you heard your fan (as a piece of furniture, not as a person) talking to you. You might suppose that you were hallucinating. But suppose it turns out that you are definitely not hallucinating. Should you conclude that there is some special source from outside the normal world that is communicating with you? No: the fan scenario can happen, and it turns out to have a perfectly everyday explanation. We might agree with Hume that it would be much more implausible that a resurrection would have an everyday explanation. Nonetheless, even if we end up concluding to the existence of some larger context, and that the miracle has no such everyday explanation, there is no good reason for it to be such and such a specific system of doctrine. Consider again Ron Conte’s predictions for the future. Most likely the things that happen between now and 2040, and even the things that happen in the 2400s, are likely to be perfectly ordinary (although the things in the 2400s might differ from current events in fairly radical ways). But even if they are not, and even if apocalyptic, miraculous occurrences are common in those days, this does not raise the probability of Conte’s specific predictions above any trivial level. In the same way, the anomalous occurrences involved in the accounts of miracles will not lend any significant probability to a religious system.

The objection here is that this seems unfair to God, so to speak. What if God wanted to reveal something to the world? What could he do, besides work miracles? I won’t propose a specific answer to this, because I am not God. But I will illustrate the situation with a little story to show that there is nothing unfair to God about it.

Suppose human beings created an artificial intelligence and raised it in a simulated environment. Wanting things to work themselves out “naturally,” so to speak, because it would be less work, and because it would probably be necessary to the learning process, they institute “natural laws” in the simulated world which are followed in an exceptionless way. Once the AI is “grown up”, so to speak, they decide to start communicating with it. In the AI’s world, this will surely show up as some kind of miracle: something will happen that was utterly unpredictable to it, and which is completely inconsistent with the natural laws as it knew them.

Will the AI be forced by the reasoning of this post to ignore the communication? Well, that depends on what exactly occurs and how. At the end of his post, Pruss discusses situations where anomalous occurrences should change your mind:

Note that this argument works less well if the anomalous case is significantly different from the cases that went into the confirmation of T. In such a case, there might be much less reason to think E won’t occur if T is false. And that means that anomalies are more powerful as evidence against a theory the more distant they are from the situations we explored before when we were confirming T. This, I think, matches our intuitions: We would put almost no weight in someone finding an anomaly in the course of an undergraduate physics lab—not just because an undergraduate student is likely doing it (it could be the professor testing the equipment, though), but because this is ground well-gone over, where we expect the theory’s predictions to hold even if the theory is false. But if new observations of the center of our galaxy don’t fit our theory, that is much more compelling—in a regime so different from many of our previous observations, we might well expect that things would be different if our theory were false.

And this helps with the second half of the problem of anomaly: How do we keep from holding on to T too long in the light of contrary evidence, how do we allow anomalies to have a rightful place in undermining theories? The answer is: To undermine a theory effectively, we need anomalies that occur in situations significantly different from those that have already been explored.

If the AI finds itself in an entirely new situation, e.g. rather than hearing an obscure voice from a fan, it is consistently able to talk to the newly discovered occupant of the world on a regular basis, it will have no trouble realizing that its situation has changed, and no difficulty concluding that it is receiving communication from its author. This does, sort of, give one particular method that could be used to communicate a revelation. But there might well be many others.

Our objector will continue. This is still not fair. Now you are saying that God could give a revelation but that if he did, the world would be very different from the actual world. But what if he wanted to give a revelation in the actual world, without it being any different from the way it is? How could he convince you in that case?

Let me respond with an analogy. What if the sky were actually red like the sky of Mars, but looked blue like it is? What would convince you that it was red? The fact that there is no way to convince you that it is red in our actual situation means you are unfairly prejudiced against the redness of the sky.

In other words, indeed, I am unwilling to be convinced that the sky is red except in situations where it is actually red, and those situations are quite different from our actual situation. And indeed, I am unwilling to be convinced of a revelation except in situations where there is actually a revelation, and those are quite different from our actual situation.

Explaining Causality

A reader asks about a previous post:

a) Per Hume and his defenders, we can’t really observe causation. All we can see is event A in spacetime, then event B in spacetime. We have no reason to posit that event A and event B are, say, chairs or dogs; we can stick with a sea of observed events, and claim that the world is “nothing more” but a huge set of random 4D events. While I can see that giving such an account restores formal causation, it doesn’t salvage efficient causation, and doesn’t even help final causation. How could you move there from our “normal” view?

b) You mention that the opinion “laws are observed patterns” is not a dominant view; though, even though I’d like to sit with the majority, I can’t go further than a). I can’t build an argument for this, and fail to see how Aristotle put his four causes correctly. I always end up gnawing on an objection, like “causation is only in the mind” or similar. Help?

It is not my view that the world is a huge set of random 4D events. This is perhaps the view of Atheism and the City, but it is a mistaken one. The blogger is not mistaken in thinking that there are problems with presentism, but they cannot be solved by adopting an eternalist view. Rather, these two positions constitute a Kantian dichotomy, and as usual, both positions are false. For now, however, I will leave this to the consideration of the reader. It is not necessary to establish this to respond to the questions above.

Consider the idea that “we can’t really observe causation.” As I noted here, it does not make sense to say that we cannot observe causation unless we already understand what causation is. If the word were meaningless to us, we would have no argument that we don’t observe it; it is only because we do understand the idea of causation that we can even suggest that it might be difficult to observe. And if we do have the idea, we got the idea from somewhere, and that could only have been… from observation, of course, since we don’t have anything else to get ideas from.

Let us untie the knot. I explained causality in general in this way:

“Cause” and “effect” simply signify that the cause is the origin of the effect, and that the effect is from the cause, together with the idea that when we understand the cause, we understand the explanation for the effect. Thus “cause” adds to “origin” a certain relationship with the understanding; this is why Aristotle says that we do not think we understand a thing until we know its cause, or “why” it is. We do not understand a thing until we know its explanation.

Note that there is something “in the mind” about causality. Saying to oneself, “Aha! So that’s why that happened!” is a mental event. And we can also see how it is possible to observe causality: we can observe that one thing is from another, i.e. that a ball breaks a window, and we can also observe that knowing this provides us a somewhat satisfactory answer to the question, “Why is the window broken?”, namely, “Because it was hit by a ball.”

Someone (e.g. Atheism and the City) might object that we also cannot observe one thing coming from another. We just observe the two things, and they are, as Hume says, “loose and separate.” Once again, however, we would have no idea of “from” unless we got it from observing things. In the same early post quoted above, I explained the idea of origin, i.e. that one thing is from another:

Something first is said to be the beginning, principle, or origin of the second, and the second is said to be from the first. This simply signifies the relationship already described in the last post, together with an emphasis on the fact that the first comes before the second by “consequence of being”, in the way described.

“The relationship already described in the last post” is that of before and after. In other words, wherever we have any kind of order at all, we have one thing from another. And we observe order, even when we simply see one thing after another, and thus we also observe things coming from other things.

What about efficient causality? If we adopt the explanation above, asserting the existence of efficient causality is nothing more or less than asserting that things sometimes make other things happen, like balls breaking windows, and that knowing about this is a way for us to understand the effects (e.g. broken windows.)

Similarly, denying the existence of efficient causality means either denying that anything ever makes anything else happen, or denying that knowing about this makes us understand anything, even in a minor way. Atheism and the City seems to want to deny that anything ever makes anything else happen:

Most importantly, my view technically is not that causality doesn’t exist, it’s that causality doesn’t exist in the way we typically think it does. That is, my view of causality is completely different from the general every day notion of causality most people have. The naive assumption one often gets when hearing my view is that I’m saying cause and effect relationships don’t exist at all, such that if you threw a brick at glass window it wouldn’t shatter, or if you jumped in front of a speeding train you wouldn’t get smashed to death by it. That’s not what my view says at all.

On my view of causality, if you threw a brick at a glass window it would shatter, if you jumped in front of a speeding train you’d be smashed to death by it. The difference between my view of causality vs the typical view is that on my view causes do not bring their effects into existence in the sense of true ontological becoming.

I am going to leave aside the discussion of “true ontological becoming,” because it is a distraction from the real issue. Does Atheism and the City deny that things ever make other things happen? It appears so, but consider that “things sometimes make other things happen” is just a more general description of the very same situations as descriptions like, “Balls sometimes break windows.” So if you want to deny that things make other things happen, you should also deny that balls break windows. Now our blogger perhaps wants to say, “I don’t deny that balls break windows in the everyday sense, but they don’t break them in a true ontological sense.” Again, I will simply point in the right direction here. Asserting the existence of efficient causes does not describe a supposedly “truly true” ontology; it is simply a more general description of a situation where balls sometimes break windows.

We can make a useful comparison here between understanding causality, and understanding desire and the good. The knowledge of desire begins with a fairly direct experience, that of feeling the desire, often even as physical sensation. In the same way, we have a direct experience of “understanding something,” namely the feeling of going, “Ah, got it! That’s why this is, this is how it is.” And just as we explain the fact of our desire by saying that the good is responsible for it, we explain the fact of our understanding by saying that the apprehension of causes is responsible. And just as being and good are convertible, so that goodness is not some extra “ontological” thing, so also cause and origin are convertible. But something has to have a certain relationship with us to be good for us; eating food is good for us while eating rocks is not. In a similar way, origins need to have a specific relationship with us in order to provide an understanding of causality, as I said in the post where these questions came up.

Does this mean that “causation is only in the mind”? Not really, any more than the analogous account implies that goodness is only in the mind. An aspect of goodness is in the mind, namely insofar as we distinguish it from being in general, but the thing itself is real, namely the very being of things. And likewise an aspect of causality is in the mind, namely the fact that it explains something to us, but the thing itself is real, namely the relationships of origin in things.