Technical Discussion and Philosophical Progress

In The Structure of Scientific Revolutions (p. 19-21), Thomas Kuhn remarks on the tendency of sciences to acquire a technical vocabulary and manner of discussion:

We shall be examining the nature of this highly directed or paradigm-based research in the next section, but must first note briefly how the emergence of a paradigm affects the structure of the group that practices the field. When, in the development of a natural science, an individual or group first produces a synthesis able to attract most of the next generation’s practitioners, the older schools gradually disappear. In part their disappearance is caused by their members’ conversion to the new paradigm. But there are always some men who cling to one or another of the older views, and they are simply read out of the profession, which thereafter ignores their work. The new paradigm implies a new and more rigid definition of the field. Those unwilling or unable to accommodate their work to it must proceed in isolation or attach themselves to some other group. Historically, they have often simply stayed in the departments of philosophy from which so many of the special sciences have been spawned. As these indications hint, it is sometimes just its reception of a paradigm that transforms a group previously interested merely in the study of nature into a profession or, at least, a discipline. In the sciences (though not in fields like medicine, technology, and law, of which the principal raison d’être is an external social need), the formation of specialized journals, the foundation of specialists’ societies, and the claim for a special place in the curriculum have usually been associated with a group’s first reception of a single paradigm. At least this was the case between the time, a century and a half ago, when the institutional pattern of scientific specialization first developed and the very recent time when the paraphernalia of specialization acquired a prestige of their own.

The more rigid definition of the scientific group has other consequences. When the individual scientist can take a paradigm for granted, he need no longer, in his major works, attempt to build his field anew, starting from first principles and justifying the use of each concept introduced. That can be left to the writer of textbooks. Given a textbook, however, the creative scientist can begin his research where it leaves off and thus concentrate exclusively upon the subtlest and most esoteric aspects of the natural phenomena that concern his group. And as he does this, his research communiqués will begin to change in ways whose evolution has been too little studied but whose modern end products are obvious to all and oppressive to many. No longer will his researches usually be embodied in books addressed, like Franklin’s Experiments . . . on Electricity or Darwin’s Origin of Species, to anyone who might be interested in the subject matter of the field. Instead they will usually appear as brief articles addressed only to professional colleagues, the men whose knowledge of a shared paradigm can be assumed and who prove to be the only ones able to read the papers addressed to them.

Today in the sciences, books are usually either texts or retrospective reflections upon one aspect or another of the scientific life. The scientist who writes one is more likely to find his professional reputation impaired than enhanced. Only in the earlier, pre-paradigm, stages of the development of the various sciences did the book ordinarily possess the same relation to professional achievement that it still retains in other creative fields. And only in those fields that still retain the book, with or without the article, as a vehicle for research communication are the lines of professionalization still so loosely drawn that the layman may hope to follow progress by reading the practitioners’ original reports. Both in mathematics and astronomy, research reports had ceased already in antiquity to be intelligible to a generally educated audience. In dynamics, research became similarly esoteric in the later Middle Ages, and it recaptured general intelligibility only briefly during the early seventeenth century when a new paradigm replaced the one that had guided medieval research. Electrical research began to require translation for the layman before the end of the eighteenth century, and most other fields of physical science ceased to be generally accessible in the nineteenth. During the same two centuries similar transitions can be isolated in the various parts of the biological sciences. In parts of the social sciences they may well be occurring today. Although it has become customary, and is surely proper, to deplore the widening gulf that separates the professional scientist from his colleagues in other fields, too little attention is paid to the essential relationship between that gulf and the mechanisms intrinsic to scientific advance.

As Kuhn says, this tendency has very well known results. Consider the papers constantly being published at arxiv.org, for example. If you are not familiar with the science in question, you will likely not be able to understand even the title, let alone the summary or the content. Many or most of the words will be meaningless to you, and even if they are not, their combinations will be.

It is also not difficult to see why this happens, and why it must happen. Everything we understand, we understand through form, which is a network of relationships. Thus if particular investigators wish to go into something in greater detail, these relationships will become more and more remote from the ordinary knowledge accessible to everyone. “Just say it in simple words” will become literally impossible, in the sense that explaining the “simple” statement will involve explaining a huge number of relationships that by default a person would have no knowledge of. That is the purpose, as Kuhn notes, of textbooks, namely to form connections between everyday knowledge and the more complex relationships studied in particular fields.

In Chapter XIII, Kuhn relates this sort of development with the word “science” and progress:

The preceding pages have carried my schematic description of scientific development as far as it can go in this essay. Nevertheless, they cannot quite provide a conclusion. If this description has at all caught the essential structure of a science’s continuing evolution, it will simultaneously have posed a special problem: Why should the enterprise sketched above move steadily ahead in ways that, say, art, political theory, or philosophy does not? Why is progress a perquisite reserved almost exclusively for the activities we call science? The most usual answers to that question have been denied in the body of this essay. We must conclude it by asking whether substitutes can be found.

Notice immediately that part of the question is entirely semantic. To a very great extent the term ‘science’ is reserved for fields that do progress in obvious ways. Nowhere does this show more clearly than in the recurrent debates about whether one or another of the contemporary social sciences is really a science. These debates have parallels in the pre-paradigm periods of fields that are today unhesitatingly labeled science. Their ostensible issue throughout is a definition of that vexing term. Men argue that psychology, for example, is a science because it possesses such and such characteristics. Others counter that those characteristics are either unnecessary or not sufficient to make a field a science. Often great energy is invested, great passion aroused, and the outsider is at a loss to know why. Can very much depend upon a definition of ‘science’? Can a definition tell a man whether he is a scientist or not? If so, why do not natural scientists or artists worry about the definition of the term? Inevitably one suspects that the issue is more fundamental. Probably questions like the following are really being asked: Why does my field fail to move ahead in the way that, say, physics does? What changes in technique or method or ideology would enable it to do so? These are not, however, questions that could respond to an agreement on definition. Furthermore, if precedent from the natural sciences serves, they will cease to be a source of concern not when a definition is found, but when the groups that now doubt their own status achieve consensus about their past and present accomplishments. It may, for example, be significant that economists argue less about whether their field is a science than do practitioners of some other fields of social science. Is that because economists know what science is? Or is it rather economics about which they agree?

The last point is telling. There is significantly more consensus among economists than among other sorts of social science, and consequently less worry about whether their field is scientific or not. The difference, then, is a difference of how much agreement is found. There is not necessarily any difference with respect to the kind of increasingly detailed thought that results in increasingly technical discussion. Kuhn remarks:

The theologian who articulates dogma or the philosopher who refines the Kantian imperatives contributes to progress, if only to that of the group that shares his premises. No creative school recognizes a category of work that is, on the one hand, a creative success, but is not, on the other, an addition to the collective achievement of the group. If we doubt, as many do, that nonscientific fields make progress, that cannot be because individual schools make none. Rather, it must be because there are always competing schools, each of which constantly questions the very foundations of the others. The man who argues that philosophy, for example, has made no progress emphasizes that there are still Aristotelians, not that Aristotelianism has failed to progress.

In this sense, if a particular school believes they possess the general truth about some matter (here theology or philosophy), they will quite naturally begin to discuss it in greater detail and in ways which are mainly intelligible to students of that school, just as happens in other technical fields. The field is only failing to progress in the sense that there are other large communities making contrasting claims, while we begin to use the term “science” and to speak of progress when one school completely dominates the field, and to a first approximation even people who know nothing about it assume that the particular school has things basically right.

What does this imply about progress in philosophy?

1. There is progress in the knowledge of topics that were once considered “philosophy,” but when we get to this point, we usually begin to use the name of a particular science, and with good reason, since technical specialization arises in the manner discussed above. Tyler Cowen discusses this sort of thing here.

2. Areas in which there doesn’t seem to be such progress, are probably most often areas where human knowledge remains at an early stage of development; it is precisely at such early stages that discussion does not have a technical character and when it can generally be understood by ordinary people without a specialized education. I pointed out that Aristotle was mistaken to assume that the sciences in general were fully developed. We would be equally mistaken to make such an assumption at the present times. As Kuhn notes, astronomy and mathematics achieved a “scientific” stage centuries before geology and biology did the same, and these long before economics and the like. The conclusion that one should draw is that metaphysics is hard, not that it is impossible or meaningless.

3. Even now, particular philosophical schools or individuals can make progress even without such consensus. This is evidently true if their overall position is correct or more correct than that of others, but it remains true even if their overall position is more wrong than that of other schools. Naturally, in the latter situation, they will not advance beyond the better position of other schools, but they will advance.

4. One who wishes to progress philosophically cannot avoid the tendency to technical specialization, even as an individual. This can be rather problematic for bloggers and people engaging in similar projects. John Nerst describes this problem:

The more I think about this issue the more unsolvable it seems to become. Loyal readers of a publication won’t be satisfied by having the same points reiterated again and again. News media get around this by focusing on, well, news. News are events, you can describe them and react to them for a while until they’re no longer news. Publications that aim to be more analytical and focus on discussing ideas, frameworks, slow processes and large-scale narratives instead of events have a more difficult task because their subject matter doesn’t change quickly enough for it to be possible to churn out new material every day without repeating yourself[2].

Unless you start building upwards. Instead of laying out stone after stone on the ground you put one on top of another, and then one on top of two others laying next to each other, and then one on top of all that, making a single three-level structure. In practice this means writing new material that builds on what came before, taking ideas further and further towards greater complexity, nuance and sophistication. This is what academia does when working correctly.

Mass media (including the more analytical outlets) do it very little and it’s obvious why: it’s too demanding[3]. If an article references six other things you need to have read to fully understand it you’re going to have a lot of difficulty attracting new readers.

Some of his conclusions:

I think that’s the real reason I don’t try to pitch more writing to various online publications. In my summary of 2018 I said it was because I thought my writing was to “too idiosyncratic, abstract and personal to fit in anywhere but my own blog”. Now I think the main reason is that I don’t so much want to take part in public debate or make myself a career. I want to explore ideas that lie at the edge of my own thinking. To do that I must assume that a reader knows broadly the same things I know and I’m just not that interested in writing about things where I can’t do that[9]. I want to follow my thoughts to for me new and unknown places — and import whatever packages I need to do it. This style isn’t compatible with the expectation that a piece will be able to stand on its own and deliver a single recognizable (and defensible) point[10].

The downside is of course obscurity. To achieve both relevance in the wider world and to build on other ideas enough to reach for the sky you need extraordinary success — so extraordinary that you’re essentially pulling the rest of the world along with you.

Obscurity is certainly one result. Another (relevant at least from the VP’s point of view) is disrespect. Scientists are generally respected despite the general incomprehensibility of their writing, on account of the absence of opposing schools. This lack leads people to assume that their arguments must be mostly right, even though they cannot understand them themselves. This can actually lead to an “Emperor has No Clothes” situation, where a scientist publishes something basically crazy, but others, even in his field, are reluctant to say so because they might appear to be the ones who are ignorant. As an example, consider Joy Christian’s “Disproof of Bell’s Theorem.” After reading this text, Scott Aaronson comments:

In response to my post criticizing his “disproof” of Bell’s Theorem, Joy Christian taunted me that “all I knew was words.”  By this, he meant that my criticisms were entirely based on circumstantial evidence, for example that (1) Joy clearly didn’t understand what the word “theorem” even meant, (2) every other sentence he uttered contained howling misconceptions, (3) his papers were written in an obscure, “crackpot” way, and (4) several people had written very clear papers pointing out mathematical errors in his work, to which Joy had responded only with bluster.  But I hadn’t actually studied Joy’s “work” at a technical level.  Well, yesterday I finally did, and I confess that I was astonished by what I found.  Before, I’d actually given Joy some tiny benefit of the doubt—possibly misled by the length and semi-respectful tone of the papers refuting his claims.  I had assumed that Joy’s errors, though ultimately trivial (how could they not be, when he’s claiming to contradict such a well-understood fact provable with a few lines of arithmetic?), would nevertheless be artfully concealed, and would require some expertise in geometric algebra to spot.  I’d also assumed that of course Joy would have some well-defined hidden-variable model that reproduced the quantum-mechanical predictions for the Bell/CHSH experiment (how could he not?), and that the “only” problem would be that, due to cleverly-hidden mistakes, his model would be subtly nonlocal.

What I actually found was a thousand times worse: closer to the stuff freshmen scrawl on an exam when they have no clue what they’re talking about but are hoping for a few pity points.  It’s so bad that I don’t understand how even Joy’s fellow crackpots haven’t laughed this off the stage.  Look, Joy has a hidden variable λ, which is either 1 or -1 uniformly at random.  He also has a measurement choice a of Alice, and a measurement choice b of Bob.  He then defines Alice and Bob’s measurement outcomes A and B via the following functions:

A(a,λ) = something complicated = (as Joy correctly observes) λ

B(b,λ) = something complicated = (as Joy correctly observes) -λ

I shit you not.  A(a,λ) = λ, and B(b,λ) = -λ.  Neither A nor B has any dependence on the choices of measurement a and b, and the complicated definitions that he gives for them turn out to be completely superfluous.  No matter what measurements are made, A and B are always perfectly anticorrelated with each other.

You might wonder: what could lead anyone—no matter how deluded—even to think such a thing could violate the Bell/CHSH inequalities?

“Give opposite answers in all cases” is in fact entirely irrelevant to Bell’s inequality. Thus the rest of Joy’s paper has no bearing whatsoever on the issue: it is essentially meaningless nonsense. Aaronson says he was possibly “misled by the length and semi-respectful tone of the papers refuting his claims.” But it is not difficult to see why people would be cautious in this way: the fear that they would turn out to be the ones missing something important.

The individual blogger in philosophy, however, is in a different position. If they wish to develop their thought it must become more technical, and there is no similar community backing that would cause others to assume that the writing basically makes sense. Thus, one’s writing is not only likely to become more and more obscure, but others will become more and more likely to assume that it is more or less meaningless word salad. This will happen even more to the degree that there is cultural opposition to one’s vocabulary, concepts, and topics.

Words, Meaning, and Formal Copies

There is quick way to respond to the implicit questions at the end of the last post. I noted in an earlier discussion of form that form is not only copied into the mind; it is also copied into language itself. Any time you describe something in words, you are to some degree copying its form into your description.

This implies that Aristotle’s objection that a mind using an organ would not be able to know all things could equally be made against the possibility of describing all things in words. There simply are not enough combinations of words to relate them to all possible combinations of things; thus, just as a black and white image cannot imitate every aspect of a colored scene, so words cannot possibly describe every aspect of reality.

Two things are evident from this comparison:

First, the objection fails overall. There is nothing that cannot be described in words because words are flexible. If we don’t have a word for something, then we can make up a name. Similarly, the meaning of a single word depends on context.  The word “this” can refer to pretty much anything, depending on the context in which it is used. Likewise meaning can be affected by the particular situation of the person using the word, or by broader cultural contexts, and so on.

Second, there is some truth in the objection. It is indeed impossible to describe every aspect of reality at the same time and in complete detail, and the objection gives a very good reason for this: there are simply not enough linguistic combinations to represent all possible combinations of things. The fact that language is not prime matter does mean that language cannot express every detail of reality at once: the determination that is already there does exclude this possibility. But the flexibility of language prevents there from being any particular aspect of things that cannot be described.

My claim about the mind is the same. There is nothing that cannot be understood by the mind, despite the fact that the mind uses the brain, because the relationship between the brain, mind, and world is a flexible one. Just as the word “this” can refer to pretty much anything, so also the corresponding thought. But on the other hand, the limitations of the brain do mean that a perfectly detailed knowledge of everything is excluded.

Our Interlocutor Insists

In a sense, the above account is sufficient to respond to the objection. There does not seem to be a reason to hold Aristotle’s account of the immateriality of the mind, unless there is also a reason to hold that language cannot be used to describe some things, and this does not seem like a reasonable position. Nonetheless, this response will give rise to a new and more detailed objection.

A black and white scene, it will be said, really and truly copies some aspects of a colored scene, and fails to copy others. Thus right angles in the black and white scene may be identical to right angles in the colored scene. The angles are really copied, and the angles are not. But language seems different: since it is conventional, it does not really copy anything. We just pretend, as it were, that we are copying the thing. “Let the word ‘cat’ stand for a cat,” we say, but there is nothing catlike about the word in reality. The form of the cat is not really copied into the word, or so it will be argued. And since we are not really copying anything, this is why language has the flexibility to be able to describe all things. The meaning of thoughts, however, is presumably not conventional. So it seems that we need to copy things in a real way into the mind, the way we copy aspects of a colored scene into a black and white image. And thus, meaning in the mind should not be flexible in this way, and a particular material medium (such as the brain) would still impede knowing all things, the way the black and white image excludes color.

Formal Copies

The above objection is similar to Hilary Lawson’s argument that words cannot really refer to things. In the post linked above on form and reality, we quoted his argument that cause and effect do not have anything in common. I will reproduce that argument here; for the purpose of the following discussion it might be useful to the reader to refer to the remainder of that post.

For a system of closure to provide a means of intervention in openness and thus to function as a closure machine, it requires a means of converting the flux of openness into an array of particularities. This initial layer of closure will be identified as ‘preliminary closure’. As with closure generally, preliminary closure consists in the realisation of particularity as a consequence of holding that which is different as the same. This is achieved through the realisation of material in response to openness. The most minimal example of a system of closure consists of a single preliminary closure. Such a system requires two discrete states, or at least states that can be held as if they were discrete. It is not difficult to provide mechanical examples of such systems which allow for a single preliminary closure. A mousetrap for example, can be regarded as having two discrete states: it is either set, it is ready, or it has sprung, it has gone off. Many different causes may have led to it being in one state or another: it may have been sprung by a mouse, but it could also have been knocked by someone or something, or someone could have deliberately set it off. In the context of the mechanism all of these variations are of no consequence, it is either set or it has sprung. The diversity of the immediate environment is thereby reduced to single state and its absence: it is either set or it is not set. Any mechanical arrangement that enables a system to alternate between two or more discrete states is thereby capable of providing the basis for preliminary closure. For example, a bell or a gate could function as the basis for preliminary closure. The bell can either ring or not ring, the gate can be closed or not closed. The bell may ring as the result of the wind, or a person or animal shaking it, but the cause of the response is in the context of system of no consequence. The bell either rings or it doesn’t. Similarly, the gate may be in one state or another because it has been deliberately moved, or because something or someone has dislodged it accidentally, but these variations are not relevant in the context of the state of system, which in this case is the position of the gate. In either case the cause of the bell ringing or the gate closing is infinitely varied, but in the context of the system the variety of inputs is not accessible to the system and thus of no consequence.

Lawson’s basic argument is that any particular effect could result from any of an infinite number of different causes, and the cause and effect might be entirely different: the effect might be ringing of a bell, but the cause was not bell-like at all, and did not have a ringing sound. So the effect, he says, tells you nothing at all about the cause. In a similar way, he claims, our thoughts cause our words, but our words and our thoughts have nothing in common, and thus our words tell us nothing about our thoughts; and in that sense they do not refer to anything, not even to our thoughts. Likewise, he says, the world causes our thoughts, but since the cause and effect have nothing in common, our thoughts tell us nothing about the world, and do not even refer to it.

As I responded at the time, this account is mistaken from the very first step. Cause and effect always have something in common, namely the cause-effect relationship, although they each have different ends of that relationship. They will also have other things in common depending on the particular nature of the cause and effect in question. Similarly, the causes that are supposedly utterly diverse, in Lawson’s account, have something in common themselves: every situation that rings the bell has “aptness to ring the bell” in common. And when the bell is rung, it “refers” to these situations by the implication that we are in a situation that has aptness to ring the bell, rather than in one of the other situations.

It is not accidental here that “refer” and “relate” are taken from forms of the same verb. Lawson’s claim that words do not “refer” to things is basically the same as the claim that they are not really related to things. And the real problem is that he is looking at matter (in this case the bell) without considering form (in this case the bell’s relationship with the world.)

In a similar way, to say that the word “cat” is not catlike is to look at the sound or at the text as matter, without considering its form, namely the relationship it has with the surrounding context which causes that word to be used. But that relationship is real; the fact that the word is conventional does not prevent it from being true that human experience of cats is the cause of thoughts of cats, and that thoughts of cats are concretely the cause of the usage of the word “cat,” even if they could in some other situation have caused some other word to be used.

I argued in the post on the nature of form (following the one with the discussion of Lawson) that form is a network of relationships apt to make something one. Insofar as an effect really receives form from a cause in the above way, words really receive meaning from the context that gives rise to their use. And in this way, it is not true that form in language is unlike form in a black and white scene, such that one could say that form in the scene is “real” and form in language is not. Both are real.

Thus the objection fails. Nonetheless, it is true that it is easier to see why it is possible to describe anything in words, than it is to see why anything can be known. And this happens simply because “anything is describable in words” precisely because “anything can be known.” So the fact that anything can be known is the more remote cause, and thus harder to know.

 

And Fire by Fire

Superstitious Nonsense asks about the last post:

So the answer here is that -some- of the form is present in the mind, but always an insufficient amount or accuracy that the knowledge will not be “physical”? You seem to be implying the part of the form that involves us in the self-reference paradox is precisely the part of the form that gives objects their separate, “physical” character. Is this fair? Certainly, knowing progressively more about an object does not imply the mental copy is becoming closer and closer to having a discrete physicality.

I’m not sure this is the best way to think about it. The self-reference paradox arises because we are trying to copy ourselves into ourselves, and thus we are adding something into ourselves, making the copy incomplete. The problem is not that there is some particular “part of the form” that we cannot copy, but that it is in principle impossible to copy it perfectly. This is different from saying that there is some specific “part” that cannot be copied.

Consider what happens when we make “non-physical” copies of something without involving a mind. Consider the image of a gold coin. There are certain relationships common to the image and to a gold coin in the physical world. So you could say we have a physical gold coin, and a non-physical one.

But wait. If the image of the coin is on paper, isn’t that a physical object? Or if the image is on your computer screen, isn’t your screen a physical object? And the image is just the colors on the screen, which are apparently just as “physical” (or non-physical) as the color of the actual coin. So why we would say that “this is not a physical coin?”

Again, as in the last post, the obvious answer is that the image is not made out of gold, while the physical coin is. But why not? Is it that the image is not accurate enough? If we made it more accurate, would it be made out of gold, or become closer to being made out of gold? Obviously not. This is like noting that a mental copy does not become closer and closer to being a physical one.

In a sense it is true that the reason the image of the coin is not physical is that it is not accurate enough. But that is because it cannot be accurate enough: the fact that it is an image positively excludes the copying of certain relationships. Some aspects can be copied, but others cannot be copied at all, as long as it is an image. On the other hand, you can look at this from the opposite direction: if you did copy those aspects, the image would no longer be an image, but a physical coin.

As a similar example, consider the copying of a colored scene into black and white. We can copy some aspects of the scene by using various shades of gray, but we cannot copy every aspect of the scene. There are simply not enough differences in a black and white image to reflect every aspect of a colored scene. The black and white image, as you make it more accurate, does not become closer to being colored, but this is simply because there are aspects of the colored scene that you never copy. If you do insist on copying those aspects, you will indeed make the black and white image into a colored image, and thus it will no longer be black and white.

The situation becomes significantly more complicated when we talk about a mind. In one way, there is an important similarity. When we say that the copy in the mind is “not physical,” that simply means that it is a copy in the mind, just as when we say that the image of the coin is not physical, it means that it is an image, made out of the stuff that images are made of. But just as the image is physical anyway, in another sense, so it is perfectly possible that the mind is physical in a similar sense. However, this is where things begin to become confusing.

Elsewhere, I discussed Aristotle’s argument that the mind is immaterial. Considering the cases above, we could put his argument in this way: the human brain is a limited physical object. So as long as the brain remains a brain, there are simply not enough potential differences in it to model all possible differences in the world, just as you cannot completely model a colored scene using black and white. But anything at all can be understood. Therefore we cannot be understanding by using the brain.

I have claimed myself that anything that can be, can be understood. But this needs to be understood generically, rather than as claiming that it is possible to understand reality in every detail simultaneously. The self-reference paradox shows that it is impossible in principle for a knower that copies forms into itself to understand itself in every aspect at once. But even apart from this, it is very obvious that we as human beings cannot understand every aspect of reality at once. This does not even need to be argued: you cannot even keep everything in mind at once, let alone understand every detail of everything. This directly suggests a problem with Aristotle’s argument: if being able to know all things suggests that the mind is immaterial, the obvious fact that we cannot know all things suggests that it is not.

Nonetheless, let us see what happens if we advance the argument on Aristotle’s behalf. Admittedly, we cannot understand everything at once. But in the case of the colored scene, there are aspects that cannot be copied at all into the black and white copy. And in the case of the physical coin, there are aspects that cannot be copied at all into the image. So if we are copying things into the brain, doesn’t that mean that there should be aspects of reality that cannot be copied at all into the mind? But this is false, since it would not only mean that we can’t understand everything, but it would also mean that there would be things that we cannot think about at all, and if it is so, then it is not so, because in that case we are right now talking about things that we supposedly cannot talk about.

Copying into the mind is certainly different from copying into a black and white scene or copying into a picture, and this does get at one of the differences. But the difference here is that the method of copying in the case of the mind is flexible, while the method of copying in the case of the pictures is rigid. In other words, we have a pre-defined method of copying in the case of the pictures that, from the beginning, only allows certain aspects to be copied. In the case of the mind, we determine the method differently from case to case, depending on our particular situation and the thing being copied. The result is that there is no particular aspect of things that cannot be copied, but you cannot copy every aspect at once.

In answer to the original question, then, the reason that the “mental copy” always remains mental is that you never violate the constraints of the mind, just as a black and white copy never violates the constraints of being black and white. But if you did violate the constraints of the black and white copy by copying every aspect of the scene, the image would become colored. And similarly, if you did violate the constraints of the mind in order to copy every aspect of reality, your mind would cease to be, and it would instead become the thing itself. But there is no particular aspect of “physicality” that you fail to copy: rather, you just ensure that one way or another you do not violate the constraints of the mind that you have.

Unfortunately, the explanation here for why the mind can copy any particular aspect of reality, although not every aspect at once, is rather vague. Perhaps a clearer explanation is possible? In fact, someone could use the vagueness to argue for Aristotle’s position and against mine. Perhaps my account is vague because it is wrong, and there is actually no way for a physical object to receive copied forms in this way.

Earth By Earth

In an earlier post I quoted Empedocles:

For ’tis by Earth we see Earth, by Water Water,

By Ether Ether divine, by Fire destructive Fire,

By Love Love, and Hate by cruel Hate.

I argued in that post that the mind does have something in common with what is known, and that this common thing is the form of the thing known. However, I took for granted that Empedocles is mistaken in assuming that the thing itself must be in common in order to be known.

I did not directly say why he is mistaken. If form makes a thing what it is, and the form of a thing known is in the mind, why does the mind not become that thing? If the form of earth is in your mind, then why is your mind not literally earth?

We will naturally be inclined to say that the form in your mind is apart from its proper matter, and that you need both form and matter to make a thing. And there is nothing wrong with this answer, as far as it goes, but it seems insufficient. Suppose you have a gold coin: what is its matter? The gold coin is presumably made out of atoms of gold, and since these atoms are not in your head, you do not see gold by gold. The problem is that atoms of gold also have some form, since this is just to say there is an answer when we ask, “What is this?”, and this will be true of anything whatever that you call matter. And there is nothing to prevent you from knowing that thing as well. There is nothing to prevent you from knowing the nature of gold atoms. And thus it seems that the matter will be present, and thus there should be actual gold in your mind.

Perhaps an Aristotelian will suggest that it is prime matter that is missing. But this answer will not work, because humans have this sort of matter in common with other things. And in any case, nothing is meant by “matter” in this sense except the ability to have the form. And since the knower can have the form, they have the ability to have the form, and thus matter. So nothing is missing, and the thing known should be literally in the knower.

Thus it appears that we have a reductio. Either my account of knowledge is mistaken, or earth should actually by known by earth, which it obviously is not.

The conclusion is only apparent, however. We can resolve it by going back to what I said about form in that post and the following one. Form is a network of relationships apt to make something one. But being one not only includes internal unity, but also separation from other things. For example, suppose we now have three gold coins, instead of one: each coin is one coin, and this depends on its parts being together, rather than in a loose heap of gold dust. But the fact that the coins are three depends on their separation from one another, and thus also the fact that each coin is “one” depends on that separation.

In other words, the form of a thing includes not only internal relationships, but also external relationships. This implies that to know the nature of a thing, one must know its external relationships. And to know a thing perfectly would require knowing both its internal and external relationships perfectly.

Now one of the things to which it is related is the very one who knows it. Thus, if the knower is to know the thing perfectly, they must perfectly understand the relationships between themselves and the thing. But this is not possible, for reasons explained in the post on self-reference. The person who attempts to know something perfectly is in the situation of someone attempting to draw a picture of themselves drawing a picture: to make a perfect copy of the gold coin, it is necessary to copy its context, which includes the knower. But this cannot be done; therefore perfect knowledge of the coin is impossible.

A different way to state the same analysis: “perfect copy” is a contradiction in terms, because such perfection would imply identity with the original, and thus not being a copy at all. In other words, perfect knowledge of a thing is impossible because perfect knowledge would imply, as in the argument of Empedocles, that one’s knowledge would literally be the thing known, and thus not knowledge at all.

Place, Time, and Universals

Consider the following three statements:

1. The chair and keyboard that I am currently using are both here in this room.

2. The chair and keyboard that I am currently using both exist in January 2019.

3. The chair and keyboard that I am currently using both came in the color black.

All three claims, considered as everyday statements, happen to be true. They also have a common subject, and something common about the predicate, namely the “in.” We have “in this room,” “in January,” and “in the color black.” Now someone might object that this is a mere artifact of my awkward phrasing: obviously, I deliberately chose these formulations with this idea in mind. So this seems to be a mere verbal similarity, and a meaningless one at that.

The objection seems pretty reasonable, but I will argue that it is mistaken. The verbal similarity is not accidental, despite the fact that I did indeed choose the formulations deliberately with this idea in mind. As I intend to argue, there is indeed something common to the three cases, namely that they represent various ways of existing together.

The three statements are true in their ordinary everyday sense. But consider the following three questions:

1. Are the chair and keyboard really in the same room, or is this commonality a mere appearance?

2. Do the chair and keyboard really exist in the same month, or is this commonality a mere appearance?

3. Did the chair and keyboard really come in the same color, or is this commonality a mere appearance?

These questions are like other questions which ask whether something is “really” the case. There is no such thing as being “really” on the right apart from the ordinary understanding of being on the right, and there is no such thing as being really in the same room apart from the ordinary everyday understanding of being in the same room. The same thing applies to the third question about color.

The dispute between realism and nominalism about universals starts in the following way, roughly speaking:

Nominalist: We say that two things are black. But obviously, there are two things here, and no third thing, and the two are not the same thing. So the two do not really have anything in common. Therefore “two things are black” is nothing but a way of speaking.

Platonic Realist: Obviously, the two things really are black. But what is really the case is not just a way of speaking. So the two really do have something in common. Therefore there are three things here: the two ordinary things, and the color black.

Since the Platonic Realist here goes more against common speech in asserting the existence of “three things” where normally one would say there are “two things,” the nominalist has the apparent advantage at this point, and this leads to more qualified forms of realism. In reality, however, one should have stopped the whole argument at this point. The two positions above form a Kantian dichotomy, and as in all such cases, both positions affirm something true, and both positions affirm something false. In this particular case, the nominalist acts as the Kantian, noting that universality is a mode of knowing, and therefore concludes that it is a mere appearance. The Platonic Realist acts as the anti-Kantian, noting that we can know that several things are in fact black, and concluding that universality is a mode of being as such.

But while universality is a way of knowing, existing together is a way of being, and is responsible for the way of knowing. In a similar way, seeing both my chair and keyboard at the same time is a way of seeing things, but this way of seeing is possible because they are here together in the room. Likewise, I can know that both are black, but this knowledge is only possible because they exist together “in” the color black. What does this mean, exactly? Since we are discussing sensible qualities, things are both in the room and black by having certain relationships with my senses. They exist together in those relationships with my senses.

There is no big difference when I ask about ideas. If we ask what two dogs have in common in virtue of both being dogs, what they have in common is a similar relationship to my understanding. They exist together in that relationship with my understanding.

It might be objected that this is circular. Even if what is in common is a relationship, there is still something in common, and that seems to remain unexplained. Two red objects have a certain relationship of “appearing red” to my eyes, but then do we have two things, or three? The two red things, or the two red things and the relationship of “appearing red”? Or is it four things: two red things, and their two relationships of appearing red? So which is it?

Again, there is no difference between these questions and asking whether a table is really on the left or really on the right. It is both, relative to different things, and likewise all three of these methods of counting are valid, depending on what you want to count. As I have said elsewhere, there are no hidden essences, no “true” count, no “how many things are really there?

“Existing together,” however, is a reality, and is not merely a mode of knowing. This provides another way to analyze the problem with the nominalist / Platonic realist opposition. Both arguments falsely assume that existing together is either logically derivative or non-existent. As I said in the post on existential relativity,  it is impossible to deduce the conclusion that many things exist from a list of premises each affirming that a single thing exists, if only because “many things” does not occur as a term in that list. The nominalist position cannot explain the evident fact that both things are black. Likewise, even if there are three things, the two objects and “black,” this would not explain why the two objects are black. The two objects are not the third, since there are three. So there must be yet another object, perhaps called “participation”, which connects the two objects and blackness. And since they both have participation, there must be yet another object, participation in general, in which both objects are also participating. Obviously none of this is helping: the problem was the assumption from the start that togetherness (whether in place, time, or color) could be something logically derivative.

(Postscript: the reader might notice that in the linked post on “in,” I said that a thing is considered to be in something as form in matter. This seems odd in the context of this post, since we are talking about being “in a color,” and a color would not normally be thought of as material, but as formal. But this simply corresponds with the fact that it would be more usual to say that the color black is in the chair, rather than the chair in the black. This is because it is actually more correct: the color black is formal with respect to the chair, not material. But when we ask, “what things can come in the color black,” we do think of black as though it were a kind of formless matter that could take various determinate forms.)

Fair and Unfair Logic

St. Thomas discusses cases in which one should not follow the law:

As stated above (Article 4), every law is directed to the common weal of men, and derives the force and nature of law accordingly. Hence the jurist says [Pandect. Justin. lib. i, ff., tit. 3, De Leg. et Senat.]: “By no reason of law, or favor of equity, is it allowable for us to interpret harshly, and render burdensome, those useful measures which have been enacted for the welfare of man.” Now it happens often that the observance of some point of law conduces to the common weal in the majority of instances, and yet, in some cases, is very hurtful. Since then the lawgiver cannot have in view every single case, he shapes the law according to what happens most frequently, by directing his attention to the common good. Wherefore if a case arise wherein the observance of that law would be hurtful to the general welfare, it should not be observed. For instance, suppose that in a besieged city it be an established law that the gates of the city are to be kept closed, this is good for public welfare as a general rule: but, it were to happen that the enemy are in pursuit of certain citizens, who are defenders of the city, it would be a great loss to the city, if the gates were not opened to them: and so in that case the gates ought to be opened, contrary to the letter of the law, in order to maintain the common weal, which the lawgiver had in view.

He calls the attitude that leads one to set aside the law in such cases “epikeia,” or “equity,” which in this context means something like fairness or moderation:

As stated above (I-II:96:6), when we were treating of laws, since human actions, with which laws are concerned, are composed of contingent singulars and are innumerable in their diversity, it was not possible to lay down rules of law that would apply to every single case. Legislators in framing laws attend to what commonly happens: although if the law be applied to certain cases it will frustrate the equality of justice and be injurious to the common good, which the law has in view. Thus the law requires deposits to be restored, because in the majority of cases this is just. Yet it happens sometimes to be injurious—for instance, if a madman were to put his sword in deposit, and demand its delivery while in a state of madness, or if a man were to seek the return of his deposit in order to fight against his country. On these and like cases it is bad to follow the law, and it is good to set aside the letter of the law and to follow the dictates of justice and the common good. This is the object of “epikeia” which we call equity. Therefore it is evident that “epikeia” is a virtue.

“Fairness” is probably a good translation here, since someone who rigidly demands the application of the law in such a situation would often be called unfair in relation to the people involved.

Someone might object that much of the benefit of having a law directly depends on following it consistently, without making exceptions based on minute analysis of particular situations, as we saw in the last post. This is correct as far as it goes, but St. Thomas is not talking about analyzing each situation in detail and making an exception whenever there appears to be a benefit, but rather talking about situations which are extremely different from the situations considered by the law. Thus he says in the reply to the second objection:

He who follows the intention of the lawgiver, does not interpret the law simply; but in a case in which it is evident, by reason of the manifest harm, that the lawgiver intended otherwise. For if it be a matter of doubt, he must either act according to the letter of the law, or consult those in power.

To the degree that “laws of logic” can be analogously interpreted as rules for sensible thought and speech, telling one to behave in some ways and not in others, similar principles will apply. Thus, for example, an atheist confronted with the argument of Alexander Pruss for the existence of God based on the indeterminacy of language might not only be inclined to call it sophistical, but to add that it is an unfair way to argue. And indeed it is, precisely in the sense that it applies the rule “either say that A is B or say that A is not B” to situations for which it was not intended, namely situations where B is simply too vague to say. The rule is intended to make people think and speak sensibly, but Pruss is abusing the rule with the opposite result: that he does not speak and think sensibly.

Someone might agree that this is reasonable insofar as we are considering these laws as rules of behavior, but another issue comes up. Human laws are really intended to exclude some kinds of behavior that are really possible. And likewise, rules of logic are really intended to exclude some kinds of behavior that are really possible, e.g. making arguments like:

A: You always say I am wrong.

B: I said you were right about X.

A: See, you just said I was wrong again. You always say I am wrong!

I know from experience that this behavior is possible, and it does violate the laws of logic considered as rules of behavior. But someone might add that the laws of logic are also based on the nature of reality itself, and for this very reason we said that they are not conventions, but could not have been otherwise. So it seems to follow that it should be possible to expound the laws of logic in a form in which they are truly exceptionless, by expressing reality as it truly is.

There is some truth here, but there is also a problem analogous to a similar objection about human law. Consider the third objection and reply in the above article from St. Thomas:

Objection 3. Further, every wise man knows how to explain his intention by words. But those who framed the laws should be reckoned wise: for Wisdom says (Proverbs 8:15): “By Me kings reign, and lawgivers decree just things.” Therefore we should not judge of the intention of the lawgiver otherwise than by the words of the law.

Reply to Objection 3. No man is so wise as to be able to take account of every single case; wherefore he is not able sufficiently to express in words all those things that are suitable for the end he has in view. And even if a lawgiver were able to take all the cases into consideration, he ought not to mention them all, in order to avoid confusion: but should frame the law according to that which is of most common occurrence.

The objection here is similar. If there are cases where it wouldn’t be good to apply the law, the lawgiver ought to have enumerated those cases. St. Thomas replies that in reality you will not foresee every case, and that even if you could, enumerating them would simply cause confusion.

A similar thing applies if we consider the laws of logic. You can say, “If you say that A is B in an infinitely precise sense, and that B is C in an infinitely precise sense, you should also say that A is C,” and your claim might be exceptionless. The problem is that your claim has no cases: no one ever says anything in an infinitely precise sense.

And on the other hand, if you try to make your claim include some actual cases, you will not be able to avoid the possibility of exceptions, just as the human lawgiver does not foresee all cases. And as in the case of human law, if you attempt to enumerate all cases, you will simply cause confusion. Thus, for example, someone might say that the problem in the case of Queen Elizabeth is that we simply don’t have a precise enough definition for “old,” and they might then attempt to give a precise definition. But this would have several results:

1. First, the new word “old” would not have the same meaning as the original word, because the very fact that the original word is vague is part of what the word is. It is not accidental; it is not meant to have a precise cut-off.

2. Someone might attempt to remedy the above flaw by enumerating various circumstances, rather than giving a precise cut-off. “If you are less then 10 years old and you say that someone is ‘old,’ it signifies someone who is at least 15.” “If you are in your 30s and you say that someone is ‘old’, it signifies that they are at least 67.” And so on. But attempting to fix the first problem, you have simply compounded it. The new word still does not have the same meaning as the original word, because the original word was meant to be flexible; even your new rules have too much rigidity.

You could attempt to remedy the above problems by listing all the situations where people in fact use the word “old,” but that is not a definition: it is just an indefinitely long list. What St. Thomas said about human law, that it “ought not to mention them all,” is equally true about this situation. The point of defining “old” is to provide an explanation which is both general and flexible. Someone might argue that we should provide a list of all possible circumstances and what should be done in those circumstances, in order to avoid the flexibility of “epikeia,” but such an attempt would be absurd, and harmful to a good life. And it is equally absurd when we attempt to apply the same process to logic or to definitions, and harmful to sensible thought and speech.

What about reality itself? Isn’t it an exceptionless reality that a thing is what it is? Indeed. But this is neither a rule of behavior nor of speech. Nor is it a rule making something be some way; reality does not need something else to make sure that it turns out to be reality rather than something else. There is simply nothing else to be. Parmenides was right at least to this degree.

Laws of Logic

In the last post, we quoted Carin Robinson’s claim:

For instance, where we use the laws of logic, let us remember that there are no known/knowable facts about logic. These laws are therefore, to the best of our knowledge, conventions not dissimilar to the rules of a game.

Law

I intend to discuss Robinson’s claim in a bit more detail shortly, but first consider the meaning of a law in its plainest sense. In the USA there is a law that you must pay your taxes for the previous year by mid April. What does this law do? Presumably the purpose of the law is to get people to pay their taxes by that time. Without the law, they would likely not pay by then, and if there were no rule that you have to pay taxes at all, people presumably would not pay taxes. So the law is meant to make something happen, namely the payment of taxes by a certain date, something that otherwise might not happen.

Rules of a game

What about the rules of a game? Consider the game of hide and seek. Wikipedia describes it in some detail:

Hide-and-seek, or hide-and-go-seek, is a popular children’s game in which any number of players (ideally at least three) conceal themselves in a set environment, to be found by one or more seekers. The game is played by one player chosen (designated as being “it”) closing their eyes and counting to a predetermined number while the other players hide. For example, count to 100 in units of 5 or count to 20, one two three and keep counting up till it reaches twenty. After reaching this number, the player who is “it” calls “Ready or not, here I come!” and then attempts to locate all concealed players.

This is partly a factual description, but it is also attempting to give the rules. It seems to be a rule that the players who are hiding have some amount of time to hide, and it would seem to be a violation of the rules if the seeker simply starts the game by announcing, “I see everyone here, so I’ve found everyone,” without there being any time to hide.

What do these rules do? Are they like the law?

Yes and no, in different respects. You can certainly imagine a player breaking the rules in the above manner. So the rules, like the law, are meant to make something happen, namely the players act in a certain manner, and they are meant to exclude what might happen without the rule, just like the law.

There is a difference, however. If a player did the above, they would not be playing the game at all. It is possible to go about your life and not pay any taxes; but it is not possible to play hide-and-seek without there being a space or time for people to hide. In this sense, the law excludes some possibilities for life, but the rule of the game does not exclude some possibilities for that game; it simply describes what the game is. It does exclude possibilities that would be rules for other games. So it excludes some possibilities; but not possibilities for the game of hide and seek.

Facts

Why does Robinson say that there are no “facts” about logic? The English word “fact” is taken from the Latin factum, which means “done” or “made.” This is not accidental to the claim here. There is nothing making things follow the rules of logic, and for this reason Robinson asserts that there are no facts, i.e. nothing made to be the case in the realm of logic. Precisely for this reason, you don’t have to go out and look at the “facts”, i.e. things that are made to be the case in the world, to determine whether or not a statement of logic or mathematics is correct or not.

Laws of Logic

Robinson argues that since the laws of logic don’t make anything be the case in the world, they must be conventional, like “rules of a game”. But in our discussion of the rules of a game, we saw that such rules do exclude certain types of possibility, while they constitute the game itself, and therefore do not exclude any possibilities for the game. How would this work if the rules of logic were rules of a game? What sorts of possibility are excluded by the rules, and what game is constituted by the rules?

As we said, it is possible to break the rules of a game, although when you do, you often stop playing the game by definition. It it similarly possible to break the laws of logic?

If we take the game to be a certain sort of speaking, yes, it is. It is possible for someone to say the words, “Blue things are not blue.” It is possible for someone to say the words, “All cats are mammals. Alvin is a cat. Therefore Alvin is not a mammal.” Someone doing this, however, is not playing the particular game in question. What is that game? I suggest we call it “speaking sensibly about reality.” Someone who breaks the laws of logic, by that very fact, fails to speak sensibly about reality, just as someone who breaks the rules of hide-and-seek fails to play the game.

The rules of hide-and-seek are conventional, in the sense that you could have other rules. But if you did have other rules, you would be playing a different game. In the same way, if you had rules other than the laws of logic for your speaking game, you would be doing something entirely different. You would not be doing what we are normally trying to do when we speak, namely speaking sensibly about reality.

Up to this point, we have actually succeeded in making a certain sort of sense out of Robinson’s claim. But does it follow, as supposed, that logic tells us nothing about reality? We pointed out in the previous post that this is not true. But why is it not, if the laws of logic are conventions about how to speak?

Do the rules of hide-and-seek tell us something about the game of hide-and-seek? Clearly they do, despite the fact that they are conventional. They tell us most of what there is to know about the game. They tell us what the game is, in fact. Likewise, the laws of logic tell us how to speak sensibly about reality. Do they also tell us about reality itself, or just about how to speak about it?

They do, in the way that considering the effect reveals the cause. Reality is what it is, and therefore certain ways of speaking are sensible and others are not. So to tell someone how to speak sensibly is to tell them something about reality. However, there is another difference between the laws of logic and the rules of a game. The rules of a game are conventional in the sense that we could have different rules and different games. And similarly, if we didn’t want to follow the “conventions” of logic, we could speak nonsensically instead of trying to speak sensibly about reality. But there is not some possible alternate reality which could be spoken of sensibly by using different “conventions.” In this sense, you can call the laws of logic rules of a game, if you wish. But they are the rules of the game of understanding, and there is only such game, not only in practice but in principle, and the rules could not have been otherwise.

Tautologies Not Trivial

In mathematics and logic, one sometimes speaks of a “trivial truth” or “trivial theorem”, referring to a tautology. Thus for example in this Quora question, Daniil Kozhemiachenko gives this example:

The fact that all groups of order 2 are isomorphic to one another and commutative entails that there are no non-Abelian groups of order 2.

This statement is a tautology because “Abelian group” here just means one that is commutative: the statement is like the customary example of asserting that “all bachelors are unmarried.”

Some extend this usage of “trivial” to refer to all statements that are true in virtue of the meaning of the terms, sometimes called “analytic.” The effect of this is to say that all statements that are logically necessary are trivial truths. An example of this usage can be seen in this paper by Carin Robinson. Robinson says at the end of the summary:

Firstly, I do not ask us to abandon any of the linguistic practises discussed; merely to adopt the correct attitude towards them. For instance, where we use the laws of logic, let us remember that there are no known/knowable facts about logic. These laws are therefore, to the best of our knowledge, conventions not dissimilar to the rules of a game. And, secondly, once we pass sentence on knowing, a priori, anything but trivial truths we shall have at our disposal the sharpest of philosophical tools. A tool which can only proffer a better brand of empiricism.

While the word “trivial” does have a corresponding Latin form that means ordinary or commonplace, the English word seems to be taken mainly from the “trivium” of grammar, rhetoric, and logic. This would seem to make some sense of calling logical necessities “trivial,” in the sense that they pertain to logic. Still, even here something is missing, since Robinson wants to include the truths of mathematics as trivial, and classically these did not pertain to the aforesaid trivium.

Nonetheless, overall Robinson’s intention, and presumably that of others who speak this way, is to suggest that such things are trivial in the English sense of “unimportant.” That is, they may be important tools, but they are not important for understanding. This is clear at least in our example: Robinson calls them trivial because “there are no known/knowable facts about logic.” Logical necessities tell us nothing about reality, and therefore they provide us with no knowledge. They are true by the meaning of the words, and therefore they cannot be true by reason of facts about reality.

Things that are logically necessary are not trivial in this sense. They are important, both in a practical way and directly for understanding the world.

Consider the failure of the Mars Climate Orbiter:

On November 10, 1999, the Mars Climate Orbiter Mishap Investigation Board released a Phase I report, detailing the suspected issues encountered with the loss of the spacecraft. Previously, on September 8, 1999, Trajectory Correction Maneuver-4 was computed and then executed on September 15, 1999. It was intended to place the spacecraft at an optimal position for an orbital insertion maneuver that would bring the spacecraft around Mars at an altitude of 226 km (140 mi) on September 23, 1999. However, during the week between TCM-4 and the orbital insertion maneuver, the navigation team indicated the altitude may be much lower than intended at 150 to 170 km (93 to 106 mi). Twenty-four hours prior to orbital insertion, calculations placed the orbiter at an altitude of 110 kilometers; 80 kilometers is the minimum altitude that Mars Climate Orbiter was thought to be capable of surviving during this maneuver. Post-failure calculations showed that the spacecraft was on a trajectory that would have taken the orbiter within 57 kilometers of the surface, where the spacecraft likely skipped violently on the uppermost atmosphere and was either destroyed in the atmosphere or re-entered heliocentric space.[1]

The primary cause of this discrepancy was that one piece of ground software supplied by Lockheed Martin produced results in a United States customary unit, contrary to its Software Interface Specification (SIS), while a second system, supplied by NASA, expected those results to be in SI units, in accordance with the SIS. Specifically, software that calculated the total impulse produced by thruster firings produced results in pound-force seconds. The trajectory calculation software then used these results – expected to be in newton seconds – to update the predicted position of the spacecraft.

It is presumably an analytic truth that the units defined in one way are unequal to the units defined in the other. But it was ignoring this analytic truth that was the primary cause of the space probe’s failure. So it is evident that analytic truths can be extremely important for practical purposes.

Such truths can also be important for understanding reality. In fact, they are typically more important for understanding than other truths. The argument against this is that if something is necessary in virtue of the meaning of the words, it cannot be telling us something about reality. But this argument is wrong for one simple reason: words and meaning themselves are both elements of reality, and so they do tell us something about reality, even when the truth is fully determinate given the meaning.

If one accepts the mistaken argument, in fact, sometimes one is led even further. Logically necessary truths cannot tell us anything important for understanding reality, since they are simply facts about the meaning of words. On the other hand, anything which is not logically necessary is in some sense accidental: it might have been otherwise. But accidental things that might have been otherwise cannot help us to understand reality in any deep way: it tells us nothing deep about reality to note that there is a tree outside my window at this moment, when this merely happens to be the case, and could easily have been otherwise. Therefore, since neither logically necessary things, nor logically contingent things, can help us to understand reality in any deep or important way, such understanding must be impossible.

It is fairly rare to make such an argument explicitly, but it is a common implication of many arguments that are actually made or suggested, or it at least influences the way people feel about arguments and understanding.  For example, consider this comment on an earlier post. Timocrates suggests that (1) if you have a first cause, it would have to be a brute fact, since it doesn’t have any other cause, and (2) describing reality can’t tell us any reasons but is “simply another description of how things are.” The suggestion behind these objections is that the very idea of understanding is incoherent. As I said there in response, it is true that every true statement is in some sense “just a description of how things are,” but that was what a true statement was meant to be in any case. It surely was not meant to be a description of how things are not.

That “analytic” or “tautologous” statements can indeed provide a non-trivial understanding of reality can also easily be seen by example. Some examples from this blog:

Good and being. The convertibility of being and goodness is “analytic,” in the sense that carefully thinking about the meaning of desire and the good reveals that a universe where existence as such was bad, or even failed to be good, is logically impossible. In particular, it would require a universe where there is no tendency to exist, and this is impossible given that it is posited that something exists.

Natural selection. One of the most important elements of Darwin’s theory of evolution is the following logically necessary statement: the things that have survived are more likely to be the things that were more likely to survive, and less likely to be the things that were less likely to survive.

Limits of discursive knowledge. Knowledge that uses distinct thoughts and concepts is necessarily limited by issues relating to self-reference. It is clear that this is both logically necessary, and tells us important things about our understanding and its limits.

Knowledge and being. Kant rightly recognized a sense in which it is logically impossible to “know things as they are in themselves,” as explained in this post. But as I said elsewhere, the logically impossible assertion that knowledge demands an identity between the mode of knowing and the mode of being is the basis for virtually every sort of philosophical error. So a grasp on the opposite “tautology” is extremely useful for understanding.

 

Perfectly Random

Suppose you have a string of random binary digits such as the following:

00111100010101001100011011001100110110010010100111

This string is 50 digits long, and was the result of a single attempt using the linked generator.

However, something seems distinctly non-random about it: there are exactly 25 zeros and exactly 25 ones. Naturally, this will not always happen, but most of the time the proportion of zeros will be fairly close to half. And evidently this is necessary, since if the proportion was usually much different from half, then the selection could not have been random in the first place.

There are other things about this string that are definitely not random. It contains only zeros and ones, and no other digits, much less items like letters from the alphabet, or items like ‘%’ and ‘$’.

Why do we have these apparently non-random characteristics? Both sorts of characteristics, the approximate and typical proportion, and the more rigid characteristics, are necessary consequences of the way we obtained or defined this number.

It is easy to see that such characteristics are inevitable. Suppose someone wants to choose something random without any non-random characteristics. Let’s suppose they want to avoid the first sort of characteristic, which is perhaps the “easier” task. They can certainly make the proportion of zeros approximately 75% or anything else that they please. But this will still be a non-random characteristic.

They try again. Suppose they succeed in preventing the series of digits from converging to any specific probability. If they do, there is one and only one way to do this. Much as in our discussion of the mathematical laws of nature, the only way to accomplish this will be to go back and forth between longer and longer strings of zeros and ones. But this is an extremely non-random characteristic. So they may have succeeded in avoiding one particular type of non-randomness, but only at the cost of adding something else very non-random.

Again, consider the second kind of characteristic. Here things are even clearer: the only way to avoid the second kind of characteristic is not to attempt any task in the first place. The only way to win is not to play. Once we have said “your task is to do such and such,” we have already specified some non-random characteristics of the second kind; to avoid such characteristics is to avoid the task completely.

“Completely random,” in fact, is an incoherent idea. No such thing can exist anywhere, in the same way that “formless matter” cannot actually exist, but all matter is formed in one way or another.

The same thing applies to David Hume’s supposed problem of induction. I ended that post with the remark that for his argument to work, he must be “absolutely certain that the future will resemble the past in no way.” But this of course is impossible in the first place; the past and the future are both defined as periods of time, and so there is some resemblance in their very definition, in the same way that any material thing must have some form in its definition, and any “random” thing must have something non-random in its definition.

 

Quantum Mechanics and Libertarian Free Will

In a passage quoted in the last post, Jerry Coyne claims that quantum indeterminacy is irrelevant to free will: “Even the pure indeterminism of quantum mechanics can’t give us free will, because that’s simple randomness, and not a result of our own ‘will.'”

Coyne seems to be thinking that since quantum indeterminism has fixed probabilities in any specific situation, the result for human behavior would necessarily be like our second imaginary situation in the last post. There might be a 20% chance that you would randomly do X, and an 80% chance that you would randomly do Y, and nothing can affect these probabilities. Consequently you cannot be morally responsible for doing X or for doing Y, nor should you be praised or blamed for them.

Wait, you might say. Coyne explicitly favors praise and blame in general. But why? If you would not praise or blame someone doing something randomly, why should you praise or blame someone doing something in a deterministic manner? As explained in the last post, the question is whether reasons have any influence on your behavior. Coyne is assuming that if your behavior is deterministic, it can still be influenced by reasons, but if it is indeterministic, it cannot be. But there is no reason for this to be case. Your behavior can be influenced by reasons whether it is deterministic or not.

St. Thomas argues for libertarian free will on the grounds that there can be reasons for opposite actions:

Man does not choose of necessity. And this is because that which is possible not to be, is not of necessity. Now the reason why it is possible not to choose, or to choose, may be gathered from a twofold power in man. For man can will and not will, act and not act; again, he can will this or that, and do this or that. The reason of this is seated in the very power of the reason. For the will can tend to whatever the reason can apprehend as good. Now the reason can apprehend as good, not only this, viz. “to will” or “to act,” but also this, viz. “not to will” or “not to act.” Again, in all particular goods, the reason can consider an aspect of some good, and the lack of some good, which has the aspect of evil: and in this respect, it can apprehend any single one of such goods as to be chosen or to be avoided. The perfect good alone, which is Happiness, cannot be apprehended by the reason as an evil, or as lacking in any way. Consequently man wills Happiness of necessity, nor can he will not to be happy, or to be unhappy. Now since choice is not of the end, but of the means, as stated above (Article 3); it is not of the perfect good, which is Happiness, but of other particular goods. Therefore man chooses not of necessity, but freely.

Someone might object that if both are possible, there cannot be a reason why someone chooses one rather than the other. This is basically the claim in the third objection:

Further, if two things are absolutely equal, man is not moved to one more than to the other; thus if a hungry man, as Plato says (Cf. De Coelo ii, 13), be confronted on either side with two portions of food equally appetizing and at an equal distance, he is not moved towards one more than to the other; and he finds the reason of this in the immobility of the earth in the middle of the world. Now, if that which is equally (eligible) with something else cannot be chosen, much less can that be chosen which appears as less (eligible). Therefore if two or more things are available, of which one appears to be more (eligible), it is impossible to choose any of the others. Therefore that which appears to hold the first place is chosen of necessity. But every act of choosing is in regard to something that seems in some way better. Therefore every choice is made necessarily.

St. Thomas responds to this that it is a question of what the person considers:

If two things be proposed as equal under one aspect, nothing hinders us from considering in one of them some particular point of superiority, so that the will has a bent towards that one rather than towards the other.

Thus for example, someone might decide to become a doctor because it pays well, or they might decide to become a truck driver because they enjoy driving. Whether they consider “what would I enjoy?” or “what would pay well?” will determine which choice they make.

The reader might notice a flaw, or at least a loose thread, in St. Thomas’s argument. In our example, what determines whether you think about what pays well or what you would enjoy? This could be yet another choice. I could create a spreadsheet of possible jobs and think, “What should I put on it? Should I put the pay? or should I put what I enjoy?” But obviously the question about necessity will simply be pushed back, in this case. Is this choice itself determinate or indeterminate? And what determines what choice I make in this case? Here we are discussing an actual temporal series of thoughts, and it absolutely must have a first, since human life has a beginning in time. Consequently there will have to be a point where, if there is the possibility of “doing A for reason B” and “doing C for reason D”, it cannot be any additional consideration which determines which one is done.

Now it is possible at this point that St. Thomas is mistaken. It might be that the hypothesis that both were “really” possible is mistaken, and something does determine one rather than the other with “necessity.” It is also possible that he is not mistaken. Either way, human reasons do not influence the determination, because reason B and/or reason D are the first reasons considered, by hypothesis (if they were not, we would simply push back the question.)

At this point someone might consider this lack of the influence of reasons to imply that people are not morally responsible for doing A or for doing C. The problem with this is that if you do something without a reason (and without potentially being influenced by a reason), then indeed you would not be morally responsible. But the person doing A or C is not uninfluenced by reasons. They are influenced by reason B, or by reason D. Consequently, they are responsible for their specific action, because they do it for a reason, despite the fact that there is some other general issue that they are not responsible for.

What influence could quantum indeterminacy have here? It might be responsible for deciding between “doing A for reason B” and “doing C for reason D.” And as Coyne says, this would be “simple randomness,” with fixed probabilities in any particular situation. But none of this would prevent this from being a situation that would include libertarian free will, since libertarian free will is precisely nothing but the situation where there are two real possibilities: you might do one thing for one reason, or another thing for another reason. And that is what we would have here.

Does quantum mechanics have this influence in fact, or is this just a theoretical possibility? It very likely does. Some argue that it probably doesn’t, on the grounds that quantum mechanics does not typically seem to imply much indeterminacy for macroscopic objects. The problem with this argument is that the only way of knowing that quantum indeterminacy rarely leads to large scale differences is by using humanly designed items like clocks or computers. And these are specifically designed to be determinate: whenever our artifact is not sufficiently determinate and predictable, we change the design until we get something predictable. If we look at something in nature uninfluenced by human design, like a waterfall, is details are highly unpredictable to us. Which drop of water will be the most distant from this particular point one hour from now? There is no way to know.

But how much real indeterminacy is in the waterfall, or in the human brain, due to quantum indeterminacy? Most likely nobody knows, but it is basically a question of timescales. Do you get a great deal of indeterminacy after one hour, or after several days? One way or another, with the passage of enough time, you will get a degree of real indeterminacy as high as you like. The same thing will be equally true of human behavior. We often notice, in fact, that at short timescales there is less indeterminacy than we subjectively feel. For example, if someone hesitates to accept an invitation, in many situations, others will know that the person is very likely to decline. But the person feels very uncertain, as though there were a 50/50 chance of accepting or declining. The real probabilities might be 90/10 or even more slanted. Nonetheless, the question is one of timescales and not of whether or not there is any indeterminacy. There is, this is basically settled, it will apply to human behavior, and there is little reason to doubt that it applies at relatively short timescales compared to the timescales at which it applies to clocks and computers or other things designed with predictability in mind.

In this sense, quantum indeterminacy strongly suggests that St. Thomas is basically correct about libertarian free will.

On the other hand, Coyne is also right about something here. While it is not true that such “randomness” removes moral responsibility, the fact that people do things for reasons, or that praise and blame is a fitting response to actions done for reasons, Coyne correctly notices that it does not add to the fact that someone is responsible. If there is no human reason for the fact that a person did A for reason B rather than C for reason D, this makes their actions less intelligible, and thus less subject to responsibility. In other words, the “libertarian” part of libertarian free will does not make the will more truly a will, but less truly. In this respect, Coyne is right. This however is unrelated to quantum mechanics or to any particular scientific account. The thoughtful person can understand this simply from general considerations about what it means to act for a reason.