Skip to main content.
October 15th, 2006

More on Rankings

I made one mistake in my note on Leiter’s M&E rankings yesterday. Texas is actually fifth, behind Notre Dame at outright fourth. My apologies for that.

As Aidan notes in the comments on that post, the rankings do come attached with the following disclaimer.

This measure obviously favors large departments (which can cover more areas) and does not discriminate between the relative importance and prestige of sub-fields within the metaphysics and epistemology category.

There’s a suggestion that this makes up for some of my criticisms. I rather think it doesn’t – it still looks like a junk stat to me. Much more as to why under the fold.

First, a few disclaimers.

I don’t think the overall departmental ratings are junk stats – I think they’re incredibly useful. And I don’t think that the individual ratings are junk stats – they’re really useful too. And I’m not displeased with where Cornell ended up. I’m not sure there is anything meaningful to be measured around here, but if there is, about 13th sounds not off by an order of magnitude to me. And as I said, there are good people at the departments that are overrated on this list.

Having said all that, here are eight worries about the stat.

The first thing to say is that if you need a disclaimer like that attached to the list, then it is probably best to not publish the list in the first place. Everyone knows that lists like this will be discussed over email, in department lounges etc, without the disclaimer. It’s a bad idea to prominently publish lists without being confident the list can stand on its own, because within minutes of it being published it will be having to stand on its own in conversations around the world.

Second, the disclaimer doesn’t really address the obviously bizarre claim, which is that Princeton is ranked so low. It isn’t that Princeton is a particularly small department. If someone chooses not to go to Princeton because of this list, that would be a really unfortunate result I think. (MIT is obviously a different case here, because the disclaimer, when it is attached, does apply there.) I’d also note the oddness of ranking Michigan 17th, and U-Mass 18th. Again, these can’t be written off as large department bias.

(Berkeley is a little low at 13th as well, but I think that has a different explanation. I actually thought Berkeley would have been helped by the odd way of drawing the boundaries, since it is strong in philosophical logic and, I thought, philosophy of action. Getting 0, or close to it, in philosophy of religion, however, really makes a difference. Going from 0 to 3.5 in one category, say philosophy of religion, lifts a department’s average by 0.5 points, or the difference between 5th and 17th. That seems excessive to me. Whether that’s exactly what’s happened to Berkeley is hard to tell without the raw data.)

Third, because ‘metaphysics & epistemology’ is not really a natural kind, it ends up being sensitive to what seem like the wrong things to me. I don’t know how well Princeton did on philosophy of action, but if I was rating it one of the large considerations would be Michael Smith’s work on moral psychology. Now I think Michael’s work on moral psychology is really very good, and very important, but it isn’t obvious to me that it’s value should play a particularly large role in determining how good his department is in metaphysics & epistemology. As things stand, it plays a pretty large role. That’s because having one person who does important work on action theory suffices to be pretty good in philosophy of action. Such a person may make as much difference to a department’s ranking in philosophy of action as three or four good epistemologists make to a department’s ranking in epistemology. But this is absurd.

Fourth, there is a striking bias here in favour of people who work across categories. If someone writes a great book on the metaphysics of free will, and how this relates to the problem of evil, that will improve their department’s ranking (perhaps considerably) on three of the seven rankings that go into this stat. But if someone writes a great book on scepticism, that only impacts the epistemology ranking, or one-seventh of the stat. But it is absurd that work on the metaphysics of free will should be three times as important to metaphysics and epistemology as work on scepticism.

Note that this is independent of worries about department size. The point is that the rank favours people whose work cuts across the different categories. Small departments can be helped in this way as much as large departments.

Fifth, the stat is incredibly dependent on just which categories we choose to measure. For various reasons, action theory and philosophy of religion are included. But there’s no category for, say, philosophy of perception. (Should it be wound into philosophy of mind or epistemology?) Including it would make a difference. (Probably helping Texas as it turns out.) So there’s some amount of arbitrariness in choosing what we did.

Sixth, the stat leads to obviously crazy results in some hypothetical situations. Consider the three hypothetical departments A, B and C in the following example.

 ABC
Metaphysics531.5
Epistemology531
Mind432
Language433
Philosophical Logic 335
Religion zero45
Action zero35

If we take the average of the seven categories, then they are ranked C, B, A. Now this is nonsense. One might object that C is obviously a fictional department. (For one thing, a department that good in action theory and philosophy of religion would already get a metaphysics ranking higher than 1.5, because of the double counting mentioned above.) But A and B seem not particularly unrealistic to me. There are two distinct problems this example brings out, that lead (finally!) to our last two objections.

Seventh, the five point scales that are used are not linear. Consider what it takes, in terms of adding people or publications or citations, to get from 4 to 5 in a category. Now consider what it takes to get from 1 to 2, or 2 to 3. When you have a set of non-linear measures like this, averaging them is a bad idea. (That’s why the A-B comparison looks so silly.)

Eighth, even if you are going to average them (despite the worries about double counting and non-linear scales) treating these categories all the same is a little absurd in terms of what is being intuitively captured. (This is what the A-C comparison brings out.) There should be some weighting of the categories to their relative importance in M&E broadly construed.

Which weighting? Good question! I think there’s no right answer here; differences about the appropriate weighting are really differences of taste. Here’s one natural suggestion, however, that is at least sorta kinda objective. Different categories in the Leiter report have different numbers of people contributing rankings. To a first approximation, the number of rankers is proportionate to the importance to the field. So we could use that as the weighting. This would give a slightly more sensible number, but it still wouldn’t deal with the double counting and non-linear scales problems.

Short version of my advice to undergraduates: Don’t use these “broad category” measures to make any decisions at all! Stick to the overall rankings and the speciality rankings.

Posted by Brian Weatherson in Uncategorized

13 Comments »

This entry was posted on Sunday, October 15th, 2006 at 12:20 pm and is filed under Uncategorized. You can follow any responses to this entry through the comments RSS 2.0 feed. You can skip to the end and leave a response. Pinging is currently not allowed.

13 Responses to “More on Rankings”

  1. Aidan McGlynn says:

    Well, there was a suggestion it might make some of the discrepencies less surprising. I didn’t suggest, and didn’t mean to suggest, that this meant they weren’t discrepencies.

    There are similar worries to be had with the science-maths rankings – what conclusions about, for example, OSU’s strength in philmath could one draw from their lack of inclusion in this top 20? Not much, I’d wager.

  2. Pablo Stafforini says:

    I don’t know how well Princeton did on philosophy of action, but if I was rating it one of the large considerations would be Michael Smith’s work on moral psychology. Now I think Michael’s work on moral psychology is really very good, and very important, but it isn’t obvious to me that it’s value should play a particularly large role in determining how good his department is in metaphysics & epistemology.

    Brian,

    In the 04/06 edition of the Report, moral psychology is grouped together with normative ethics. Michael Smith’s work on moral psychology, good and important as it might be, is therefore irrelevant for assessing Princeton’s ranking in philosophy of action. (Unless, of course, the category boundaries have been redrawn for this new edition.)

  3. Brian Weatherson says:

    Just because there is also a category for moral psychology doesn’t mean that people won’t pay attention to it when evaluating departments on philosophy of action. Especially the people who are evaluations for action theory but not moral psychology, it is hard to believe this wouldn’t have played some role in their judgments.

  4. Neil says:

    it is absurd that work on the metaphysics of free will should be three times as important to metaphysics and epistemology as work on scepticism

    .

    Agreed. Work in free will is no more than twice as important.

  5. Pablo Stafforini says:

    Just because there is also a category for moral psychology doesn’t mean that people won’t pay attention to it when evaluating departments on philosophy of action.

    Evaluators might have paid attention to moral psychology when assessing departments in philosophy of action. My point is that if they did, they shouldn’t have. In my intuitive understanding of the system of categorization adopted by the Report, explicitly including an area in a category tacitly excludes it from all the others. So, when there is a separate category for a particular specialty area, you are not supposed to subsume this area under any of the other available categories.

  6. Michael Kremer says:

    Pablo Stafforini,

    (1) Do evaluators assigned to evaluate philosophy of action know that moral psychology is assigned to normative ethics? (I think they don’t necessarily know that, especially if it is a new category; they are just given the list of categories for which they are to evaluate.)

    (2) Even if they did, why should they accept this categorization as determining that excellent work in moral psychology does not strengthen a department in philosophy of action? Suppose you are deciding how to advise a student who is interested in philosophy of action about whether to go to a particular department which has someone doing very good work in moral psychology, which you think would be important for that student’s education in philosophy of action. Do you say to yourself, “well, moral psychology is really a part of ethics, so I won’t tell the student about that”? Of course not. Yet that is what you’re doing, in effect (remember, one primary user of PGR is the student thinking about graduate schools) by ignoring moral psychology in ranking schools’ strength in philosophy of action.

  7. Pablo Stafforini says:

    Michael,

    (1) Do evaluators assigned to evaluate philosophy of action know that moral psychology is assigned to normative ethics?

    I assume they do: to evaluate departments in a given category implies knowing which category one is evaluating. And the category in question is \‘Normative Ethics and Moral Psychology\’ (see here)

    Even if they did, why should they accept this categorization as determining that excellent work in moral psychology does not strengthen a department in philosophy of action?

    I would say that if they don\‘t accept the categorization, then they shouldn\‘t evaluate departments in the categories whose boundaries they contest.

  8. Brian Weatherson says:

    Pablo’s position here seems to make very strong assumptions about how evaluators should operate. First, they are expected to know what the other categories are, which isn’t always obvious. Second, they’re meant to leave off stuff that falls into another category.

    I think the second would lead to absurd consequences. We wouldn’t count work in philosophy of physics for the philosophy of science evaluation. And we wouldn’t count work on free will or religion towards the metaphysics category. This would be very odd practice indeed. I’ll just say that I certainly took work on free will and philosophy of religion into account when evaluating how strong departments are in metaphysics. And I took work in decision theory into account in the epistemology rankings, when it was epistemologically relevant. It would be very bad to not do this I’d think.

  9. Brian Leiter says:

    We at least agree that students shouldn’t choose departments on the basis of this aggregated list rather than the overall and specialty rankings. Nonetheless, it seems to me this list, and the comparable lists for the other areas, provide information, that may be important to departments, and perhaps to prospective students.

    I’ll take some, not all your, points in order.

    The first point I find very puzzling. We long ago decided against excessive paternalism, i.e., not sharing data because the innocent might misuse it. We have always tried to present the data clearly, with appropriate caveats, as in this case. Why you think those discussing it would ignore the bolded caveat is a bit mysterious. The entire discussion on this blog is testimony to the fact that in discussing a list like this people note these, and other, caveats about the limitations of this measure.

    Second: Princeton is not ranked that low. Princeton is, in fact, much smaller than both Texas and Notre Dame. It was ranked only one-tenth of a point behind Texas. Texas performed better than Princeton for the specialty rankings in philosophy of language and philosophical logic, and was outperformed by Princeton in metaphysics and epistemology (I haven’t seen the phil of mind results, though based on 2004 I’d expect they’re at least competitive, perhaps Texas even came out ahead). Where Texas gets the extra edge is in virtue of its size, with people working in phil of religion and phil of action.

    Third: obviously “M&E across all the areas” lumped together in the PGR isn’t a natural kind; but you can’t really think that, e.g., “philosophy of language” is a natural kind either, do you? All these fields involve some arbitrary line-drawing (e.g., where does work on “vagueness” go? Phil of language? Metaphysics? Philosophical logic? In fact, it tends to turn up in all the assorted Blackwell Companions on these subjects). This aggregated category involves more arbitrary line-drawing than some of the existing specialty categories, but surely phil of action and phil of religion belong in M&E, and are often put there? Perhaps what is driving your comments is your low opinion of these fields, compared to “metaphysics proper” or epistemology; but that is consistent with the caveat I posted. Someone who is mainly interested in epistemology shouldn’t heed this list. But someone with wide-ranging interests in M&E might benefit from noticing how strong Oxford and Notre Dame and Rutgers are across this wide swath of fields. (The “phil of action” category was defined as to include work on “free will.” My guess from the results is that most evaluators did not take it to encompass moral psychology.)

    Fifth: of course the category is sensitive to what it measures. That’s why I mention what it measures. The categories are ones the Advisory Board settled on some time ago, and I assume they are familiar ones to everyone.

    Sixth: your hypothetical scenarios are not the actual ones.

    Your seventh point may be correct.

    Eighth: following this suggestion would be FAR more contentious. Why not just say explicitly: phil of action and religion aren’t as important as metaphysics and epistemology, therefore one should ignore any aggregation that includes all these fields on a par. That does seem to me to be what this (apart from your 7th point) boils down to.

    But back to the beginning: I don’t think these aggregations are as informative or useful as the overall or specialty rankings. But I’m not in a position to release any of the latter rankings yet. Those will continue to be the primary evaluative information in the PGR. What may prove useful and striking about the aggregated categories is the extent to which they will highlight programs with strengths in broader sub-fields: e.g., departments with a strong investment in history of phil (e.g., Chicago); departments with a strong investment in phil of the sciences and mathematics (e.g., Irvine); departments with a strong investment in value theory (e.g., Michigan and North Carolina).

  10. Jono says:

    Brian Leiter writes:
    “Someone with wide-ranging interests in M&E might benefit from noticing how strong Oxford and Notre Dame and Rutgers are across this wide swath of fields.”

    This is kind of true, maybe, but I don’t know what it establishes. I think students interested in anything can benefit from a lot of things, and I don’t think what’s included in M&E really captures what would most benefit people. Here are some kinds that seem better to me:

    phil language, phil mind, phil logic, phil cog sci, math. logic, metaethics, history analytic

    phil mind, phil language, phil cog sci, phil sci, metaphysics, early modern phil, modern phil

    epistemology, phil. sci, phil mind, phil language, metaphysics, early modern, modern, kant, ancient

    metaphysics, phil sci, phil mind, epistemology, metaethics, ancient, early modern, modern

    I think most of the choices of inclusion are obvious. Maybe metaethics is less so. But that field, I think, has made far more wide-ranging contributions to M&E in the last century than has the philosophy of religion or philosophy of action: many trace the notion of supervenience to metaethics. And, unless I’m getting the details mixed up, quasi-realism originated in metaethical debates.

    To motivate some of the exclusions: I would prefer a philosophy of language that was well-informed by cognitive science and the history of analytic philosophy and ignorant of, say, mereology or causation. I’ve never come across any mention of mereology or causation in my readings of, say, the semantics of natural languages. But the history of analytic philosophy (especially Frege and Russell) comes up fairly frequently.

    And much work in epistemology (for example, self-knowledge, and the McDowell-Brandom exchange) is closely connected to the philosophy of mind and therefore to philosophy of cognitive science. Philosophy of mind without philosophy of cognitive science is just a bad idea.

  11. Aidan McGlynn says:

    It would be hard not to come across any mention of causation reading semantics of natural language. Presumably that would mean not reading Kripke, to take an obvious example.

  12. Jono says:

    Aidan,

    Sorry, I miswrote. I thought my intention was clear. Obviously, there is fairly frequent use of ‘causation’. But there is little metaphysical analysis of what causation is. My point is that many areas in history of analytic philosophy and philosophy of cognitive science are far more relevant to philosophy of language that are many areas in metaphysics, such as metaphysical accounts of causation and natural laws.

  13. Keith DeRose says:

    I don’t know how sticky we’re being about what’s to count as “natural kind” here, but the interests of many faculty — and presumably of many potential students — reside in areas of overlap among the components of MEML, yet they don’t have similarly strong interests in the likes of ethics, history of philosophy, etc. For such people, a broader evaluation may be more appropriate than narrower ones concerning the individual components of MEML.

    But then the way to get such broader-but-not-yet-overall evaluations would seem to be to ask the evaluators how each dept. is in the broader areas: Since we are ultimately relying on the evaluators anyway, might as well let (each of) them decide how much weight to give each component — and how to take into account the work that falls in the areas of overlap.

    In the end, I doubt it would be worth it. For while for some, an MEML evaluation might be important, each potential grad student has her own mix of interests, so best just to give evaluations for lots of smaller areas (as are on the PGR currently) & let each student figure out for herself which mixes of strengths make sense for her. That’s basically what potential students do — though probably without working out all the math — when they use the PGR in the way that I think is most helpful for them. I advise students to go through the specialty rankings, and look for the departments that show up strong in the areas of most interest to them. That’s a great way of coming up with, or adding to, a list of programs to check out further.

Leave a Reply

You must be logged in to post a comment.