In Defense of My Defense of Beer-Drinking Scientists

My March 21 post on this topic generated an amazing amount of interest. I guess it didn’t hurt that a blurb about that post made the front page of Slashdot – and thus conferring near-god-like status on me according to my most geeky friends (“Chris – you’ve been slashdotted!”). The Slashdot post generated over 130 comments, and the post on my blog hit a record 35 comments as of today. Wow. Who’d of thought that a challenge to the beer-drinking habits of scientists would generate such interest?

To review, Czech ornithologist Tomas Grim published a paper showing a negative correlation between social beer drinking and scientific output. I then posted a critique of that study explaining why I thought that the data did not support his conclusions.

Many of the comments I received were fun, some were silly, and still others were just plain weird. Some comments made suggestions for future work (yes, I am considering carefully whether I should conduct my own study), while others criticized my analysis. I think some of these critiques are worth responding to.

Some people pointed out the obvious cause-and-effect relationship between drunkenness and poor thinking skills. Isn’t it obvious that drinking would lead to bad science? However, the paper gives enough information to conclude that the average rate of beer drinking among studied subjects is not much different from the Czech average – less than one pint (0.5 liters) per day. Although the raw data is not presented (to protect the identities of the subjects, according to the author), it seems unlikely that the heaviest drinkers in the study rose to the level of alcohol abusers. And since the survey only asked about average weekly beers consumed, there is no data on binging or even if any of the heaviest beer drinkers got a buzz!

Several people caught my error where I referred to R-squared as the “correlation coefficient”. I definitely deserve a reduction in my beer rations for that bit of sloppiness. R-squared (or R^2) is often called the “coefficient of determination” and, for the case of least-squares linear regression, R-squared is equal to the square of the correlation coefficient (r).

Several people jumped on my case of claiming that the five lowest scientific output data points were outliers that should be removed from the analysis. I never used the term outlier, and I never said they should be removed from the analysis. Instead, I claimed that these data points were extremely influential (without them the correlation nearly disappears). Properly dealing with outliers is extremely tricky and I’d be surprised if 1 in a 100 scientists could do it well. But understanding how to handle highly influential data points (and what that means for the interpretation of the fitting results) is even harder, and is almost always ignored. Pointing out that five highly influential beer guzzlers controlled the fit is very important when trying to draw conclusions that apply to tens of thousands of scientists.

Which brings me back to the main point of my critique of Dr. Grim’s study – the low number of subjects. I mentioned that there were 34 data points – but this is actually an exaggeration. Grim made two surveys of beer drinking habits, one in 2002 with 18 responses and a second one 2006 with 16 responses. Thus, there were 34 data points. But the people being surveyed were the same! Thus, the total number of people involved in the study looks to be 18. (I wonder which data point(s) belong to Dr. Grim.) The analysis of the 2002 data showed an R^2 of 0.34, while the 2006 survey showed an R^2 of 0.52. The correlation coefficients are just barely large enough to claim with 95% confidence that the results are statistically different from an R^2 of 0 (no correlation) – assuming all confounding factors have been properly eliminated.

But that’s the rub. There is no way to know whether all confounding factors (read systematic errors) have been eliminated. And it seems unlikely, as I mentioned before, that one can plausibly claim that the data points are independent, since it seems likely that at least some of this small group of Czech ornithologists socialized together. Thus, a low R^2 for a study with a small number of data points on a subject of great complexity is unlikely to be very revealing despite the most rigorous of statistical treatments.

Interestingly, the most revealing thing I learned in the past week about this study came from Dr. Grim’s website. The page on his beer study is labeled “Beer vs science – first laugh, then think (what to drink:-)”, which makes me suspect he is angling for an Ig Noble award. He posts his now-infamous paper for download, as well as giving links to many of the press reports about his work. Then come a collection of pictures showing that Dr. Grim’s data point would not lie on the low beer consumption side of the graph. If nothing else, it seems that Dr. Grim is having fun. Good for him!

By the way, I did not drink a single beer while I wrote this post. Only scotch.

6 thoughts on “In Defense of My Defense of Beer-Drinking Scientists”

  1. Hi Chris,
    Are you kidding me? I might accept arguments in your new „In Defense of My Defense of Beer-Drinking Scientists“ if it was posted today (i.e. on 1st April). But it was not. So, briefly:

    1. “cause-and-effect relationship … the average rate of beer drinking among studied subjects is not much different from the Czech average”. Stop and think for a while – what has population average (for any data set) to do with the relationship (between any variables)? Nothing, of course. What matters is the spread of data which you cannot determine from the Fig. 1 (because I used Box-Cox transformation; see http://en.wikipedia.org/wik…).

    2. “it seems unlikely that the heaviest drinkers in the study rose to the level of alcohol abusers”. But “unlikely” just means that you just do not know. Moreover, for a particular person drinking just 2 or beers may have the effect as 10 beers for another person. Anyway, I would be happy to see your mental productivity with a hangover after a night spent with some 5 liters of beer (I mean Czech beer, not the US crap:-). Just one night per week like that would decrease your productivity by some 20% in comparison (assuming you work 5 days/week) to another similar person that would not drink (so much or at all). I am eager to hear an argument that rejects that logic:-) (or the results of hundreds of studies of alcohol-related defects in human cognition).

    3. “five highly influential beer guzzlers controlled the fit”. Incorrect: I (just for fun) removed those people, recalculated the regression and the relationship was just close to the sacred cow named 0.05. But in my sample I would prefer to remove a different data points than you – I am joking of course.

    4. “the total number of people involved in the study looks to be 18”. Wrong. In 2006 I got 34 data points INCLUDING 18 old and 16 NEW researchers. See the bloody Fig. 1 properly, man!:-)

    5. “The correlation coefficients are just barely large enough to claim with 95% confidence that the results are statistically different from an R^2 of 0”. This is bullshit. PLEASE, read some statistical textbook first…

    6. “There is no way to know whether all confounding factors (read systematic errors) have been eliminated.” Agreed – just like in ANY ecological/behavioural study:-)

    7. “this small group of Czech ornithologists socialized together”. I checked those 5 people you were so much interested in – they work at 5 different institutions in 3 different towns. So they unlikely drink every night together:-)

    8. “small” data set. The sample of that magnitude (34) is quite good for an ecological study,
    in fact! (see any behavioural ecology journal and you will see that my sample is perhaps even above average:-) More importantly, I surveyed Czech avian ecologists and I had 90% (!) success in getting response from my respondents – thus, I had data for almost the whole population I studied! From this "sample quality" view my study is very much above average, of course:-) But still my work was just a preliminary survey which should be followed by wider sampling across countries and scientific professions, of course.

    9. The last paragraph is the only one in you text which is close to reality, sorry:-)

    Cheers, Tom

  2. So beer may or may not hinder a scientist’s creative abilities. On the flip side, will scientists ever start taking drugs in order to improve their skills? Would this ever lead to drug testing researchers that announce amazing new scientific breakthroughs? (sort of far fetched but an interesting idea nonetheless).

  3. this small group of Czech ornithologists socialized together”. I checked those 5 people you were so much interested in – they work at 5 different institutions in 3 different towns. So they unlikely drink every night together:-)

  4. n the flip side, will scientists ever start taking drugs in order to improve their skills? Would this ever lead to drug testing researchers that announce amazing new scientific breakthroughs?

  5. Ya, Scientists do take alcohol..But i think they only take that for to enjoy themselves or to get rid of some mental tension. I usually take alcohol to forget my bad memories. but it is only a matter of time.

  6. More importantly, I surveyed Czech avian ecologists and I had 90% (!) success in getting response from my respondents – thus, I had data for almost the whole population I studied! From this "sample quality" view my study is very much above average, of course:-) But still my work was just a preliminary survey which should be followed by wider sampling across countries and scientific professions, of course.

Leave a Reply

Your email address will not be published. Required fields are marked *