jetpack domain was triggered too early. This is usually an indicator for some code in the plugin or theme running too early. Translations should be loaded at the init action or later. Please see Debugging in WordPress for more information. (This message was added in version 6.7.0.) in /mnt/stor08-wc1-ord1/694335/916773/www.tvhe.co.nz/web/content/wp-includes/functions.php on line 6131updraftplus domain was triggered too early. This is usually an indicator for some code in the plugin or theme running too early. Translations should be loaded at the init action or later. Please see Debugging in WordPress for more information. (This message was added in version 6.7.0.) in /mnt/stor08-wc1-ord1/694335/916773/www.tvhe.co.nz/web/content/wp-includes/functions.php on line 6131avia_framework domain was triggered too early. This is usually an indicator for some code in the plugin or theme running too early. Translations should be loaded at the init action or later. Please see Debugging in WordPress for more information. (This message was added in version 6.7.0.) in /mnt/stor08-wc1-ord1/694335/916773/www.tvhe.co.nz/web/content/wp-includes/functions.php on line 6131Individual: “You are an economist! You must agree that GDP is crap.”
Nolan: “Ahh well, ummm, what question are you asking?”
Individual: “What, it is just crap though, you must agree that it is rubbish”
Nolan: “Well it depends on your question, why are we measur …”
Individual: “Come on, it is just rubbish, everyone agrees it is rubbish. I mean we care about so many other things”
Nolan: “Ahh so your question is about what we should value. Ok yeah it doesn’t measure all social value but …”
Individual: “Yeah, it’s rubbish, exactly”
I don’t have enough fingers and toes to count the number of times I’ve had this conversation – but in truth GDP is really good at measuring what it is supposed to measure … the problem is that people keep using it as a measure of something else.
But it is hardly the individual’s fault that they have come to this conclusion. Decades of GDP fetishisation by policy makers combined with economists who spend more time talking about (and in the extreme teaching) the shortcomings of GDP than actually teaching what GDP is supposed to measure has provided this great rule of thumb that people follow to understand what is going on.
So in this post let me do something novel – let me stand up for GDP.
What is GDP
Say that you were sitting around doing your nails and the government suddenly contacted you. They said “Dear TVHE reader, we really want to know the aggregate final production of goods and services that is taking place in the nation, can you help us”.
As you are the type of person who reads this blog, you are also the type of person that would know how to answer that question – you would tell them to go onto that statistics agency website and download the latest GDP data.
GDP is a measure of the market value of production in final goods and services that takes place in a period of time.
Ok, why are we using market value – is it some type of conspiracy? No. We use market value as a way of aggregating things that are otherwise incomparable – when thinking about production it is pretty hard to add up a chair and a haircut. By using the market price we are also incorporating the idea of the full set of resources used to construct it – if factor prices do not change, then this price tells us how much of one good could be translated into another. Of course, factor prices do change if we change relative demands so this isn’t a thought experiment we should push too far in any case.
Why do we care then? You know that circular flow we are all taught in stage one economics, the one that people keep making fun of saying we need to “include an ecosystem” or “include the value of social connections” or “make it into a doughnut”, it is those FLOWS that GDP is measuring.
I’m not saying that including other things is bad – they’re cool especially when there are feedbacks between these systems. Such modelling is completely legitimate (although lets note that it also requires measurement, not just pictures). What I am saying is that GDP is an aggregate that describes a very specific system. It measures that very specific flow, as it was that flow at a point in time that Kuznets was aiming to measure. As in the circular flow, GDP is measuring the monetary flow associated with the real flow of the production of goods and services (expenditure GDP) or the monetary flow associated with the real flow of factors of production used to make goods and services (income GDP) – in so far as we care about measuring this flow, and a question that deals with this flow in isolation, it is a brilliant measure!
Now if you are about to launch into a discussion of the fact that Kuznets was discussing National Income and that GNP was the original measure not GDP that is all good [great posts on it here and here] – I don’t disagree, and in a country like New Zealand the differences between these measures matter when truly considering access to resources, hence why trends in GNDI are what are often recommended by NZ economists. But these measures are all looking for the same core set of concepts – the market value of production (by national borders or national citizens) for a given nation – it is just that GDP does it most directly for the production flow.
What is it (GDP) good for?
Well GDP is good at measuring that flow we just talked about – or measuring the market value of production within national borders at a point in time. Why would we care about this measure?
GDP is surprisingly nifty for these things … as long as we don’t always start with the unconditional prior “more GDPs are good”. We need to ask “what is the trade-off that creates this GDP”. Furthermore, even if no other observed variables change when GDP goes up this still doesn’t imply rising GDP is good (note no observed changes, so higher GDP implies higher income for everyone) as it may be the result of unmeasured non-market activities being measured (eg illegal drug sales, raising of children) or market activities with an unmeasured negative externality.
So just enjoy it for what it is!
What about those shortcomings!
We can split the short comings in two – the shortcomings for measuring what it is supposed to, and the shortcomings associated with being a measure of “progress” or “wellbeing”.
As a measure of production aggregation is problematic. If a table is $50 and a TV is $200 our nation is unlikely to be able to say “I will stop building 40,000 tables to build 10,000 TVs”. However, it offers the best way I can think of to aggregate output at a point in time.
Furthermore, unmeasured output and externalities are missed. This is mostly problematic when we want to compare over time and the coverage of GDP or social environment has changed. For example the rise of the second earner has seen more household services provided by the market sector, increasing GDP although no more is really being produced. However, if we want to talk about it in terms of “consumption value” the externality issue becomes key.
Now what happens when we try to talk about GDP in terms of wellbeing. Ummm well GDP measures the flow of production created in national borders. If we were a factory owner that would be fairly useful, but also limited – but we are an entire nation in this discussion!
If we make more things then this is an intermediate target towards some true goal – life satisfaction. Now economists tend to frame all of these as “consumption opportunities”, but we aren’t talking about consuming more physical things per se – it is all different ways of an individual spending their limited time that give them satisfaction. This is complex both philosophically (what is wellbeing), practically (how do we influence wellbeing), and in terms of measurement (how do we measure wellbeing – pro-tip we can’t directly) so we use these intermediate values as heuristics for some underlying whole.
People want some set of consumption opportunities, and they always do want more (non-satiation). However, the value they receive depends on what others have, what they have, and it is far from just measured in terms of GDP – especially when things like community spirit are part of this consumption!
GDP measures a sub set of the consumption opportunities of individuals – yes more GDP with no change in other opportunities is great, I would appreciate a world where I had a better can opener or another pair of socks. But this alone is so far from the final set of consumption opportunities that greet me on a daily basis with all my choices (“if I do this how will it influence my relationship with my boss”, “should I go to this party to engage with this group, or stay home and spend mental health time with my toy dog”) for me to view it as a sole measure of wellbeing!
Note: This isn’t me accepting that policy “should” target some mythical goal of well-being … we are going to get back to this in the future, as there is a lot of ground to cover before we can get there.
We need wellbeing measures damnit!
Now there is your issue – you want an aggregate to tell you the world is getting better. The world is more complicated than that, with the way we value things that are not clearly comparable the biggest issue (eg comparing the wage in your job to the value of a second blanket on a hospital bed).
Yes, economists can lean towards markets to provide this information (given the way people make choices when facing trade-offs). But even this preference doesn’t lead to economists saying GDP, as a measure of market activity, measures anything like well-being.
The fact GDP doesn’t do this isn’t GDP’s fault – it is your fault for asking it to do this! It is policy analysts and politicians fault for focusing on the lazy way of describing progress. It is the public’s fault for not educating themselves on these measures. I realise people including myself have an incentive to avoid thinking, but that isn’t an excuse for it.
Honestly, if people want to describe what is going on and how it fits their own value system they have to do a lot of work – no-one can add it all up for you as only you know your values!!
Take the World Economic Forum, an organisation packed with economists who are infinitely smarter and more successful than I’ll ever be:
Is our love-affair with GDP coming to an end? As the business landscape reinvents itself, demographics shift, inequality expands, climate change gets worse and technology continues to advance at breakneck speed, Gross Domestic Product is struggling to stay relevant. In order to keep up with the changes wrought by the Fourth Industrial Revolution, many are arguing that we need to find a new measure to assess the health of our economies and – more importantly – the people living in them.
This is followed by the selection of five “indicators” which are, somehow, supposed to give an “alternative”.
Can no-one else see how utterly incoherent this framework is. GDP is a single measure of a single thing. Values are complicated. As a result, our solution is to pick random things that we can add up (let’s not pretend that they aren’t hinting we just add the five measures together with equal weights in that post – not saying it directly doesn’t change hinting it hard) so we feel like we have a measure of a complicated issue …
If you want to work out whether society is “progressing” you have to do work with value judgments. If you want to answer a smaller question you have to do the same.
Look it is about policy, we use GDP to tell if policies are good now and we shouldn’t
Hey ok, that is a good way of reframing the debate. So you are admitting GDP is fine, but it has excessive importance when we have policy debates as other things we value are not measured as well or as publicly.
Wait a second, what happens if we do measure those other things well and just describe the trade-offs when faced with an actual policy question?
Wait another second, what do these alternative measures we want to replace GDP with actually measure? A dashboard of incomparable statistics doesn’t measure anything let alone tell us anything?! We need a policy question and a way of describing trade-offs between outcomes!
Wait a third of these seconds, I don’t actually think I hear real life people talk about the consequences of policies on GDP or funky looking aggregate measures that much during an election – they will talk about take home incomes, often keeping GDP fixed. They will talk about investment in health and education because we see those as things we desire. They talk about jobs because people want to have the opportunity to work.
I find when I talk to non-economists/non-policy people they articulate their values in terms of these consequences, income, jobs, quality of life – and policy arguments based on those outcomes matter. They are interested in the distribution of outcomes, winners and losers, and they want quick ways for this information to be articulated. They realise not everything they care about can be cleanly quantified, and demand to be told about things they care about that we cannot measure as well.
Given this it isn’t clear that GDP is the issue, but instead people who aren’t able to persuade the public to agree with what they want have decided they need to create esoteric measures of “wellbeing” to convince others to agree with them 
I love the idea of measuring things better and giving more information by doing so. I love the idea that the public looks at more than just one measure to understand the society and economy around them. But GDP is actually one of the measures that fits within this concept, when many of the arbitrary aggregates being create to “replace it” – which don’t clearly measure anything – don’t.
In truth asking if we have “progressed” is subjective to the point where telling people we have measured it with an aggregate is misleading/misinformation. If someone has a question about specific policy aims (is there less crime, are we wealthier, what is happening with social activities) we should be able to pull together measures to give them an idea of that outcome – and that’s that.
Ramming together a measure to tell people if “the world is better” at a single viewing isn’t helpful – it is merely making a series of hidden value statements and telling people “believe them, it is data”. If you think that is persuasive, useful for the public, and useful for good policy making then I must say that I am far from convinced of your position so comment below 
Before you get angry at me
First off you may be frustrated by the way GDP has been used and obsessed with my policy makers and politicians. And you know what, hell yea I agree with you completely here – the “moar GDP” justification for policy always wound me up too! But I’m saying that maybe there is a lesson that single measures of progress are inherently flawed, and that we have a measure for a specific question – rather than this being a suggestion that GDP is rubbish and we just need some new measure to save the world.
You may have an alternative measure that you believe fairly represents a different question, say about social progress. Or a dashboard that you think articulates the varying outcomes society values clearly without imposing unnecessary value judgments on them. If that is the case link it below and make a little case for it in comments – as that sounds really cool and I’d love to learn about it.
I was just making an argument for GDP and for current data against the idea of an arbitrary dashboard such as that used by WEF – I have no doubt there are much better efforts out there that articulate important information clearly. I am open to your persuasion 
Of course, you know me. I find the “scoreboard format” a bit naff – making these things into strict “targets” can be a bit misleading. Furthermore, the report embodies a set of value judgments – and these just may not be the moral judgments of the public.
Here are the notes I quickly tied together when I found I would have to speak on the report. The notes have very little to do with anything I said – as they involved discussing and critiquing the report, when the interview ended up being a defence of the focus of the report.
I am not sure they are particularly useful, and would suggest reading the report yourself if you have a bit of time – however, it may serve as a marker of things to keep an eye out for when reading 
The purpose of studying economics is not to acquire a set of ready-made answers to economic questions, but to learn how to avoid being deceived by economists.

And in 2008 I made the same point I was about the make now:
Training to be an economist does not tell you the answer to any economic question – it gives you the tools with which to determine answers for yourself, given your own set of value judgments.
Although I would note “any” is a bit strong, it really depends what the question is
. So why was I going to post this quote again?
The reason I was going to post this is because I’m currently in the throws of reading a significant amount of ‘normative’ economics, specifically on issues such as household income distribution. In some (not the majority – but a significant minority) of these papers the authors will state we need value judgments in order to discuss these issues (true) and they then use that as an excuse to throw ANY value judgment out there as justified, often without an indication of the limitations of the assumption – and as a result, in a non-transparent way. This type of rhetorical sleight of hand saddens me greatly.
Studying economics allows you to spot when people are doing this, so you can ask yourself whether you agree, and then ask them why they hold that judgment. We can have rational discussion about value judgments, these things are not fixed for all of time within an individual or invariant to evidence.
In my ideal world we don’t study economics to win, or persuade others, on the role of policy. We study economics to help evaluate, understand, and justify both policy views, and views of social organisation that are of interest to all. We should appreciate that reasonable people may hold different value judgments, and thereby hold different views on what is ‘right’ or ‘good’ – that is information that helps to inform a public without the time and the inclination to every little element of policy. These judgments should be tempered by empirical data – but I avoid the term empirical ‘fact’ as we can often confuse the data with our interpretation of it. Our interpretation requires we put theory and beliefs into the data – and if we are not clear on that, we will often overestimate the strength of our own results!
By making our own value judgments for a policy transparent we should be less dogmatic about policy – and as a result, when I see rhetorical sleight of hands such as the ones I’m hinting about in some of the literature I’ve read I feel uncomfortable.
Update: The economics blog ‘sex, drugs, and economics’ by Michael Cameron from the University of Waikato discusses these points as well.
]]>Social capital is associated with a host of desirable outcomes:
- There is more trust and there are more blood donations in towns with lots of civic associations.
- Voter turnout is higher, and financial markets work better (Guiso, Sapienza, and Zingales 2008).
A growing literature has pointed out that social capital can also have a ‘dark side’ (Field 2003):
- The Ku Klux Klan, drug-dealers and the mafia rely on social cohesion to ensure co-operation.
- Also, important recent work shows that civic associations can lead to the entrenchment of existing leaders, undermining the quality of governance (Acemoglu, Reed, and Robinson 2013).
Recently when I reviewed “the Spirit Level” I specifically called it out for its cursory treatment of status goods – and what exactly they are and how society behaves with them. Then when I wrote up this article on inequality (with extra comment on TVHE) I stated:
And of course the elephant in the room is how government policy choices have an impact on trust and co-operation. What drives these matters, and would a society that tries to more closely control means of production and income actually create greater trust and social cohesion?
Often in public debate at present it is “taken as given” that social capital and the behaviour associated with “in group behaviour” is always a good. I was uncomfortable with this – and the VOX-EU post articulates the concept with evidence better than I could!
Social capital has many elements, and it includes the idea of using heuristics of action and ideas that are used for making choice. Defining such things as ‘inherently good’ doesn’t seem to fit into any consequentialist frame of moral worth, and trying to make it a universal good in this sense appears inappropriate.
Now I am NOT saying that economists “know” this and other disciplines don’t – as that is a pile of horse doo doos. In the Spirit Level book (and I note it in my review), sociologists paint this as a concern. Furthermore, I wasn’t taught these ideas while studying economics – I specifically remember talking about this sort of social capital and in group behaviour as a “bad” thing in the race and racism course I did in the Victoria University History school.
Yes co-operation and communication in society are a great thing – but there is no magic “silver bullet” like cutting inequality that does this. And furthermore, this co-operation should NOT be at the expense of groups we decide to define as “others”. The purpose of government, sentience, and civilised society is to move us past ‘prisoner’s dilemmas‘ that involve describing and fighting other groups – not to reinforce them!!!
Note: I realise in saying all this, I’m wandering into a minefield of moral arguments that I’m probably failing on – in which case I would enjoy being appropriately torn apart. I would just note that many of the people making relatively steep arguments about the moral value of “social capital” are on ground that is at least as shaky 
). I’ve decided to ignore that, and to read the book on the merits it gave in the text. Given this how did I find it?
Warning, this is an exceptionally long post given that I am putting down brief notes on each chapter (the review is over 4k, the summaries just that again). I have signposted sections so if there is something you want to swing to (tl;dr, conclusion, general issues) you can swing to that. Also note that I link to an excessive number of old posts from the blog – essentially many of the ideas noted down over the last few years here will come in useful for reviewing the book. For that reason, I’ve put a pdf of my blahing here MN Spirit level review.
I have also explicitly avoided reading other people’s reviews – except for the postscript that I have written later. As a result, I can’t promise I’ve added anything, but I can promise that I’m being honest in my views 
Note: This review is also a blog post – not some external thing you just read, but something you can interact with. If I’m wrong on statistics, on logic, on interpreting the authors, or I’m just plain immoral as a person, throw down a comment and tell me. If you convince me, I add updates onto the review to say you’ve changed my mind – party times! I am here to steal all your knowledge.
Tl;dr review
I have a list of concerns – excluding anything about the “countries in the sample”:
Conclusion: Contrary to what they say, this book is not “science” (in the way they try to make it sound – I don’t want to fight about “what science is” as I’m not the one defining it for the book!), it is thoughtless “silver bullet” posturing dressed up as analysis.
To the full review!
To review the book, I really need to frame it as an argument.
Base book argument
This is the very base argument, but they do attempt to give it some structure. Specifically they go along the lines:
The mixture of a static figure and a dynamic figure is a bit vexing. If I want to be generous (which I do) I would state they mean the following argument in the book, even though they don’t frame it this way due to a lack of time series data (supposedly).
This hypothesis is not just reasonable. It is akin to the way economists view how the social preference for “optimal levels of inequality” will function. I can get behind that (to a degree – not as an absolute). As a result, we need to ask whether the “scientific” analysis they promise they are giving us helps to show that.
Quick point on science: I’m down with the Popperian view that we can’t prove hypotheses, only reject them – and I’m down with the fact that in social science these come conditioned with protective sets of ceteris paribus assumptions. So I don’t expect this to prove anything conclusively. However, the way the arguments are treated has a distinct bearing on how much we update our beliefs around the likelihood the argument may be true!
Now I have severe problems with how they have shown this, and with how they described it. You may look at me, look at my job, and decided that this is due to my vested interests – I am an economist after all.
But before you judge what I am going to say on those credentials – it is important to note the large hippy streak I inherently hold. Specifically, I am a long-term proponent of land tax based on the idea that all land is communally held, but that we offer long-term leases to get the advantage of property rights. I term land-tax a rent that provides a “social dividend” that should be shared among all people – inherently I also believe in a guaranteed minimum level of income as a result.
Furthermore, I titled this blog. I gave it the name “the visible hand” because I see the government as a driver of good – it is a mechanism that people in society use to help us co-ordinate and create the social structure we desire. Yes government can be corrosive as well, like other institutions, however they form a central part of our “social contract”.
On an ideological basis I am actually, in some ways, in the same camp that Wilkinson and Pickett are – a normative base I try to avoid thrusting down the throats of people that read the blog, where I try to instead force the idea of “trade-offs”.
And it is in this regard that the Spirit Level, with its disregard for the literature and data on inequality, and the poor data analysis involved, leaves me cold. Yes I personally believe in redistribution, and yes I believe that trust, honesty, and social capital are central parts of society. But the way the book tries to force all this into “income inequality” is not just flawed, it is likely to lead to more middle-class welfare rather than a real focus on the poor.
They are determined to describe a “free lunch” stemming from “magically lowering inequality somehow” – but without more structure they are on very dangerous ground!
Issues with the data
It would be obvious for me to say there are severe data issue here – and I’m going to try to avoid going heavily down this path.
The first point to raise is that a lot of people have talked about the size of the sample, and the fact countries have been excluded that would influence the result. I haven’t been able to avoid hearing that point. I am going to ignore it.
They get points for sticking to the same series on countries the whole way through – and I am going to ASSUME this was the right set. This could well be wrong (many feel it is) but it will have no bearing on my review.
So what data issues did I have:
In truth this is a central criticism of the book – and I should have checked and put it in here, I’m sorry.General issues
These issues are other concerns I ran into:
“Second, it is a fundamental methodological principle of epidemiological analysis that you should not control for factors which form part of the causal chain – in this case explaining how inequality causes a particular problem. For example, if we think – as we do – that societies with greater income inequality have worse health because poorer social relations increase chronic stress, then we would need to be cautious about how to analyse that particular causal sequence. Fortunately, there are products like CBD Oil UK to counter that stress. Simply including measures of trust and social cohesion in a statistical model could remove the association between income inequality and health, even though it is likely that inequality actually leads to poorer health because it is socially divisive.”
. The “true model” they keep describing is between social capital and trust ideas and social ills – so this should be what they include. If your argument is that inequality is causing the change in social capital, show that separately (Note how this increases the importance of “through time” analysis for the argument!). If you put both in, and inequality isn’t significant it is because inequality only has an impact “through” the social channel – you don’t avoid doing the regression in that case, you just interpret the data correctly
. I will be honest here, this is a horrible error – data limitations make this hard, but it should do a LOT more to reduce their confidence in their results. Univariate analysis on limited data doesn’t make a “strong scientific case” no matter how many times they say it in the book.
. Essentially, the relationship between inequality and these outcomes is not invariant to the policies we use to get there. Think of it this way, the policy suggestions in this book are based on the first “argument” I have at the start (nom static income inequality to solve problems) – but the “structure” is based on the third one (lifetime inequalities based on certain relationships can influence trust and social structures, and we should try to ensure policy changes are consistent with their impact on these things using lifetime inequalities as a lens to understand them). We have an indirect reduced form relationship between these structural causes and static inequality – but unless the policies we use to target static inequality are the same ones we would use to target the structural model, we aren’t going to get the results we expect when we change policy settings. Note: Read this as “not all ways of changing inequality are created equally!

Oi, so what else can shift inequality that we should be looking at!
The book mentions two causes – changes in technology and skill, and changes in social norms and government policy. It says the first is exaggerated, and the second is what it is all on! They only do this briefly, surprisingly.
Now there is a pile of work going on at the moment with people breaking all this shiz down. The two things that come to mind this second are “Income inequality in the United States, 1913-1998” by Piketty and Saez (2003) and “Levels and Trends in United States Income and Its Distribution A Crosswalk from Market Income Towards a Comprehensive Haig-Simons Income Approach” Burkhauser and Larrimore (2013). I’ve put down two articles that come to different conclusions – and my impression is more of the literature is currently behind Piketty-Saez (although the Burkhauser-Larrimore paper is new).
That is lovely and all, but one of the key things is that when we start breaking down the income distribution into causes we start seeing that there are many more causes – population aging, household structure, changes in relative prices (such that the change in nominal income distributions may not actually mean different real purchasing powers), changes in tax, benefit and labour market policies, changes in other policies (macroeconomies are filled with unforeseeable distributional consequences). This is in addition to changes in skills, education, broad technology (both globally shared and nation specific), and demand for skills types.
Making the argument sound like there can only be two causes of changes in the inequality distribution allows them to exaggerate the policy relevance of what they are saying. If they actually had to go to the source, trust, status competition, and social capital the “solutions” would be as easy (although they would have a lot more validity). This is a FUNDAMENTAL methodological issue with what they are doing – reading this entire book, and seeing how they talked about inequality, it just sounds like they don’t understand the variable they want society to control. This is damning.
Alternate hypotheses:
Even if the fundamental result was robust – we still have an issue of causation! Here is a non-exhaustive list of alternative hypotheses that they do not cover off in the book, including in the post-script where they cover off other alternative hypotheses.

To be honest, I find many of these alternatives more compelling than inequality stuff did it – but testing any of them in a satisfactory manner, including inequality, is danged difficult. Given this, I am amazed at the confidence they have in their own results …
Conclusion: My concerns given the hypothesis
How is this result being interpreted? Just the way they want it to be – that changing inequality has a mystical magical impact on these social outcomes. Not only has this not been shown, but it merely leads people to stop thinking about the hard issues and trade-offs in order to try to push a button that fixes everything.
It just so happen that this button involves more transfers to the middle class (hello directors law) and less focus on the specific issues that exist in the health and education system. Now, the people that really get disadvantaged from this change in focus are the poor – and the people that win are the middle class people who received income transfers as a way of helping them deal with the stress of other people being richer than them.
Let that sink in, this is what the book is fundamentally saying. It is a non-scientific book (the number of data and statistical concerns indicate that) that supports transferring resources and policy effort towards the middle classes.
It is not an objective analysis of the way we build up trust and cooperation in society – which is unsurprising as that would be a very difficult thing to do a full analysis of. However, it pretends it is doing that – instead of placing its flag in the ground and admitting the trade-offs involved with the specific program of redistribution they are pushing they want to make they pretend there are none. Lovely.
Now do not get me wrong – many of the policies that we may decide to put in place to help groups will knock down inequality. The idea that, as a community we should try to build up trust and social capital has definite merit. The idea that progressive tax and transfer policy, that represents the social contract, can help in this end is also acceptable. But this book doesn’t show that – instead it gives a misleading idea that targeting one variable will create great happiness, the sort of blind ideology that leads to social harm.
My postscript – reading what others have said
I still haven’t read anything – and I’m pretty tired from putting together this review! I will check other reviewers and write about what they have said in a separate piece in a couple of weeks. However, here are some links that I have been sent:
Does anyone have pieces supporting their research – more stuff against them is fine, but I’d like to also see work challenging some of the concerns I’ve run into here.
Appendix chapter summaries
Here I will give you the brief notes of the chapters, and comments I had while reading those bits. This will hopefully add to what I have discussed in the review, rather than contradict it 
Prelude
Chapter 1
Chapter 2

Chapter 3
Chapter 4
Chapter 5
Chapter 6
Chapter 7

Chapter 8

Chapter 9
Chapter 10
Chapter 11
Chapter 12

Chapter 13

Alternative hypotheses:
Ok, that was it. Onwards
Alternative hypo again
Chapter 14

Chapter 15


Chapter 16
Postscript

I have heard a bunch of criticism of the Spirit Level book based on its “sample” and “cheery picking” (Note: This started life as a typo, but I liked it and used it throughout
). I’ve decided to ignore that, and to read the book on the merits it gave in the text. Given this how did I find it?
Warning, this is an exceptionally long post given that I am putting down brief notes on each chapter (the review is over 4k, the summaries just that again). I have signposted sections so if there is something you want to swing to (tl;dr, conclusion, general issues) you can swing to that. Also note that I link to an excessive number of old posts from the blog – essentially many of the ideas noted down over the last few years here will come in useful for reviewing the book. For that reason, I’ve put a pdf of my blahing here MN Spirit level review.
I have also explicitly avoided reading other people’s reviews – except for the postscript that I have written later. As a result, I can’t promise I’ve added anything, but I can promise that I’m being honest in my views 
Note: This review is also a blog post – not some external thing you just read, but something you can interact with. If I’m wrong on statistics, on logic, on interpreting the authors, or I’m just plain immoral as a person, throw down a comment and tell me. If you convince me, I add updates onto the review to say you’ve changed my mind – party times! I am here to steal all your knowledge.
Tl;dr review
I have a list of concerns – excluding anything about the “countries in the sample”:
Conclusion: Contrary to what they say, this book is not “science” (in the way they try to make it sound – I don’t want to fight about “what science is” as I’m not the one defining it for the book!), it is thoughtless “silver bullet” posturing dressed up as analysis.
To the full review!
To review the book, I really need to frame it as an argument.
Base book argument
This is the very base argument, but they do attempt to give it some structure. Specifically they go along the lines:
The mixture of a static figure and a dynamic figure is a bit vexing. If I want to be generous (which I do) I would state they mean the following argument in the book, even though they don’t frame it this way due to a lack of time series data (supposedly).
This hypothesis is not just reasonable. It is akin to the way economists view how the social preference for “optimal levels of inequality” will function. I can get behind that (to a degree – not as an absolute). As a result, we need to ask whether the “scientific” analysis they promise they are giving us helps to show that.
Quick point on science: I’m down with the Popperian view that we can’t prove hypotheses, only reject them – and I’m down with the fact that in social science these come conditioned with protective sets of ceteris paribus assumptions. So I don’t expect this to prove anything conclusively. However, the way the arguments are treated has a distinct bearing on how much we update our beliefs around the likelihood the argument may be true!
Now I have severe problems with how they have shown this, and with how they described it. You may look at me, look at my job, and decided that this is due to my vested interests – I am an economist after all.
But before you judge what I am going to say on those credentials – it is important to note the large hippy streak I inherently hold. Specifically, I am a long-term proponent of land tax based on the idea that all land is communally held, but that we offer long-term leases to get the advantage of property rights. I term land-tax a rent that provides a “social dividend” that should be shared among all people – inherently I also believe in a guaranteed minimum level of income as a result.
Furthermore, I titled this blog. I gave it the name “the visible hand” because I see the government as a driver of good – it is a mechanism that people in society use to help us co-ordinate and create the social structure we desire. Yes government can be corrosive as well, like other institutions, however they form a central part of our “social contract”.
On an ideological basis I am actually, in some ways, in the same camp that Wilkinson and Pickett are – a normative base I try to avoid thrusting down the throats of people that read the blog, where I try to instead force the idea of “trade-offs”.
And it is in this regard that the Spirit Level, with its disregard for the literature and data on inequality, and the poor data analysis involved, leaves me cold. Yes I personally believe in redistribution, and yes I believe that trust, honesty, and social capital are central parts of society. But the way the book tries to force all this into “income inequality” is not just flawed, it is likely to lead to more middle-class welfare rather than a real focus on the poor.
They are determined to describe a “free lunch” stemming from “magically lowering inequality somehow” – but without more structure they are on very dangerous ground!
Issues with the data
It would be obvious for me to say there are severe data issue here – and I’m going to try to avoid going heavily down this path.
The first point to raise is that a lot of people have talked about the size of the sample, and the fact countries have been excluded that would influence the result. I haven’t been able to avoid hearing that point. I am going to ignore it.
They get points for sticking to the same series on countries the whole way through – and I am going to ASSUME this was the right set. This could well be wrong (many feel it is) but it will have no bearing on my review.
So what data issues did I have:
In truth this is a central criticism of the book – and I should have checked and put it in here, I’m sorry.General issues
These issues are other concerns I ran into:
“Second, it is a fundamental methodological principle of epidemiological analysis that you should not control for factors which form part of the causal chain – in this case explaining how inequality causes a particular problem. For example, if we think – as we do – that societies with greater income inequality have worse health because poorer social relations increase chronic stress, then we would need to be cautious about how to analyse that particular causal sequence. Fortunately, there are products like CBD Oil UK to counter that stress. Simply including measures of trust and social cohesion in a statistical model could remove the association between income inequality and health, even though it is likely that inequality actually leads to poorer health because it is socially divisive.”
. The “true model” they keep describing is between social capital and trust ideas and social ills – so this should be what they include. If your argument is that inequality is causing the change in social capital, show that separately (Note how this increases the importance of “through time” analysis for the argument!). If you put both in, and inequality isn’t significant it is because inequality only has an impact “through” the social channel – you don’t avoid doing the regression in that case, you just interpret the data correctly
. I will be honest here, this is a horrible error – data limitations make this hard, but it should do a LOT more to reduce their confidence in their results. Univariate analysis on limited data doesn’t make a “strong scientific case” no matter how many times they say it in the book.
. Essentially, the relationship between inequality and these outcomes is not invariant to the policies we use to get there. Think of it this way, the policy suggestions in this book are based on the first “argument” I have at the start (nom static income inequality to solve problems) – but the “structure” is based on the third one (lifetime inequalities based on certain relationships can influence trust and social structures, and we should try to ensure policy changes are consistent with their impact on these things using lifetime inequalities as a lens to understand them). We have an indirect reduced form relationship between these structural causes and static inequality – but unless the policies we use to target static inequality are the same ones we would use to target the structural model, we aren’t going to get the results we expect when we change policy settings. Note: Read this as “not all ways of changing inequality are created equally!

Oi, so what else can shift inequality that we should be looking at!
The book mentions two causes – changes in technology and skill, and changes in social norms and government policy. It says the first is exaggerated, and the second is what it is all on! They only do this briefly, surprisingly.
Now there is a pile of work going on at the moment with people breaking all this shiz down. The two things that come to mind this second are “Income inequality in the United States, 1913-1998” by Piketty and Saez (2003) and “Levels and Trends in United States Income and Its Distribution A Crosswalk from Market Income Towards a Comprehensive Haig-Simons Income Approach” Burkhauser and Larrimore (2013). I’ve put down two articles that come to different conclusions – and my impression is more of the literature is currently behind Piketty-Saez (although the Burkhauser-Larrimore paper is new).
That is lovely and all, but one of the key things is that when we start breaking down the income distribution into causes we start seeing that there are many more causes – population aging, household structure, changes in relative prices (such that the change in nominal income distributions may not actually mean different real purchasing powers), changes in tax, benefit and labour market policies, changes in other policies (macroeconomies are filled with unforeseeable distributional consequences). This is in addition to changes in skills, education, broad technology (both globally shared and nation specific), and demand for skills types.
Making the argument sound like there can only be two causes of changes in the inequality distribution allows them to exaggerate the policy relevance of what they are saying. If they actually had to go to the source, trust, status competition, and social capital the “solutions” would be as easy (although they would have a lot more validity). This is a FUNDAMENTAL methodological issue with what they are doing – reading this entire book, and seeing how they talked about inequality, it just sounds like they don’t understand the variable they want society to control. This is damning.
Alternate hypotheses:
Even if the fundamental result was robust – we still have an issue of causation! Here is a non-exhaustive list of alternative hypotheses that they do not cover off in the book, including in the post-script where they cover off other alternative hypotheses.

To be honest, I find many of these alternatives more compelling than inequality stuff did it – but testing any of them in a satisfactory manner, including inequality, is danged difficult. Given this, I am amazed at the confidence they have in their own results …
Conclusion: My concerns given the hypothesis
How is this result being interpreted? Just the way they want it to be – that changing inequality has a mystical magical impact on these social outcomes. Not only has this not been shown, but it merely leads people to stop thinking about the hard issues and trade-offs in order to try to push a button that fixes everything.
It just so happen that this button involves more transfers to the middle class (hello directors law) and less focus on the specific issues that exist in the health and education system. Now, the people that really get disadvantaged from this change in focus are the poor – and the people that win are the middle class people who received income transfers as a way of helping them deal with the stress of other people being richer than them.
Let that sink in, this is what the book is fundamentally saying. It is a non-scientific book (the number of data and statistical concerns indicate that) that supports transferring resources and policy effort towards the middle classes.
It is not an objective analysis of the way we build up trust and cooperation in society – which is unsurprising as that would be a very difficult thing to do a full analysis of. However, it pretends it is doing that – instead of placing its flag in the ground and admitting the trade-offs involved with the specific program of redistribution they are pushing they want to make they pretend there are none. Lovely.
Now do not get me wrong – many of the policies that we may decide to put in place to help groups will knock down inequality. The idea that, as a community we should try to build up trust and social capital has definite merit. The idea that progressive tax and transfer policy, that represents the social contract, can help in this end is also acceptable. But this book doesn’t show that – instead it gives a misleading idea that targeting one variable will create great happiness, the sort of blind ideology that leads to social harm.
My postscript – reading what others have said
I still haven’t read anything – and I’m pretty tired from putting together this review! I will check other reviewers and write about what they have said in a separate piece in a couple of weeks. However, here are some links that I have been sent:
Does anyone have pieces supporting their research – more stuff against them is fine, but I’d like to also see work challenging some of the concerns I’ve run into here.
Appendix chapter summaries
Here I will give you the brief notes of the chapters, and comments I had while reading those bits. This will hopefully add to what I have discussed in the review, rather than contradict it 
Prelude
Chapter 1
Chapter 2

Chapter 3
Chapter 4
Chapter 5
Chapter 6
Chapter 7

Chapter 8

Chapter 9
Chapter 10
Chapter 11
Chapter 12

Chapter 13

Alternative hypotheses:
Ok, that was it. Onwards
Alternative hypo again
Chapter 14

Chapter 15


Chapter 16
Postscript

After all, why do we have such a problem with it. If elite athletes decide to take drugs and do really cool stuff that we can watch in our own homes isn’t this all good.
People say stuff like “it isn’t in the spirt of the game” or “they are supposed to be role models for kids” but this sounds like a bunch of blah – we want our elite athletes fighting in cages with robot body parts and hyped up on all sorts of drugs right!
Well let’s assume we do – just for fun. Even in this case there is a clear reason why we may want to try banning drugs – the prisoner’s dilemma faced by athletes.
Assume there is some inherent ranking of athletes, but that the difference in ability from bottom to top in some set is 20%. Then assume there is a drug that will boost performance by 21%. Then the “worst” athlete could take the drug and be the winner! Of course, all the athletes up to the “second-best” know they can improve their position by taking the drug – and they know, as does the “top” athlete, that if someone else is taking the drug the only way to maintain their position is to take the drug also. As a result, even though the drug taking has a cost (shorter life) it is a dominant strategy.
This is a bit loose, as it is not actually a dominant strategy for the worst athlete – as if everyone else is taking the drug it doesn’t change his position, he just has lower health, so won’t bother with it. But then we have the potential for a mixed strategy equilibrium with the second-to-worst athlete (as if the bottom athlete doesn’t SWA won’t want to, but if SWA doesn’t the worst athlete will want to, so they will both take drugs with some positive probablity less than 1) – and in the end we aren’t going to worry about this too much 
So say this prisoner’s dilemma holds – we can ensure that the same “outcome” in terms of ranking takes place, and that the athletes have better health than they would get taking drugs, if we just ban drugs and enforce it well. Drugs in this case are banned from the Olympics because we care about the health of athletes, isn’t that nice! This illustrates the useful role of regulators and government in the face of issues to do with co-ordination or prisoner’s dilemma types games!
Update: Now I’ve woken up, this story is dumb. Better game to get my point across – take two equal athletes that both have a 50% chance of winning. Taking a drug can make this 100% if the other person is not taking it, or 50% if the other person is, but there is some positive cost to taking it. In that case both athletes take the drug – both end up in the same position as not taking – and both have worse health than not taking. Why didn’t I just say that?
However, the idea that drug taking for the best athlete is a strategic complement (when someone else takes it increases the incentive for them to take), but for the worst it is a strategic substitute (when someone else takes it decreases the incentive for them to take), is pretty cool – that makes for a neat game to think about!
]]>New Zealand needs to get out of the “childish mentality” that wide-ranging benefit reforms which come into force today are punitive, Finance Minister Bill English says.
And also this via Kiwiblog:
Work and Income says the results are “some of the best from any case management trial” in recent years, with 6000 of the 10,000 people in the pilot no longer on a benefit. More than half of those people found work, the rest opted out or cancelled benefits for reasons such as no longer meeting eligibility requirements.
Hold up – hold the hang up. Opted out due in ineligibility? Don’t we mean they had their benefit canceled because they didn’t fit the new criteria? Nearly a third of beneficiaries were pushed off the benefit without employment income in other words – ummmm is that really a good result? Now note, this isn’t actually the case, in truth over that period of time some people would have become ineligible or found work irrespective of the benefit policy – I was just throwing this here to illustrate how they exaggerated the “benefit” of the policy, and how we could do the same thing to exaggerate the “cost”!
Look, all this makes sense if your priors are “the benefit is and hand up, not a hand out” and “get a job you hippies”. But we have two things other going on:
I find it incomprehensible to push people out of benefits in a situation where the labour market is distinctly weak. Also, in moral terms, I believe that society should offer people an outside option – there is something distinctly perverse about basing someones status and self-worth solely on our view of them as a “labour input”. Part of the reason for the benefit is to give workers an “outside option” to improve their bargaining position, it cannot be viewed independently of this.
Now a lot of people won’t agree with me on the second point, that’s all gravy I’m not the social planner. But when we have a very weak labour market these reforms do appear punitive – kicking people off benefits because they have taken drugs, because they are disenfranchised around spending a long time looking for work and being rejected, that is punitive. Look I 100% agree with some of the reforms – more targeted case management is nice (in line with trying to solve the “matching” problem in labour markets) – but mixing this in with punitive policies doesn’t stop them being punitive policies. Telling beneficiaries and those who feel uncomfortable with suddenly tightening requirements when the labour market is weak to stop being childish doesn’t change this interpretation either 
My views are coloured by the fact that I remember what happened during the early 1990s with benefit reform – I remember how policies were “interpreted” at the branch level, and how much changes in benefit eligibility created hardship in my community. But more generally I see government as a way society agrees to “co-operate” on some things, and I have an inherently different belief in the ways we should co-operate than some of these policies represent – if that makes me childish then I should spend more time talking to children about their views on the inherent social contract within society!
]]>I think it is pretty obvious to everyone what has to be done here … we need to legalise other drugs.
From what I can tell, K2 is inferior to other drugs for the person consuming them, and causes higher externalities than other drugs. The only reason people are choosing it is because other drugs have been made prohibitively expensive – by being made illegal.
You may say “no no, we should just ban it”. But isn’t the point that we really have a mental health issue here – people feel that they need to consume something to deal with the inane nature of life. If we keep banning the things we are consuming, we are just pushing many of these people towards other more harmful forms of consumption/addiction.
I’m not saying don’t help people who are facing these sorts of issues, and become addicted to substances as a way of dealing with things going on in their lives. What I am saying is that instead of criminalizing it, and making them criminals, we should perhaps look at it as a mental health issue and try to help them. Pretending we care, then choosing policy actions (criminalization) that actually hurt the individuals involved can’t even be called good intentions – it’s just sort of silly and harmful.
Legalise, tax, use the tax money to pay for treatment and to help fund general mental health work. At some level we all have mental health issues, just like we all have physical health issues. Lets destigmatize and actually accept these issues, allow actions that people take when they are trying to cope to be legal, but make sure that as part of our social security net we help people who are really struggling – in the same way we do when someone finds themselves out of work, or injures themselves physically in a work place accident. If you agree with me, you can fill out an online application that takes only 2 minutes to complete, asking our government to step up and help people battling mental illness.
]]>At the elite level, the difference between being a champion and an also-ran is so miniscule, and yet matters so much, that athletes are pressured to do whatever they can to gain the slightest edge over their competitors. It is reasonable to suspect that gold medals now go not to those who are drug-free, but to those who most successfully refine their drug use for maximum enhancement without detection.
…
Julian Savulescu proposes that instead of trying to detect whether an athlete has taken drugs, we should focus on measurable indications of whether an athlete is risking his or her health. So, if an athlete has a dangerously high level of red blood cells as a result of taking erythropoietin (EPO), he or she should not be allowed to compete. The issue is the red blood cell count, not the means used to elevate it.
…
To those who say that this will give drug users an unfair advantage, Savulescu replies that now, without drugs, those with the best genes have an unfair advantage… Setting a maximum level of red blood cells actually levels the playing field by reducing the impact of the genetic lottery. Effort then becomes more important than having the right genes.
This discussion of the value of genes vs effort, and the morality of using drugs to level the genetic field, seems to have parallels with social redistribution. We begin unequal because of genetic differences. Through a system of taxes, subsidies, and other transfers, we redistribute the fruits of effort, genetic talent, and opportunity.
What is interesting is that most people think it is fair to redistribute some of the fruits of talent to the least lucky in society. That is why we have progressive systems of taxation, for instance. But, in the sporting arena, pure talent and opportunity is glorified and there are few serious movements to redistribute the lottery. That’s not to say there isn’t some action, but the call for more equal outcomes in sporting contests clearly not as strong as in economic contests.
]]>If all these things are so obvious then the question is, why haven’t they been dealt with already? And, “because nobody else is smart enough to see the solution” is unlikely to be the correct answer. More often, the person with the ‘solution’ just doesn’t understand the complexity of the problem and the reasons why things are as they stand.
Now, that isn’t a new insight, but I’m sure I’m not the only one who’s been bothered by the lack of a snappy name for it. After all, it isn’t a real proposition until you can name it after someone. So I was very happy to come across this article by Megan McArdle discussing Chesterton’s Gate. I’m not sure if the name is widespread, but i’m already a fan. Apparently GK Chesterton gave a fairly eloquent description of the problem:
]]>In the matter of reforming things, as distinct from deforming them, there is one plain and simple principle; a principle which will probably be called a paradox. There exists in such a case a certain institution or law; let us say, for the sake of simplicity, a fence or gate erected across a road. The more modern type of reformer goes gaily up to it and says, “I don’t see the use of this; let us clear it away.” To which the more intelligent type of reformer will do well to answer: “If you don’t see the use of it, I certainly won’t let you clear it away. Go away and think. Then, when you can come back and tell me that you do see the use of it, I may allow you to destroy it.”
This paradox rests on the most elementary common sense. The gate or fence did not grow there. It was not set up by somnambulists who built it in their sleep. It is highly improbable that it was put there by escaped lunatics who were for some reason loose in the street. Some person had some reason for thinking it would be a good thing for somebody. And until we know what the reason was, we really cannot judge whether the reason was reasonable. It is extremely probable that we have overlooked some whole aspect of the question, if something set up by human beings like ourselves seems to be entirely meaningless and mysterious. There are reformers who get over this difficulty by assuming that all their fathers were fools; but if that be so, we can only say that folly appears to be a hereditary disease. But the truth is that nobody has any business to destroy a social institution until he has really seen it as an historical institution. If he knows how it arose, and what purposes it was supposed to serve, he may really be able to say that they were bad purposes, that they have since become bad purposes, or that they are purposes which are no longer served. But if he simply stares at the thing as a senseless monstrosity that has somehow sprung up in his path, it is he and not the traditionalist who is suffering from an illusion.