Mark Thompson: The electoral system and the expenses scandal correlation

Friday 5 February 2010

The electoral system and the expenses scandal correlation - Legg update

This post is co-written by me and Will Straw, editor of Left Foot Forward.

Last May, Mark did a series of blogposts which analysed the data from the expenses scandal and suggested that there could be a link between the first-past-the-post electoral system and the likelihood of an MP being involved in the scandal. New evidence from the Legg report suggests this is the case.

As the graph below shows, the average majority of the worst offenders is higher than that of those expenses-sinners who wrongly claimed smaller amounts.

Mark’s first post was published on May 17 at which point The Telegraph had named 74 MPs. The second post (an update to the first one) was published the next day and the number of MPs implicated was 94. The methodology used was that if an MP had been mentioned negatively at all in The Telegraph’s coverage with relation to expenses then a 1 was put next to their name otherwise there was a 0. The groups of MPs were then split up into quartiles ordered by how safe their seat was. The initial analysis suggested that MPs in the safest seats were 3 times as likely to have been implicated in the scandal as those in the least safe. A further update a week or so later by which point the number of MPs implicated had almost quadrupled to 243 suggested that the figure had come down to a bit less than 2 times as likely.

With the release of the Legg report yesterday – possibly the most definitive word we will get on the scandal – we decided to have another look at this situation and whether there still seems to be a link. The main problem is that more than half of all MPs are implicated in the scandal. There is also a disparity between the lowest erroneous claims, from Mike Gapes in Ilford who was asked to pay back 40p, and some MPs who are paying back £30,000 to £40,000. For this reason, and given all the data we now have, the original methodology seemed a bit blunt. Using Mark’s original analysis, there is still an observable effect but it is now very small (there are 87 MPs in both the safest and second safest quartiles, 74 in the third and 80 in the safest) as the Chart below shows.

In the original Telegraph data it was not possible to be robust about the amount of money MPs were going to have to pay back. This is because the amounts were not always listed or were disputed. Now, however, we have all the amounts down to the very last penny.

We thought about introducing a cut-off point to isolate the worst expenses excesses. If you take only those MPs who claimed £5,000 or more then the ratio of 2-1 between the highest and lowest quartile is still there (the figures are 21 MPs from the safest quartile, 13 in the second, 11 in the third and 11 in the safest). The selection of the cut-off point though is arbitrary and hence probably not greatly robust.

So in order to try and find a better way to see if the safety of an MP’s seat could be correlated with the amount of expenses money claimed we listed the 328 MPs who (after appeals and adjustments) have been asked to pay money back and ordered them by size of payment. This way we are now taking into account this wide range of difference in amounts paid back and including all the implicated MPs.

We then split this data into quartiles and looked at the average size of the majority for the MPs in each quartile. What we found is that for MPs in the top quartile (including Barbara Follett, Andrew Mackay, and many of the most controversial claims) the average majority is 8,678. In the second quartile the average majority is 7,534. In the third it is 7,705. And in the lowest quartile (including people like Mike Gapes and his 40p) the average majority is 7,276. So there is a fair bit of difference here but there is another point to note. The average size of majority for all 328 MPs implicated is 7,798 (7,613 for all MPs). This means that the top quartile is quite a way above this average (by nearly 1,000 depending on from which point you measure it) and the bottom quartile is a fair way below it (by close to 500). The two middle quartiles are clustered near the average(s).

As with Mark’s original posts, there will likely be debate over what these figures tell us the degree of statistical significance, but we feel that, at the margin, they show that there is a link between the expenses scandal and the size of an MP’s majority. Of those MPs implicated, on average the safer their seat, the more they wrongly claimed.

Credit: Jessica Asato and Bhumi Purohit of Progress for compiling the data.

Also cross-posted here on Left Foot Forward.

UPDATE: For those interested in doing their own statistical analysis, the data is available here.

4 comments:

sanbikinoraion said...: It seems to me that a scatter-plot of individual claims/majorities could be instructive; presumably one could calculate a regression (is that the right word for it?) and come up with a correlation number between majority and expense claim size. *That* would be really interesting.; 5 February 2010 at 11:21
emily said...: I'll repeat what I wrote on LFF.
You're using the wrong measure for seat safety - the size of the majority means very little if you don't take into account what proportion of the electorate that is. In short you need to use Percentage swing versus some other variable (eg amount claimed/amount claimed as a percentage of total possible claim etc). I also have to agree with above poster that a scatter plot would then be a better way to represent the resulting graph. Why the quartiles?; 5 February 2010 at 15:10
Mark Thompson said...: san/hyena,

Not knowing a huge amount about stats I was not sure how to go about doing things like regression etc.

I split them into quartiles to try and maintain some consistency with the method used previously.

As for the measure of the swing needed being more pertinent than the absolute size of the majority, I am certainly open to that idea but can you clarify why that is better? I don't have time right now to work out or collate all the swings to redo this, however the data is in the public domain now (see update to the post above) so anyone can take it and run any tests they want on it. I hope they do and look forward to hearing what other people find.

Oh, and at the suggestion of someone else on the LFF comment thread I did take the majority and expenses data and quickly run it through a Spearman's Rank test online which suggested a "weak positive correlation".; 5 February 2010 at 15:24
Phil Walker said...: Mark: what hyena said. The reason is that the average constituency is 74,000 electors, but they vary from Eileann na-Thingummybob at 22,000 to the Isle of Wight at 110,000. You can see that a majority of 1,000 looks far more ropey in the Scottish Isles than on the one of the south coast of England! I can sort out that problem quite quickly, I think: the ONS has a dataset with electors per constituency at the last election.

Also, you ought to take into account the length of time an MP has been sitting in the House. MPs who arrived during the last Parliament have had less time to abuse the system, but (in principle; I don't expect this in practice) could have been every bit as bad. That would be the big job, and you'd have to cut it off with the duration of time Legg was considering: I don't believe he considered claims going back to the Eighties, did he?; 6 February 2010 at 09:57