Sabermetric Research: Income inequality and the Fed report

The New York Yankees are struggling. Why don't they sign Reggie Jackson? Sure, he's 68 years old, but he'd still be a productive hitter if the Yankees signed him today.

Why do I say that? Because if you look at the data, you'll see that players' production doesn't decline over time. In 1974, the Oakland A's hit .247. In 2013, they hit .254. Their hitting was just as good -- actually, better -- even thirty-nine years later!

So how can you argue that players don't age gracefully?

------

It's obvious what's wrong with that argument: the 2013 Oakland A's aren't the same players as the 1974 Oakland A's. The team got better, but the individual players got worse -- much, much worse. Comparing the two teams doesn't tell us anything at all about aging.

The problem is ridiculously easy to see here. But it's less obvious in most articles I've seen that discuss trends in income inequality, even though it's *exactly the same flaw*.

Recently, the US Federal Reserve ("The Fed") published their regular report on the country's income distribution (.pdf). Here's a New York Times article reporting on it, which says,

"For the most affluent 10 percent of American families, average incomes rose by 10 percent from 2010 to 2013."

Well, that's not right. The Fed didn't actually study how family income changed over time. Instead, they looked at one random sample of families in 2010, and a *different* random sample of families in 2013.

The confusion stems from how they gave the two groups the same name. Instead of "Oakland A's," they called them "Top 10 Percent". But those are different families in the two groups.

Take the top decile both years, and call it the "Washington R's." What the Fed report says is that the 2013 R's hit for an average 10 points higher than the 2010 R's. But that does NOT mean that the average 2010 R family gained 10 points. In fact, it's theoretically possible that the 2010 R's all got poorer, just like the 1974 Oakland A's all got worse.

In one sense, the effect is stronger in the Fed survey than in MLB. If you're a .320 hitter who drops to .260 while playing for the A's, Billy Beane might still keep you on the team. But if you're a member of the 2010 R's, but wind up earning only an middle-class wage in 2013, the Fed *must* demote you to the minor-league M's, because you're not allowed to stay on the R's unless you're still top 10 percent.

The Fed showed that the Rs, as a team, had a higher income in 2013 than 2010. The individual Rs? They might have improved, or they might have declined. There's no way of knowing from this data alone.

-----

So that quote from the New York Times is not justified. In fact, if even one family dropped out of the top decile from 2010 to 2013, you can prove, mathematically, that the statement must be false.

That has nothing to do with any other assumptions about wealth or inequality in general. It's true regardless, as a mathematical fact.

Could it just be bad wording on the part of the Fed and the Times, that they understand this but just said it wrong? I don't think so. It sure seems like the Times writer believes the numbers apply to individuals. For instance, he also wrote,

"There is growing evidence that inequality may be weighing on economic growth by keeping money disproportionately in the hands of those who already have so much they are less inclined to spend it."

The phrase "already have so much" implies the author thinks they're the same people, doesn't it? Change the context a bit. "Lottery winners picked up 10 percent higher jackpots in 2013 than 2010, keeping winnings disproportionately in the hands of those who already won so much."

That would be an absurd thing to say for someone who realizes that the jackpot winners of 2013 are not necessarily the same people as the jackpot winners of 2010.

Anyway, I shouldn't fault the Times writer too much ... he's just accepting the incorrect statements he found in the Fed paper.

And I don't think any of the misstatements are deliberate. I suspect that the Fed writers were sometimes careless in their phrasing, and sometimes genuinely thought that "team" declines/increases implied family declines/increases.

Still, some of the statements, in both places, are clearly not justified by the data and should not have made it into print.

------

I've read articles in the past that made a similar point, that individuals and families might be improving significantly, even though the data appears to give the impression that their group is falling behind.

It's not hard to think of an example of how that might be possible.

Imagine that everyone gets richer every year. During the boom, immigration grows the population by 25 percent every year, and the new arrivals all start at $10 per hour.

What happens?

(a) the lowest bottom 20 percent of every year earn the same amount; but
(b) everyone gets richer every year

That is: *everyone* is better off *every year*, even though the data may make it falsely appear that the poor are stagnating.

(Note: the words "rich" and "poor" are defined as "high wealth" and "low wealth," but in this post, I'm also going to [mis]use them to mean "high income" and "low income." It should be obvious from the context which one I mean.)

-------

Now, even if you agree with everything I've said so far, you could still have other reasons to be concerned about the Fed report. For me, the me, the most important fact is the discovery that 2013's poor (bottom quintile) have 8 percent less income than 2010's poor.

You can't conclude that any particular family dropped, but you *can* conclude that, even if they're different people, the bottom families of 2013 are worse off than the bottom families of 2010. That's real, and that's something you could certainly be concerned about.

But, many people, like the New York Times writer, aren't just concerned about the poorer families -- they worry about how "income inequality" compares them to the richer ones. They're uncomfortable with the growing distance between top and bottom, even in good times where the "rising tide" lifts everyone's income. For them, even if every individual is made better off, it's the inequality that bothers them, not the absolute levels of income, or even now fast overall income is growing. If the "Washington R's" gain 20 percent, but the "Oakland P's" gain only 5 percent ... for them, that's something to correct.

They might say something like,

"It's nice that the overall pie is growing, and it's nice that the "P's" are getting more money than they used to. But, still, every year, it seems like the high-income "team" is getting bigger increases than the low-income "team". There must be something wrong with a system where, years ago, the top-to-bottom ratio used to be 5-to-1, but now it's 10-to-1 or 15-to-1 or higher."

"Clearly, the rich are getting richer faster than the poor are getting richer. There must be something wrong with a system that benefits the rich so much while the poor don't keep up."

Rebutting that argument is the main point of this post. Here's what I'm going to try to convince you:

Even when the rich/poor ratio increases over time, that does NOT necessarily imply that the rich are getting more benefit than the poor.

That is: *even if inequality is a bad thing*, it could still be that the changes in the income distribution have benefited the poor more than the rich.

I can even go further: even if ALL the benefits of increased income go to the poor, it's STILL possible for the rich/poor inequality gap to grow. The government could freeze the income of every worker in the top half, and increase the income of every worker in the bottom half. And even after that, the rich/poor income gap might still be *higher*.

-------

It seems that can't be possible. If everyone's income grows at the same rate, the ratio has to stay the same, right? If rich to poor is $200K / $20K one year, and rich and poor both double equally, you get $400K / $40K, and the ratio of 10:1 doesn't change. Mathematically, R/P has to equal xR/xP.

So if benefits that are equal keep the ratio equal, benefits that favor the poor have to change the ratio in favor of the poor. No?

No, not necessarily. For instance:

Suppose that in 2017, the ratio between rich and poor is 1.25. In 2018, the ratio between rich and poor is 1.60. Pundits say, "this is because the system only benefited the rich!"

But it could be that the pundits have it 100% backwards, and the system actually only favored the poor.

How? Here's one way.

There are two groups, with equal numbers of people in each group. In 2017, everyone in the bottom group made $40K, and everyone in the top group made $50K. That's how the ratio between rich group and poor group was 1.25.

The government instituted a program to help the poor, the bottom group. Within a year, the income of the poor doubled, from $40K to $80K, while the top group stagnated at $50K.

So, in 2018, the richest half of the population earned $80K, and the poorest half earned $50K. That's how inequality increased, from 1.25 to 1.60, only from helping the poor!

------

What happened? How did our intuition go wrong? For the same reason as before: we didn't immediately realize that the groups were different people in different years. The 2017 rich aren't the same as the 2018 rich.

When the pundits argued "the system only benefited the rich," whom did they mean? The "old" 2017 rich, or the "new" 2018 rich? Without specifying, the statement is ambiguous. So ambiguous, in fact, that it almost has no meaning.

What really happened is that the system benefited the old poor, who happen to be the new rich. It failed to benefit the old rich, which happen to be new poor.

Inequality increased from 1.25 to 1.60, but it's meaningless to say the increase benefited the "rich". Which rich? Obviously, it didn't benefit the "old rich."

But, isn't it true to say that the increase benefited the new rich?

It's true, but it doesn't tell us much -- it's true by definition! In retrospect, ANY change will have benefited the "new rich" more than the "new poor." If you used to be relatively poor, but now you're relatively rich, you must have benefited more than average. So when you say increasing inequality favors the "new rich," you're really saying "increasing inequality favors those who benefited the most from increasing inequality."

These examples sound absurd, but they're exact illustrations of what's happening:

-- You have a program to help disadvantaged students go to medical school. Ten years later, you follow up, and they're all earning six-figure incomes as doctors. "Damn!" you say. "It turns out that in retrospect, we only helped the rich!"

-- Or, you do a study of people who won the lottery jackpot last year, and find that most of them are rich, in the top 5%. "Damn!" you say. "Lotteries are just a subsidy for the rich!"

-- Or, you do a study of people who were treated for cancer 10 years ago, and you find most of them are healthy. "Damn!" you say. We wasted cancer treatments on healthy patients!

It makes no sense at all to regret a sequence of events on the grounds that, in retrospect, it helped the people with better outcomes more than it helped the people with worse outcomes. Because, that's EVERY sequence of events!

If you want to complain that increasing inequality is disproportionately benefiting well-off people, that can make sense only if you mean it's those who were well off *before* the increase. But the Fed data doesn't give you any way of knowing whether that's true. It might be happening; it might not be happening. But the Fed data can't prove it either way.

----

Here's an example that's a little more realistic.

Suppose that in 2010, there are five income quintiles, where people earn $20K, $40K, $60K, $80K, and $100K, respectively. I'll call them "Poor," "Lower Class," "Middle Class," "Upper Class," and "Rich", for short. We'll measure inequality by the R/P ratio, which is 5 (100 divided by 20).

Using three representative people in each group, here's what the distribution looks like:

2010 group, 2010 income
------------------------
P L M U R
------------------------
20 40 60 80 100
20 40 60 80 100
20 40 60 80 100
------------------------
R/P ratio: 5

From 2010 to 2013, people's incomes change, for the usual reasons -- school, life events, luck, shocks to the economy, whatever. In each group, it turns out that one-third of people make double what they did before, one third experience no change, and one third see their incomes drop in half.

Overall, that means incomes have grown by 16.7%: the average of +100%, 0%, and -50%. Workers have 1/6 more income, overall. But the change gets spread unevenly, since life is unpredictable.

Here are the 2013 incomes, but still based on the 2010 grouping. The top row are the people who dropped, the middle row are the status quo, and the bottom row are the ones who doubled.

2010 group, 2010 income
------------------------
P L M U R
------------------------
10 20 30 40 50
20 40 60 80 100
40 80 120 160 200
------------------------
R/P ratio: 5

You can easily calculate that every 2010 group got, on average, the same 16.7% increase. So, since life treated the groups equally, the 2010 rich/2010 poor ratio is still 5. In chart form:

2010 group, % change 2010-2013
------------------------------
P L M U R
------------------------------
+17% +17% +17% +17% +17%

But the Fed doesn't have any of those numbers, because it doesn't know which 2010 group the 2013 earners fell into. It just takes the 2013 data, and mixes it into brand new groups based on 2013 income:

2013 group, 2013 income
-------------------------
P L M U R
-------------------------
10 30 40 80 120
20 40 50 80 160
20 40 60 100 200
-------------------------
R/P ratio: 9.6

What does the Fed find? Much more inequality in 2013 than in 2010. The ratio between rich and poor is 9.6 -- almost double what it was!

The Fed method will also see that the bottom three groups are earning less than the corresponding group earned three years previous. Only the top two groups, the "upper class" and "rich," are higher. Here are the changes between each new group and the corresponding old group:

Perceived change 2010-2013
--------------------------
P L M U R
--------------------------
-17% -8% -8% +8% +60%

If you don't think about what's going on, you might be alarmed. You might conclude that none of the economy's growth benefited the lowest 60 percent at all -- that all the benefits accrued to the well off!

But, that's not right: as we saw, the benefits accrued equally. And, as we saw, the "R" group ALWAYS has to be high, by definition, since it's selectively comprised of those who benefited the most!

In effect, comparing the 2010 sample to the 2013 sample is a subtle "cheat," creating an illusion that can be used (perhaps unwittingly) to falsely exaggerate the differences. When the poor improve their lot, the method moves them to another group, and winds up ignoring that they benefited.

For instance, when a $30K earner moves to $90K, a $90K earner moves to $120K, and a $120K earner drops to $30K, the Fed method makes it look like they all benefited equally, at zero. In reality, the "poor" gained and the "rich" declined -- the $30K earner grew 200%, the $90K earner grew 33%, and the $120K earner dropped by -75%.

No matter how you choose the numbers, as long as there is any movement between groups, the method will invariably overestimate how much the "rich" benefited, and underestimate how much the "poor" benefited. It never works the other way.

--------

One last example.

This time, let's institute a policy that does something special for the disadvantaged groups, to try to make society more equal. For everyone in the P and L group in 2010, we institute a program that will double their eventual 2013 income. Starting with the same 20/40/60/80/100 distribution for 2010, here's what we see after the 2013 doubling:

2010 group, 2013 income
-----------------------
P L M U R
-----------------------
20 40 30 40 50
40 80 60 80 100
80 160 120 160 200
-----------------------
R/P ratio: 2.5

Based on the 2010 classes, we've cut the rich/poor ratio in half! But, as usual, the Fed doesn't know the 2010 classes, so they sort the data this way:

2013 group, 2013 income
-----------------------
P L M U R
-----------------------
20 40 60 80 160
30 40 80 100 160
40 50 80 120 200
-----------------------
R/P ratio: 5.8

Inequality has jumped from 5.0 to 5.8. That's even after we made a very, very serious attempt to lower it, doubling the incomes of the previous poorest 40 percent of the population!

-------

There's an easy, obvious mathematical explanation of why this happens.

When you look at income inequality, you're basically looking at the variance of the income distribution. But, changes from year-to-year are not equal, so they have their own built-in variance.

If the changes in income are independent of where you started -- that is, if the system treats rich and poor equally, in terms of unpredictability -- then

var(next year) = var(this year) + var(changes)

Which means, as long as rich and poor are equal in how their incomes change, inequality HAS TO INCREASE.

Take 100 people, start them with perfect equality, $1 each.

Every day, they roll a pair of dice. They multiply their money by the amount of the roll, then divide by 7.

Obviously, on Day 2, equality disappears: some people will have $12/7, while others will have only $2/7. The third day, they'll be even more unequal. The fourth day, even more so. Eventually, some of them will be filthy, filthy rich, having more money than exists on the planet, while others will have trillionths of a dollar, or less.

That's just the arithmetic of variation. Increasing inequality is what happens naturally, not just in incomes, but in everything -- everything where things change independently of each other and independently over time.

What if you want to fight nature, and keep inequality from growing? You have to arrange for year-to-year changes to benefit the poor more than the rich. That effect has to be large -- as we saw earlier, doubling the income of the 40 poorest percent wasn't enough. (It was a contrived example, but, still, it sure *seemed* like it should have been enough!)

-----

How much do you have to tilt the playing field in favor of the poor? Thinking out loud, scrawling equations ... I didn't double-check, so try this yourself because I may have screwed up ... but here's what I got:

Without independence,

var(next year) = var(this year) + var(changes) + 2 cov(this year, changes)

Solving on the back of my envelope ... if I've done it right, using logarithm of income and some rough assumptions ... I get that the correlation between this year's income and the change to next year's income has to be around -0.25.

My scrawls say that if you're in the top 2.5% of income, your next-year change has to be in the bottom 30%. And if you're in the bottom 2.5%, your next-year change has to be in the top 30%.

That seems really tough to do. In a typical year that the economy grows normally, what percentage of incomes in the Fed survey would be lower than last year's? If it's 30 percent, then ... to keep inequality constant, just ONE of the things you need to do is make sure high-income people, on average, never earn more this year than last year.

You'd almost have to repeal compound interest!

------

I don't mean to imply that increasing inequality is *completely* just the result of normal variation. There are lots of other factors. Progressive taxation creates a small effect on equality. Increased savings while the economy grows contributes to inequality. A growing population means that inequality increases where bestselling authors have a larger market. And so on.

But the point is: because increasing inequality happens naturally, you can't conclude anything just from *the fact that there's an increase*. At the very least, you have to back out the natural effects if you want to really explain what's going on. You have to do some math, and some arguing.

The argument, "Inequality is growing -- therefore, we must be unfairly favoring the rich" is not a valid one. It is true that inequality is growing. And it *might* be true that we are unfairly favoring the rich. But, the one doesn't necessarily follow from the other.

It's like saying, "Philadelphia was warmer in June than April; therefore, global warming must be happening."

------

Again, I'm not trying to argue that inequality is a good thing, or that you shouldn't be concerned about it. Rather, I'm arguing that increasing inequality does NOT tell you anything reliable about who benefits from the "system" or how much (if at all) the increase favors the rich over the poor.

I am arguing that, even if you think increasing inequality is a bad thing, the following are still, objectively, true:

-- increasing inequality is a natural mathematical consequence of variation;
-- it is not necessarily the result of any deliberate government policy;
-- it does not necessarily disproportionately favor the rich or hurt the poor;
-- there is no way to know which individuals it favors just from the Fed data;
-- the natural forces that cause inequality to increase are very strong;
-- natural inequality growth may be so strong that it will persist even after successful attempts to benefit the poor generously and significantly;
-- the poor could be gaining relative to the rich even while measured inequality increases.

As for the Fed study itself,

-- the Fed statistics do not measure income changes for any family or specific group of families;
-- the Fed statistics that measure distributional income changes for percentile groups are a biased, exaggerated estimate of the income changes for the average family starting in that percentile;
-- It is impossible to tell, from the Fed's numbers, how the poor are faring relative to the rich.

Finally, and most importantly,

-- all of these statements follow necessarily from basic logic and math -- and do not require any other arguments from politics, economics, compassion, greed, fairness, or partisanship.

Labels: income inequality

11 Comments:

At Monday, September 15, 2014 12:39:00 AM, Anonymous said...: Phil, I think you have a flaw in the "one last example" section. This is how I followed your argument. You're starting with this table:

P L M U R
20 40 60 80 100
20 40 60 80 100
20 40 60 80 100

R/P of 5

Then applying one year of variation to get this table in the original groups:

P L M U R
10 20 30 40 50
20 40 60 80 100
40 80 120 160 200

R/P of 5

Then the Fed regroups to perform its analysis, resulting in this table:

P L M U R
10 30 40 80 120
20 40 50 80 160
20 40 60 100 200

R/P of 9.6

But then to get to "what we see after the 2013 doubling" in the first table you post in that section, you're using this table:

P L M U R
20 40 30 40 50
40 80 60 80 100
80 160 120 160 200

R/P of 2.5

The mistake I think you're making here is that you're keeping the original groups when you double the P and L groups. When the policy goes into effect to double the P and L incomes, they'll use the actual P and L groups. They won't double the income of someone who WAS in the bottom 40% last year but this year due to variation is in the top 60%. So your table with doubled income for P and L should look like this instead:

P L M U R
20 60 40 80 120
40 80 50 80 160
40 80 60 100 200

R/P of 4.8

And the regrouped table looks like this:

P L M U R
20 40 60 80 120
40 50 80 80 160
40 60 80 100 200

R/P of 4.8

So the policy of doubling the income of the bottom 40% would in fact reduce income inequality, although it's very close to breaking even.

Let me know if I've missed something. Either way, this doesn't detract from your larger point.
At Monday, September 15, 2014 1:33:00 AM, Phil Birnbaum said...: Hi, SrMeowMeow,

It was actually my intention to keep the original groups when doubling. My thinking was, the policy goes into effect this year on this year's poor, and it takes a year for the effects to happen.

My thinking was that they WOULD improve the fortunes of someone who WAS in the bottom but is now in the top, because they didn't know that person would be in the top when they instituted the policy.

As you say, it doesn't really detract from the larger point, so either method works. But I was thinking more of policies that would permanently improve the income of the poor (say, scholarships, or job programs), which would unavoidably wind up helping poor people who would have done well anyway. Your method, on the other hand, seems more like a welfare-type thing that tops up income, which doesn't seem like the kind of thing people are talking about when they want to make incomes more equal. (In fact, we have good reason to believe "take money from the high-income families and give it to the low-income families" is NOT what they're thinking, since they measure inequality in pre-tax income (which, I'm pretty sure is what the Fed measured). And taxing the rich to give to the poor obviously doesn't affect pre-tax inequality at all.

But, either way works. As you calculated, even doubling the incomes of *this year's* poor, rather than last year's poor, is still just barely able to lower the ratio.
At Tuesday, September 16, 2014 12:40:00 PM, Anonymous said...: Your argument is mathematically sound, they're potentially comparing 2 different groups of people, but in reality the conclusion is likely the same because there are strong economic forces that perpetuate the income of the top earners.

The wealthy derive a large percentage of their income from returns on capital which grows exponentially year after year and from generation to generation. If the return on capital outpaces the growth rate, the stock market rises faster than the GDP which it has, then it's expected that the incomes of those who owned capital 10 years ago would grow faster than the majority who own essentially nothing. In the absence of very progressive taxation systems on income, wealth, and inheritance a growing income inequality between those who currently own capital and those who don't is actually what the system is designed to do.
At Sunday, September 28, 2014 1:35:00 PM, Kevin L said...: The stock market is subject to the same measurement problems. Corporations that lose big our lose continually drop out of the traded market.
At Sunday, September 28, 2014 1:40:00 PM, Kevin L said...: Also, if capital stocks keep growing, it can't defy the law of diminishing returns. The reason capital gets more valuable is because it's not just more of the same kinds of capital, it's better capital. And wealthy people who don't transform their capital into the newer, valuable forms will no longer be wealthy, and will be replaced with those who did.
At Sunday, September 28, 2014 1:41:00 PM, Phil Birnbaum said...: Agreed. When they say, "The value of the most valuable 500 stocks was $X trillion in 1990, but $Y trillion now." You can't conclude that the market is up by a factor of Y/X.

For specific stock indexes, it's not a problem, because the ones that drop out are still included in the index on the day they leave. When Enron goes bankrupt, the index drops to reflect its new value of zero. Only afterwards do Mr. Standard and Mr. Poor replace it with another stock.

The first case is like the Fed study. The second case is like following the same families, and making appropriate non-retroactive statistical adjustments for the ones that drop out.
At Sunday, September 28, 2014 1:41:00 PM, Phil Birnbaum said...: My 1:41 comment above refers to Kevin's 1:35 comment.
At Monday, September 29, 2014 5:36:00 PM, Unknown said...: This is such a common error and drives me nuts seeing it in media/news reports for decades. Phil, kudos to you for the simplest most intuitive explanation I have yet read.
At Monday, September 29, 2014 8:00:00 PM, Phil Birnbaum said...: Thanks, Michael!
At Monday, September 29, 2014 8:12:00 PM, Phil Birnbaum said...: Oops, I found arithmetical errors in the "perceived change" table. It was

-13% -8% 0% +29% +60%

I have changed it to

-17% -8% -8% +8% +60%

I've also changed "bottom two" groups to "bottom three" in the paragraph immediately above the table to reflect the correction.
At Saturday, November 01, 2014 2:07:00 PM, Josh H said...: There are a few things to keep in mind:

1) The Fed is not implying that they have time-series data on a cohort of top earners.

2) Samples are used to study populations. Samples can be statistically significant representations of a population.

2) There is a good deal of turn over in the income deciles. Not so much in the 1%. The share of income the 1% receives is growing:

http://www.decisionsonevidence.com/wp-content/uploads/2011/08/Decomposing-the-Top-Decile-US-Income-Share-into-3-Groups-1913-2007.png

3) The concentration of wealth is the driving force behind inequality, not incomes. There is no denying that wealth is concentrating.

<< Home

Sabermetric Research

Sunday, September 14, 2014

Income inequality and the Fed report

11 Comments:

About Me

Previous Posts