Noahpinion: Big TFP data mystery!

Tuesday, November 03, 2015

Big TFP data mystery!

NOTE: Mystery probably resolved! See update below. Here was the original post, for posterity:

While recently complaining about the overselling of static-efficiency policies, I asserted that rich countries have all grown at about the same long-term rate, despite decade-long divergences. I was talking, of course, about Total Factor Productivity, which at long horizons should be determined by technology.

I had been under the impression that over the last three decades or so, the rich countries had all experienced similar rates of TFP growth. My source for that was the OECD's time-series on multifactor productivity (another name for TFP). Here is a chart of those OECD productivity numbers since 1985:

As you can see, most rich countries grew their TFP at the same average rate, consistent with the idea that TFP mostly measures technology in the long term, and that technology spreads rather easily between rich countries. A few countries, like Korea, Ireland, and Finland, did much better over this period, and a few countries, like Italy, Spain, and Portugal, lagged behind. But most rich countries were clustered along the same basic line. The U.S., UK, France, and Germany (highlighted on the graph) all stayed very close to each other.

But I now see that FRED has its own TFP numbers for various countries, taken from the Penn World Tables. And here's what happens when I plot the TFP numbers for the U.S., UK, France, and Germany over the same time period (1985-2011):

What??!!

The U.S. and UK lines match up as before, but the Germany and France lines are wildly, totally different! In fact, according to the Penn World Tables, Germany's TFP actually steadily declined from the mid-80s to 2011!

What on Earth is going on here?? Obviously the two measurement methodologies are very different. So I tried to track down the source of the discrepancy, and I found some interesting stuff.

First of all, it turns out that the Penn World Tables, currently assembled by a team of economists from UC Davis and the University of Groningen, have undergone substantial revisions to their methodology in recent years. They switched to a new growth accounting method developed by Francesco Caselli in the early 2000s (which I plan to study in detail when I get the chance). As Antonio, and Marek Jarociński pointed out in 2010, these revisions were enough to substantially change the results of all cross-country growth regressions. Simon Johnson, William Larson, Chris Papageorgiou, Arvind Subramanian criticized the new Penn methodology, and suggested possible changes.

Meanwhile, the OECD methodology for calculating TFP has some questions surrounding it as well. To get TFP you need measures of labor and capital inputs. The OECD uses a pretty textbook method for doing this - simply stick in the raw estimates for the dollar values of labor and capital. But when they tried using another database called EU-KLEMS that tries to adjust for "quality" of inputs, they found totally different numbers.

I am not experienced enough in growth accounting to wade into these disputes in a substantive manner; it would take me at least a month of serious study to be able to say with any confidence which of these methodologies I believe most. The real takeaway here, though, is that TFP measurements are HIGHLY suspect, and will continue to be so for the foreseeable future.

That is bad news for most of modern macroeconomics, both on the growth theory and on the business cycle theory side of things. If differing methodologies for measuring labor and capital inputs diverge by this much, it means that any series you use probably has tons of stuff in it that it shouldn't have. That means that changes in the series at business-cycle frequencies - the good old TFP shocks of RBC models, which are also part of "kitchen sink" DSGE models like Smets-Wouters - are also unreliable. Basically, all those "shocks" are as likely as not to just be noise. That's probably true whether you compare across countries or look only at one country.

So this is a very pessimistic finding, and a huge challenge for the growth accounting field. Hopefully, a meeting of the brightest minds will get to the bottom of the problem and arrive at a consensus solution. If not, it means that any model that relies on measures of aggregate TFP, or factor inputs in general, is unreliable until the accounting problems are worked out.

Updates

Robert Inklaar of the University of Groningen contacted me and explained what was wrong! The most recent version of the Penn World Tables, version 8, did not take into account changes in averaged hours worked in some countries. Also, it used a Barro-Lee data source that apparently had some questionable data on trends in education. Inklaar says that the next version of the PWT, version 9, will fix the problems, and until it comes out, to use OECD data.

Well, I am mostly relieved. It's not really a methodology disagreement (except for the Barro-Lee education data). All of macro does not have to be scrapped, just yet. :-)

Thanks to Robert Inklaar for helping me out!

...But the growth economists I talked to about this mystery all expressed deep skepticism about these TFP data sets in general...

52 comments:

Krzys8:54 PM
It's very simple. TFP is a residual, not a factor.
ReplyDelete
Replies
pithom9:42 PM
I looked at some studies once, but they had contradictory methods of measuring capital.
ReplyDelete
Replies
pithom9:45 PM
Also, this is Google. Blogger is a part of Google, which is a part of Alphabet.
ReplyDelete
Replies
rickstersherpa@msn.com9:49 PM
Your are not the only one puzzling about it. http://www.wsj.com/articles/alan-blinder-the-unsettling-mystery-of-productivity-1416873038

http://nakedkeynesianism.blogspot.com/2014/12/the-mystery-of-productivity-what-mystery.html

http://www.voxeu.org/article/uk-productivity-and-job-puzzle

One explanation that might make for interesting research is to see if a prolong period of declining real wages decreases the incentive to invest in business and train workers to make them more productive. To quote the voxeu article:

"Lower real wages seem to have helped reduce lay-offs because they have made it cheaper for firms to hang on to their workers even in the face of falling demand. But low wages also made it relatively cheaper for firms to take on more workers rather than invest in new capital or technologies."

See also falling labor share and slower productivity growth. http://angrybearblog.com/?s=productivity
ReplyDelete
Replies
JJF10:43 PM
You could always be a dick about it and get him a cheaper, lower quality beer from a low TFP country...
ReplyDelete
Replies
mOOm10:59 PM
I've looked at the PWT TFP numbers before and decided they made little sense...
ReplyDelete
Replies
Efi12:12 AM
Cool post. I've wondered about the underlying numbers on TFP, and what actually goes into them. But I'm not sure what it says about the field if you already fear political motivations for the differences between the two measures prior to digging into the details.
ReplyDelete
Replies
Anonymous10:44 AM
This is my second try at posting a comment. On the first try I accidentally hit the 'Sign out' button below instead of the 'Publish' button, and there was no confirmation screen to decline. Yuck!

Anyway, my point is that the US uses deliberately deceptive methods of measuring labor inputs due to foreign labor, and I would suspect the UK follows suit. The US doesn't even measure much of the foreign labor inputs to US production as labor, rather they measure it as capital inputs. While this wouldn't change the TFP calculation, it would change the perception that TFP measures technology gains.

Note that this accounting problem doesn't affect all trade relations, but it does affect the relations with countries of low wages compared to the US. The effect here depends upon the source of commodities and the destination of finished goods.

France and Germany are not nearly as dependent upon foreign labor as the US and the UK are, so this should be the first place to look for more specific insight.
ReplyDelete
Replies
JJF11:57 AM
I'll admit that the only other time I've heard about a debate on Total Factor Productivity calculation has been in relation to the book "Time on the Cross", which makes the (extremely difficult for me to accept) assertion that slave labor was more economically efficient than free labor in mid-19th century America.

https://en.wikipedia.org/wiki/Time_on_the_Cross

One has to ask "Then where were the great slave run factories, and why was slave raised wheat and corn not driving Northern produce out of the market with its cheapness?"
ReplyDelete
Replies
Anonymous2:51 PM
In your update, I suspect that Groningen U is just trying to cross its ts and dot its is. (How do you write this phrase properly?). If they're going to be getting a lot more attention they're going to need a new level of internal scrutiny that they don't have in place yet. I don't doubt their numbers in their general implications. I suspect they will be corrected somewhat closer to OEPC, but will they totally correct the problem?

They could be mistaken in trying to address some things I've said in terms of productivity alone, and most likely productivity calculations are not going to show the problem we're trying to get at here. My main point is that productivity measures don't necessarily measure technology. They can measure other things, such as shifts in accounting methods due to corporate inversion.
ReplyDelete
Replies
david6:34 PM
May I suggest http://www.iariw.org/papers/2013/Weipaper.pdf
ReplyDelete
Replies
Anonymous9:15 PM
I think the subject here is Wisconsin policy in this district.

Where are these people? Sean, want a debate?

Fellows, want a debate?
ReplyDelete
Replies
Vulgar Economics10:40 PM
This comment has been removed by the author.
ReplyDelete
Replies
Vulgar Economics10:41 PM
How many times do I have to tell you that TFP is just the geometrically share-weighted factor prices? I know you've seen my paper on twitter: https://ideas.repec.org/p/new/wpaper/1513.html
ReplyDelete
Replies
Tom Warner11:46 PM
If the think fate of macro hangs on the accuracy of measuring TFP "shocks," then yes, of course, you have been miseducated and you need to start over from scratch. There are no TFP shocks driving business cycles, unless maybe that once if you counted the oil embargo as a TFP shock, in which case you probably don't need to measure it in terms of TFP.

I'm not sure if it's one of the reasons for that particular dataset to be nonsense, but generally the main reason the Penn World Tables are garbage is the PPP.
ReplyDelete
Replies
reason4:13 AM
Noah,
I'm not so sure why you are so keen on Cochrane. My view is that good guys are:
1. honest
2. want rising median welfare
Bad guys are
1. dishonest
2. uninterested in distribution of benefits.

I'm not sure how Cochrane comes out a good guy on that basis.
ReplyDelete
Replies
Unknown6:51 AM
Duh, of course the OECD TFP numbers converge. They have set 2010 as 100 for all countries. By design all countries must then converge (and have not hsd much time to diverge). It's an issue of relative vs absolute effects .....
ReplyDelete
Replies
Longtooth12:38 PM
The residuals (TFP)are just the composite of the effect of variables for which you don't account. Why don't you account for them? Because economists can't be bothered to systematically investigate the unknowns.

Example of TFP

A guy already working in a company and already being paid for doing X comes up with an idea (Y) that will improve labor productivity at no measurable additional cost of either labor or capital, and with no additional input labor effort, difficulty, stress, moral, etc.. but simply by utilizing an already highly developed labor skill set that nobody prior had thought of using in this application.

Two simple examples of Y:
1. Utilizing intricate sewing skills in an industrial high volume fine wire winding application. -- a 200% productivity improvement occurred. Real life example.... no exaggeration.
2. Rearranging points in a process to weed out stuff that would ultimately fail quality criteria at points further along in the process at higher value added loss. Net greater productivity per unit produced and shipped by 100%. Another real life example, no exaggeration. This can be called "paying more attention to detail".
3. Similar to 2) above: Implementing ongoing statistical trend data in real time processes to identify systematic drifts and spurious outliers followed by rigorous identification of root cause(s) and tooling adjustments to eliminate them. No additional engineering effort -- just more productive use of same. Imperceptible increase in labor with 1000% productivity improvement over 2 years. Real life example... no exaggeration. And BTW, Japan used this in a far more systematic and rigorous form which did add measurable labor and capital following WWII and was the predominant reason for Japan's post-war production machine and exports. The common name for it was "control charting".

These productivity improvements spread rapidly intra-company and then even more rapidly thereafter inter-company, then inter-industry and ultimately inter-globally.

These end up in the residuals (TFP). They only become apparent economically in creeping form over time as the effects spread from one unit of company to the entire company, then to more company's in an industry, then to more industries, etc.

These types of improvements can be simply classified as "human efficiency" or " human factor yield improvement", where there is no perceptible or measurable increase in human effort applied, no or virtually no measureable capital expense. I simply refer to them as "attention to detail". A early example of this kind of efficiency gain which was widely proliferated for a long time was known as "time & motion" implementations... Smith's "pin" production example is probably the earliest reference to it in macro economics.
ReplyDelete
Replies
Ray Lopez10:59 PM
What b.s....this is becoming my favorite blog. Like Scott Sumner's blog and Tyler Cowen's blog, this idiot says things that are provocative, that make you want to reach out and comment even when you wish to just lurch!

"the real takeaway here, though, is that TFP measurements are HIGHLY suspect, and will continue to be so for the foreseeable future" - yes, and I knew that as a non-economist. TFP is measured as a 'residual' meaning it's not directly measured, hence prone to error due to everything and the kitchen sink included in it, as a residual. It's a fudge factor, akin to 'expectations' in macro models, or "natural interest rates", that also can't be measured. Actually even physics has this, with occult variables in quantum entanglement, but the difference is physicists can actually test their models, instead of just debating them online, and achieve results. Peace!
ReplyDelete
Replies

Add comment