They don't mind GMM and spending a week or two teaching it, but they tend to think that making GMM the basis of all econometrics is putting the cart before the horse.

Why do the old timers shake heads at Hayashi? Yes, Hansen and Shiller made perfect sense as Nobel winners, and Hansen made perfect sense as a Nobel winner, but the three of them together was puzzling to me until I read your explanation. But even so, I think Hansen should've won a separate Nobel, or one in conjunction with other econometricians, rather than being lumped in with finance guys.

Seems like much to-do about nothing. A sort of least squares fit algorithm for c rappy parametric models with sparse data. This is worthy of a Nobel? even more scary, central bank models depend on this? E-gad, we're in worse shape than I thought... I haven't touched on the computational aspects of GMM. As you can imagine, for linear (and possibly other) models, there are closed form solutions. (Obviously, in the case of models that reduce to OLS and such.) Otherwise it's a matter of using numerical root finding methods.

GMM has its origins in asset pricing. Every econometrics sequence teaches some GMM, but not all professors make it the unifying framework in the way that Hayashi does, and in some departments that approach would be regarded as a little eccentric.

I use MathJax for math in blog posts.

This is very interesting. I'm a statistician, and basically all statistical inference that is taught in statistics is maximum likelihood. You mention that a weakness of maximum likelihood is the need to make parametric assumptions. Fitting non-parametric models is done usually with spline methods in statistics.

I've never seen GMM taught before. So this post makes me wonder why this isn't taught in statistics, given that GMM is also a generalization of maximum likelihood according to this post. Any thoughts?

Also, how do you include LaTeX in blog posts?

Good to see one of the backup team interacting with the prizes and trying to provide some background. Extra kudos for taking on the hardest one. A bit disappointing that with 8 authors on the list to the right of the page, and 3 winners, we only got one post on the prize today.

Aaaaah right. Many thanks.

That, and no t superscript on beta. I added 1/T in front, is that what you meant?

Shouldn't (4) have 1/beta instead of beta^t if it's taking a sample average?

The coefficient of relative risk aversion (-cu''/u') isn't gamma in your example, it's 1-gamma.

I wouldn't say Hansen's insight was that moments could be used for estimation. As the link indicates, that was known for a while. It was about how one could use the 'extra' moment conditions that often crop up.

Good catch, I have fixed that. Equation 3 shouldn't because that is a view of the optimization problem over a single period from time t to time t + 1.

The Nobel Prize committee honored Lars Peter Hansen for his work in developing a statistical method for testing rational theories of asset price movements. The statistical method Hansen developed is Generalized Method of Moments (GMM). The fact that Hansen won the Nobel Prize for his "empirical analysis of asset prices" caught me off guard as I did not realize this was the original application of GMM. 

GMM is used in the estimation of the New Keynesian Phillips Curve. The New Keynesian Phillips Curve includes expectations of future inflation as an idependent variable. Since inflation expectations cannot really be observed, GMM offers a way around this difficulty. 

The New Keynesian Phillips Curve, which was developed in 1995, is integral to most DSGE models that central banks across the globe are increasingly dependent. Thus it's hard to imagine modern central banking without Hansen's contributions to econometrics. So for Hansen to have won the prize for his empirical analysis of asset prices strikes me as somewhat ironic. Thanks for the helpful primer! Quick q, shouldn't equations 2, 3 and 4 have beta^t so that later periods are discounted more heavily?

Nice explanation, but let's not forget that the J test is really just Sargan's old test of over-identification, re-visited.

Great explanation. I think one of the great things about GMM is that it allows us to estimate a single equation from a model without assuming that the entire model is "true." Additionally, an underappreciated aspect of GMM is that it shows how silly the structural vs. reduced form debate is. Viewed through the lens of GMM (which as you note is a generalization of OLS), this debate reduces to a differences in functional form.