I gave a talk about the paper in our internal journal club recently (we work on similar problems, usually using stereo imagery though).
It's a nice piece of work. I especially like the sections on data cleaning and registration, as that seemed to have been one of the limiting factors of the previous approaches.
I am sceptical about how accurately you can predict heights for specific trees from mono-images, but I think for cases where you just need to be right on average (e.g. biomass estimation, fuel load estimates) it's a great approach.