Fascinating work and inspiring application of the underlying DINOv3 image segmentation model!
The blog post and paper [1] describe a promising approach to solving related problems at previously impossible scale and quality: I am currently exploring methods to better represent seasonal land cover changes that would improve wind power generation forecasting and this paper provides a great starting point.
I hope DINOv3 can inspire more work like this - and I would encourage any curious mind to play with that model! I was amazed by its capability to distinguish between fine object details. For example, in a photo of a bicycle, the patch embeddings cleanly separated the background from the individual spokes of the wheel.
This is really cool, I wonder how old the satellite data they used is, it’s a bit unclear
Related: Just the other day I used USGS 3DEP LiDAR data + Claude Code to get a sense for the number of trees on my property. Diffing terrain map and canopy map gives tree elevation. It was a fun project to explore, primarily because I set CC loose and said "here is the bounding box of my property, pad it by 50 feet and then go absolutely nuts against government datasets gathering as much open data as you can" - it figured out the rest. Dug into soil maps, historical satellite imagery, and lidar data.
Here are the visuals re: trees - https://i.imgur.com/R0W4q4O.png
I gave a talk about the paper in our internal journal club recently (we work on similar problems, usually using stereo imagery though).
It's a nice piece of work. I especially like the sections on data cleaning and registration, as that seemed to have been one of the limiting factors of the previous approaches.
I am sceptical about how accurately you can predict heights for specific trees from mono-images, but I think for cases where you just need to be right on average (e.g. biomass estimation, fuel load estimates) it's a great approach.