How to estimate an objects distance?
I know there's models like DepthAnything or VGGT, but the problem is they don't have semantic understanding. I was thinking of combining a model like YOLO to get an object bounding box then using a depth model, but you can't know where within the bounding box to take the depth, as often theres background or occlusions within the box that aren't the real object. Anyone know a good way of doing this?
u/boringblobking — 4 hours ago