I'm not quite sure what is the main reason why YOLO uses multiple bounding boxes for a grid cell. An answer I can find on web is for multiple aspect ratio in the prediction. However I don’t think it’s completely true because if I understand correctly, YOLO optimizes the offset of a prior proportional to the size of the image, and then the box can expand to anywhere in the image, and the number of boxes doesn’t really matter. My guess is that maybe it’s a heuristic move to speed up the computation or to avoid sticking to a bad local optima. Or is there another good explanation here?
Asked
Active
Viewed 574 times
1
-
Related: [What is the need of multiple Bounding Boxes per grid cell in YOLO v1?](https://stats.stackexchange.com/q/339754/1352) – Stephan Kolassa May 15 '19 at 09:17
-
I already looked at that thread, not helping much. I mentioned it in the question: "An answer I can find on web is for multiple aspect ratio in the prediction" – StapleStable May 17 '19 at 22:44