Andrew Ng’s class on deep learning with convolutional neural nets covers object detection and localization with the YOLO algorithm, non-max suppression and anchor boxes.

YOLO

Using the pretrained network from the YOLOv2 model, I picked a random street scene image and it did this:

Object detection and localization

References: