Highest Voted 'yolo' Questions - Statistical Analysis Stack Exchange

31

votes

5 answers

Yolo Loss function explanation

I am trying to understand the Yolo v2 loss function: \begin{align} &\lambda_{coord} \sum_{i=0}^{S^2}\sum_{j=0}^B \mathbb{1}_{ij}^{obj}[(x_i-\hat{x}_i)^2 + (y_i-\hat{y}_i)^2 ] \\&+ \lambda_{coord} \sum_{i=0}^{S^2}\sum_{j=0}^B…

asked Jun 27 '17 at 01:56

Kamel BOUYACOUB

341
1
3
3

9

votes

1 answer

Yolo v3 loss function

The original loss function can be seen here and is more or less explained in Yolo Loss function explanation: \begin{align} &\lambda_{coord} \sum_{i=0}^{S^2}\sum_{j=0}^B \mathbb{1}_{ij}^{obj}[(x_i-\hat{x}_i)^2 + (y_i-\hat{y}_i)^2 ] \\&+…

neural-networks loss-functions object-detection yolo

asked Oct 23 '18 at 03:55

sachinruk

1,113
1
9
21

7

votes

1 answer

On Yolo, and its loss function

I'm having a hard time understanding some on the inner-working of YOLO, especially the loss function depicted in this seminal paper. Bear in mind that I'm nowhere closed to being a specialist in deep learning and computer vision (in fact, I started…

neural-networks loss-functions yolo

asked Jun 28 '18 at 15:54

Hermès

173
1
5

5

votes

2 answers

What is sigma function in the YOLO object detector?

I have gone through the YOLO9000 paper, in that they have mentioned that network predicts 5 coordinates of the bounding box, and from that we find the exact centre coordinates and the width and height. I'm confused with those…

neural-networks image-processing notation object-detection yolo

asked Apr 18 '19 at 05:53

bibinwilson

153
3

4

votes

2 answers

How are YOLO anchor boxes generated?

I am recently trying out darkflow, a Tensorflow implementation of Darknet written by Joseph Redmon. Looking at the configuration files, I noticed a section called region as shown below. [region] anchors = 1.08,1.19, 3.42,4.41, 6.63,11.38, …

computer-vision object-detection yolo

asked May 13 '19 at 10:42

user3600725

41
1
3

4

votes

1 answer

Yolo loss function for detecting 1 class

I'm trying to work on a Yolo implementation which searches a 19x19 grid to find a specific item. There is only a single class in all of these images I am looking to get bounding boxes for. I'm a little confused about the calculation of the loss…

neural-networks conv-neural-network loss-functions yolo

asked Nov 09 '17 at 18:34

Michael Bawol

151
4

4

votes

2 answers

Adjusting Grid Size in YOLO?

I was going through the YOLO Object Detection Paper by Joseph Redmon. The authors use a grid size of $S = 7$. If I am not wrong, the network architecture has carefully been curated for this specific grid size (Since the output of the final conv…

machine-learning neural-networks computer-vision object-detection yolo

asked Sep 04 '17 at 07:35

Banach Tarski

233
2
8

3

votes

1 answer

What are "Grids" and Detection at different scales" in YOLOV3?

I've recently started working with Yolov3 and the more I go in depth, the more confused I get. In the simplest terms what I think about YOLOV3 (On 416 input, 80 classes, 3 BB) is that: It Extract features from DarkNet-53 and then again use 53…

neural-networks convolution computer-vision object-detection yolo

asked Jan 29 '21 at 05:19

Deshwal

234
1
8

3

votes

1 answer

Why use multiple anchor boxes with the same positions in a multi-box detector?

What is the benefit of using multiple anchor boxes with the same positions in a single-shot multi-box detector model, like YOLO? In particular, I notice Google's BlazeFace model does this. If the purpose of an anchor box in a multi-box detector is…

machine-learning computer-vision yolo

asked Oct 07 '20 at 17:29

algal

133
3

3

votes

1 answer

1 neuron BCE loss VS 2 neurons CE loss

I built a custom version of YOLO that should only detect one type of objects, where the objectness measure (which tells how likely a bounding-box contains an object of any type), is learned using a logistic regression (BCE loss), with a single…

logistic cross-entropy object-detection yolo

asked Sep 01 '20 at 13:39

SomethingSomething

145
6

3

votes

0 answers

Yolov3: Single-class vs multiple class

I'm really new to object detection with Yolov3. Let's say I have 10 classes and the amount of data is approximately the same. Do I achieve better average precision when I use 10 Yolo models and train them always with the data of just one class or…

machine-learning object-detection yolo

asked Feb 05 '20 at 08:25

kai Herbst

31
2

3

votes

0 answers

What is the need of multiple Bounding Boxes per grid cell in YOLO v1?

This question is regarding YOLO v1 architecture as in here. I am confused as to why authors have used 2 bounding boxes per grid cell for training. Assuming there can be only one object per grid cell. Would two boxes would be relevant? If yes:…

conv-neural-network object-detection yolo

asked Apr 10 '18 at 20:43

GeneX

622
7
14

3

votes

1 answer

YOLO loss function width and height component explanation

I am reading this paper on how yolo defines loss function. https://arxiv.org/abs/1506.02640 I did research on other posts, but these posts did not seem to answer my confusion: (How to calculate the class probability of a grid cell in YOLO object…

neural-networks object-detection yolo

asked Mar 12 '18 at 04:39

Earthgod

133
4

2

votes

1 answer

Question YOLOv2 bounding box prior

In the YOLO9000 paper, they define the distance between a box and centroid as $d(box, centroid) = 1 - IOU(box, centroid)$. I think box here is a ground truth bounding box, but what is centroid?

neural-networks computer-vision yolo

asked Jan 02 '22 at 21:51

Yandle

743
2
12

2

votes

1 answer

Performance of MaskRCNN/YOLO as a function of object size in pixels

I am trying to find references on how the resolution of an object affects the ability of object detection systems such as MaskRCNN and YOLO to correctly identify the object. For example, if the camera is zoomed further and further out, the number…

neural-networks computer-vision object-detection yolo

asked Jan 09 '20 at 21:50

Surgical Commander

369
1
3
9

Questions tagged [yolo]