Why Does the Sigmoid Output Layer in a Binary Feedforward Neural Network Represent the Probability of the Positive Class (Label = 1)?

Question

I'm a beginner who just started to study deep learning. I recently learned that in a feedforward neural network with a binary output and a Bernoulli distribution, the output of the sigmoid function represents the probability that the label is 1. I`m curious why it cannot be the other way round (probability of the label being 0). Is it just for the convenience?

$\begingroup$ Related...you don't even have to use the 0/1 convention. $\endgroup$

Dave
– Dave

2024-07-24 16:27:11 +00:00
Commented Jul 24, 2024 at 16:27 — Dave
– Dave, Commented Jul 24, 2024 at 16:27
$\begingroup$ The post helped greatly. Thanks! $\endgroup$

wruskrappy
– wruskrappy

2024-07-24 23:31:53 +00:00
Commented Jul 24, 2024 at 23:31 — wruskrappy
– wruskrappy, Commented Jul 24, 2024 at 23:31

Siong Thye Goh · Accepted Answer · 2024-07-24 09:41:02Z

4

It is a convention.

Ultimately, what is important is that the objective function, likely to be log-likelihood in your context has to be computed based on the convention that you have chosen.

It would be great to follow the convention that most have adopted to reduce the risk of miscommunication/ misinterpretation.

answered Jul 24, 2024 at 9:41

Siong Thye Goh

7,8563 gold badges23 silver badges34 bronze badges

Add a comment |

Stack Exchange Network

Why Does the Sigmoid Output Layer in a Binary Feedforward Neural Network Represent the Probability of the Positive Class (Label = 1)?

1 Answer 1

Your Answer

Linked

Hot Network Questions

Why Does the Sigmoid Output Layer in a Binary Feedforward Neural Network Represent the Probability of the Positive Class (Label = 1)?

1 Answer 1

Your Answer

Sign up or log in

Post as a guest

Linked

Related

Hot Network Questions