Binary classification inconsistencies #1059

aiqc · 2022-06-01T21:54:03Z

aiqc
Jun 1, 2022

nn.Sigmoid = float
nn.BCELoss() = float
torchmetrics.Accuracy() for binary = long

so i had to implement a flip_type function with a try statement while looping in epochs

meanwhile

torchmetrics.Accuracy() for multi-label = float

https://torchmetrics.readthedocs.io/en/stable/classification/accuracy.html

looking at the Accuracy docs, it seems like comparing equal length tensors is reserved for ordinal ints. seems like a symptom of the nn.Softmax(dim=1) problem

Answered by Borda

Aug 12, 2025

The inconsistencies you are experiencing likely stem from passing raw softmax outputs (probabilities) directly to the Accuracy metric. As per the TorchMetrics Accuracy docs, the metric expects integer class labels or binary labels for comparison, not probability tensors.
For binary classification, you should convert your model outputs from softmax or sigmoid probabilities to binary predictions by applying a threshold such as 0.5. Here’s a simple example of how to do this:

import torch
from torchmetrics.classification import BinaryAccuracy

# Initialize metric
accuracy = BinaryAccuracy()

# Model outputs as probabilities (2D with softmax over dim=1)
probs = torch.tensor([[0.3, 0.7], [0.6, 0.4

View full answer

Borda · 2025-08-12T14:18:09Z

Borda
Aug 12, 2025
Maintainer

The inconsistencies you are experiencing likely stem from passing raw softmax outputs (probabilities) directly to the Accuracy metric. As per the TorchMetrics Accuracy docs, the metric expects integer class labels or binary labels for comparison, not probability tensors.
For binary classification, you should convert your model outputs from softmax or sigmoid probabilities to binary predictions by applying a threshold such as 0.5. Here’s a simple example of how to do this:

import torch
from torchmetrics.classification import BinaryAccuracy

# Initialize metric
accuracy = BinaryAccuracy()

# Model outputs as probabilities (2D with softmax over dim=1)
probs = torch.tensor([[0.3, 0.7], [0.6, 0.4], [0.8, 0.2]])

# Convert to predicted class by argmax for multiclass or threshold for binary
# For binary, if you have single-channel output use threshold; here we use argmax for demonstration
preds = torch.argmax(probs, dim=1)

# True labels as integer class indices
target = torch.tensor([1, 0, 0])

# Update accuracy metric
acc_val = accuracy(preds, target)

print(f"Accuracy: {acc_val.item()}")

If you’re using single-channel sigmoid outputs for binary classification, apply a threshold:

sigmoid_out = torch.tensor([0.3, 0.7, 0.6])
preds = (sigmoid_out > 0.5).int()

Avoid passing raw logits or softmax probabilities directly. This approach should resolve the issues you described. The current TorchMetrics Accuracy implementation does not automatically handle thresholding.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Binary classification inconsistencies #1059

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Binary classification inconsistencies #1059

Uh oh!

aiqc Jun 1, 2022

Replies: 1 comment

Uh oh!

Borda Aug 12, 2025 Maintainer

aiqc
Jun 1, 2022

Borda
Aug 12, 2025
Maintainer