Newest 'loss-function' Questions - Data Science Stack Exchange

3 votes

2 answers

91 views

Imbalanced classes and ML set up

I’m working on a MarTech use case (predict customers conversions to a certain product). Not really used to work within this domain, therefore I’m seeking some critical questions on my set up. Context: ...

Henri

133

asked Oct 10, 2025 at 7:49

3 votes

1 answer

96 views

What loss functions are suitable for a YOLO-like architecture in TensorFlow/Keras, especially for fine-tuning on an imbalanced dataset?

I'm working with a custom YOLO-like architecture implemented in TensorFlow/Keras. While pretraining on the COCO dataset works, I plan to fine-tune the model on a highly imbalanced dataset. ...

chhu

141

asked Aug 6, 2025 at 15:12

3 votes

1 answer

90 views

How do you differentiate population count/Hamming weight?

I've come across a loss regularizing function that uses population counts (i.e., bits that are one, Hamming weight) of activations: $$ L_\mathrm{reg} = H(\max(\lfloor x \rceil, 0)), $$ where $x$ is an ...

Gaslight Deceive Subvert

227

asked Jul 27, 2025 at 22:21

2 votes

2 answers

129 views

Best Practice for Group Based splitting (Train / Val / Test)

As an intro, Group Based Splitting is data splitting into Train / Test (Val), when by some attribute like patient_id, item_id or similar, to ensure that same person ...

Michael D

209

asked Jun 10, 2025 at 10:43

6 votes

1 answer

128 views

I wrote a code in R language to download PDF files from a website automatically, but the code didn't find the PDF file links, although there are links

Download PDF files frome this website "https://register.awmf.org/de/start" but the code didn't find any PDF Link, although there are links to PDF files, but indirectly,I want to download all ...

Ward Khedr

61

asked Jun 7, 2025 at 13:05

0 votes

0 answers

34 views

Custom loss function not behaving as expected in PyTorch but does in TensorFlow

I tried modifying the reconstruction loss such that values that are pushed out of bounds do not contribute to the loss and it works as expected in tensorflow after training an autoencoder. However, ...

zvxayr

1

asked Apr 14, 2025 at 4:16

1 vote

0 answers

53 views

Using a differentiable Self-Organizing Map loss in a CNN

I've been trying to aggregate a normal CNN loss with a loss that quantifies how well we can cluster the second-to-last layer embeddings (i.e. feed the embeddings to a 2D Self Organizing Map (SOM) and ...

catalyst

11

asked Mar 30, 2025 at 13:43

6 votes

2 answers

90 views

Does it make sense to mix the labels in each batch?

For a binary classification model, When training a deep model, at each training step, the model receives a batch (i.e batch of size 32 samples). Let's assume that in each training batch there are ...

user3668129

829

asked Mar 24, 2025 at 14:28

3 votes

1 answer

143 views

How to incorporate weights (probability measurements) of data into a mean squared error loss function

I am training a CNN to regress on 4 targets related to a given image. Within the image is a point of interest whose position can be defined by phi, and theta (corresponding to x and y of a normal ...

Jack Stethem

31

asked Jan 31, 2025 at 22:15

5 votes

2 answers

702 views

Is there any advantage of a lower value of a loss function?

I have two loss functions $\mathcal{L}_1$ and $\mathcal{L}_2$ to train my model. The model is predominantly a classification model. Both $\mathcal{L}_1$ and $\mathcal{L}_2$ takes are two variants of ...

Aleph

205

asked Jan 6, 2025 at 3:05

5 votes

1 answer

98 views

Taking into account instance cost in learning?

I am generally trying to take into account costs in learning. The set-up is as follows: a statistical learning problem with usuall X and y, where y is imbalanced (roughly 1% of ones). Scikit learn ...

Lucas Morin

3,274

asked Dec 27, 2024 at 9:47

1 vote

0 answers

57 views

Per Channel loss or Per Sample Loss

I am currently tackling a semantic segmentation problem where, for each sample, my goal is to segment two masks corresponding to two objects. Notably, object two is typically located inside object one,...

Ahmed Mohamed

251

asked Dec 27, 2024 at 2:10

3 votes

1 answer

123 views

Why softmax training is more stable

I'm wondering about which activation function will be easier to train with (get better accuracy / smallest loss) - with SoftMax or sigmoid (for multiclass classification problem) According to: https://...

user3668129

829

asked Nov 17, 2024 at 9:09

0 votes

1 answer

117 views

What exactly is a true distribution in ML problems?

I define a classification problem as a problem of calculating a function $h$ that approximates a function $f$ that classifies data. The approximation is calculated by taking a set of training samples ...

Leandro

25

asked Oct 11, 2024 at 14:07

0 votes

1 answer

102 views

Gradient output through custom loss function

I’m very new to Pytorch (and ML in general), so I’m having difficulty understanding what is going on WRT a custom loss/cost function I’m looking at. I understand what’s going on in the function, but I ...

user3460324

1

asked Jul 18, 2024 at 20:59

Stack Exchange Network

Questions tagged [loss-function]

Imbalanced classes and ML set up

What loss functions are suitable for a YOLO-like architecture in TensorFlow/Keras, especially for fine-tuning on an imbalanced dataset?

How do you differentiate population count/Hamming weight?

Best Practice for Group Based splitting (Train / Val / Test)

I wrote a code in R language to download PDF files from a website automatically, but the code didn't find the PDF file links, although there are links

Custom loss function not behaving as expected in PyTorch but does in TensorFlow

Using a differentiable Self-Organizing Map loss in a CNN

Does it make sense to mix the labels in each batch?

How to incorporate weights (probability measurements) of data into a mean squared error loss function

Is there any advantage of a lower value of a loss function?

Taking into account instance cost in learning?

Per Channel loss or Per Sample Loss

Why softmax training is more stable

What exactly is a true distribution in ML problems?

Gradient output through custom loss function

Hot Network Questions