Newest 'machine-learning-model' Questions - Data Science Stack Exchange

3 votes

0 answers

35 views

What is the correct model selection protocol in order to generate the best prediction model?

We're evaluating a novel Machine Learning algorithm and we would like to ensure that its prediction are comparable, if not better, than the baseline models. Let's say that we have a generic dataset &...

Filippo Portera

157

asked Feb 25 at 9:45

3 votes

1 answer

37 views

Why does my model perform worse after transforming the target?

I have a target with skewed distribution. So, i tried applying TransformedTargetRegressor from scikit using np.log1p as the function and np.expm1 as the inverse function. However, when i evaluate it, ...

Ocean

705

asked Jan 19 at 5:13

12 votes

1 answer

2k views

Use of training data that has been labeled by the AI model itself

I'm a software engineer working with medical device AI models that predict diseases and other conditions. For the most part, I don't design the models but I help with getting FDA clearance for them. ...

raner

223

asked Nov 3, 2025 at 20:32

10 votes

1 answer

324 views

How do I train a regression model on time series data containing a band of zeros?

I am trying to create some kind of regression model. Target is continuous and can both be negative and positive. However, the issue is that there is a region/band that I know is roughly -50 to 50, ...

Denver Dang

213

asked Sep 13, 2025 at 13:22

33 votes

3 answers

5k views

Is class imbalance really a problem in machine learning?

Following on from my recent post on the topic, my goal here is to synthesise the excellent community wisdom on it over at Cross Validated into a "canonical" Q&A for the data science SE :)...

Robert Long

6,775

asked Sep 2, 2025 at 13:49

6 votes

2 answers

271 views

What are some good resources to read about recommendation systems to help build your own?

I am working on a content-based recommendation system. I am planning to frame this as a binary classification problem (1 = click/0 = not click). And I was looking for paper/readings on feature ...

louise_vuitton

101

asked Aug 25, 2025 at 9:57

2 votes

0 answers

39 views

How to train Vanna AI to distinguish between two similar tables and their column values?

I am working with Vanna AI (text-to-SQL) and I have two problems regarding my database schema and how the model interprets it: Problem 1: Two similar tables I have two tables: SellingDocuments, ...

Joshie

21

asked Aug 24, 2025 at 15:25

0 votes

0 answers

73 views

RL - Updating rewards at every step based on filtering model, how to evaluate policies?

I am trying to apply Reinforcement Learning (RL) to the following partially observed setting. I would really appreciate hearing your thoughts on my question. I have a Markov process that evolves as $p(...

Uomond

1

asked Aug 20, 2025 at 19:21

0 votes

0 answers

45 views

churn prediction machine learning low precision

i am working on a project to check for churn prediction, but my data is very imbalanced I tried so many things but this the best model I can get to my main problem is that I want recall and Precision ...

AW FOUR

1

asked Jul 15, 2025 at 19:34

0 votes

0 answers

32 views

Discrete Feature Imputation: How to Choose an Appropriate Data Distribution Model?

I am working on a dataset containing features that are discrete frequency counts. I understand that knowing the underlying data distribution is important for selecting an appropriate imputation method....

Emre

1

asked Jul 11, 2025 at 10:22

1 vote

0 answers

39 views

Fine-tuning Llama 3 to generate task dependencies (industrial planning)

I'm working on fine-tuning a language model (Meta-Llama-3-8B-Instruct) to generate a dependency graph for industrial tasks. The idea is: given a list of unordered tasks, the model should output a ...

lili

371

asked Jun 17, 2025 at 7:42

0 votes

0 answers

35 views

How to properly set up your X matrix for time-series classification

I am making predictions at the entity level, and for simplicity's sake, suppose there is only one feature. My goal is to set up my X matrix such that I can capture changes to the entity over different ...

Andrew Bell

1

asked Jun 9, 2025 at 17:52

1 vote

1 answer

79 views

Tips on how to fix sampling bias

I am trying to improve a classification model with a highly imbalanced dataset — the positive class has very few samples. To compensate, I added more positive-class samples to the training set only, ...

louise_vuitton

101

asked Jun 1, 2025 at 12:25

4 votes

1 answer

131 views

Which model is the best suitable for generating edges?

I'm trying to develop a model who'd be able to generate dependencies between industrial tasks. In order to do that, i went for the GNN solution : i have nodes = tasks, dependencies = edges, and have ...

lili

371

asked May 22, 2025 at 9:25

2 votes

0 answers

68 views

DensNet169 model accuracy not increasing on medical classification dataset

I am training an DensNet model on medical dataset which has gold standards as per annotation. After training i noticed accuracy is just 60%. Later i performed following changes but still no luck. ...

NIrbhay Mathur

123

asked May 22, 2025 at 4:15

Stack Exchange Network

Questions tagged [machine-learning-model]

What is the correct model selection protocol in order to generate the best prediction model?

Why does my model perform worse after transforming the target?

Use of training data that has been labeled by the AI model itself

How do I train a regression model on time series data containing a band of zeros?

Is class imbalance really a problem in machine learning?

What are some good resources to read about recommendation systems to help build your own?

How to train Vanna AI to distinguish between two similar tables and their column values?

RL - Updating rewards at every step based on filtering model, how to evaluate policies?

churn prediction machine learning low precision

Discrete Feature Imputation: How to Choose an Appropriate Data Distribution Model?

Fine-tuning Llama 3 to generate task dependencies (industrial planning)

How to properly set up your X matrix for time-series classification

Tips on how to fix sampling bias

Which model is the best suitable for generating edges?

DensNet169 model accuracy not increasing on medical classification dataset

Hot Network Questions