Questions tagged [predictive-modeling]
Statistical techniques used for predicting outcomes.
1,196 questions
0
votes
1
answer
28
views
What could be a dataset in which the presence of an outlier or a null value dramatically affects the performance of the decision tree?
I am tasked with giving an example of a dataset in which the presence of an outlier or a null value dramatically affects the performance of
a decision tree. I've searched and searched the web and I ...
1
vote
0
answers
46
views
Training with rich features, predicting with limited plan features
Introduction
I haven’t had the chance to work on a real data-science project yet — this would be my first one. I hope someone here can help.
Problem
For past years we have plan data ...
6
votes
1
answer
61
views
Suitable method to disaggregate time-series from national level to regional level?
I have data for five variables (in the form of time-series) that are reported at both national and regional levels. The response variable (also a time-series) is only reported at the national level- ...
3
votes
1
answer
103
views
Principal Data Analysis - how to determine the key features contribute to PC1 using scikit-learn python
I struggle to select the key features that contribute to PC1. I will use the public breast cancer dataset to illustrate the issue. Please feel free to point me to previous post if this question has ...
4
votes
1
answer
77
views
How to build model with smoothness via various data point
I am trying to model the arch of a basketball free throw projectory. Usually per person, this dataset has 6 points each where it is the height of the basketball via various seconds after the player ...
0
votes
0
answers
33
views
How to properly set up your X matrix for time-series classification
I am making predictions at the entity level, and for simplicity's sake, suppose there is only one feature. My goal is to set up my X matrix such that I can capture changes to the entity over different ...
0
votes
0
answers
40
views
ML Algorithms for Energy Market Price Forecasting During Volatile Periods (Post-Liberation)
just looking for some inspiration and to better understand the available options for modeling during volatile periods
I’ve already explored traditional ML models (e.g., linear/logistic regression, ...
2
votes
0
answers
35
views
Predicting when a variable length subscription will be cancelled
I have subscriptions that can have different duration/length. Some will have 1 month, some 3 months, some 12 months and so on.
How can I predict when a customer can cancel his subscription within the ...
1
vote
0
answers
38
views
How to correctly use a transformer model for a generating dependencies project
I'm currectly trying to train a model in order to predict dependencies between text, here it's industrials tasks, based on historical data. The goal is to learn that "Task A precedes Task B for ...
3
votes
0
answers
46
views
suppose 1 category in a variable create data leakage, can we use other categories in the same variable as dummy to predict?
We are predicting conversion. Conversion means customer converted from paying one-off to paying regular (subscribe)
If one feature is categorical feature "Activity" , consisting 15+ ...
6
votes
1
answer
126
views
What are some popular but outdated or ineffective practices in data science?
I was taught stepwise feature selection (like forward and backward selection) during college, and at the time, it seemed like a really effective way to pick features. But recently i have been reading ...
1
vote
0
answers
29
views
Can i use historical error occurrence count data every day from a machine to predict when the errors will cross a certain threshold?
I have been working on a project for predictive maintenance and have been studying research papers on it. According to my observation, predictive maintenance is mainly done using sensor data tracking ...
3
votes
0
answers
84
views
How can I link tasks using machine learning / ai based on historical task sequences?
I'm working on an AI model to predict dependency links between tasks for industrial plannifications, based on historical project data. I have two tables:
Task Table (15 sheets, one sheet = one ...
1
vote
0
answers
40
views
Need help with a task dependency links model - part 2
Sorry for the confusion — English is not my first language and I may have expressed myself poorly. Lost access to my guest account so i'll be giving more details about my previous question ( Need help ...
5
votes
1
answer
146
views
How to do Exploratory Data Analysis when my response variable is binary?
I am doing a multilevel regression, and my response variable is binary (presence of females on a tech board). all the EDA methods i know are about plotting correlation, but this as this is a binary i ...