Skip to main content
Tooling
0 votes
3 replies
57 views

I have a raw time series dataset. Can I use classification models on time series data? If yes which classification models can I use without forecasting? Most examples I’ve seen focus on forecasting or ...
Rifa Martha's user avatar
Advice
0 votes
0 replies
32 views

Recently I have been practicing with AutoEncoders and trying to use them for anomaly detection. I have been told that using a threshold on the loss to classify anomalies (calibrated with roc-curve) is ...
wat's user avatar
  • 1
Best practices
1 vote
2 replies
58 views

I am working on a categorical classification problem using the Portugal Bank Marketing Dataset. The target variable is binary (yes / no), and the dataset is highly imbalanced (far more no than yes). I ...
Dinithi's user avatar
Tooling
0 votes
1 replies
55 views

I’m working on a prompt-based binary classification task using an LLM, where the main goal is to maximize precision. Instead of assigning a label to every input, I want the system to: • Assign a ...
Raviteja Mn's user avatar
Best practices
0 votes
2 replies
46 views

I created a random forest classifier in R intended to identify individual urban tree species/genera. I have a large train/test dataset (n= 200k) with about 30 predicotrs that are mostly spectral ...
Bob SomeAle's user avatar
Advice
0 votes
2 replies
85 views

I am currently trying the semi-supervised classification (SSC) library in R using code from the vignette. The vignette removes some observations from the Wine dataset such that it's partially labelled ...
EB3112's user avatar
  • 339
1 vote
0 answers
56 views

For a bit of context, I am working in a lab where we use a dissimilarity map to characterize textures between them, called LDM (Local Dissimilarity Maps)[1]. Recently, this was further enhanced by ...
mosfet's user avatar
  • 66
Tooling
0 votes
1 replies
80 views

I have a large dataset containing small-scale / local company names, and I need to categorize each company into sectors such as Tech, Industrial, Finance, Retail, etc. The problem is: These companies ...
Preksha Parekh's user avatar
0 votes
0 answers
38 views

am performing regression analysis using the fitnet function to develop a supervised neural network that acts as a surrogate model. The training target data have specific constraints: all outputs must ...
shiyu shukla's user avatar
0 votes
1 answer
96 views

If I am using stratified 10-folds for classification/regression tasks, where do I need to define the logic for hyperparameter tuning using Scikit or Wandb? Should it be inside the loop or outside? I ...
Ayesha Kiran's user avatar
0 votes
1 answer
87 views

I’m developing a tree-based model classifier (XGBoost) using some healthcare (patient visits) data. The data has a time dimension, and I want to observe if there is a longitudinal effect for the ...
Sasoo's user avatar
  • 1
0 votes
0 answers
60 views

I’m building a PyTorch binary classifier using ~9 months of daily data. There’s extremely strong seasonality in the positive rate, and I only have 9 months total, so a whole year of training data is ...
Diddley4209's user avatar
0 votes
1 answer
58 views

I am a little bit lost in tidymodels. I have a some data from topicmodeling: prevalent_topic: factor variable with most prevalent topic, ranging from "Topic_1" to "Topic_5" value1 ...
PsyR's user avatar
  • 35
1 vote
0 answers
97 views

When trying to fit scikit-learn DecisionTreeClassifier on my data, I am observing some weird behavior. x[54] (a boolan feature) is used to break the 19 samples into 2 and 17 on top left node. Then ...
Krishna's user avatar
  • 1,682
0 votes
2 answers
112 views

I have a dataset of olive oil samples and the goal of creating a classification model for oil quality. I'm having trouble deciding how to deal with missing data. have a look at the data here if you ...
BOBTHEBUILDER's user avatar

15 30 50 per page
1
2 3 4 5
517