Skip to main content

Questions tagged [random-forest]

Random forest is a machine-learning method based on combining the outputs of many decision trees.

1 vote
0 answers
34 views

I have a nice multiclass random forest model in R (using the packages ranger and caret) but I think this question applies to any random forest logic. When I use my RF to label unknown data I want to ...
Dr Egg's user avatar
  • 11
3 votes
0 answers
32 views

Is there any existing open source software implementation of mixed effects random forest regression (for clustered data) that employs conditional inference decision trees as base learners, and enables ...
Mike's user avatar
  • 63
0 votes
0 answers
61 views

I used a robust linear regression to evaluate the impact of some variables on a dependent variable, their linear correlation being tested and proven. Now, I want to compute an importance score of ...
Corina's user avatar
  • 1
0 votes
0 answers
89 views

TLDR : confusion matrix is used to validate a model. But I also want to make predictions using my models. Can I use the confusion matrix to make predictions? I don't see any other way to do it, but I ...
Siva Kg's user avatar
  • 23
0 votes
0 answers
55 views

Given these two target representations for the same underlying data: Target A : Minority class samples (Cluster 5) isolated in distribution tail, majority class samples (Clusters 3+6) shifted toward ...
n0rdp0l's user avatar
1 vote
0 answers
62 views

I am learning currently about decision trees and have read about bagging and random forests method. Since bagging and random forests rely on the fact that data is IID, so that bootstrapping makes ...
Daniil's user avatar
  • 111
8 votes
1 answer
207 views

A typical workflow in machine learning is to split data into train and test sets, using the former to develop a model and the latter to evaluate its ability to generalize. Some dispute this as a best ...
Dave's user avatar
  • 72.9k
1 vote
0 answers
76 views

I’m working on species distribution modeling with binary data (presence / absence, 1 / 0). My target species is extremely rare (prevalence ~0.014), so my dataset is almost all zeros and just a handful ...
LolaRT96's user avatar
0 votes
0 answers
46 views

I'm building a classification pipeline that evaluates multiple predictive models across different feature sets, each generated using a distinct feature selection method. Feature selection methods: ...
randomstate42's user avatar
0 votes
0 answers
34 views

I’m using randomForest in R solely for feature selection, not for prediction. The model is trained on all available data, and variable importance is assessed using <...
Bertram's user avatar
2 votes
0 answers
70 views

I’m working with a large dataset (≈700k observations) from an experiment, involving ≈5k patients and repeated trials across ≈50 covariates. The data structure includes multiple levels of clustering, ...
CapsLock's user avatar
1 vote
1 answer
61 views

I have a binary outcome and multiple covariates. I am calculating the AUC for a fitted random forest model (using the party::cforest function to fit the random forest). Some of my covariates are ...
user70810's user avatar
0 votes
0 answers
55 views

I am rather new to the world of random forests and have been using the tidy models package in R. For context I am running a random forest with 7 predictors on a testing data set of 5,122 observations. ...
antR's user avatar
  • 354
1 vote
1 answer
118 views

I'm working on fitting a random forest model using the caret library in R with a repeated cross-validation design to select hyperparameters. I've also experimented with adjusting the number of trees (...
Mdhale's user avatar
  • 133
0 votes
0 answers
46 views

Context I'm trying to create an Ensemble survival neural network with a custom loss function which consist of 3 base models, Random Survival Forest (RSF), Gradient Boosting Survival Model (GBSM) and a ...
Yugan Gogul Muthukumar's user avatar

15 30 50 per page
1
2 3 4 5
167