Skip to main content

Questions tagged [model-selection]

Model selection is a problem of judging which model from some set performs best. Popular methods include $R^2$, AIC and BIC criteria, test sets, and cross-validation. To some extent, feature selection is a subproblem of model selection.

0 votes
0 answers
44 views

Suppose I was given a data set, say, golf, in the form of an MLR model. Given that best subset selection is choosing the top 5 best models of each size, how would ...
DavyJonessss's user avatar
1 vote
0 answers
22 views

The paper https://arxiv.org/pdf/1811.12808 by Sebastian Raschka explains how to perform 3-way holdout method, and also how to compute the final model (used in production). During computation of the ...
Ayrat's user avatar
  • 43
2 votes
0 answers
61 views

In cross-validation, $k$-folds are a common way to train, compare and validate models. Often we want to find an optimal set of hyperparameters for our models. There are many ways to probe the ...
Markus Klyver's user avatar
0 votes
0 answers
52 views

I am using GAMMs to model the probability of occurrence of a species, applying logistic regressions with mgcv::bam() to presence-pseudoabsence data. The dataset ...
airC's user avatar
  • 41
0 votes
0 answers
41 views

I have run an OLS regression and detected that it contains autocorrelation and heteroskedasticity. To deal with this I intend to use Newey-West standard errors. But I am not sure what is the proper ...
Mateo Bergman's user avatar
0 votes
0 answers
55 views

I want to simulate data with missing values and use them to compare the predictive performance of several machine learning algorithms, including LASSO. All analyses will be performed in R, using the ...
Benykō-Zamurai's user avatar
0 votes
1 answer
76 views

I'm analyzing an experiment I ran with bumblebees, and really struggling with choosing the appropriate model. In the experiment, each bee made feeder choices across two temperature conditions: ...
bee-researcher's user avatar
1 vote
0 answers
63 views

I'm building a species distribution model using MaxEnt with 260 presence points, collected opportunistically within a relatively small study area (a single administrative department in France). I'm ...
Martin Eden's user avatar
0 votes
0 answers
41 views

I have a model set with 36 candidate models and 4 models with an AIC less than or equal to 2.0. I do not want to model average because I don't think my candidate set really fits in with the caveats ...
Amanda Goldberg's user avatar
1 vote
1 answer
43 views

Most DCC-GARCH tutorials and guides I found online often use "replicate" in creating their DCC specification, i.e. ...
Matt's user avatar
  • 43
0 votes
1 answer
93 views

DCC-GARCH is comprised of two stages: (1) estimating the univariate GARCH and (2) estimating the correlations through DCC. My time series (bond yields) is not normally distributed, as they rejected ...
Matt's user avatar
  • 43
1 vote
1 answer
65 views

I estimated the univariate GARCH models for each series, and all coefficients are statistically significant. However, upon putting them into one DCC-GARCH model with a DCC(1,1) spec, the individual ...
Matt's user avatar
  • 43
1 vote
1 answer
79 views

I would like to know whether Goodness of Fit Tests (like Pearson's Chi-squared test or Kolmogorov-Smirnov Test) be used to select which probabilistic distribution model certain empirical observation ...
Luthfi Ahmad's user avatar
0 votes
1 answer
52 views

Learning about EM algorithms and finite mixture models and I've run into a particularly unintuitive problem. I'm trying to fit a finite mixture regression model on simulated data, where the true ...
dancing_monkeys's user avatar
0 votes
0 answers
76 views

I’m currently working on multiple regression analyses with a small sample (n = 36), using multiple imputation via the mice package in R (5 imputed datasets). The ...
statsInPractice's user avatar

15 30 50 per page
1
2 3 4 5
136