Skip to main content

Questions tagged [model-selection]

Model selection is a problem of judging which model from some set performs best. Popular methods include $R^2$, AIC and BIC criteria, test sets, and cross-validation. To some extent, feature selection is a subproblem of model selection.

Filter by
Sorted by
Tagged with
0 votes
0 answers
44 views

Suppose I was given a data set, say, golf, in the form of an MLR model. Given that best subset selection is choosing the top 5 best models of each size, how would ...
DavyJonessss's user avatar
1 vote
0 answers
22 views

The paper https://arxiv.org/pdf/1811.12808 by Sebastian Raschka explains how to perform 3-way holdout method, and also how to compute the final model (used in production). During computation of the ...
Ayrat's user avatar
  • 43
2 votes
0 answers
61 views

In cross-validation, $k$-folds are a common way to train, compare and validate models. Often we want to find an optimal set of hyperparameters for our models. There are many ways to probe the ...
Markus Klyver's user avatar
0 votes
0 answers
52 views

I am using GAMMs to model the probability of occurrence of a species, applying logistic regressions with mgcv::bam() to presence-pseudoabsence data. The dataset ...
airC's user avatar
  • 41
0 votes
0 answers
41 views

I have run an OLS regression and detected that it contains autocorrelation and heteroskedasticity. To deal with this I intend to use Newey-West standard errors. But I am not sure what is the proper ...
Mateo Bergman's user avatar
0 votes
0 answers
55 views

I want to simulate data with missing values and use them to compare the predictive performance of several machine learning algorithms, including LASSO. All analyses will be performed in R, using the ...
Benykō-Zamurai's user avatar
0 votes
1 answer
76 views

I'm analyzing an experiment I ran with bumblebees, and really struggling with choosing the appropriate model. In the experiment, each bee made feeder choices across two temperature conditions: ...
bee-researcher's user avatar
1 vote
0 answers
64 views

I'm building a species distribution model using MaxEnt with 260 presence points, collected opportunistically within a relatively small study area (a single administrative department in France). I'm ...
Martin Eden's user avatar
0 votes
0 answers
41 views

I have a model set with 36 candidate models and 4 models with an AIC less than or equal to 2.0. I do not want to model average because I don't think my candidate set really fits in with the caveats ...
Amanda Goldberg's user avatar
1 vote
1 answer
43 views

Most DCC-GARCH tutorials and guides I found online often use "replicate" in creating their DCC specification, i.e. ...
Matt's user avatar
  • 43
0 votes
1 answer
93 views

DCC-GARCH is comprised of two stages: (1) estimating the univariate GARCH and (2) estimating the correlations through DCC. My time series (bond yields) is not normally distributed, as they rejected ...
Matt's user avatar
  • 43
1 vote
1 answer
65 views

I estimated the univariate GARCH models for each series, and all coefficients are statistically significant. However, upon putting them into one DCC-GARCH model with a DCC(1,1) spec, the individual ...
Matt's user avatar
  • 43
1 vote
1 answer
79 views

I would like to know whether Goodness of Fit Tests (like Pearson's Chi-squared test or Kolmogorov-Smirnov Test) be used to select which probabilistic distribution model certain empirical observation ...
Luthfi Ahmad's user avatar
0 votes
1 answer
52 views

Learning about EM algorithms and finite mixture models and I've run into a particularly unintuitive problem. I'm trying to fit a finite mixture regression model on simulated data, where the true ...
dancing_monkeys's user avatar
0 votes
0 answers
76 views

I’m currently working on multiple regression analyses with a small sample (n = 36), using multiple imputation via the mice package in R (5 imputed datasets). The ...
statsInPractice's user avatar
1 vote
0 answers
42 views

I'm attempting to train a model to parse maritime location ranges. These are strings that can be resolved into a geographical area or a list of shipping ports. An example could be ...
Stromgren's user avatar
  • 119
6 votes
1 answer
280 views

There are many resources explaining why automatic variable selection is bad (e.g. here). Regarding the selection of $p$, $d$, $q$ parameters in ARIMA models, the Hyndman-Khandakar algorithm combines ...
Thomas's user avatar
  • 600
0 votes
0 answers
46 views

I’m modeling longitudinal substance use (number of days consumed over 30 days) for ~930 patients with repeated measures. The outcome is modeled with a beta-binomial distribution (logit link, glmmTMB ...
W. IC.'s user avatar
  • 23
0 votes
0 answers
51 views

I am currently trying to build a model to link water quality metrics (e.g. biochemical oxygen demand, chemical oxygen demand) with regional characteristics data (e.g. population, GDP) through multiple ...
Osuke Miyamaru's user avatar
6 votes
2 answers
521 views

From Shmueli's paper "To Explain or to Predict?", which also has a section about descriptive modeling (section 1.3): (see also this page) [Descriptive modeling] is aimed at summarizing or ...
Thomas's user avatar
  • 600
1 vote
1 answer
133 views

I'm currently studying the Least Angle Regression algorithm by Efron et al. (https://arxiv.org/abs/math/0406456). After equation (2.22) in Efron et al., the authors claim the following: It is easy to ...
flushel's user avatar
  • 155
2 votes
0 answers
80 views

Can AUC be used for model selection, and how can the excessive number of features/parameters be penalized in this case? In frequentist framework we have various model selection criteria, like AIC, BIC,...
Roger V.'s user avatar
  • 5,091
5 votes
0 answers
144 views

In R, I want to use a repeated measures analysis with a mixed regression model to analyze how the mean of my response variable (mean bee pollination score) varies based on 1) week, 2) number of bee ...
Emily's user avatar
  • 51
0 votes
0 answers
48 views

I'm new to (generalized) linear mixed effects models. Any help would be appreciated! Below is my study design with dummy data. I'm exploring the effects of the parameters I manipulated in game 1 on ...
fox_jane's user avatar
2 votes
1 answer
117 views

I would like to run a model in R with two binary dependent variables. I know how to model an interaction on the independent variable, but is it possible to do this on the dependent variable too? If my ...
Milli's user avatar
  • 21
0 votes
0 answers
31 views

If I have a single model say y = ax^2 + bx + c, can I use 3 linear regression algorithms y=ax^2, y=ax and y=a to learn the original function if use the same data set. Please help me out here.
Neelesh Samptur's user avatar
0 votes
0 answers
53 views

I'm doing project using ARIMA and i face a problem where I cannot choose the order for ARIMA model. I know that i had to choose the order by identifying the significant lag, but the PACF plot showing ...
Milda KS's user avatar
4 votes
1 answer
114 views

Suppose I have a family of $N$ models for the same data, indexed by $n\in\{1,\dots,N\}$. And suppose that model $n\in\{1,\dots,N\}$ has log-likelihood given by: $$L(X_n \theta_n),$$ where $L:\mathbb{R}...
cfp's user avatar
  • 565
1 vote
1 answer
71 views

Let's say I pick any of the winning surrogate models in my nested cv (in theory if you do k outer folds you could have k surrogate models) to simplify things, lets say I pick the first model and just ...
iYOA's user avatar
  • 185
2 votes
0 answers
87 views

Let $\mathcal{D}_{\text{train}}$ be a training dataset, and let $D_{\text{test}} = \{(x_{\text{test}}, y_{\text{test}})\}$ be a single holdout test point drawn independently from the same distribution ...
iYOA's user avatar
  • 185
0 votes
0 answers
78 views

In nested cross validation, I'm seeing an interesting scenario that I'd like to understand better: Using 4-fold outer CV, my model selection process chose Model A overall (it performed best on average ...
iYOA's user avatar
  • 185
1 vote
1 answer
141 views

I am trying to fit the following panel regression with fixed entity effects $$Y_{it} = \alpha_i + \sum_j \beta_jX^{(j)}_{it} + \epsilon_{it},$$ where the index $j$ labels the different features. Some ...
Mark Dubin's user avatar
0 votes
0 answers
45 views

I want to study the Conditional Variance of various crypto-currencies returns series (13, of which 5 meme, 8 "serious"). Since my main focus is the asymmetric response of the variance ...
Hodmezor's user avatar
1 vote
1 answer
107 views

I want to fit a mixture of Gaussian to simulated data. Then, I need to calculate the Bayesian information criteria for each mixture component. My point is that, after the model convergence, I ...
Dr. Statistics's user avatar
6 votes
1 answer
476 views

I am working on a fault detection problem for a mechanical system where the goal is to determine the fault type. I use a dataset that for each type of fault (target label) has three sizes and each ...
S.H.W's user avatar
  • 99
1 vote
1 answer
104 views

Apologies for cross-posting I am starting to use Lasso and cross validation for model selection to explain a dependent variable using linear models, but I can not understand why all p-values ...
Rodrigo Badilla's user avatar
1 vote
1 answer
93 views

In Andrews Ng's machine learning notes (https://cs229.stanford.edu/main_notes.pdf), he introduced the following bound for the difference between generalization error and training error (see the ...
ExcitedSnail's user avatar
  • 3,090
0 votes
0 answers
90 views

I'm studying which variant of variational autoencoders (VAE) gives better expected calibration error (ECE) (see also this doc) under small dataset. According to google's tuning playbook, to compare ...
Kaiwen's user avatar
  • 307
5 votes
1 answer
320 views

I have a follow-up question to this OP. I hope to understand the difference between comparing 2 models with AIC, and interpreting the summary output of the full model - specifically for GAMs. Gavin ...
Nate's user avatar
  • 2,537
1 vote
1 answer
92 views

Admittedly, I am a bit inexperienced in the world of statistics and data modeling but am trying my best to learn on the job. As a first time user, I apologize if there are any formatting errors here! ...
LonelyBadger12's user avatar
23 votes
4 answers
3k views

When it comes to data exploration, aside from checking for outliers (human error), correlated covariates, and missing values, is there a downside to viewing relationships between a response variable ...
Nate's user avatar
  • 2,537
16 votes
2 answers
841 views

I understand AIC is asymptotically equivalent to leave-one-out cross-validation and that BIC has a similar asymptotic equivalence to leave-k-out cross-validation. My question is, other than ...
Louis F-H's user avatar
  • 271
0 votes
1 answer
111 views

This is the ACF and PACF for my the first difference of my variable $\Delta y_t,$ I used the ADF test, the PP test, the Schmidt Phillips test and the DFGLS test, and got the same result that my ...
alyosha's user avatar
3 votes
2 answers
309 views

The monograph Cross Validation contains a section on nested cross-validation for hyper-parameter optimisation (page 6). The author refers to this paper for a reason why it is better to decouple hp-...
Ayrat's user avatar
  • 43
2 votes
0 answers
59 views

I am trying to select the number of Principal Components of this data following the optimality criteria of Bai and Ng (2002), on R. The function ICr from the ...
oibaFox's user avatar
  • 21
1 vote
0 answers
56 views

I’m using nested cross-validation to evaluate multiple models and hyperparameter configurations. After running trials with different random seeds (outer: 3-fold with 10 seeds, inner: 5-fold with 50 ...
iYOA's user avatar
  • 185
0 votes
0 answers
32 views

I am looking for documents and online sources to understand whether or not I should exclude variables from my model through model selection (variable selection). I also tried to use methods of Least ...
Student coding's user avatar
6 votes
2 answers
330 views

In many medical science fields "hierarchical regression" is a popular method. The approach is to break variables into categories, add one category of variables at a time and then remove ...
purple-blade's user avatar
2 votes
1 answer
153 views

I have a quick question concerning model selection for linear mixed effects models: When directly comparing AICs of two models (either including or excluding an additional fixed effect) versus ...
Julia's user avatar
  • 41
3 votes
1 answer
135 views

I have on more than one occasion come across both recently-published textbooks and classes that teach the use of stepwise methods for model construction. Why is this still done, given the problems ...

1
2 3 4 5
41