Newest 'cross-validation' Questions

6 votes

1 answer

71 views

How can I evaluate a time‑series forecasting model when I must train on the entire small dataset?

I’m building a Python forecasting pipeline that tries several models: Holt‑Winters (tuned with Optuna) ARIMA (via pmdarima.auto_arima) XGBoost (tuned with Optuna) ...

CSe

161

asked Nov 19 at 18:08

2 votes

0 answers

61 views

Do k-folds risk sampling bias and, if so, how do we avoid it?

In cross-validation, $k$-folds are a common way to train, compare and validate models. Often we want to find an optimal set of hyperparameters for our models. There are many ways to probe the ...

Markus Klyver

311

asked Oct 18 at 16:51

2 votes

1 answer

58 views

Should differential expression analysis be incorporated in cross validation for training machine learning models?

I'm conducting some experiments using TCGA-LUAD clinical and RNA-Seq count data. I'm building machine learning models for survival prediction (Random Survival Forests, Survival Support Vector Machines,...

Yordany Paz

21

asked Oct 11 at 18:22

2 votes

0 answers

59 views

Cross-validating multi-output models: importance + SHAP

I am currently developing a project that deals with multiple targets which can have different numbers of cardinalities. The idea is to use different ML-models(e.g. Random Forest, SVM, AdaBoost) and ...

Le Roi des Aulnes

53

asked Oct 3 at 17:30

0 votes

0 answers

25 views

What is the best way to determine if cross validated R-squared scores are significantly different? [duplicate]

I'm comparing, pairwise, the results of Linear Regression models with transformations applied to one numerical feature and the target. I'm using K folds cross validation scoring with R-squared. The ...

Morgan P

1

asked Sep 22 at 11:14

1 vote

0 answers

56 views

How to choose between ARIMA and ARFIMA?

I am in the position of having a time series data set that I can model well using either a Autoregressive Fractionally Integrated Moving Average (ARFIMA) or an ARIMA model. I'm asking for ways to ...

David White

297

asked Sep 20 at 13:54

4 votes

1 answer

519 views

Should I normalize both train and valdiation sets or only the train set?

I have a question about normalization when merging training and validation sets for cross-validation. Normally, I normalize using re-scaling (Min-Max Normalization) calculated from the training set ...

Suebpong Pruttipattanapong

43

asked Aug 20 at 5:30

1 vote

2 answers

273 views

A proper approach to K-fold cross validation on imbalanced data

What is the proper algorithm for k-fold CV in case of class-balancing (under/over sampling)? Variant 1: split data into train and test set balance classes in the train set run k-fold CV Variant 2: ...

Jakub Małecki

378

asked Aug 14 at 10:47

4 votes

1 answer

133 views

When and how can unsupervised preprocessing before splitting data lead to overoptimistic model performance?

Conceptually, I understand that models should be built totally blind to the test set in order to most faithfully estimate performance on future data. However, I'm struggling to understand the extent ...

Evan

329

asked Jul 30 at 15:22

0 votes

0 answers

55 views

LASSO and cross validation when dealing with missing data

I want to simulate data with missing values and use them to compare the predictive performance of several machine learning algorithms, including LASSO. All analyses will be performed in R, using the ...

Benykō-Zamurai

563

asked Jul 23 at 12:38

4 votes

1 answer

89 views

Confused about the utility of nested cross-validation vs k-fold cross-validation

I am using nested cross validation in mlr3 to tune my model's hyperparameters and gauge its out-of-sample performance. Previously, when I was performing regular k-fold CV, my understanding was that ...

Adverse Effect

51

asked Jul 21 at 19:01

1 vote

1 answer

122 views

How to choose and structure a GLM for species richness with non-normal distribution? [closed]

I know my next steps involve using a GLM and selecting the type of GLM based on my response variables (possibly gamma or Poisson regression?). I also need to standardise explanatory variables to be ...

SMM

41

asked Jul 15 at 15:07

0 votes

1 answer

147 views

Comparing AUROCs of binary classifiers across cross-validation folds: alternatives to DeLong

I have two binary classifiers and would like to check whether there is a statistically significant difference between the area under the ROC curve (AUROC). I have reason to opt for AUROC as my ...

IsaacNuketon

1

asked Jul 7 at 16:09

2 votes

0 answers

31 views

How can one statistically compare machine learning models based on the results of a cross validation? [duplicate]

It is often recommended that one uses cross fold validation to estimate the generalisation ability of a machine learning model. Most ressources I've found however do not adres what one should do after ...

Digitallis

121

asked Jul 2 at 12:19

0 votes

0 answers

64 views

Time series LASSO K-fold cross validation

This topic has been discussed before but I couldn't find a specific answer. Here's my approach to forecast QoQ values, Run the usual LASSO K-fold CV on timeseries data and generate a one-step ahead ...

bebgejo

1

asked Jun 19 at 3:40

0 votes

1 answer

60 views

Data cross validation to predict label from cluster analysis [closed]

My project has the following steps: Use elbow method to determine the features and number of clusters for kmeans. Run kmeans on the data (with determined features and n clusters), and gives the ...

Xin Niu

103

asked Jun 17 at 22:34

2 votes

1 answer

111 views

When Does a Mediation Model Qualify as SEM Without a Direct IV→DV Path?

I’m trying to understand structural equation modeling (SEM) for hypotheses model and have questions about when to apply SEM. I have three models in mind: • Model 1: IV → M → DV • Model 2: IV → M1 →...

chen Crush

85

asked Apr 11 at 16:14

2 votes

1 answer

160 views

How many folds should a unnnested CV have compared to a nested CV

I read in the mlr3 book about nested resampling that: Nested resampling is a method to compare models and to estimate the generalization performance of a tuned model, however, this is the performance ...

ChickenTartR

43

asked Apr 8 at 7:39

1 vote

1 answer

122 views

Huge steps in AUROC plot

I'm building a model for a binary classification task. Because my dataset is pretty small (~86 samples with 68 class 0 and 18 class 1), I'm using a nested k-fold cross validation (5-inner loops and 5-...

Shortytot

13

asked Apr 5 at 23:15

1 vote

0 answers

55 views

How to compare two kappa statistics from the same group of raters, rating the same subjects, but under two different conditions?

Is there a statistical way to compare two kappa statistics from the same group of raters, rating the same subjects, but under two different conditions (low vs. high field strength MRIs)? We can't ...

ACHD

13

asked Apr 5 at 11:28

3 votes

1 answer

96 views

K-folds cross validation application

We have a small dataset of n=130. Current step is exploring the data looking for anything interesting. Our primary aim is to compare whether using additional variable is helping improve model ...

Leon Yao

33

asked Apr 1 at 12:41

4 votes

1 answer

274 views

Simple procedure for feature selection given correlated predictors

I am trying to make a linear regression predictive model between a continuous dependent variable and a set of continuous predictors. I have a large number (~5000) of these predictor variables (...

user7831861

145

asked Mar 20 at 15:40

1 vote

0 answers

42 views

Calibrated Classifier on Training Data [closed]

If I am using a GridSearchCV to find hyper parameters on a training set; if I were to run a CalibriatedClassifierCV to tune my probabilities, would it suffice to fit the CalibraitedClassifierCV with ...

user54565

89

asked Mar 17 at 4:37

0 votes

0 answers

51 views

Calculating Standard Deviation of RMSE of an unsupervised algorithm

If there is an ML model, the standard deviation (SD) of the root mean squared error (RMSE) can be calculated using time series splits by fitting the model on different training sets and evaluating it ...

Geek_Tech

329

asked Mar 16 at 23:29

1 vote

1 answer

87 views

Comparing two cross validation methods for hyperparameter tuning

For cross validation of hyperparameters, I have a question about which approach is generally considered better in the context of running regularized regression (specifically elastic net l1, l2 ...

qwer

111

asked Mar 15 at 20:10

6 votes

1 answer

133 views

Evaluating a model in a small sample using a test set: bootstrap vs. LOOCV

The thread Evaluating a classifier with small samples considers the problem in its title. Specifically, the question is about splitting off the test set from the rest of the data many times instead of ...

Richard Hardy

71.5k

asked Mar 13 at 19:29

6 votes

2 answers

203 views

Evaluating classifier with small samples

I'm trying to evaluate two classifiers splitting the sample into the training and tests samples with 50-50 split. The classifiers are fitted and tuned with K-fold CV on the training sample. The ...

Lionville

487

asked Mar 12 at 14:15

0 votes

0 answers

34 views

Nested linear model comparison and regression parameter testing in LOOCV setting?

How do I obtain a reasonable parameter estimate (regression beta) for the single predictor of interest in a multiple regression model and appropriate standard errors for this estimate using holdout ...

jf1

312

asked Feb 11 at 16:14

1 vote

0 answers

57 views

The use of cross-validation and a hold-out set

I've been thinking about the use of cross-validation and hold-out sets and I don't really see the use of a randomly selected hold-out test set. I have to say, though, that when the hold-out is not ...

adriavc00

11

asked Feb 10 at 21:29

1 vote

1 answer

71 views

Is it okay to select any of the surrogate models in nested cv?

Let's say I pick any of the winning surrogate models in my nested cv (in theory if you do k outer folds you could have k surrogate models) to simplify things, lets say I pick the first model and just ...

iYOA

185

asked Feb 4 at 15:34

0 votes

0 answers

78 views

Interpreting Nested CV Results When Selected Model Didn't Win All Outer Folds

In nested cross validation, I'm seeing an interesting scenario that I'd like to understand better: Using 4-fold outer CV, my model selection process chose Model A overall (it performed best on average ...

iYOA

185

asked Jan 31 at 23:09

0 votes

0 answers

88 views

Use cross validation to determine number of factors in factor analysis: why the case is not simply that more factors get larger likelihood?

Consider a factor analysis model \begin{equation*} \begin{array}{cccccccccc} X &=& \mu&+& L&\cdot& f & + &u \\ p\times 1 & & p\times 1 &&p\times k& ...

user398751

asked Jan 31 at 22:47

0 votes

0 answers

58 views

Can't understand the evaluation approach used in this paper

In this paper, two deep learning models where proposed: Hybrid-AttUnet++ and EH-AttUnet++. The first model, Hybrid-AttUnet++, is simply a modified U-net model, and the second model is an ensemble ...

AAA_11

1

asked Jan 31 at 11:24

4 votes

1 answer

146 views

Use cross validation to select ridge regression parameter $k$: What if mean of $\mathbf x_i$ and $y_i$ might be non-zero on test/training set?

Consider a regression model $$ Y= X\beta+ u. \tag{$\star$} $$ $Y$ is a column vector with length $n$ containing $n$ observations. $X$ is a $n\times p$ matrix with each row corresponding to a ...

user398751

asked Jan 30 at 4:05

0 votes

0 answers

36 views

Error when using stratified samples with MERT in LongituRF package in R

I'm using the LongituRF package in R to fit a MERT (Mixed effects regression trees) model to my data. While I have no issues ...

Linus

399

asked Jan 27 at 13:39

0 votes

0 answers

54 views

Classification strategies for small biomedical dataset with imbalanced classes

I have spectroscopy data measured from 10 different porcine. The goal is to analyse three different tissue types. However, not all tissues were measured from each porcine. The total numbers are Fat: 3,...

masto12

119

asked Jan 27 at 12:30

2 votes

1 answer

187 views

Multinomial logistic regression, Ridge regression

I am currently working with a dataset that includes sociodemographic information about each student in a class (X variables) and information about whom each student votes for as class speaker (Y ...

Elena O.

51

asked Jan 13 at 12:23

0 votes

0 answers

43 views

Safe to break up k-fold cross validation grid search into separate chunks?

I'm performing gradient boosting machine modeling on a large dataset (700k+ records) with several hundred variables on a work laptop with limited memory. I'm coding in R v2022.02.2. I've found running ...

RobertF

6,644

asked Jan 12 at 16:37

1 vote

0 answers

34 views

Do I have to get another separate test set that is independent of the dataset I used in cross-validation?

What I'm doing I am making an undergraduate thesis about audio classification using SVM. My goal is to identify if adding Feature X to the feature matrix could improve the performance of the ...

ASTRAL

11

asked Jan 9 at 7:06

0 votes

0 answers

26 views

Does model retrain frequency in time series CV have to match production retrain frequency?

Lets assume that we retrain the model every year in production and we have accumulating 50 years of data. If using a time series CV (e.g TimeSeriesSplit in sklearn) for hyperparams recalibration at ...

Kreol

121

asked Jan 7 at 0:19

1 vote

2 answers

284 views

GAM Leave one out cross validation (LOOCV) for biggish models

I have fitted a relatively complex/large generalized additive model for prediction purposes but would like to assess its predictive power/cross-validate it. Due to variability in observed data and the ...

Paul Julian

362

asked Dec 20, 2024 at 14:44

3 votes

1 answer

372 views

What should the objective be when tuning hyperparameters to minimize overfitting?

I'm working on a classification problem with ~90k data rows and 12 features. I'm trying to tune the hyperparamters of an XGBoost model to minimize the overfitting. I use ROC_AUC as the metric to ...

WatermelonBunny

60

asked Dec 18, 2024 at 9:28

0 votes

0 answers

76 views

How do I find correlation between variables in a time series across multiple days?

I have data for each day, with a date/time, event, and when a secondary event gets triggered. ...

erotavlas

101

asked Dec 17, 2024 at 3:23

1 vote

1 answer

104 views

Lasso and cross validation: model selection

Apologies for cross-posting I am starting to use Lasso and cross validation for model selection to explain a dependent variable using linear models, but I can not understand why all p-values ...

Rodrigo Badilla

13

asked Dec 16, 2024 at 11:55

5 votes

3 answers

277 views

Cross-validated bandwidth for the derivative of the function with local quadratic estimation

I am trying to estimate non parametrically the first order derivative of a function g(x). I am estimating $g(x)$ using a local polynomial (quadratic) procedure. I know how to compute the leave-one-out ...

G. Ander

239

asked Dec 4, 2024 at 22:47

0 votes

0 answers

45 views

Youtube Spam Classifier - Different Methods yielding the same accuracy (94%)

(CONTEXT) I'm currently doing a report project at my university to build a classifer model that classifies a comment as spam or ham (non-spam) using this data set, and then submit a prediction csv ...

KitanaKatana

391

asked Nov 29, 2024 at 15:16

0 votes

1 answer

225 views

What’s the appropriate statistical test to compare ML model performance over CV folds?

I’m comparing the performance of 10 ML models across 15-fold cross-validation, using metrics like MSE. Each model’s performance is ranked per fold, and I want to determine if there are significant ...

PascalIv

921

asked Nov 25, 2024 at 15:10

1 vote

0 answers

201 views

Error in fitting Zero inflated negative binomial in Python using cross validation

I want to assess predictive power of zero-inflated negative binomial model in Python. My steps are listed as below: Regarding 5-folds cross-validation: Fit multiple Zero-Inflated Negative Binomial (...

Student coding

137

asked Nov 25, 2024 at 13:56

16 votes

2 answers

841 views

Advantages of information criteria over cross-validation

I understand AIC is asymptotically equivalent to leave-one-out cross-validation and that BIC has a similar asymptotic equivalence to leave-k-out cross-validation. My question is, other than ...

Louis F-H

271

asked Nov 24, 2024 at 17:27

0 votes

1 answer

71 views

Separate Test Set for Cross-Validation for Small Sample (n=140)

I’m working on a survival analysis model with a small internal dataset (n=140). An outside researcher suggests splitting the dataset into train/val and setting aside a separate test set (e.g., ~10%, ...

mel

1

asked Nov 24, 2024 at 1:56

Questions tagged [cross-validation]