Questions tagged [standardization]
Usually refers to "z-standardization" which is shifting and rescaling data to assure they have zero mean and unit variance. Other "standardizations" are possible, too.
853 questions
3
votes
3
answers
113
views
which normalization strategy is preferable when clustering territorial units according to hazard exposure?
I am working on a project to cluster provinces according to their exposure to river floods. I am considering the following indicators:
Total number of flood events / total provincial area
Total ...
0
votes
1
answer
598
views
Re-scaling standardized values
I am constructing an index composed of around 100 indicators aggregated in sub-index and these in an Index, among the indicators different types can be found: categorical, ordinal, interval and ratio.
...
0
votes
1
answer
197
views
Can a variable be statistically insignificant but its components be significant? [duplicate]
There is a paper (here) which shows that the beta coefficient of a standardised variable (see equation 3 in the paper), which represents the correlation between the standardised variables, can be ...
1
vote
2
answers
307
views
How to obtain uncentered factor scores from Factor analysis
The Exploratory Factor Analysis has the following mathematical formulation as in the screenshot from wikipedia (https://en.wikipedia.org/wiki/Factor_analysis):
That means the factors in F are ...
6
votes
3
answers
532
views
Meaning of “standardized random variable” in multiple choice question
I would like to clarify the meaning of this question.
If Z is a standardized random variable, which of the following
statements is correct?
A) Its distribution is always Normal.
B) We always have E(Z²...
1
vote
1
answer
351
views
How to normalize/standardize when the number of observations is different?
I want to measure sentiment for reviews. I have the sentiment measure for each single sentence in the review, but the length of the reviews are different. I cannot simply add up the scores because ...
2
votes
1
answer
964
views
Bounded Anomaly Score between 0 and 1
I am using a KNN anomaly detection approach, where the distance to my nearest neighbor is an indication for an anomaly.
I am wondering how I can normalize the score between 0 and 1. I can use a test ...
1
vote
1
answer
374
views
Should I standardize age in the following situation? [closed]
I am a graduate student in biochemistry, and I am looking at cancer death rates for the Canadian population in 2010. Basically, I have acquired from Statistics Canada:
1) The estimates of individuals ...
3
votes
1
answer
3k
views
How to pre-process data for partial least square PLS regression in R?
I have a data frame that is consisted of 20 observations and 35 variables.
I want to prepare the data for partial least square regression PLS in R.
Many authors suggest:
Check whether the variables ...
9
votes
4
answers
524
views
Should I standardize or only center "Age" in an ordinal semiparametric regression model?
I am using Ordinal Semiparametric Regression (Frank Harrell's rms package) to model overall survival in patients with brain tumor. I am thinking of centering the Age covariate, because I want Age = 0 ...
0
votes
1
answer
232
views
Understanding different methods of standardization
I am currently working on a scatterplot with two standardized values. One axis is compromised of the mean value of several differences between traits, a index so to say.
Index = ((A/B) + (C/D)) / 2
...
1
vote
2
answers
368
views
Should I standardize all variables before a PCA separately if some share the same units
I have a matrix that contains >2000 variables which can be divided in 4 groups of ~500 variables with each group having a distinct unit. I need to standardize the matrix before running a PCA, but when ...
29
votes
2
answers
35k
views
Question about standardizing in ridge regression
Hey guys I found one or two papers which use ridge regression (for basketball data). I was always told to standardize my variables if I ran a ridge regression, but I was simply told to do this because ...
4
votes
1
answer
1k
views
CFA: scaling of measured variables/indicators
I'm running a latent variable analysis with:
166 observations
21 continuous variables
using the R package lavaan
A simple run of ...
2
votes
1
answer
210
views
Using random intercepts or z-standardizing within factors: Two identical ways to account for variance between factors?
I have the following (simple?) question about statistics: I have a dataset where I look for correlations between variables and would like to control for differences between factor levels. For ...