Skip to main content

Questions tagged [standardization]

Usually refers to "z-standardization" which is shifting and rescaling data to assure they have zero mean and unit variance. Other "standardizations" are possible, too.

3 votes
3 answers
113 views

I am working on a project to cluster provinces according to their exposure to river floods. I am considering the following indicators: Total number of flood events / total provincial area Total ...
0 votes
1 answer
598 views

I am constructing an index composed of around 100 indicators aggregated in sub-index and these in an Index, among the indicators different types can be found: categorical, ordinal, interval and ratio. ...
0 votes
1 answer
197 views

There is a paper (here) which shows that the beta coefficient of a standardised variable (see equation 3 in the paper), which represents the correlation between the standardised variables, can be ...
1 vote
2 answers
307 views

The Exploratory Factor Analysis has the following mathematical formulation as in the screenshot from wikipedia (https://en.wikipedia.org/wiki/Factor_analysis): That means the factors in F are ...
6 votes
3 answers
532 views

I would like to clarify the meaning of this question. If Z is a standardized random variable, which of the following statements is correct? A) Its distribution is always Normal. B) We always have E(Z²...
1 vote
1 answer
351 views

I want to measure sentiment for reviews. I have the sentiment measure for each single sentence in the review, but the length of the reviews are different. I cannot simply add up the scores because ...
2 votes
1 answer
964 views

I am using a KNN anomaly detection approach, where the distance to my nearest neighbor is an indication for an anomaly. I am wondering how I can normalize the score between 0 and 1. I can use a test ...
1 vote
1 answer
374 views

I am a graduate student in biochemistry, and I am looking at cancer death rates for the Canadian population in 2010. Basically, I have acquired from Statistics Canada: 1) The estimates of individuals ...
3 votes
1 answer
3k views

I have a data frame that is consisted of 20 observations and 35 variables. I want to prepare the data for partial least square regression PLS in R. Many authors suggest: Check whether the variables ...
9 votes
4 answers
524 views

I am using Ordinal Semiparametric Regression (Frank Harrell's rms package) to model overall survival in patients with brain tumor. I am thinking of centering the Age covariate, because I want Age = 0 ...
0 votes
1 answer
232 views

I am currently working on a scatterplot with two standardized values. One axis is compromised of the mean value of several differences between traits, a index so to say. Index = ((A/B) + (C/D)) / 2 ...
1 vote
2 answers
368 views

I have a matrix that contains >2000 variables which can be divided in 4 groups of ~500 variables with each group having a distinct unit. I need to standardize the matrix before running a PCA, but when ...
29 votes
2 answers
35k views

Hey guys I found one or two papers which use ridge regression (for basketball data). I was always told to standardize my variables if I ran a ridge regression, but I was simply told to do this because ...
4 votes
1 answer
1k views

I'm running a latent variable analysis with: 166 observations 21 continuous variables using the R package lavaan A simple run of ...
2 votes
1 answer
210 views

I have the following (simple?) question about statistics: I have a dataset where I look for correlations between variables and would like to control for differences between factor levels. For ...

15 30 50 per page
1
2 3 4 5
57