Use clusters as dependent variables

Question

I wanted to ask anyone was aware of a type of two-stage analysis where clusters are used as a dependent variable in prediction models?

For example, suppose I had used an unsupervised model based on five categorical covariates, and I generated 3 clusters as a consequence.

Is it possible to use a representation of one of these clusters as a dependent variable in another model, to evaluate how well another set of mixed covariates would predict the cluster?

Sounds potentially outrageous, but would welcome comments and feedback.

Can't think of any example but not outrageous at all in my opinion. — Erwan
– Erwan, Commented Jul 29, 2021 at 22:14

Nicolas Martin · Accepted Answer · 2021-07-29 14:54:42Z

0

Some unsupervised models use random functions and you might not have the same clusters as before.

Nevertheless, you can apply some functions to know the clusters features'ranges and define them with specific labels, so that you can identify future clusters easily (but not the ones out of the ranges, in that case you migh group them in a label "other" and reorganise them later).

answered Jul 29, 2021 at 14:54

Nicolas Martin

5,3931 gold badge8 silver badges16 bronze badges

$\begingroup$ Thanks Nicolas, I appreciate your contribution there $\endgroup$

EB3112
– EB3112

2021-07-30 11:09:25 +00:00
Commented Jul 30, 2021 at 11:09

Add a comment |

Stack Exchange Network

Use clusters as dependent variables

1 Answer 1

Hot Network Questions

Use clusters as dependent variables

1 Answer 1

Related

Hot Network Questions