All Questions
51 questions
0
votes
0
answers
82
views
Agglomerative ward linkage criteria
I am asking about ward linkage criteria how can i computing it ? i understand that single linkage merge the two clusters that have minimum distance and complete linkage merge the clusters have maximum ...
0
votes
1
answer
31
views
Can the parent nodes of clusters formed using disjoint set forest be used as cluster representative?
The intention is to merge clusters which have similarity higher than the Jaccard similarity based on pairwise comparison of cluster representative. My logic here is that because the child nodes are ...
-2
votes
1
answer
902
views
Which algorithm can handle multiple columns as response variables? [closed]
I am trying to build a cross selling model where I have customers and some of their attributes along with the products that they have bought. SO basically I will be having multiple response variables ...
0
votes
1
answer
77
views
Is that possible that we use bagging on the combination of a decision tree and a linear separator?
Is that possible that we use bagging on the combination of a decision tree and a linear separator?
I know bagging could be used for the decision tree or linear separator individually. but how about ...
0
votes
4
answers
2k
views
Algorithm that finds similar words based on their letters
I am looking for a way to detect the similarity of words (textstrings) based on their shared letters.
I was looking into Hash functions and especially the Rabin-Karp algorithm to find similar words ...
-3
votes
2
answers
452
views
Machine Learning: NBA data to find MVP of season
I want to use NBA data to find the MVP for a particular season. I was looking at a couple of algorithms like Neural Networks, AdaBoost, Random Forest, and Decision Trees. I really want to use AdaBoost ...
1
vote
3
answers
4k
views
Writing own kmeans algorithm in R
I am trying to write my first own kmeans algorithm in R. I am new in this field, so please don't judge me for don't seeing the obvious.
In its current state, the algorithm takes two vectors x, y, ...
0
votes
0
answers
37
views
Name for this adaptive weight updating rule?
I have the following adaptive weight updating rule.
The intuition is when recent samples are normal, gradually decrease the weight. When an abnormal occurred, increase the weight by a constant number....
1
vote
1
answer
228
views
What is the meaning of this paragraph of code for calculating auc in ctr prediction?
Please forgive me for my broken English.
This is the code.
def scoreClickAUC(num_clicks, num_impressions, predicted_ctr):
"""
Calculates the area under the ROC curve (AUC) for click ...
0
votes
2
answers
128
views
Clustering+Regression-the right approach or not?
I have a task of prognosing the quickness of selling goods (for example, in one category). E.g, the client inputs the price that he wants his item to be sold and the algorithm should displays that it ...
-1
votes
1
answer
29
views
Determininig Association Rules with a given item set
Im trying to learn how to generate associate rules. If I say for example i have an item set (a1=yes,b2=yellow, x3=round)? what kind of rules would be generated.
1
vote
2
answers
2k
views
Clustering algorithms for strings
I have to implement a module in which i need to group sentences(strings) having similar meaning into different clusters. I read about k-means , EM clustering etc. But the problem which i am facing is ...
0
votes
1
answer
74
views
algorithm to find blocks of trends
Let's say I have a a 24 lines of data, such that each line represents an hour in the day. What I want to achieve is to implement an algorithm that can detect trends in the data and can divide it into ...
14
votes
2
answers
12k
views
What is stratified bootstrap?
I have learned bootstrap and stratification. But what is stratified bootstrap? And how does it work?
Let's say we have a dataset of n instances (observations), and m is the number of classes. How ...
-3
votes
1
answer
152
views
Community detection: Any algorithm that assigns a person to multiple communities?
I feel in some cases it is reasonable for a person to be assigned to multiple communities. For example in the science field a professor may work in multiple research areas. Is there an algorithm that ...