Skip to main content

Questions tagged [data-mining]

An activity that seeks patterns in large, complex data sets. It usually emphasizes algorithmic techniques, but may also involve any set of related skills, applications, or methodologies with that goal.

4 votes
1 answer
68 views

In Orange data mining (GUI), what is the default number of iterations for the data sampler bootstrap? And is there a way to increase it?
lala's user avatar
  • 41
3 votes
1 answer
85 views

I'm working on a project which works on loop control, when I try to implement that in the orange platform, I'm unable to connect one widget (python script) to another in loop, as the connection is ...
Anto Delin Xavier's user avatar
2 votes
1 answer
50 views

I am working with time series data in R and converting them to symbolic strings using the Symbolic Aggregate Approximation(SAX) algorithm. I have tried two different R packages for SAX: TSclust ...
sgourosf's user avatar
3 votes
1 answer
81 views

I see that both of following arrangements work in Orange software to give score for a model: and Both above work but which of above two is the correct method? Does the selection of model (Tree, ...
rnso's user avatar
  • 1,648
0 votes
0 answers
35 views

I am making predictions at the entity level, and for simplicity's sake, suppose there is only one feature. My goal is to set up my X matrix such that I can capture changes to the entity over different ...
Andrew Bell's user avatar
0 votes
0 answers
31 views

I am seeking high-quality datasets for my PhD dissertation on developing data mining models for diabetes prediction and treatment. Given the sensitivity of medical data, I am aware that accessing ...
user139289's user avatar
0 votes
0 answers
22 views

Given a sequence shown as follows, what are the normal approaches to automatically identify all the points that are suddenly have a big change.
user297850's user avatar
5 votes
1 answer
60 views

I am hoping to reach someone who knows how to interpret data, if not, someone with better logic than me would still help :) I had around 9000 users paying for monthly subscriptions for a service on ...
adrianTNT's user avatar
  • 151
0 votes
1 answer
61 views

I have a sequence dataset as the following. These sequences are statuses got approved by clients and they are ordered by date/time. A client can get multiple statuses and jump back to the same status ...
Totura's user avatar
  • 31
3 votes
1 answer
51 views

The Question I'm not super familiar with the name's of common algorithms in Data Science, and I feel like this would be something that is commonly used, and so should have a name - want to refer to ...
Mike Kennard's user avatar
2 votes
1 answer
59 views

I have big dataset (hundreds of millions of records, counted in dozens of GBs) and I would like to perform LOF for the problem of anomaly detection (testing different methods for academic purposes) ...
Asic's user avatar
  • 21
3 votes
1 answer
117 views

I am trying to build a small healthcare fact table with the following information [patientid], [organid], [value] Each [patientid] is unique to that patient, but there are only 10 available [organid] ...
A. Romain's user avatar
1 vote
1 answer
43 views

i have plan to start my career on data analytics and i need a guildline how to start and where to start ,if you are ready to give some hints through that I'll get some clarity and i'll start my ...
Preethi S's user avatar
0 votes
1 answer
88 views

I am currently doing my thesis on Natural Language Processing and it involves studying how people text online in a community so that it can be used to simulate conversational agents that can mimic ...
The Limit Does Not Exist's user avatar
5 votes
1 answer
307 views

I work in the sales department of electronics component manufacturing company and we do data science projects using traditional algorithm like Random forests (success likelihood of design project), ...
The Great's user avatar
  • 2,815

15 30 50 per page
1
2 3 4 5
79