Skip to main content

Questions tagged [reference-request]

"References" is our generic tag for questions seeking information about books, papers, presentations, videos of lectures, on-line tutorials, etc., regarding any subject matter that is on-topic for Data Science.

6 votes
1 answer
103 views

I am looking for a handbook for quick revision of statistics that is straightforward, offering concise explanations, key assumptions, formulas, and possibly one exercise for practice. If you have any ...
Guna's user avatar
  • 947
11 votes
1 answer
883 views

My data set consists of an output variable which is categorical with 4 different values and the input variables of which there are roughly 100 and they are boolean, ie True/False. The data set has ...
quarague's user avatar
  • 778
0 votes
2 answers
76 views

I need some help in clearing out basics of plotting in various plotting packages. Specifically matplotlib, seaborn and plotly. Certain fundamental principles are always same across all packages. Can ...
user3851878's user avatar
0 votes
0 answers
60 views

Nowadays python is mostly used for machine learning and i think it is also used in new ML courses of Andrew Ng https://www.quora.com/Why-was-MATLAB-not-used-in-the-Andrew-Ng-course-of-deep-learning ...
DSP_CS's user avatar
  • 111
0 votes
1 answer
102 views

In my experiences, binary classifiers tend do better in terms of F1 scores when the class imbalance is at least reduced. However, this leads to over-predicting in the test data. (Thought) Example: If ...
yurnero's user avatar
  • 141
1 vote
0 answers
165 views

I am working on a problem for which no datasets exist. I have obtained several examples from this domain, and so far have been using them in Large Language Model (LLM) prompts(few shot learning) but I ...
Karl 17302's user avatar
1 vote
1 answer
4k views

I have recently read through a lot of documentation and articles about Large Language Models (LLMs), and I have come to the conclusion that 0.7 is, most of the time, the default value for the ...
jmpion's user avatar
  • 11
0 votes
1 answer
78 views

I've been learning machine learning for the past few weeks from books and online courses. The books I've been reading, and currently still reading is "Hands-On Machine Learning with Scikit-Learn ...
Justin Jonany's user avatar
0 votes
0 answers
58 views

Question 1. Does clustering belong to the domain of data mining or to the domain of machine learning? Or to both domains? Question 2. Depending on the answer to Question 1, could you please suggest a ...
Ommo's user avatar
  • 103
1 vote
0 answers
33 views

The above backpropagation algorithm is taken from Shalev Shwartz and Ben-David's textbook: Understanding Machine Learning. This algorithm is described in the same way as the one in Mostafa's textbook, ...
Fraïssé's user avatar
  • 119
0 votes
1 answer
96 views

Some websites provide a link to a dataset (in Excel sheets format) which allows the dataset to be downloaded. But some others additionally provide API documentation, like this site. Can you please ...
IwillLearn's user avatar
1 vote
1 answer
178 views

I'm trying to make a sales prediction using the column X = item_amount and y = item_price_total, I'm confused whether it's okay to normalize data on the dependent variable using minmaxscalar? With the ...
Fatur's user avatar
  • 11
4 votes
1 answer
82 views

We are looking to design a system where specific items or categories of items can be boosted/promoted up or relegated/demoted down the recommendation order. What are the common strategies or standards ...
JPTheEngineer's user avatar
0 votes
3 answers
2k views

I was wondering what is the best practice for removing outliers from data. Plotting a boxplot for each feature (column of the dataset) and removing data that fall outside the whiskers seems like a ...
Erik M's user avatar
  • 93
1 vote
1 answer
75 views

I have been trying to understand this for a long time, but this information proves to be incredibly elusive online. What are possible jobs that a pure Data Scientist, without much background knowledge,...
temporario1001's user avatar

15 30 50 per page
1
2 3 4 5 6