Skip to main content

Questions tagged [scikit-learn]

scikit-learn is a popular machine learning package for Python that has simple and efficient tools for predictive data analysis. Topics include classification, regression, clustering, dimensionality reduction, model selection, and preprocessing.

0 votes
0 answers
45 views

I'm currently making a small binary classification program using Quantum Machine Learning (EstimatorQNN to be more specific). My program classifies data inside the Wisconsin Breast Cancer database and ...
Andrea's user avatar
  • 1
5 votes
1 answer
92 views

I am using sklearn.metrics.roc_curve to calculate the points of a ROC curve. This is the output I obtain. This plot does not look as I would expect it to. The line ...
user2138149's user avatar
3 votes
1 answer
103 views

I struggle to select the key features that contribute to PC1. I will use the public breast cancer dataset to illustrate the issue. Please feel free to point me to previous post if this question has ...
WhiskerFeatures's user avatar
0 votes
0 answers
21 views

I’m working through the runtime analysis of scikit-learn’s OneVsRestClassifier for two cases: LogisticRegression (solver=lbfgs, ...
user184658's user avatar
1 vote
1 answer
92 views

I would appreciate your advice on how to resolve the following issue. I am working with a dataset that contains two categorical features (actually, more than two, but two are enough to illustrate the ...
S. N.'s user avatar
  • 131
0 votes
0 answers
30 views

day Modified today Viewed 25 times 0 I want to build a model that forecasts ticket resolution time for a data science software support tickets . I’ve calculated queuing time and resolution time from ...
Rebel Royals's user avatar
2 votes
1 answer
56 views

I am a lot confused about the pre-processing scaling process. I have a dataset with several meteorological quantities (pressure, temperature, wind direction, etc.) and I am using it to forecast the ...
cicciodevoto's user avatar
4 votes
1 answer
77 views

I am trying to model the arch of a basketball free throw projectory. Usually per person, this dataset has 6 points each where it is the height of the basketball via various seconds after the player ...
ChairmanMeow's user avatar
1 vote
0 answers
40 views

I am learning Machine Learning and exploring nested cross-validation. I don't understand the example given in scikit-learn as the model seems to learn from the whole dataset and the evaluation is not ...
SamGG's user avatar
  • 11
3 votes
2 answers
144 views

I'm training a classifier on the DAIGT dataset. The objective is to differentiate human from AI text and so this is a binary classification problem. As a baseline before I move onto an LLM classifier, ...
saladmobster's user avatar
7 votes
2 answers
159 views

I'm building a machine learning model to predict loan approval rate. My dataset includes features like: Credit_History ...
Muhammed Erbay's user avatar
5 votes
1 answer
81 views

LinearRegression has an attribute singular_ which returns "singular values of x". According to a definition I found: "singularity is ... when a ...
Moti's user avatar
  • 53
4 votes
0 answers
79 views

When trying to fit scikit-learn DecisionTreeClassifier on my data, I am observing some weird behavior. x[54] (a boolan feature) ...
Krishna's user avatar
  • 141
4 votes
0 answers
27 views

I am doing a geospatial assessment integrated with ML modeling. The problem is the very low accuracy percentage, as more training features increases, it gets lower. What could be the solution to such ...
Reem 's user avatar
  • 41
1 vote
0 answers
38 views

I am using sklearn's Isolation Forest as a model to detect anomalies. My dataset is relatively small, 50 records with only 2-3 features. To prevent any overfitting, what would you recommend to tune ...
Mar's user avatar
  • 165

15 30 50 per page
1
2 3 4 5
154