Questions tagged [r]
R is a free, open-source programming language and software environment for statistical computing, bioinformatics, and graphics.
1,449 questions
4
votes
1
answer
28
views
R Chart Overlay data
I have some data in a CSV that pertains to bandwidth tests, like so:
Date.Time
Mean
Standard.Deviation
2025/12/24 12:06:46
88382
6046
2025/12/24 12:22:59
93813
3986
2025/12/24 13:36:06
91530
8136
2025/...
4
votes
1
answer
52
views
Use of R caret pacage and xgbTree method to predict probabilities/proportions
I enjoy using the R package caret to streamline my workflows when doing machine learning in R. I currently have a problem where I would like to use extreme gradient ...
6
votes
1
answer
47
views
How can data science techniques be used to improve SEO and digital marketing performance?
How machine learning models are used for keyword analysis and ranking predictions
I work in digital marketing and I’m trying to understand how data science can be practically applied to improve SEO ...
2
votes
1
answer
40
views
R - xgboost::xgb.train() with callback raises error
I am playing with xgboost and callbacks in xgb.train. The code below raises an error:
"Error in predict(env$model, dval) :
argument "env" is missing, with no default"
Can someone ...
4
votes
0
answers
44
views
Why can zero-inflated generalized Poisson model not capture the overdispersion in the count data?
I have fitted a zero-inflated Poisson (ZIP) model to my count data with an excess of zeros (figure 1). Since it didn't capture the overdispersion in the data, and overdispersion is indeed ...
2
votes
1
answer
289
views
CRAN Mirror for Rstudio. Which country should I choose?
I am from Malaysia. I would like to download the R-studio package, but the first thing I need to do is to download the CRAN Mirror. In the list of R studio, there is no country Malaysia available. Can ...
2
votes
1
answer
50
views
Why do TSclust's diss.MINDIST.SAX() and jmotif's min_dist() give different results for the same SAX strings?
I am working with time series data in R and converting them to symbolic strings using the Symbolic Aggregate Approximation(SAX) algorithm.
I have tried two different R packages for SAX:
TSclust
...
5
votes
1
answer
87
views
Changes over time is significant
I am not sure if this is the right place to ask, but I have two fecundity datasets per year. One for males, the other for females:
To give an excerpt of the data:
Gender
year
number born
M
1990
1
M
...
0
votes
0
answers
37
views
Boruta vs Multiple Linear Regression
I am using the red wine quality dataset from Kaggle 'Red Wine Quality' for regression as self-exploration.
I did the analysis using multiple linear regression and found that '***' appeared against ...
1
vote
0
answers
120
views
Latent Class Analysis with weighted survey data
I'm intending to attempt some LCA on weighted survey data. I can see that Latent Gold (which I do not have a access to) allows the use of "case weights" which seem specially designed for ...
8
votes
2
answers
806
views
How to deal with lab analysis results that provide an inequality instead of a discrete value
I am working with some soil data that I had analysed for my postgraduate research project. The data involves the concentration of specific ions within the soil. Some of the ions which were determined ...
0
votes
1
answer
58
views
cluster named towns using their Latitude and longitude or radian values
I would like to cluster named towns using their Latitude and longitude or radian values. I am new to R and would like some advice.
0
votes
0
answers
37
views
Performing piecewise polynomial regression on the Auto dataset (ISLR2)
I am trying to analyze some data from the Auto data set (from ISLR2). I am trying to fit a piecewise polynomial to the acceleration column. But every time I run my code, R throws this error:
...
1
vote
0
answers
88
views
How to calculate odds ratios from Shap values?
Hope everyone is doing well. For my logistic regression model predicting hypertension, I want to calculate Shapley-adjusted odds ratios (ORs) and 95% confidence intervals (CIs) for each predictor in ...
2
votes
1
answer
100
views
New Variables to Add to Model GLM/GBM
I already have a GLM model in place to predict claims frequency. I know have access to many new variables (a mix of categorical and continuous variables, some of which are likely correlated). I wish ...