Skip to main content

Questions tagged [r]

Use this tag for any *on-topic* question that (a) involves `R` either as a critical part of the question or expected answer, & (b) is not *just* about how to use `R`.

1 vote
1 answer
22 views

I know SHAP (or shapley) values are the contribution of each input variable to the model prediction. Adding the base values to the sum of all SHAP values gives you the model prediction for any data ...
lsr729's user avatar
  • 261
1 vote
2 answers
53 views

Let's assume the non-monotonic data below (right graph and data from here). I would like to test if the two variables x and y are correlated or at least not independent, given the non-monotonic ...
denis's user avatar
  • 305
0 votes
1 answer
43 views

I would like to estimate asample size to compare the means of three groups(A,B and C) taking into account three pairwise comparisons (A/B, B/C, A/C) given a coefficient variation and an effect size as ...
Mubita 's user avatar
  • 191
0 votes
0 answers
37 views

The Union of India has undergone frequent political re-organizations since independence. The problem today (for me) is that, I've been unable to account for certain data values of the following states/...
Mithu's user avatar
  • 1
6 votes
2 answers
208 views

I'm performing an IPD meta-analysis, and need to fit my models with study-specific variances (which is why I need to fit with nlme::gls instead of ...
slammaster's user avatar
7 votes
2 answers
275 views

I have fit interaction models of the form: phenotype ~ genotype * environment, based on theory. I am assigning environment (GFR, in this case) as the moderator. I have three scenarios: A: Non-parallel ...
Mubita 's user avatar
  • 191
2 votes
1 answer
51 views

I aim to build a model that includes 2D smooth by different factors (to check for smooth differences; e.g., level 1 of factor 1 vs. level 2 of factor 1, level 1 of factor 2 vs. level 2 of factor 2), ...
David's user avatar
  • 135
2 votes
1 answer
45 views

How do these two differ in terms of interpretation? When should one be used over the other? ...
esss123's user avatar
  • 21
1 vote
1 answer
50 views

Crossposting from https://stackoverflow.com/q/79913674/19231816 I am building a GLMM to answer an ecological question. Most of my predictors were log-transformed and then z-score standardised ((x - ...
msug's user avatar
  • 11
6 votes
2 answers
252 views

In a multivariable logistic regression model, some continuous predictors are modeled using spline transformations to allow for nonlinear effects, while other continuous predictors are entered as ...
Konstantinos Gkirgkiris's user avatar
0 votes
0 answers
41 views

Let $X$ be a random variable and having quantile function $Q$ and quantile density function $q$. Let $X_1, X_2,..., X_n$ be independent and identically distributed random variables from $F(x)$. I ...
Unknown's user avatar
  • 133
6 votes
1 answer
149 views

Considering the histograms and Q-Q-plots of these 6 response variables, which distribution families would you recommend for fitting GAMs with mgcv? Var3 and Var6 follow a somewhat normal distribution ...
David's user avatar
  • 135
4 votes
1 answer
200 views

I commonly work with cancer data that is on the patient-lesion level, as patients with metastases often have multiple treated lesions. While looking at patient survival is easy, I get a bit stuck with ...
scott9's user avatar
  • 135
2 votes
1 answer
29 views

I am trying to test for vocal differences between 5 primate individuals with sample sizes of n = 102, 86, 115, 45, 12 recordings of their calls. The data is not normally distributed and the variance ...
Ema C.'s user avatar
  • 21
3 votes
1 answer
44 views

I came across the article from Bender et al.(2005) and attempted to put this into R code to simulate survival times based on an empirical baseline hazard from existing data. I compute survival times ...
peer's user avatar
  • 43

15 30 50 per page