Skip to main content
Became Hot Network Question
formatted table
Source Link
EdM
  • 116.5k
  • 11
  • 121
  • 378

SOORTCODE | Code | management | R10 | abundance:

carexpan | A | mid | 105 | 12

carexpan | A | mid | 106 | 8

carexpan | B | extensive | 210 | 3

carexnig | A | mid | 107 | 5

carexnig | B | no management | 305 | 2

SOORTCODECodemanagementR10abundance
carexpanAmid10512
carexpanAmid1068
carexpanBextensive2103
carexnigAmid1075
carexnigBno management3052

SOORTCODE | Code | management | R10 | abundance:

carexpan | A | mid | 105 | 12

carexpan | A | mid | 106 | 8

carexpan | B | extensive | 210 | 3

carexnig | A | mid | 107 | 5

carexnig | B | no management | 305 | 2

SOORTCODECodemanagementR10abundance
carexpanAmid10512
carexpanAmid1068
carexpanBextensive2103
carexnigAmid1075
carexnigBno management3052
added 213 characters in body
Source Link
fleur
  • 41
  • 2

I have a question about how to analyse my dataset and would really appreciate your advice.

My data consist of observations of a set of target plant species collected during field surveys. The surveys were not strictly standardized: observers walked through the study area and recorded species when encountered. Therefore, the dataset mainly contains presence records.

In terms of structure, the dataset includes:

  • SPECIESCODE: species identity
  • Code: sampling unit (there are only two distinct codes, corresponding to two survey areas)
  • Management intensity: Extensive, No management, Moderately intensive, Intensive
  • R10: raster cell (10 m × 10 m grid) in which the observation was located
  • Abundance: number of individuals recorded per observation (if two of the same species were present in the same R10, then the max abundance was taken)

Each row corresponds to a species observation within a given sampling unit, management type, and raster cell.

The goal is to estimate the preference per species per management intensity. That is why I transformed my data into a presence–absence format by assuming that if a species was not recorded within a given sampling unit (Code + management), it was absent.

data_full_heide <- data_full_heide %>%
  mutate(presence = 1) %>%
  complete(
    SOORTCODE,
    nesting(Code, management), #nesting met R10 needed?
    fill = list(presence = 0)
  )

Then I performed a logistic regression, with code as fixed variable instead of a random variable because it consists of only two categories.

model <- glmer(
  presence ~ management + Code + (1 | SOORTCODE), 
  family = binomial,
  data = data_full_heide
)

But how can I now test the individual preference per species? Do I conduct a GLM per species? But this gives very high standard errors and many p-values are not significant.

glm_per_species <- data_full_heide %>%
  group_by(SOORTCODE) %>%
  group_modify(~ {
    mod <- glm(
      presence ~ management,
      family = binomial,
      data = .x
    )
    broom::tidy(mod)
  })

Let me know if you need additional information!

SOORTCODE | Code | management | R10 | abundance:

carexpan | A | mid | 105 | 12

carexpan | A | mid | 106 | 8

carexpan | B | extensive | 210 | 3

carexnig | A | mid | 107 | 5

carexnig | B | no management | 305 | 2

I have a question about how to analyse my dataset and would really appreciate your advice.

My data consist of observations of a set of target plant species collected during field surveys. The surveys were not strictly standardized: observers walked through the study area and recorded species when encountered. Therefore, the dataset mainly contains presence records.

In terms of structure, the dataset includes:

  • SPECIESCODE: species identity
  • Code: sampling unit (there are only two distinct codes, corresponding to two survey areas)
  • Management intensity: Extensive, No management, Moderately intensive, Intensive
  • R10: raster cell (10 m × 10 m grid) in which the observation was located
  • Abundance: number of individuals recorded per observation (if two of the same species were present in the same R10, then the max abundance was taken)

Each row corresponds to a species observation within a given sampling unit, management type, and raster cell.

The goal is to estimate the preference per species per management intensity. That is why I transformed my data into a presence–absence format by assuming that if a species was not recorded within a given sampling unit (Code + management), it was absent.

data_full_heide <- data_full_heide %>%
  mutate(presence = 1) %>%
  complete(
    SOORTCODE,
    nesting(Code, management), #nesting met R10 needed?
    fill = list(presence = 0)
  )

Then I performed a logistic regression, with code as fixed variable instead of a random variable because it consists of only two categories.

model <- glmer(
  presence ~ management + Code + (1 | SOORTCODE), 
  family = binomial,
  data = data_full_heide
)

But how can I now test the individual preference per species? Do I conduct a GLM per species? But this gives very high standard errors and many p-values are not significant.

glm_per_species <- data_full_heide %>%
  group_by(SOORTCODE) %>%
  group_modify(~ {
    mod <- glm(
      presence ~ management,
      family = binomial,
      data = .x
    )
    broom::tidy(mod)
  })

Let me know if you need additional information!

I have a question about how to analyse my dataset and would really appreciate your advice.

My data consist of observations of a set of target plant species collected during field surveys. The surveys were not strictly standardized: observers walked through the study area and recorded species when encountered. Therefore, the dataset mainly contains presence records.

In terms of structure, the dataset includes:

  • SPECIESCODE: species identity
  • Code: sampling unit (there are only two distinct codes, corresponding to two survey areas)
  • Management intensity: Extensive, No management, Moderately intensive, Intensive
  • R10: raster cell (10 m × 10 m grid) in which the observation was located
  • Abundance: number of individuals recorded per observation (if two of the same species were present in the same R10, then the max abundance was taken)

Each row corresponds to a species observation within a given sampling unit, management type, and raster cell.

The goal is to estimate the preference per species per management intensity. That is why I transformed my data into a presence–absence format by assuming that if a species was not recorded within a given sampling unit (Code + management), it was absent.

data_full_heide <- data_full_heide %>%
  mutate(presence = 1) %>%
  complete(
    SOORTCODE,
    nesting(Code, management), #nesting met R10 needed?
    fill = list(presence = 0)
  )

Then I performed a logistic regression, with code as fixed variable instead of a random variable because it consists of only two categories.

model <- glmer(
  presence ~ management + Code + (1 | SOORTCODE), 
  family = binomial,
  data = data_full_heide
)

But how can I now test the individual preference per species? Do I conduct a GLM per species? But this gives very high standard errors and many p-values are not significant.

glm_per_species <- data_full_heide %>%
  group_by(SOORTCODE) %>%
  group_modify(~ {
    mod <- glm(
      presence ~ management,
      family = binomial,
      data = .x
    )
    broom::tidy(mod)
  })

Let me know if you need additional information!

SOORTCODE | Code | management | R10 | abundance:

carexpan | A | mid | 105 | 12

carexpan | A | mid | 106 | 8

carexpan | B | extensive | 210 | 3

carexnig | A | mid | 107 | 5

carexnig | B | no management | 305 | 2

deleted 4 characters in body; edited title
Source Link
Nick Cox
  • 63.1k
  • 8
  • 151
  • 240

which Which test do iI use to estimate the preference of species?

I have a question about how to analyse my dataset and would really appreciate your advice.

My data consist of observations of a set of target plant species collected during field surveys. The surveys were not strictly standardized: observers walked through the study area and recorded species when encountered. Therefore, the dataset mainly contains presence records.

In terms of structure, the dataset includes:

  • SPECIESCODE: species identity
  • Code: sampling unit (there are only two distinct codes, corresponding to two survey areas)
  • managementManagement intensity: Extensive, No management, Moderately intensive, Intensive
  • R10: raster cell (10 m × 10 m grid) in which the observation was located
  • Abundance: number of individuals recorded per observation (if two of the same species wherewere present in the same R10, thanthen the max abundance was taken)

Each row corresponds to a species observation within a given sampling unit, management type, and raster cell.

The goal is to estimate the preference per species per management intensity. That is why I transformed my data into a presence–absence format by assuming that if a species was not recorded within a given sampling unit (Code + management), it was absent.

data_full_heide <- data_full_heide %>%
  mutate(presence = 1) %>%
  complete(
    SOORTCODE,
    nesting(Code, management), #nesting met R10 needed?
    fill = list(presence = 0)
  )

Then I performed a logistic regression, with code as fixed variable instead of a random variable because it only consists of only two categories.

model <- glmer(
  presence ~ management + Code + (1 | SOORTCODE), 
  family = binomial,
  data = data_full_heide
)

But how can I now test the individual preference per species? Do I conduct a GLM per species? But this givegives very high standard errors and many p-values are not significant.

glm_per_species <- data_full_heide %>%
  group_by(SOORTCODE) %>%
  group_modify(~ {
    mod <- glm(
      presence ~ management,
      family = binomial,
      data = .x
    )
    broom::tidy(mod)
  })

Let me know if you need additional information!

Fleur

which test do i use to estimate the preference of species?

I have a question about how to analyse my dataset and would really appreciate your advice.

My data consist of observations of a set of target plant species collected during field surveys. The surveys were not strictly standardized: observers walked through the study area and recorded species when encountered. Therefore, the dataset mainly contains presence records.

In terms of structure, the dataset includes:

  • SPECIESCODE: species identity
  • Code: sampling unit (there are only two distinct codes, corresponding to two survey areas)
  • management intensity: Extensive, No management, Moderately intensive, Intensive
  • R10: raster cell (10 × 10 m grid) in which the observation was located
  • Abundance: number of individuals recorded per observation (if two of the same species where present in the same R10, than the max abundance was taken)

Each row corresponds to a species observation within a given sampling unit, management type, and raster cell.

The goal is to estimate the preference per species per management intensity. That is why I transformed my data into a presence–absence format by assuming that if a species was not recorded within a given sampling unit (Code + management), it was absent.

data_full_heide <- data_full_heide %>%
  mutate(presence = 1) %>%
  complete(
    SOORTCODE,
    nesting(Code, management), #nesting met R10 needed?
    fill = list(presence = 0)
  )

Then I performed a logistic regression, with code as fixed variable instead of a random variable because it only consists of two categories.

model <- glmer(
  presence ~ management + Code + (1 | SOORTCODE), 
  family = binomial,
  data = data_full_heide
)

But how can I now test the individual preference per species? Do I conduct a GLM per species? But this give very high standard errors and many p-values are not significant.

glm_per_species <- data_full_heide %>%
  group_by(SOORTCODE) %>%
  group_modify(~ {
    mod <- glm(
      presence ~ management,
      family = binomial,
      data = .x
    )
    broom::tidy(mod)
  })

Let me know if you need additional information!

Fleur

Which test do I use to estimate the preference of species?

I have a question about how to analyse my dataset and would really appreciate your advice.

My data consist of observations of a set of target plant species collected during field surveys. The surveys were not strictly standardized: observers walked through the study area and recorded species when encountered. Therefore, the dataset mainly contains presence records.

In terms of structure, the dataset includes:

  • SPECIESCODE: species identity
  • Code: sampling unit (there are only two distinct codes, corresponding to two survey areas)
  • Management intensity: Extensive, No management, Moderately intensive, Intensive
  • R10: raster cell (10 m × 10 m grid) in which the observation was located
  • Abundance: number of individuals recorded per observation (if two of the same species were present in the same R10, then the max abundance was taken)

Each row corresponds to a species observation within a given sampling unit, management type, and raster cell.

The goal is to estimate the preference per species per management intensity. That is why I transformed my data into a presence–absence format by assuming that if a species was not recorded within a given sampling unit (Code + management), it was absent.

data_full_heide <- data_full_heide %>%
  mutate(presence = 1) %>%
  complete(
    SOORTCODE,
    nesting(Code, management), #nesting met R10 needed?
    fill = list(presence = 0)
  )

Then I performed a logistic regression, with code as fixed variable instead of a random variable because it consists of only two categories.

model <- glmer(
  presence ~ management + Code + (1 | SOORTCODE), 
  family = binomial,
  data = data_full_heide
)

But how can I now test the individual preference per species? Do I conduct a GLM per species? But this gives very high standard errors and many p-values are not significant.

glm_per_species <- data_full_heide %>%
  group_by(SOORTCODE) %>%
  group_modify(~ {
    mod <- glm(
      presence ~ management,
      family = binomial,
      data = .x
    )
    broom::tidy(mod)
  })

Let me know if you need additional information!

formatted code, other minor edits
Source Link
EdM
  • 116.5k
  • 11
  • 121
  • 378
Loading
Source Link
fleur
  • 41
  • 2
Loading