Demographic Requirements
a03_require_demographics.Rmd
For this example we’ll use the Eunomia synthetic data from the CDMConnector package.
con <- DBI::dbConnect(duckdb::duckdb(), dbdir = eunomia_dir())
cdm <- cdm_from_con(con, cdm_schema = "main",
write_schema = c(prefix = "my_study_", schema = "main"))
Let’s start by creating two drug cohorts, one for users of diclofenac and another for users of acetaminophen.
cdm$medications <- conceptCohort(cdm = cdm,
conceptSet = list("diclofenac" = 1124300,
"acetaminophen" = 1127433),
name = "medications")
settings(cdm$medications)
#> # A tibble: 2 × 4
#> cohort_definition_id cohort_name cdm_version vocabulary_version
#> <int> <chr> <chr> <chr>
#> 1 1 acetaminophen 5.3 v5.0 18-JAN-19
#> 2 2 diclofenac 5.3 v5.0 18-JAN-19
cohortCount(cdm$medications)
#> # A tibble: 2 × 3
#> cohort_definition_id number_records number_subjects
#> <int> <int> <int>
#> 1 1 9365 2580
#> 2 2 830 830
Cohort 1 contains users of acetaminophen and has 2580 subjects and 9365 records. Cohort 2 contains users of diclofenac and has 830 subjects and 830 records.
Restrict cohort by age
We can choose a specific age range for individuals in our cohort
using requireAge()
from CohortConstructor.
cdm$medications <- cdm$medications %>%
requireAge(indexDate = "cohort_start_date",
ageRange = list(c(18,100)))
summary_attrition <- summariseCohortAttrition(cdm$medications)
plotCohortAttrition(summary_attrition, cohortId = 1)
The flow chart above illustrates the changes to cohort 1 (users of acetaminophen) when restricted to only include individuals aged 18 to 90. 226 individuals and 2,863 records were excluded.
The variable ‘cohort_start_date’ is used so that individuals are filtered based on the age they were when they entered the cohort.
Restrict cohort by sex
We can also specify a sex criteria for individuals in our cohort
using requireSex()
from CohortConstructor.
cdm$medications <- cdm$medications %>%
requireSex(sex = "Female")
summary_attrition <- summariseCohortAttrition(cdm$medications)
plotCohortAttrition(summary_attrition, cohortId = 1)
The flow chart above illustrates the changes to cohort 1 when restricted to only include ‘female’ individuals. 1,264 individuals and 4,647 records were excluded.
Restrict cohort by number of prior observations
We can also specify a minimum number of days of prior observations
for each individual using requirePriorObservation()
from
CohortConstructor.
cdm$medications <- cdm$medications %>%
requirePriorObservation(indexDate = "cohort_start_date",
minPriorObservation = 365)
summary_attrition <- summariseCohortAttrition(cdm$medications)
plotCohortAttrition(summary_attrition, cohortId = 1)
The flow chart above illustrates the changes to cohort 1 when restricted to only include individuals with at least 365 days of prior observations. 5 individuals and 109 records were excluded.
Restrict cohort by number of future observations
We can also specify a minimum number of days of prior observations
for each individual using requireFutureObservation()
from
CohortConstructor.
cdm$medications <- cdm$medications %>%
requireFutureObservation(indexDate = "cohort_start_date",
minFutureObservation = 365)
summary_attrition <- summariseCohortAttrition(cdm$medications)
plotCohortAttrition(summary_attrition, cohortId = 1)
The flow chart above illustrates the changes to cohort 1 when restricted to only include individuals with at least 365 days of future observations. 14 individuals and 206 records were excluded.
Applying multiple demographic requirements to a cohort
We can implement multiple demographic requirements on a cohort using
requireDemographics()
from CohortConstructor.
cdm$medications <- cdm$medications %>%
requireDemographics(indexDate = "cohort_start_date",
ageRange = c(18,100),
sex = "Female",
minPriorObservation = 365,
minFutureObservation = 365)
summary_attrition <- summariseCohortAttrition(cdm$medications)
plotCohortAttrition(summary_attrition, cohortId = 1)
The flow chart above illustrates the changes to cohort 1 when multiple demographic restrictions, so that only female individuals between 18 and 100 years old, with at least 365 days of prior and future observations are included. 1,413 individuals and 6156 records were excluded.