Introduction
The OHDSI Characterization package lets users extract descriptive
analyses from observational healthcare data sets mapped to the OMOP CDM.
There are currently four different types of characterizations analyses
(incidence rates, time-to-event, dechallenge-rechallenge and various
aggregate covariate cohort comparisons).
The Characterization package currently lets users answer the
following questions:
-
Incidence Rate: How often does
<add outcome>
occur within
<add time-at-risk>
after first record of
<add exposure/indication>
?
-
Time-to-event: When does
<add outcome>
occur relative to the first recorded of
<add exposure/indication>
? Is it more common before
or after <add exposure>
?
-
Dechallenge-rechallenge: Is there any evidence of
<add outcome>
causing
<add exposure>
to be discontinued and then
<add outcome>
re-occurring once
<add exposure>
restarts?
-
Cohort Comparison: What is different at index
between patients in
<add exposure/outcome/indication>
and patients in
<add different exposure/outcome/indication>
?
-
Database Comparison: What is different at index
between patients in
<add exposure/outcome/indication>
across two or more OMOP CDM databases?
-
Risk factors: What are the risk factors of
<add outcome>
occurring within
<add time-at-risk>
for those exposed to
<add exposure>
?
-
Case-series: What happens to cases (those exposure
to
<add exposure>
who have
<add outcome>
during
<add time-at-risk>
) before exposure, between exposure
and outcome start and after outcome start? How bad are the cases
prognosis?
Features and Functionalities
Defining a target cohort as a set of patients with
an exposure or interest and/or with evidence of having an indication of
interest and an outcome cohort as a set of patients
with evidence of the outcome of interest, we run the following
analyses:
-
Cohort Summary - computes aggregate covariate
summaries for cohorts (targets and/or outcomes), offering a granular
view of the cohort’s demographics, conditions, drug exposures, and more.
This enables a deeper understanding of the cohort’s characteristics at
various time points at or relative to the index date.:
-
Database Comparison Lets you compare the same
cohort across two or more databases and adds in the standardized mean
different calculation when exactly two databases are selected. This is a
measure of association between the feature and the cohort, therefore
identifying which features differ across databases.
-
Cohort Comparison Lets you compare two or more
cohorts across a database and adds in the standardized mean different
calculation when exactly two cohort are selected. This is a measure of
association between the feature and the cohort, therefore identifying
which features differ across databases.
-
Exposed Case Series - characterizations that look
at people in the target cohort who have the outcome during some
specified time-at-risk:
-
Risk Factor Compares aggregate covariate summaries
for patients in the target who have the outcome during the time-at-risk
period vs patients in the target who do not have the outcome during the
time-at-risk period. The standardized mean difference is added to
identify covariates that differ between the cohorts.
-
Case Series Compares aggregate covariate summaries
before target start, between target start up to outcome start and after
outcome start for people in the target cohort who have the outcome
during some specified time-at-risk. This lets you see covariates that
are common before exposure and what happens afterwards.
-
Time to Event: Shows the distribution of when the
outcome occurs relative to the target start. This can show you whether
the outcome occurs more after or before target exposure.
-
Dechallenge Rechallenge: Offers the ability to
compute dechallenge (withdrawal of a drug or treatment) and rechallenge
(reintroduction) results. This analysis is critical for understanding
the causality between exposures and outcomes, especially in
pharmacovigilance studies and when adverse events following exposure to
a drug may occur.
-
Incidence Rate: Utilizing the CohortIncidence R package, this set of
analyses computes incidence rates for both target and outcome cohorts
during the time at risk selected. This feature is essential for
assessing the frequency of outcomes or conditions within the specified
timeframe, providing a quantitative measure of risk or occurrence.
Incidence measures are provided in both tabular and graphical form, and
can be stratified across calendar year, age, and sex.
Utility and Application
Characterization serves as a powerful tool for researchers aiming to
dissect and understand the nuances of patient cohorts in observational
health data. Its capabilities allow for the detailed examination of
cohort attributes, the incidence of health outcomes, and the effects of
treatment exposures over time. By facilitating a comprehensive analysis
of target and comparator cohorts, Characterization enables researchers
to draw meaningful conclusions about patient care, treatment efficacy,
and health outcomes, thereby contributing to the advancement of
evidence-based medicine.
To find out more about the analyses execution details and see
examples, please see here.
To see the code behind the Characterization R package, please see here.