summarise_large_scale_characteristics

summarise_large_scale_characteristics(
    cohort,
    cdm,
    *,
    cohort_id=None,
    strata=None,
    window=None,
    event_in_window=None,
    episode_in_window=None,
    index_date='cohort_start_date',
    minimum_frequency=0.005,
    excluded_codes=(0,),
)

Summarise large-scale characteristics for cohorts.

For each clinical domain table, counts concepts occurring in specified time windows relative to the index date. Reports counts and percentages for concepts above the minimum frequency threshold.

Parameters

Name Type Description Default
cohort Cohort table. required
cdm Cdm reference with clinical tables. required
cohort_id Optional cohort_definition_id(s) to include. None
strata Optional stratification columns. None
window Time windows as (lower, upper) day offsets. Default spans from -Inf to Inf with standard breaks. None
event_in_window Domain tables to analyze (e.g. ["condition_occurrence"]). If None, defaults to all available domain tables. None
episode_in_window Not yet implemented. None
index_date Column name for the reference date. Default "cohort_start_date". 'cohort_start_date'
minimum_frequency Minimum proportion to keep a concept. Default 0.005. 0.005
excluded_codes Concept IDs to exclude. Default (0,). (0,)

Returns

Name Type Description
SummarisedResult