splitSettings
R/DataSplitting.R
splitData.Rd
Split the plpData into test/train sets using a splitting settings of class splitSettings
splitData(
plpData = plpData,
population = population,
splitSettings = splitSettings
)
An object of type plpData
- the patient level prediction
data extracted from the CDM.
The population created using createStudyPopulation
that define who will be used to develop the model
An object of type splitSettings
specifying the split - the default can be created using createDefaultSplitSetting
An object of class splitSettings
Returns a list containing the training data (Train) and optionally the test data (Test). Train is an Andromeda object containing
covariateRef: a table with the covariate information
labels: a table (rowId, outcomeCount, ...) for each data point in the train data (outcomeCount is the class label)
folds: a table (rowId, index) specifying which training fold each data point is in.
Test is an Andromeda object containing
covariateRef: a table with the covariate information
labels: a table (rowId, outcomeCount, ...) for each data point in the test data (outcomeCount is the class label)