apply several clustering methods and append results to a dataframe; usually will include one subject per row
consensusSubtypingTrain(
dataToClust,
featureNames,
clustVec,
ktrain,
reorderingVariable,
mvcl = "MVST",
ksearch = 3,
verbose = FALSE
)
dataframe input that contains the relevant variables (may have others as well)
names to use in the clustering
names of the clustering methods to use
the number of clusters
the name of the column to use to reorder the cluster names
character prefix for the new cluster column names
the cluster number(s) for clustering of the methods by concordance
boolean
a list with newdata: dataframe with new variables attached; models contains the trained models; reorderers contains a dataframe for reordering cluster levels; quality measurements and clustering based on cluster concordance (adjusted rand index) are also returned.
mydf = generateSubtyperData( 100 )