generate input for consensus clustering give several clustering algorithms
consensusSubtypingPrep(
dataToTrain,
dataToPredict,
featureNames,
clustVec,
maxK,
reorderingVariable,
mvcl,
idvar,
visitName,
baselineVisit,
whichrank = 0,
ntoreturnperk = 2,
verbose = FALSE
)
dataframe input that contains the relevant variables (may have others as well) on which training will be based
dataframe input that contains the relevant variables (may have others as well) for which prediction will be done
names to use in the clustering
names of the clustering methods to use
the maximum desired number of classes
the name of the column to use to reorder the cluster names
character prefix for the new cluster column names
variable name for unique subject identifier column
the column name defining the visit variables
the string naming the baseline visit
allows user to get 2nd (or 3rd) rank set of methods
number of method results per k to return
boolean
new dataframe with new variables attached
mydf = generateSubtyperData( 100 )