R/subtyper.R
select_important_variables.Rd
This function selects important variables from a dataset based on stabilized correlations computed using the Sinkhorn method.
select_important_variables(
data,
cols,
threshold = 0.5,
epsilon = 0.001,
max_iter = 0,
use_glasso = 0.1,
least = FALSE,
return_raw_importance = FALSE
)
A data frame containing the variables.
A character vector specifying which columns (variables) to consider.
Proportion of variables to select based on importance (default: 0.2).
Convergence threshold for Sinkhorn iterations (default: 1e-3).
Maximum number of Sinkhorn iterations (default: 100).
set a scalar greater than zero to use graphical LASSO; this parameter relates to sparseness levels
boolean take least explanatory (most uncorrelated) variables
boolean
A character vector of selected variable names.