Read distance matrices from different models,
run dimensional reduction for visualization based on different techniques
and store the coordinates corresponding to each model in a dataframe per technique.
The names of the models will be found in the models
file and their paths will be
searched for in input_dir
: if a file is not found, a warning will be issued.
Usage
getClouds(
input_dir,
output_dir,
files_list,
lemma,
solutions,
logrank = TRUE,
logdist = TRUE,
type = "token",
row_selection = vector()
)
Arguments
- input_dir
Directory where the token distance matrices are stored.
- output_dir
Directory where the data will be stored.
- files_list
Liste of filenames within
input_dir
.- lemma
Name of the lemma, for filenames
- solutions
Named list of techniques to run for visualization possible
technique
values ingetFit
.- logrank
Whether to transform the matrices with
transformMats
.- logdist
Whether euclidean distances should be computed between the rows of the transformed matrices (when
logrank
isTRUE
). Otherwise, the matrix of log-transformed ranks is only made symmetric.- type
Whether to open the files with
tokensFromPac
(for "token") orfocdistsFromCsv
(otherwise).- row_selection
List of row (and column) names to subset the matrices.