Compute Semvar values on clusters
Arguments
- m
Tibble with one token per row and HDBSCAN information. The
coords
element of a model resulting fromsummarizeHDBSCAN
.- k
Number of tokens to compute
separationkNN
Value
A tibble with one row per cluster and output from
separationkNN
and silhouette
for each class, based on the coordinates in the input and both including
and excluding noise tokens.