Maximum Homogeneity Clustering for One-Dimensional Data
Introduction | Feature engineering for high-cardinality categorical features | Simulated postal code data | Recode the training set | Recode the test set | Grouping coefficients in regression models | Simulated regression data | Raw estimates | Soft-thresholding | Clustering (k = 2) | Clustering (k = 3) | Clustering while preserving data order (k = 5) | Sequential data peak calling and segmentation | Simulated time series data | Peak calling (k = 2) | Peak calling (k = 4) | Segmentation (k = 6) | References