Paper ID | MLSP-32.2 |
Paper Title |
Minimizing Weighted Concave Impurity Partition Under Constraints |
Authors |
Thuan Nguyen, Thinh Nguyen, Oregon State University, United States |
Session | MLSP-32: Optimization Algorithms for Machine Learning |
Location | Gather.Town |
Session Time: | Thursday, 10 June, 15:30 - 16:15 |
Presentation Time: | Thursday, 10 June, 15:30 - 16:15 |
Presentation |
Poster
|
Topic |
Machine Learning for Signal Processing: [MLR-LEAR] Learning theory and algorithms |
IEEE Xplore Open Preview |
Click here to view in IEEE Xplore |
Virtual Presentation |
Click here to watch in the Virtual Conference |
Abstract |
Set partitioning is a key component of many algorithms in machine learning, signal processing, and communications. In general, the problem of finding a partition that minimizes a given impurity (loss function) is NP-hard. As such, there exists a wealth of literature on approximate algorithms and theoretical analysis for the partitioning problem under different settings. In this paper, we formulate and solve a variant of the partition problem called the minimum weighted concave impurity partition under constraint (MIPUC). MIPUC finds an optimal partition that minimizes a given weighted concave loss function under a given concave constraint. MIPUC generalizes the recently proposed Deterministic Information Bottleneck problem which finds an optimal partition that maximizes the mutual information between the input and partitioned output while minimizing the partitioned output entropy. Our proposed algorithm is based on an optimality condition, which allows us to find a locally optimal solution efficiently. We also show that the optimal partitions are separated by some hyperplanes in the space of posterior probability mass functions. |