Understanding the importance of initialization and iterative pruning in neural networks.


coremsg

insights-into-the-lottery-ticket-hypothesis-and-iterative-magnitude-pruning


Insights into the Lottery Ticket Hypothesis and Iterative Magnitude Pruning


title_rewrite


The SMART pruner utilizes a separate, learnable probability mask to rank weight importance, employing a differentiable Top-k operator and dynamic temperature parameter trick to achieve target sparsity and escape non-sparse local minima, resulting in state-of-the-art performance on block and output channel pruning across various computer vision tasks and models.


separate-differentiable-and-dynamic-smart-pruner-for-efficient-block-and-output-channel-pruning-on-computer-vision-tasks


Separate, Differentiable, and Dynamic (SMART) Pruner for Efficient Block and Output Channel Pruning on Computer Vision Tasks