Efficient Block-Max Pruning for Faster Learned Sparse Retrieval
Block-Max Pruning (BMP) is an innovative dynamic pruning strategy that efficiently processes indexes generated by learned sparse retrieval models, outperforming existing methods in both safe and approximate retrieval scenarios.