Efficient Multi-Vector Retrieval by Rethinking Token Retrieval
XTR, a simplified and efficient method for multi-vector retrieval, improves the initial token retrieval stage to enable scoring documents solely based on the retrieved tokens, greatly reducing the computational cost while achieving state-of-the-art performance.