Core Concepts
The author presents Starling, an I/O-efficient disk-resident graph index framework optimizing data layout and search strategy for high-dimensional vector similarity search on data segments.
Abstract
The content discusses the challenges of high-dimensional vector similarity search (HVSS) on data segments and introduces Starling as a solution. It addresses issues with existing disk-based methods and proposes a new framework for efficient HVSS.
High-dimensional vector similarity search (HVSS) is crucial in various domains, but in-memory indexes face challenges with large datasets. Disk-based solutions like Starling optimize data layout and search strategy to improve efficiency. Existing methods fall short in balancing accuracy, efficiency, and space cost on data segments.
Starling's approach involves an in-memory navigation graph and reordered disk-based graph to reduce search path length and minimize disk bandwidth wastage. The block search strategy aims to minimize costly disk I/O operations during query execution. Through experiments, Starling shows superior performance compared to state-of-the-art methods.
Key metrics:
On a data segment with 2GB memory and 10GB disk capacity, Starling accommodates up to 33 million vectors in 128 dimensions.
Offers HVSS with over 0.9 average precision and top-10 recall rate.
Achieves latency under 1 millisecond.
Exhibits 43.9× higher throughput with 98% lower query latency compared to existing methods while maintaining accuracy.
Stats
On a data segment with 2GB memory and 10GB disk capacity, Starling can accommodate up to 33 million vectors in 128 dimensions.
Offers HVSS with over 0.9 average precision and top-10 recall rate.
Achieves latency under 1 millisecond.
Exhibits 43.9× higher throughput with 98% lower query latency compared to existing methods while maintaining accuracy.