Fusion-in-T5 (FiT5) is a unified document ranking model that integrates text matching, ranking features, and global document information through attention fusion, outperforming complex cascade ranking pipelines.
Generative relevance feedback and adaptive re-ranking can improve passage retrieval performance, with the most effective approach combining generative pseudo-relevance feedback and adaptive re-ranking over a large corpus graph.
Seismic, a novel approximate nearest neighbor algorithm, enables efficient and effective retrieval over learned sparse embeddings by leveraging the concentration of importance property in these representations.
KamerRaad is an AI tool that leverages large language models and hierarchical summarization to enhance citizens' ability to interactively engage with and understand Belgian parliamentary proceedings.
Providing counterfactual queries as explanations to help users understand and interact with search engine relevance decisions.
Existing retrieval models struggle to effectively comprehend exclusionary queries, where users explicitly express what they do not want to retrieve. Generative retrieval models exhibit unique advantages in handling such queries compared to sparse and dense retrieval methods.
A two-step approach to efficiently approximate SPLADE, a learned sparse retrieval model, while maintaining its effectiveness. The first step uses a pruned and reweighted version of SPLADE vectors for fast retrieval, and the second step rescores a sample of documents using the original SPLADE vectors.
A novel framework, Term-Set Generation (TSGen), that uses a set of terms as the document identifier (DocID) to address the false pruning problem in existing generative retrieval methods.
Varying the query generation techniques significantly impacts the computed retrievability scores, posing challenges for result reproducibility across different document collections.
Expanding the corpus with linked entity names can boost the performance of both sparse and dense retrievers in the early stage of cascaded ranking architectures.