แนวคิดหลัก
Integrating multi-aspect query generation with advanced retrieval and reranking models, particularly learned sparse retrieval, significantly improves conversational search performance, surpassing even human-level query rewriting.
สถิติ
Automatic runs using the MQ4CS framework with learned sparse retrieval and ensemble reranking demonstrated a 2.3-point increase in Recall@100 and a 3.2-point increase in mAP compared to using a single query rewrite.
The best performing automatic run achieved a 1.5-point gain in nDCG@5 and a 6.8-point increase in nDCG compared to using a single query rewrite with the same reranking model.
Automatic runs outperformed manual runs (using human-generated rewrites) on nDCG, MRR, P@20, and mAP metrics.
Ensembling multiple cross-encoders for reranking consistently improved performance compared to using a single reranker.