ProCQA is a large-scale dataset extracted from StackOverflow, offering mixed-modal QA pairs for programming question answering, leading to significant performance improvements in code retrieval benchmarks.
ProCQA introduces a large-scale programming question answering dataset from StackOverflow, improving code retrieval models.