The performance of solutions for analyzing Stack Overflow content hinges significantly on the selection of representation models for Stack Overflow posts. This study comprehensively evaluates the effectiveness of various representation models, including Stack Overflow-specific and general/domain-specific transformer-based models, and proposes SOBERT, a model that consistently outperforms the others by further pre-training on Stack Overflow data.


coremsg

evaluating-representation-models-for-analyzing-stack-overflow-posts


Evaluating Representation Models for Analyzing Stack Overflow Posts