Disaggregating Response-Level Feedback into Sentence-Level Scores for Improved Language Model Tuning
Methods to disaggregate response-level labels into sentence-level (pseudo-)labels, leveraging multiple instance learning, learning from label proportions, and prior information, to train specialized models for improved sentence-level scoring across various natural language tasks.