Evaluating Masked and Generative Language Models on Natural Language Inference for Clinical Trial Data
Comparing the performance of masked language models and generative language models on a natural language inference task for clinical trial data, focusing on metrics of faithfulness and consistency.