The sender aims to make persuasive action recommendations to the receiver over time, while gradually learning the unknown prior distribution of the payoff-relevant state.