Introducing Smart, a framework to minimize inference costs of Large Language Models while ensuring accuracy guarantees.