AudioChatLlama: Extending Large Language Models with General-Purpose Speech Capabilities
AudioChatLlama is an end-to-end large language model that can directly process and respond to audio prompts, maintaining the wide range of original text-based capabilities without using carefully curated paired data.