Meta Llama3 8b with Llava Multimodal capabilities
Clone a voice and generate speech from text
Generate images from text prompts