Google has launched Gemini Embedding 2, its first natively multimodal embedding model supporting text, images, video, audio, ...
OpenAI has released a new version of its text-to-video AI model, Sora, for ChatGPT Plus and Pro users, marking another step in expansion into multimodal AI technologies. The original Sora model, ...
Microsoft has introduced a new AI model that, it says, can process speech, vision, and text locally on-device using less compute capacity than previous models. Innovation in generative artificial ...