Alibaba Qwen 2.5 Omni AI Model With Real-Time Speech Generation Released

Alibaba’s Qwen team released a new artificial intelligence (AI) model in the Qwen 2.5 family on Wednesday. Dubbed Qwen 2.5 Omni, it is a flagship-tier end-to-end multimodal model. The company claims it can process a wide range of inputs, including text, images, audio, and videos, while generating real-time text and natural speech responses.
Read More

OpenAI Adds Image Generation Capability to GPT-4o, Can Render Text and Offers Prompt-Based Editing

OpenAI added image generation capability to its existing GPT-4o artificial intelligence (AI) model on Tuesday. The San Francisco-based AI firm released the 4o Image Generation model and integrated it into the GPT-4o. The company said that the focus of this image generator is on usefulness instead of decorativeness. It comes with accurate text rendering, high prompt ad...
Read More

Nvidia Releases Cosmos-Transfer1 AI Model That Can Be Used for Simulation-Based Training for Robots

Nvidia released a new artificial intelligence (AI) model last week that can be used to train robots on simulation. Dubbed Cosmos-Transfer 1, the new large language model (LLM) is aimed at AI-powered robotics hardware, also known as physical AI. The company has released the model in open source with a permissive licence, and interested individuals can download it from ...
Read More