OpenAI Releases GPT-4.5 with Broad Reasoning Improvements
GPT-4.5 brings stronger multimodal capabilities and long-context reasoning, making it one of the most anticipated model updates for developers.
NaviAI中文Stay up to date with the latest AI developments
GPT-4.5 brings stronger multimodal capabilities and long-context reasoning, making it one of the most anticipated model updates for developers.
Google Gemini 2.0 Ultra outperforms rival models on multiple benchmarks, with standout video understanding and code generation performance.
Sora 2.0 introduces longer-duration, higher-resolution video generation and can now create clips up to five minutes long.
Midjourney v7 introduces a new rendering engine with three times faster generation and much better visual consistency.
Claude 3.7 Sonnet stands out in coding and mathematical reasoning, and its extended thinking mode digs deeper into complex problems.
Chinese large-model vendors continue pushing voice interaction and multimodal understanding, with Doubao and Kimi both shipping updates.
AI agents are moving from single tasks to multi-step autonomous execution, and multiple companies are launching products that can replace manual operations.
Stability AI releases Stable Diffusion 4.0 with higher-resolution output and finer style control.