Visual Spatial Reasoning

Causal reasoning meets visual representation learning: A prospective study

With the emergence of huge amounts of heterogeneous multi-modal data, including images, videos, texts/languages, audios, and multi-sensor data, deep learning-based methods have shown promising ...

Las Vegas Sun

Perceptron AI Launches Physical AI Model That Matches Frontier Labs at a Fraction of the Cost

Perceptron AI today announced the launch of its model purpose-built for video understanding and embodied reasoning. It delivers performance competitive with leading frontier models – including Google, ...

20 天

ChatGPT Image 2.0 Signals Visual Reasoning To Solve Real-World Tasks

ChatGPT Image 2.0 suggests that AI image generation is evolving into visual reasoning and verifiable AI, with implications for the future of physical intelligence.

来自MSN

Claude Code struggles with visual puzzles despite coding strengths

Claude Code, Anthropic’s AI coding assistant, excelled in text-based problem solving but faltered when tackling children’s visual puzzles like mazes and word placement. While it quickly generated ...

来自MSN

GPT Image 2 Transforms Creative Workflows with Precision and Reasoning

GPT Image 2 combines advanced reasoning, spatial accuracy, and multi-image generation to deliver production-ready visuals from complex prompts. Its flexible modes and integration into platforms like ...

28 天

Google Gemini Robotics-ER 1.6 update adds advanced reasoning and label identification for ...

Gemini Robotics-ER 1.6 improves robot reasoning, spatial understanding, and label identification, while adding instrument reading and safety upgrades to help machines perform real-world tasks with ...

一些您可能无法访问的结果已被隐去。

显示无法访问的结果