Robotics Diffusion Transformer

小米开源首代机器人 VLA 大模型 Xiaomi-Robotics-0

IT之家 2 月 12 日消息，小米今日对外发布开源 VLA 模型 Xiaomi-Robotics-0，拥有 47 亿参数、兼具视觉语言理解与高性能实时执行能力，刷新多项 ...

9 天

小米开源首个机器人VLA大模型Xiaomi-Robotics-0：47亿参数加持，物理智能 ...

小米今日正式开源了其首个机器人视觉语言动作（VLA）大模型 Xiaomi-Robotics-0 ，这一举措无疑为机器人技术领域注入了新的活力。这款模型以其 47 亿参数的体量，在视觉语言理解与高性能实时执行能力上取得了显著进展，并在多个 SOTA ...

9 天

小米开源47亿参数机器人大模型，“大脑+小脑”架构三大测试刷新 ...

2月12日，小米正式宣布开源其首代机器人VLA（Vision-Language-Action）大模型Xiaomi-Robotics-0。据小米技术官方介绍，该模型拥有47亿参数，兼具视觉语言理解与高性能实时执行能力，已在多项基准测试中刷新纪录。在架构设计上，Xiaomi-Robotics-0采用了"大脑+小脑"混合架构。其中，视觉语言"大脑"基于多模态VLM大模型作为底座，负责理解人类的自然语言 ...

9 天on MSN

小米发布并开源VLA模型Xiaomi-Robotics-0 兼具高性能与物理智能泛化能力

小米公司今日正式推出开源视觉语言动作（VLA）模型Xiaomi-Robotics-0，该模型凭借47亿参数规模与独特的架构设计，在仿真测试与真实机器人任务中均取得突破性表现。其核心优势在于实现"感知-决策-执行"闭环的物理智能，能够在消费级显卡上完成实时推理，为机器人领域带来新的技术范式。

盖世汽车 on MSN

小米机器人开源VLA模型Xiaomi-Robotics-0

2月12日，小米雷军通过微博披露，小米机器人团队正式开源Xiaomi-Robotics-0，一个47亿参数的具身智能VLA模型。该模型采用Mixture-of-Transformers混合架构，在LIBERO、CALVIN和SimplerEnv三大仿 ...

9 天on MSN

Xiaomi announces Xiaomi-Robotics-0, its first-generation robot large-scale model

Xiaomi is best known for smartphones, smart home gear, and the occasional electric vehicle update. Now it wants a place in robotics research too. The company has announced Xiaomi-Robotics-0, an ...

CU Boulder News & Events

CSCI 7000 - Transformers for Robotics

Deep neural networks based on self-attention are revolutionizing robotics with their ability to perform "open world" reasoning across multiple modalities including text and images, and their ability ...

Yahoo

Diffusion transformers are the key behind OpenAI's Sora -- and they're set to upend GenAI

OpenAI's Sora, which can generate videos and interactive 3D environments on the fly, is a remarkable demonstration of the cutting edge in GenAI -- a bona fide milestone. But curiously, one of the ...

一些您可能无法访问的结果已被隐去。

显示无法访问的结果