Close Menu
    Facebook X (Twitter) Instagram
    Facebook X (Twitter) Instagram
    TIMES24H
    • Hot!
      1. Vietnam
      2. Asia
      3. Video
      Featured
      Hai Sau Sau (266) Partners with Samsung to Drive “One Samsung” Strategy in Vietnam

      Hai Sau Sau (266) Partners with Samsung to Drive “One Samsung” Strategy in Vietnam

      By Mike HarrisonNovember 13, 20250
      Recent
      Hai Sau Sau (266) Partners with Samsung to Drive “One Samsung” Strategy in Vietnam

      Hai Sau Sau (266) Partners with Samsung to Drive “One Samsung” Strategy in Vietnam

      November 13, 2025
      TechTimes Editors’ Choice 2024: 9Fit eBiz Mag Stand NFC Wallet – The Most Unique Mobile Accessory

      TechTimes Editors’ Choice 2024: 9Fit eBiz Mag Stand NFC Wallet – The Most Unique Mobile Accessory

      January 8, 2025

      BCP Vietnam and Vitalify Asia Launch the First A.I-Powered Business Matching Platform

      December 20, 2024
    • World
      • PR Newswire
      • Media Outreach
      • GLOBENEWSWIRE
    • Business
      Taiwan: The Global Powerhouse Shaping the Future of AI

      Taiwan: The Global Powerhouse Shaping the Future of AI

      August 29, 2025
      MEGA US EXPO 2025: A Hub for Innovation and Business Collaboration Between Vietnam and Korea

      MEGA US EXPO 2025: A Hub for Innovation and Business Collaboration Between Vietnam and Korea

      July 31, 2025
      Vietnamese Enterprises Engage with Global AI Innovations at COMPUTEX TAIPEI 2025

      Vietnamese Enterprises Engage with Global AI Innovations at COMPUTEX TAIPEI 2025

      May 19, 2025

      BCP Vietnam and Vitalify Asia Launch the First A.I-Powered Business Matching Platform

      December 20, 2024

      POPS Reaches Huge Milestone with 10,000 Enrolled Students

      December 16, 2021
    • Life
      1. Lifestyle
      2. Recipes
      3. Fashion
      4. View All
      3E Accounting Marks 15 Years of Excellence, Accelerating Global Business Growth with AI-Powered Efficiency

      3E Accounting Marks 15 Years of Excellence, Accelerating Global Business Growth with AI-Powered Efficiency

      May 25, 2026
      Phancy Group Announces Strong 2026 First Quarter Results

      Phancy Group Announces Strong 2026 First Quarter Results

      May 22, 2026
      Save the Children Hong Kong Releases "Hearing Children" – Child-led Research Report: How Family Interactions Affect Youth Mental Health

      Save the Children Hong Kong Releases “Hearing Children” – Child-led Research Report: How Family Interactions Affect Youth Mental Health

      May 22, 2026
      TCMA Marks National Milestone, Driving Thailand’s Cement Industry toward Net Zero 2050

      TCMA Marks National Milestone, Driving Thailand’s Cement Industry toward Net Zero 2050

      May 22, 2026

      Cooking tips for a smaller Thanksgiving celebration

      November 18, 2020

      Hanoi: A capital, and a kingdom of egg coffee shops

      November 16, 2020

      4 must-try recipes when you travel to Vietnam

      November 7, 2020

      Cutting-Edge Technology for Top Dentists

      December 24, 2021

      H&M faces boycott in Vietnam over “problematic map”

      April 7, 2021
      Pierre Cardin

      Ground-breaking French designer Pierre Cardin dies aged 98

      December 30, 2020
      JESSICA SIMPSON

      #HealthGoals: Jessica Simpson shows off 100 lbs weight loss in Christmas pajamas

      December 27, 2020

      Plane captain dies during Miami-Chile flight

      August 17, 2023

      French paintings of Vietnamese life a century ago exhibited in HCMC

      August 17, 2023

      Judge says accused TV contest not rigged

      August 17, 2023

      I don’t know how to tell my Christian parents-in-law I want a divorce

      August 17, 2023
    • Sport
    • Tech
      1. Gadgets
      2. View All
      9Fit and DTR Launch Vietnam’s First Smart Ring: A Leap Towards the Future of Wearable Technology

      9Fit and DTR Launch Vietnam’s First Smart Ring: A Leap Towards the Future of Wearable Technology

      December 12, 2024

      “Stupid windman” PC assembly experience based on Newegg ChatGPT

      March 29, 2023

      The value of the industrial cloud as an example of “the power of ecosystem, the power of expertise”

      March 29, 2023

      Machbase Releases Open Source Structured Time Series Database “Macbase Neo”

      March 28, 2023
      Taiwan Digital Day 2025

      Taiwan Digital Day 2025: Driving Vietnam-Taiwan Tech Collaboration in Ho Chi Minh City

      July 30, 2025
      Vietnamese Enterprises Engage with Global AI Innovations at COMPUTEX TAIPEI 2025

      Vietnamese Enterprises Engage with Global AI Innovations at COMPUTEX TAIPEI 2025

      May 19, 2025
      9Fit and DTR Launch Vietnam’s First Smart Ring: A Leap Towards the Future of Wearable Technology

      9Fit and DTR Launch Vietnam’s First Smart Ring: A Leap Towards the Future of Wearable Technology

      December 12, 2024

      “Stupid windman” PC assembly experience based on Newegg ChatGPT

      March 29, 2023
    Media Outreach Newswire
    TIMES24H
    Home»GLOBENEWSWIRE»The Evolution of Generalist Embodied
    GLOBENEWSWIRE

    The Evolution of Generalist Embodied

    GLOBENEWSWIREBy GLOBENEWSWIREMarch 11, 2025No Comments6 Mins Read
    Facebook Twitter Pinterest LinkedIn Tumblr Email
    The Evolution of Generalist Embodied
    Share
    Facebook Twitter LinkedIn Pinterest Email

    Shanghai, China , March 11, 2025 (GLOBE NEWSWIRE) — Today, AgiBot launches Genie Operator-1 (GO-1), an innovative generalist embodied foundation model. GO-1 introduces the novel Vision-Language-Latent-Action (ViLLA) framework, combining a Vision-Language Model (VLM) and Mixture of Experts (MoE). The VLM utilizes internet-scale heterogeneous data to establish a solid foundation for scene and object understanding. The MoE consists of two key components: the Latent Planner, which learns from cross-embodiment and human operation data to develop general action understanding, and the Action Expert, which uses over a million real robot demonstrations to achieve high-frequency and dexterous manipulation. 

    These components work in synergy, providing GO-1’s unique capabilities:

    • Learning from Human Videos
    • Few-shot Generalization
    • Cross-Embodiment Adaptation
    • Continuous Self-Evolution

    Paper: https://agibot-world.com/blog/agibot_go1.pdf

    YouTube Link: https://youtu.be/9dvygD4G93c

    At the end of 2024, AgiBot launched the AgiBot World dataset, a large-scale, high-quality real world robotics dataset comprising over 1 million trajectories across 217 tasks in five application domains. Building on top of AgiBot World, today AgiBot introduces Genie Operator-1 (GO-1), a generalist embodied foundation model.

    GO-1: An Evolution from VLA to ViLLA

    To maximize the value of the high-quality AgiBot World dataset as well as web-scale heterogeneous videos while improving the policy’s generalization capability, AgiBot proposes a hierarchical Vision-Language-Latent-Action (ViLLA) framework. Compared to the Vision-Language-Action (VLA) model, where actions are directly conditioned on vision and language inputs, the ViLLA model predicts latent action tokens, bridging the gap between image-text inputs and robot actions generated by the action expert.

    The ViLLA framework consists of a VLM and MoE. The VLM uses massive multimodal data on the internet to obtain general scene understanding and language comprehension. The Latent Planner in MoE harnesses data from various embodiments and human actions to build action comprehension. Meanwhile, the Action Expert, trained with over a million real world robot demonstrations, refines action execution. During inference, the VLM, Latent Planner, and Action Expert cooperate as follows:

    1. VLM: Using the InternVL-2B model, it processes multi-view images, force signals and language inputs to provide scene understanding and instruction comprehension.
    2. Latent Planner: This expert predicts Latent Action Tokens based on intermediate outputs from the VLM, forming a Chain of Planning (CoP) for general action understanding and planning.
    3. Action Expert: It generates the final fine-grained action sequences based on intermediate outputs from the VLM and the Latent Action Tokens.

     

    The following is an introduction to the two key components of MoE: Latent Planner and Action Expert.

     

    Expert 1: Latent Planner

    Although the AgiBot World dataset is the largest real world robot dataset globally, the volume of action-labeled robot data remains limited relative to internet-scale datasets. To address this, AgiBot employs latent actions to model the inverse dynamics of consecutive frames. This approach enables the transfer of real-world dynamics from heterogeneous data sources into universal manipulation knowledge.

    • Latent Action Model (LAM): This model extracts the ground truth of Latent Actions between current and historical frames, consisting of an encoder and a decoder.
    1. The encoder employs a spatial-temporal transformer with causal temporal masks.
    2. The decoder uses a spatial transformer, taking the initial frame and discretizing Latent Action Tokens as input.
    3. Latent Action Tokens are quantized using VQ-VAE.
    • Latent Planner: The Latent Planner is responsible for predicting discrete Latent Action Tokens. It shares the same Transformer architecture as the VLM backbone but utilizes two independent sets of Feed-Forward Networks (FFN) and Q/K/V/O (Query, Key, Value, Output) projection matrices. The Latent Planner integrates intermediate VLM outputs layer-by-layer and  is trained using cross entropy loss.

    Expert 2: Action Expert

    To achieve high-frequency and dexterous manipulation, AgiBot integrates an action expert that utilizes a diffusion objective to model the continuous distribution of low-level actions.

    • The Action Expert shares the same architectural design as the Latent Planner, utilizing the same Transformer backbone as the VLM but with two independent sets of Feed-Forward Networks (FFN) and Q/K/V/O (Query, Key, Value, Output) projection matrices. It employs a denoising process to iteratively regress the action sequence.
    • The Action Expert is hierarchically integrated with the VLM and Latent Planner, ensuring consistency in information flow and collaborative optimization.

    Experimental Results

     

    Using the novel Vision-Language-Latent-Action (ViLLA) framework, AgiBot evaluated GO-1 across five tasks of varying complexity. Compared to current state-of-the-art models, GO-1 significantly outperforms them, increasing success rates by 32% (46% → 78%). Notably, tasks like “Pour Water” and “Restock Beverage” showed remarkable improvements. Furthermore, AgiBot validated the contribution of the Latent Planner within the ViLLA framework, showing a 12% success rate improvement (66% → 78%).

    GO-1: Comprehensive Innovation of Embodied Intelligence

    AgiBot GO-1 leverages human and diverse types of robot data, enabling robots to acquire revolutionary learning capabilities. It can generalize across various environments and objects, quickly adapt to new tasks, and learn new skills. At the same time, it can be deployed across various robotic embodiments, enabling efficient implementation and continuous evolution in real-world environments.

    The key characteristics of GO-1 can be summarized as follows:

    • Learning from Human Videos:GO-1 can learn from internet videos and real human demonstrations to enhance its understanding of human actions.
    • Few-Shot Generalization:GO-1’s strong generalization ability enables fast adaptation to new scenes and tasks with minimal data, even in zero-shot scenarios, resulting in very low post-training costs.
    • Cross-Embodiment Adaptation:GO-1 is a generalist robot policy model, capable of transferring between different kinds of robots and quickly adapting to various embodiments.
    • Continuous Self-Evolution:GO-1 can continuously evolve from data generated by issues encountered during real-world execution, within AgiBot’s complete data feedback system.

    The launch of GO-1 marks a rapid advancement of embodied intelligence towards generalization, openness, and enhanced capabilities: 

    • From Single Task to Multi-Task: Robots can now perform multiple tasks across diverse scenarios without needing to retrain for each new task.
    • From Closed Environments to Open Worlds: Robots are no longer limited to controlled lab settings but can operate in dynamic real-world environments.
    • From Predefined Programs to Instruction Generalization: Robots can now understand and follow natural language instructions, reasoning and combining tasks based on semantics, rather than being confined to predefined programs.

    AgiBot GO-1 will accelerate the widespread adoption of embodied intelligence, transforming robots from task-specific tools into autonomous agents with general intelligence. It will play a greater role across various domains, including manufacturing, service, and household applications, paving the way for a more versatile and intelligent future.

    AgiBot official website:

    https://www.linkedin.com/feed/update/urn:li:activity:7304747190139150338

    https://fb.watch/yefx6B0bsC/

    
                

    Nguồn: GLOBENEWSWIRE – Đơn vị phát hành hoàn toàn chịu trách nhiệm về nội dung thông báo này.

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

    Related Posts

    CADian

    WIZCORE giới thiệu CADian – giải pháp thay thế AutoCAD trong xu hướng chuyển đổi sang CAD hợp pháp tại Việt Nam

    May 26, 2026
    Google Cloud Security Uses Instruqt Platform to Train 150+ Practitioners on Agentic AI at Google Next 2026

    Google Cloud Security Uses Instruqt Platform to Train 150+ Practitioners on Agentic AI at Google Next 2026

    May 24, 2026
    U.S. Air Force 309th Software Engineering Group Selects Rise8 to Accelerate Delivery of Torque, the Enterprise System Powering Aircraft Readiness

    U.S. Air Force 309th Software Engineering Group Selects Rise8 to Accelerate Delivery of Torque, the Enterprise System Powering Aircraft Readiness

    May 21, 2026
    Leave A Reply Cancel Reply

    Latest News
    CADian

    WIZCORE giới thiệu CADian – giải pháp thay thế AutoCAD trong xu hướng chuyển đổi sang CAD hợp pháp tại Việt Nam

    May 26, 2026

    Xinhua Silk Road: Tea-themed educational tour boosts economy in Mengzhuang Town, Shandong Province

    May 26, 2026

    Huawei Hosts 3rd Global C&I Visionaries Summit, Shaping a Greener Future Across Diverse Industries

    May 25, 2026

    Cyient Semiconductors Announces Strategic Financing with Edelweiss at ~ USD 500 Mn. Equity Valuation

    May 25, 2026
    DMCA.com Protection Status
    Facebook X (Twitter) Instagram Pinterest

    © 2026 TIMES24H. All rights reserved

    TIMES24H is a global news platform delivering timely, reliable, and insightful coverage across technology, business, lifestyle, and current affairs. Our mission is to provide readers with clear perspectives and trusted information to navigate a fast-changing world.

    Type above and press Enter to search. Press Esc to cancel.