VLA 線 · Vision-Language-Action

VLA 研究日報VLA 研究日报

VISION-LANGUAGE-ACTION · cs.RO + cs.AI + cs.LG

Vision-Language-Action（VLA）機器人系統 — 整合 cs.RO、cs.AI、cs.LG 三條 arxiv 流。重點追蹤 flow matching、世界模型、具身推理等前沿方向，每日 09:00 CST 由 Qwen3.5-Plus 自動評級Vision-Language-Action（VLA）机器人系统 — 整合 cs.RO、cs.AI、cs.LG 三条 arxiv 流。重点追踪 flow matching、世界模型、具身推理等前沿方向，每日 09:00 CST 由 Qwen3.5-Plus 自动评级。

— 2026 年 4 月 —

2026 · 04 · 24 今天

🔧 15 📖 24

Object-centric task representation and transfer using diffused orientation fields

2026 · 04 · 23 昨天

FASTER: Value-Guided Sampling for Fast RL

2026 · 04 · 22 2 天前

Demonstrate once, execute on many: Kinematic intelligence for cross-robot skill transfer

2026 · 04 · 18 6 天前

⚡ 2 🔧 4 📖 9

Model-Based Reinforcement Learning Exploits Passive Body Dynamics for High-Performance Biped Robot Locomotion

2026 · 04 · 17 7 天前

⚡ 1 🔧 7 📖 12

Jump-Start Reinforcement Learning with Vision-Language-Action Regularization

2026 · 04 · 16

⚡ 1 🔧 5 📖 11

Habitat-GS: A High-Fidelity Navigation Simulator with Dynamic Gaussian Splatting

2026 · 04 · 14

AssemLM: Spatial Reasoning Multimodal Large Language Models for Robotic Assembly

2026 · 04 · 12

LAMP: Lift Image-Editing as General 3D Priors for Open-world Manipulation

2026 · 04 · 11

LAMP: Lift Image-Editing as General 3D Priors for Open-world Manipulation

2026 · 04 · 10

⚡ 1 🔧 5 📖 14

RichMap: A Reachability Map Balancing Precision, Efficiency, and Flexibility for Rich Robot Manipulation Tasks

2026 · 04 · 09

ICR-Drive: Instruction Counterfactual Robustness for End-to-End Language-Driven Autonomous Driving

2026 · 04 · 08

Diffusion Policy with Bayesian Expert Selection for Active Multi-Target Tracking

2026 · 04 · 07

F2F-AP: Flow-to-Future Asynchronous Policy for Real-time Dynamic Manipulation

2026 · 04 · 04

Causal Scene Narration with Runtime Safety Supervision for Vision-Language-Action Driving

2026 · 04 · 03

Multi-Camera View Scaling for Data-Efficient Robot Imitation Learning

2026 · 04 · 02

Object Affordance Recognition and Grounding via Multi-scale Cross-modal Representation Learning

2026 · 04 · 01

ViPRA: Video Prediction for Robot Actions

— 2026 年 3 月 —

2026 · 03 · 31

MPDiT: Multi-Patch Global-to-Local Transformer Architecture For Efficient Flow Matching and Diffusion Model

2026 · 03 · 28

⚡ 1 🔧 5 📖 29

FODMP: Fast One-Step Diffusion of Movement Primitives Generation for Time-Dependent Robot Actions

2026 · 03 · 27

NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos

2026 · 03 · 26

Model Predictive Control with Differentiable World Models for Offline Reinforcement Learning

2026 · 03 · 25

⚡ 2 🔧 4 📖 5

Causal World Modeling for Robot Control

2026 · 03 · 24

R2-Dreamer: Redundancy-Reduced World Models without Decoders or Augmentation

2026 · 03 · 21

FASTER: Rethinking Real-Time Flow VLAs

2026 · 03 · 20

Grounding Robot Generalization in Training Data via Retrieval-Augmented VLMs

2026 · 03 · 19

⚡ 2 🔧 5 📖 5

DexGrasp-Zero: A Morphology-Aligned Policy for Zero-Shot Cross-Embodiment Dexterous Grasping

2026 · 03 · 18

Panoramic Affordance Prediction

2026 · 03 · 17

MoE-ACT: Scaling Multi-Task Bimanual Manipulation with Sparse Language-Conditioned Mixture-of-Experts Transformers

2026 · 03 · 14

Robot-mediated haptic feedback outperforms vision in violin duo coordination

2026 · 03 · 13

⚡ 1 🔧 2 📖 8

Cross-embodied Co-design for Dexterous Hands

2026 · 03 · 12

RL-100: Performant Robotic Manipulation with Real-World Reinforcement Learning

2026 · 03 · 11

⚡ 1 🔧 3 📖 25

Evolving Diffusion and Flow Matching Policies for Online Reinforcement Learning

2026 · 03 · 10

⚡ 1 🔧 5 📖 10

What if? Emulative Simulation with World Models for Situated Reasoning

2026 · 03 · 07

Beyond the Patch: Exploring Vulnerabilities of Visuomotor Policies via Viewpoint-Consistent 3D Adversarial Object

2026 · 03 · 06

Beyond Pixel Histories: World Models with Persistent 3D State

2026 · 03 · 05

Next Embedding Prediction Makes World Models Stronger

2026 · 03 · 04

Robometer: Scaling General-Purpose Robotic Reward Models via Trajectory Comparisons

2026 · 03 · 03

Beyond State-Wise Mirror Descent: Offline Policy Optimization with Parameteric Policies

2026 · 03 · 02

UCM: Unifying Camera Control and Memory with Time-aware Positional Encoding Warping for World Models

2026 · 03 · 01

When to Act, Ask, or Learn: Uncertainty-Aware Policy Steering

— 2026 年 2 月 —

2026 · 02 · 28

On Sample-Efficient Generalized Planning via Learned Transition Models

2026 · 02 · 27

Provably Safe Generative Sampling with Constricting Barrier Functions

2026 · 02 · 26

🔧 11 📖 15

Path-Decoupled Hyperbolic Flow Matching for Few-Shot Adaptation