ViPRA: Video Prediction for Robot Actions
Sandeep Routray et al. · Video prediction to policy conversion, unlabeled video usage.
Sandeep Routray et al. · Video prediction to policy conversion, unlabeled video usage.
Haonan Chen et al. · Generative VLM as world model for zero-shot manipulation.
Ningyu Yan et al. · GPU-accelerated visuotactile sim for online RL and zero-shot deployment.
Hongyu Yan et al. · Progress guidance for VLA diffusion policy, solves long-horizon termination.
Daichi Yashima et al. · Tokenization-free flow matching policy, efficient autoregressive learning.
Yuan Xu et al. · Egocentric viewpoint augmentation for IL robustness.
Masaki Murooka et al. · Open-source IL framework for real/sim pipeline.
Vineet Bhat et al. · Integrates depth/3D context into VLA for task generalization.
Weiwei Gu et al. · Dialogue-based continual skill learning, interaction focus.
Yichi Zhang et al. · VLA attention improvement, overlaps with tracked Selective Perception [💧灌水].
Philip Schroeder et al. · Uses VLM reasoning as reward signal for on-robot RL.
Yiran Shi et al. · Overlaps with tracked StreamVLA, efficiency optimization [💧灌水].
Yu Sun et al. · New benchmark for reasoning-oriented manipulation evaluation.
Feiyu Jia et al. · Tactile glove hardware for dexterous teleoperation data collection.
Weiguang Zhao et al. · Teleoperation framework for dynamic object catching tasks.
Robin Kühn et al. · Empirical study on sensor setups for ACT on humanoids.
Akash Anand et al. · Routing mechanism for selecting 3D reconstruction models based on cost.
Solvin Sigurdson et al. · Spectral decomposition for inverse dynamics in model-based control.
Siyi Lang et al. · Control policy for space robotic manipulation of dynamic targets.
Yu Zhang et al. · Experience replay modification for stable RL in manipulation.
Hao Li et al. · Extends UMI framework to underwater manipulation without teleop.
Irvin Steve Cardenas et al. · ROS 2 framework for agentic robot control integration.
Boyang Cai et al. · Systematic study on multi-view demonstration benefits and synthesis.
Sanjay Oruganti et al. · Cognitive architecture for safety-critical robot teaming with LLMs.
Farong Wang et al. · Safe trajectory execution via surface-constrained warping and pose projection.
Christopher Agia · Uses 3D scene graphs for task-driven perception and planning.
Uses VLE to reduce oracle feedback cost in preference RL.
Benchmark for paraphrase robustness in VLA models on LIBERO.