研究日報 — 近7日匯總 | AI App + VLA | Pulsar

近 7 日匯總近 7 日汇总 04-14 → 04-24

33 AI精選AI精选

157 VLA論文VLA论文

⚡ 4 突破

🔧 63 工具/技術工具/技术

📖 90 背景/觀點背景/观点

⚡ 本週最強信號本周最强信号

World-Value-Action Model: Implicit Planning for Vision-Language-Action Systems CS.RO · 04-18
Towards Deploying VLA without Fine-Tuning: Plug-and-Play Inference-Time VLA Policy Steering via Embodied Evolutionary Diffusion CS.RO · 04-18

AI 今日 · 04-24 VLA 今日 · 04-24

近 3 天內容近 3 天内容 04-22 → 04-24

2026-04-24 最新 44 條条

小米 MiMo-V2.5 发布：罗福莉领衔，对标 Claude Opus 4.6 / GPT-5.4 AI OpenAI Codex 推出 Automations：定时/触发式自动执行任务 AI OpenAI Codex 上线 Plugins & Skills 体系：连接外部工具 + 可复用工作流 AI LiteParse for the Web：纯浏览器端 PDF 空间文本解析 AI GitHub Trending：huggingface/ml-intern（开源ML工程师）和 claude-context（代码搜索MCP） AI Object-centric task representation and transfer using diffused orientation fields VLA From autonomy to alliance: Robotic foundation models must learn with us, not just for us VLA Boston Dynamics and Google DeepMind Teach Spot to Reason VLA JoyAI-RA 0.1: A Foundation Model for Robotic Autonomy VLA LLM-Guided Safety Agent for Edge Robotics with an ISO-Compliant Perception-Compute-Control Architecture VLA Cortex 2.0: Grounding World Models in Real-World Industrial Deployment VLA A Vision-Language-Action Model for Adaptive Ultrasound-Guided Needle Insertion and Needle Tracking VLA Bimanual Robot Manipulation via Multi-Agent In-Context Learning VLA Temporal Difference Calibration in Sequential Tasks: Application to Vision-Language-Action Models VLA FingerEye: Continuous and Unified Vision-Tactile Sensing for Dexterous Manipulation VLA Visual-Tactile Peg-in-Hole Assembly Learning from Peg-out-of-Hole Disassembly VLA PokeVLA: Empowering Pocket-Sized Vision-Language-Action Model with Comprehensive World Knowledge Guidance VLA Rodrigues Network for Learning Robot Actions VLA MATT-Diff: Multimodal Active Target Tracking by Diffusion Policy VLA OmniUMI: Towards Physically Grounded Robot Learning via Human-Aligned Multimodal Interaction VLA Evolvable Embodied Agent for Robotic Manipulation via Long Short-Term Reflection and Optimization VLA HELM: Harness-Enhanced Long-horizon Memory for Vision-Language-Action Manipulation VLA Gated Memory Policy VLA RoboWM-Bench: A Benchmark for Evaluating World Models in Robotic Manipulation VLA VLA Foundry: A Unified Framework for Training Vision-Language-Action Models VLA FASTER: Value-Guided Sampling for Fast RL VLA UniT: Toward a Unified Physical Language for Human-to-Humanoid Policy Learning and World Modeling VLA PhysMem: Scaling Test-time Physical Memory for Robot Manipulation VLA ExpertGen: Scalable Sim-to-Real Expert Policy Learning from Imperfect Behavior Priors VLA ARM: Advantage Reward Modeling for Long-Horizon Manipulation VLA If you're waiting for a sign... that might not be it! Mitigating Trust Boundary Confusion from Visual Injections on Vision-Language Agentic Systems VLA EmbodiedMidtrain: Bridging the Gap between Vision-Language Models and Vision-Language-Action Models via Mid-training VLA DeVI: Physics-based Dexterous Human-Object Interaction via Synthetic Video Imitation VLA Closed-loop tactile-visual interactivity via chip-free luminescent fibers enabled by capacitive coupling VLA A time-stamping tactile sensor enabled by pseudoconductive interface design at dielectric heterojunctions VLA ETac: A Lightweight and Efficient Tactile Simulation Framework for Learning Dexterous Manipulation VLA VTouch++: A Multimodal Dataset with Vision-Based Tactile Enhancement for Bimanual Manipulation VLA CubeDAgger: Interactive Imitation Learning for Dynamic Systems with Efficient yet Low-risk Interaction VLA CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence VLA Mask World Model: Predicting What Matters for Robust Robot Policy Learning VLA Curiosity-Critic: Cumulative Prediction Error Improvement as a Tractable Intrinsic Reward for World Model Training VLA Safety-Critical Contextual Control via Online Riemannian Optimization with World Models VLA Cloning Deterministic Worlds: The Critical Role of Latent Geometry in Long-Horizon World Models VLA X-Cache: Cross-Chunk Block Caching for Few-Step Autoregressive World Models Inference VLA

2026-04-23 昨日 24 條条

ChatGPT Images 2.0 发布：首个「会思考」的图像模型 AI Firefox 150 用 Claude Mythos 发现 271 个漏洞 AI OpenAI 推出 ChatGPT for Clinicians，对认证医师免费 AI AWS SageMaker AI 推出 GenAI 推理优化推荐 AI FASTER: Value-Guided Sampling for Fast RL VLA Gated Memory Policy VLA RoboWM-Bench: A Benchmark for Evaluating World Models in Robotic Manipulation VLA Wrench-Aware Admittance Control for Unknown-Payload Manipulation VLA VLA Foundry: A Unified Framework for Training Vision-Language-Action Models VLA UniT: Toward a Unified Physical Language for Human-to-Humanoid Policy Learning and World Modeling VLA PhysMem: Scaling Test-time Physical Memory for Robot Manipulation VLA ExpertGen: Scalable Sim-to-Real Expert Policy Learning from Imperfect Behavior Priors VLA ARM: Advantage Reward Modeling for Long-Horizon Manipulation VLA Drift-Based Policy Optimization: Native One-Step Policy Learning for Online Robot Control VLA HELM: Harness-Enhanced Long-horizon Memory for Vision-Language-Action Manipulation VLA LAMP: Lift Image-Editing as General 3D Priors for Open-world Manipulation VLA MultiWorld: Scalable Multi-Agent Multi-View Video World Models VLA Closed-loop tactile-visual interactivity via chip-free luminescent fibers enabled by capacitive coupling VLA A time-stamping tactile sensor enabled by pseudoconductive interface design at dielectric heterojunctions VLA Mask World Model: Predicting What Matters for Robust Robot Policy Learning VLA Flow-Opt: Scalable Centralized Multi-Robot Trajectory Optimization with Flow Matching and Differentiable Optimization VLA Curiosity-Critic: Cumulative Prediction Error Improvement as a Tractable Intrinsic Reward for World Model Training VLA Safety-Critical Contextual Control via Online Riemannian Optimization with World Models VLA Cloning Deterministic Worlds: The Critical Role of Latent Geometry in Long-Horizon World Models VLA

2026-04-22 2日前 35 條条

SpaceX 洽以 600 亿美元收购 Cursor AI Kimi K2.6 正式开源：SWE-Bench Pro 58.6%、300 Agent 集群 AI Claude Code 从 Pro 套餐中移除 AI Vercel OAuth 供应链攻击：Context.ai 泄露成突破口 AI CrabTrap：Brex 开源 LLM-as-a-judge Agent 安全代理 AI nateherkai/token-dashboard：Claude Code token 消耗可视化 AI Demonstrate once, execute on many: Kinematic intelligence for cross-robot skill transfer VLA A careful examination of large behavior models for multitask dexterous manipulation VLA XEmbodied: A Foundation Model with Enhanced Geometric and Physical Cues for Large-Scale Embodied Environments VLA Test-Time Perturbation Learning with Delayed Feedback for Vision-Language-Action Models VLA Disentangled Robot Learning via Separate Forward and Inverse Dynamics Pretraining VLA ReconVLA: An Uncertainty-Guided and Failure-Aware Vision-Language-Action Framework for Robotic Control VLA Rewind-IL: Online Failure Detection and State Respawning for Imitation Learning VLA LongBench: Evaluating Robotic Manipulation Policies on Real-World Long-Horizon Tasks VLA Chain Of Interaction Benchmark (COIN): When Reasoning meets Embodied Interaction VLA GaLa: Hypergraph-Guided Visual Language Models for Procedural Planning VLA FLASH: Fast Learning via GPU-Accelerated Simulation for High-Fidelity Deformable Manipulation in Minutes VLA OmniVLA-RL: A Vision-Language-Action Model with Spatial Understanding and Online RL VLA AnchorRefine: Synergy-Manipulation Based on Trajectory Anchor and Residual Refinement for Vision-Language-Action Models VLA ReFineVLA: Multimodal Reasoning-Aware Generalist Robotic Policies via Teacher-Guided Fine-Tuning VLA OFlow: Injecting Object-Aware Temporal Flow Matching for Robust Robotic Manipulation VLA ST-$\pi$: Structured SpatioTemporal VLA for Robotic Manipulation VLA StableIDM: Stabilizing Inverse Dynamics Model against Manipulator Truncation via Spatio-Temporal Refinement VLA Unmasking the Illusion of Embodied Reasoning in Vision-Language-Action Models VLA COFFAIL: A Dataset of Successful and Anomalous Robot Skill Executions in the Context of Coffee Preparation VLA Can Explicit Physical Feasibility Benefit VLA Learning? An Empirical Study VLA SafeVLA: Towards Safety Alignment of Vision-Language-Action Model via Constrained Learning VLA On the Importance of Tactile Sensing for Imitation Learning: A Case Study on Robotic Match Lighting VLA UniDomain: Pretraining a Unified PDDL Domain from Real-World Demonstrations for Generalizable Robot Task Planning VLA Contact-Rich Robotic Assembly in Construction via Diffusion Policy Learning VLA Stable Language Guidance for Vision-Language-Action Models VLA ROBOGATE: Adaptive Failure Discovery for Safe Robot Policy Deployment via Two-Stage Boundary-Focused Sampling VLA World-Value-Action Model: Implicit Planning for Vision-Language-Action Systems VLA InternScenes: A Large-scale Simulatable Indoor Scene Dataset with Realistic Layouts VLA DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Models VLA

歸檔索引归档索引

73 期

— 2026 年 4 月 —

AI 2026 · 04 · 24 最新

小米 MiMo-V2.5 发布：罗福莉领衔，对标 Claude Opus 4.6 / GPT-5.4

VLA 2026 · 04 · 24 最新

🔧 15 📖 24 Object-centric task representation and transfer using diffused orientation fields

AI 2026 · 04 · 23 昨日

ChatGPT Images 2.0 发布：首个「会思考」的图像模型

VLA 2026 · 04 · 23 昨日

🔧 6 📖 14 FASTER: Value-Guided Sampling for Fast RL

AI 2026 · 04 · 22 2日前

SpaceX 洽以 600 亿美元收购 Cursor

VLA 2026 · 04 · 22 2日前

🔧 21 📖 8 Demonstrate once, execute on many: Kinematic intelligence for cross-robot skill transfer

AI 2026 · 04 · 21

Anthropic 未来十年将在 AWS 投入超 1000 亿美元

AI 2026 · 04 · 20

Headless everything for personal AI

AI 2026 · 04 · 19

DeepSeek 启动首轮外部融资，估值超 100 亿美元

AI 2026 · 04 · 18

Cursor 洽談融資 20 億美元，NVIDIA 計劃參與

VLA 2026 · 04 · 18

⚡ 2 🔧 4 📖 9 Model-Based Reinforcement Learning Exploits Passive Body Dynamics for High-Performance Biped Robot Locomotion

AI 2026 · 04 · 17

OpenAI 发布 Codex 重大更新：可与用户协同操作电脑进行持续性工作

VLA 2026 · 04 · 17

⚡ 1 🔧 7 📖 12 Jump-Start Reinforcement Learning with Vision-Language-Action Regularization

AI 2026 · 04 · 16

Claude Opus 4.7 刚刚曝光，Claude Code 一夜重构，7x24 小时替你打工

VLA 2026 · 04 · 16

⚡ 1 🔧 5 📖 11 Habitat-GS: A High-Fidelity Navigation Simulator with Dynamic Gaussian Splatting

AI 2026 · 04 · 15

Tell HN: Fiverr left customer files public and searchable via Cloudinary

AI 2026 · 04 · 14

OpenAI：微软限制了我们接触客户的能力

VLA 2026 · 04 · 14

🔧 5 📖 12 AssemLM: Spatial Reasoning Multimodal Large Language Models for Robotic Assembly

AI 2026 · 04 · 13

OpenAI 估值 8520 亿美元：CEO 零持股，股东博弈白热化

AI 2026 · 04 · 12

Small models also found the vulnerabilities that Mythos found

VLA 2026 · 04 · 12

📖 2 LAMP: Lift Image-Editing as General 3D Priors for Open-world Manipulation

AI 2026 · 04 · 11

Sam Altman 住所遭燃烧瓶袭击，嫌疑人被捕

VLA 2026 · 04 · 11

🔧 5 📖 23 LAMP: Lift Image-Editing as General 3D Priors for Open-world Manipulation

AI 2026 · 04 · 10

AWS Bedrock AgentCore：在 React 应用中嵌入实时 AI 浏览器 Agent

VLA 2026 · 04 · 10

⚡ 1 🔧 5 📖 14 RichMap: A Reachability Map Balancing Precision, Efficiency, and Flexibility for Rich Robot Manipulation Tasks

AI 2026 · 04 · 09

OpenAI 发布儿童安全蓝图（Child Safety Blueprint）

VLA 2026 · 04 · 09

🔧 8 📖 18 ICR-Drive: Instruction Counterfactual Robustness for End-to-End Language-Driven Autonomous Driving

AI 2026 · 04 · 08

Anthropic Claude Mythos Preview 发布（安全研究限定）

VLA 2026 · 04 · 08

🔧 6 📖 31 Diffusion Policy with Bayesian Expert Selection for Active Multi-Target Tracking

AI 2026 · 04 · 07

博通 - 谷歌-Anthropic 达成长期 TPU 供应协议（至 2031 年）

VLA 2026 · 04 · 07

🔧 3 📖 12 F2F-AP: Flow-to-Future Asynchronous Policy for Real-time Dynamic Manipulation

AI 2026 · 04 · 06

2M 周健康保险咨询：ChatGPT 成美国医疗「隐形基础设施」

AI 2026 · 04 · 05

Anthropic 封杀 OpenClaw 等第三方 Claude API 访问

AI 2026 · 04 · 04

Vulnerability Research Is Cooked

VLA 2026 · 04 · 04

🔧 6 📖 19 Causal Scene Narration with Runtime Safety Supervision for Vision-Language-Action Driving

AI 2026 · 04 · 03

Codex now offers more flexible pricing for teams

VLA 2026 · 04 · 03

🔧 5 📖 5 Multi-Camera View Scaling for Data-Efficient Robot Imitation Learning

AI 2026 · 04 · 02

OpenAI 为 Claude Code 发布 Codex 插件

VLA 2026 · 04 · 02

🔧 2 📖 6 Object Affordance Recognition and Grounding via Multi-scale Cross-modal Representation Learning

AI 2026 · 04 · 01

OpenAI 完成新一轮融资，估值达 $852B

VLA 2026 · 04 · 01

🔧 5 📖 23 ViPRA: Video Prediction for Robot Actions

— 2026 年 3 月 —

AI 2026 · 03 · 31

Mr. Chatterbox：维多利亚时代伦理训练的可本地运行模型

VLA 2026 · 03 · 31

🔧 3 📖 14 MPDiT: Multi-Patch Global-to-Local Transformer Architecture For Efficient Flow Matching and Diffusion Model

AI 2026 · 03 · 30

ChatGPT won't let you type until Cloudflare reads your React state

AI 2026 · 03 · 29

Domscribe：给 AI 编程 Agent 装上「前端透视眼」

AI 2026 · 03 · 28

We Rewrote JSONata with AI in a Day, Saved $500K/Year

VLA 2026 · 03 · 28

⚡ 1 🔧 5 📖 29 FODMP: Fast One-Step Diffusion of Movement Primitives Generation for Time-Dependent Robot Actions

VLA 2026 · 03 · 27

🔧 5 📖 19 NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos

AI 2026 · 03 · 26

谷歌发布 Lyria 3 Pro 音乐生成模型，支持 3 分钟曲目

VLA 2026 · 03 · 26

🔧 3 📖 19 Model Predictive Control with Differentiable World Models for Offline Reinforcement Learning

AI 2026 · 03 · 25

OpenAI 宣布关闭 Sora 视频生成服务

VLA 2026 · 03 · 25

⚡ 2 🔧 4 📖 5 Causal World Modeling for Robot Control

VLA 2026 · 03 · 24

🔧 2 📖 8 R2-Dreamer: Redundancy-Reduced World Models without Decoders or Augmentation

VLA 2026 · 03 · 21

📖 1 FASTER: Rethinking Real-Time Flow VLAs

VLA 2026 · 03 · 20

🔧 2 📖 11 Grounding Robot Generalization in Training Data via Retrieval-Augmented VLMs

VLA 2026 · 03 · 19

⚡ 2 🔧 5 📖 5 DexGrasp-Zero: A Morphology-Aligned Policy for Zero-Shot Cross-Embodiment Dexterous Grasping

VLA 2026 · 03 · 18

📖 3 Panoramic Affordance Prediction

VLA 2026 · 03 · 17

📖 30 MoE-ACT: Scaling Multi-Task Bimanual Manipulation with Sparse Language-Conditioned Mixture-of-Experts Transformers

VLA 2026 · 03 · 14

📖 3 Robot-mediated haptic feedback outperforms vision in violin duo coordination

VLA 2026 · 03 · 13

⚡ 1 🔧 2 📖 8 Cross-embodied Co-design for Dexterous Hands

VLA 2026 · 03 · 12

🔧 3 📖 8 RL-100: Performant Robotic Manipulation with Real-World Reinforcement Learning

VLA 2026 · 03 · 11

⚡ 1 🔧 3 📖 25 Evolving Diffusion and Flow Matching Policies for Online Reinforcement Learning

VLA 2026 · 03 · 10

⚡ 1 🔧 5 📖 10 What if? Emulative Simulation with World Models for Situated Reasoning

VLA 2026 · 03 · 07

📖 1 Beyond the Patch: Exploring Vulnerabilities of Visuomotor Policies via Viewpoint-Consistent 3D Adversarial Object

VLA 2026 · 03 · 06

📖 9 Beyond Pixel Histories: World Models with Persistent 3D State

VLA 2026 · 03 · 05

📖 9 Next Embedding Prediction Makes World Models Stronger

VLA 2026 · 03 · 04

🔧 5 📖 19 Robometer: Scaling General-Purpose Robotic Reward Models via Trajectory Comparisons

VLA 2026 · 03 · 03

🔧 5 📖 15 Beyond State-Wise Mirror Descent: Offline Policy Optimization with Parameteric Policies

VLA 2026 · 03 · 02

📖 1 UCM: Unifying Camera Control and Memory with Time-aware Positional Encoding Warping for World Models

VLA 2026 · 03 · 01

📖 1 When to Act, Ask, or Learn: Uncertainty-Aware Policy Steering

— 2026 年 2 月 —

VLA 2026 · 02 · 28

🔧 2 📖 1 On Sample-Efficient Generalized Planning via Learned Transition Models

VLA 2026 · 02 · 27

🔧 4 📖 21 Provably Safe Generative Sampling with Constricting Barrier Functions

VLA 2026 · 02 · 26

🔧 11 📖 15 Path-Decoupled Hyperbolic Flow Matching for Few-Shot Adaptation