Skip to content

DAILY INTELLIGENCE · AI APP + VLA

研究日報研究日报

AI App 精選與 VLA 論文評級的每日合輯AI App 精选与 VLA 论文评级的每日合辑下方匯總近 7 天的信號總覽,歸檔索引可按領域過濾下方汇总近 7 天的信号总览,归档索引可按领域过滤

近 7 日匯總近 7 日汇总 04-14 → 04-24
33 AI精選AI精选
157 VLA論文VLA论文
⚡ 4 突破
🔧 63 工具/技術工具/技术
📖 90 背景/觀點背景/观点
近 3 天內容近 3 天内容 04-22 → 04-24
2026-04-24 最新 44
小米 MiMo-V2.5 发布:罗福莉领衔,对标 Claude Opus 4.6 / GPT-5.4 AI OpenAI Codex 推出 Automations:定时/触发式自动执行任务 AI OpenAI Codex 上线 Plugins & Skills 体系:连接外部工具 + 可复用工作流 AI LiteParse for the Web:纯浏览器端 PDF 空间文本解析 AI GitHub Trending:huggingface/ml-intern(开源ML工程师)和 claude-context(代码搜索MCP) AI Object-centric task representation and transfer using diffused orientation fields VLA From autonomy to alliance: Robotic foundation models must learn with us, not just for us VLA ​Boston Dynamics and Google DeepMind Teach Spot to Reason​ VLA JoyAI-RA 0.1: A Foundation Model for Robotic Autonomy VLA LLM-Guided Safety Agent for Edge Robotics with an ISO-Compliant Perception-Compute-Control Architecture VLA Cortex 2.0: Grounding World Models in Real-World Industrial Deployment VLA A Vision-Language-Action Model for Adaptive Ultrasound-Guided Needle Insertion and Needle Tracking VLA Bimanual Robot Manipulation via Multi-Agent In-Context Learning VLA Temporal Difference Calibration in Sequential Tasks: Application to Vision-Language-Action Models VLA FingerEye: Continuous and Unified Vision-Tactile Sensing for Dexterous Manipulation VLA Visual-Tactile Peg-in-Hole Assembly Learning from Peg-out-of-Hole Disassembly VLA PokeVLA: Empowering Pocket-Sized Vision-Language-Action Model with Comprehensive World Knowledge Guidance VLA Rodrigues Network for Learning Robot Actions VLA MATT-Diff: Multimodal Active Target Tracking by Diffusion Policy VLA OmniUMI: Towards Physically Grounded Robot Learning via Human-Aligned Multimodal Interaction VLA Evolvable Embodied Agent for Robotic Manipulation via Long Short-Term Reflection and Optimization VLA HELM: Harness-Enhanced Long-horizon Memory for Vision-Language-Action Manipulation VLA Gated Memory Policy VLA RoboWM-Bench: A Benchmark for Evaluating World Models in Robotic Manipulation VLA VLA Foundry: A Unified Framework for Training Vision-Language-Action Models VLA FASTER: Value-Guided Sampling for Fast RL VLA UniT: Toward a Unified Physical Language for Human-to-Humanoid Policy Learning and World Modeling VLA PhysMem: Scaling Test-time Physical Memory for Robot Manipulation VLA ExpertGen: Scalable Sim-to-Real Expert Policy Learning from Imperfect Behavior Priors VLA ARM: Advantage Reward Modeling for Long-Horizon Manipulation VLA If you're waiting for a sign... that might not be it! Mitigating Trust Boundary Confusion from Visual Injections on Vision-Language Agentic Systems VLA EmbodiedMidtrain: Bridging the Gap between Vision-Language Models and Vision-Language-Action Models via Mid-training VLA DeVI: Physics-based Dexterous Human-Object Interaction via Synthetic Video Imitation VLA Closed-loop tactile-visual interactivity via chip-free luminescent fibers enabled by capacitive coupling VLA A time-stamping tactile sensor enabled by pseudoconductive interface design at dielectric heterojunctions VLA ETac: A Lightweight and Efficient Tactile Simulation Framework for Learning Dexterous Manipulation VLA VTouch++: A Multimodal Dataset with Vision-Based Tactile Enhancement for Bimanual Manipulation VLA CubeDAgger: Interactive Imitation Learning for Dynamic Systems with Efficient yet Low-risk Interaction VLA CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence VLA Mask World Model: Predicting What Matters for Robust Robot Policy Learning VLA Curiosity-Critic: Cumulative Prediction Error Improvement as a Tractable Intrinsic Reward for World Model Training VLA Safety-Critical Contextual Control via Online Riemannian Optimization with World Models VLA Cloning Deterministic Worlds: The Critical Role of Latent Geometry in Long-Horizon World Models VLA X-Cache: Cross-Chunk Block Caching for Few-Step Autoregressive World Models Inference VLA
2026-04-23 昨日 24
ChatGPT Images 2.0 发布:首个「会思考」的图像模型 AI Firefox 150 用 Claude Mythos 发现 271 个漏洞 AI OpenAI 推出 ChatGPT for Clinicians,对认证医师免费 AI AWS SageMaker AI 推出 GenAI 推理优化推荐 AI FASTER: Value-Guided Sampling for Fast RL VLA Gated Memory Policy VLA RoboWM-Bench: A Benchmark for Evaluating World Models in Robotic Manipulation VLA Wrench-Aware Admittance Control for Unknown-Payload Manipulation VLA VLA Foundry: A Unified Framework for Training Vision-Language-Action Models VLA UniT: Toward a Unified Physical Language for Human-to-Humanoid Policy Learning and World Modeling VLA PhysMem: Scaling Test-time Physical Memory for Robot Manipulation VLA ExpertGen: Scalable Sim-to-Real Expert Policy Learning from Imperfect Behavior Priors VLA ARM: Advantage Reward Modeling for Long-Horizon Manipulation VLA Drift-Based Policy Optimization: Native One-Step Policy Learning for Online Robot Control VLA HELM: Harness-Enhanced Long-horizon Memory for Vision-Language-Action Manipulation VLA LAMP: Lift Image-Editing as General 3D Priors for Open-world Manipulation VLA MultiWorld: Scalable Multi-Agent Multi-View Video World Models VLA Closed-loop tactile-visual interactivity via chip-free luminescent fibers enabled by capacitive coupling VLA A time-stamping tactile sensor enabled by pseudoconductive interface design at dielectric heterojunctions VLA Mask World Model: Predicting What Matters for Robust Robot Policy Learning VLA Flow-Opt: Scalable Centralized Multi-Robot Trajectory Optimization with Flow Matching and Differentiable Optimization VLA Curiosity-Critic: Cumulative Prediction Error Improvement as a Tractable Intrinsic Reward for World Model Training VLA Safety-Critical Contextual Control via Online Riemannian Optimization with World Models VLA Cloning Deterministic Worlds: The Critical Role of Latent Geometry in Long-Horizon World Models VLA
2026-04-22 2日前 35
SpaceX 洽以 600 亿美元收购 Cursor AI Kimi K2.6 正式开源:SWE-Bench Pro 58.6%、300 Agent 集群 AI Claude Code 从 Pro 套餐中移除 AI Vercel OAuth 供应链攻击:Context.ai 泄露成突破口 AI CrabTrap:Brex 开源 LLM-as-a-judge Agent 安全代理 AI nateherkai/token-dashboard:Claude Code token 消耗可视化 AI Demonstrate once, execute on many: Kinematic intelligence for cross-robot skill transfer VLA A careful examination of large behavior models for multitask dexterous manipulation VLA XEmbodied: A Foundation Model with Enhanced Geometric and Physical Cues for Large-Scale Embodied Environments VLA Test-Time Perturbation Learning with Delayed Feedback for Vision-Language-Action Models VLA Disentangled Robot Learning via Separate Forward and Inverse Dynamics Pretraining VLA ReconVLA: An Uncertainty-Guided and Failure-Aware Vision-Language-Action Framework for Robotic Control VLA Rewind-IL: Online Failure Detection and State Respawning for Imitation Learning VLA LongBench: Evaluating Robotic Manipulation Policies on Real-World Long-Horizon Tasks VLA Chain Of Interaction Benchmark (COIN): When Reasoning meets Embodied Interaction VLA GaLa: Hypergraph-Guided Visual Language Models for Procedural Planning VLA FLASH: Fast Learning via GPU-Accelerated Simulation for High-Fidelity Deformable Manipulation in Minutes VLA OmniVLA-RL: A Vision-Language-Action Model with Spatial Understanding and Online RL VLA AnchorRefine: Synergy-Manipulation Based on Trajectory Anchor and Residual Refinement for Vision-Language-Action Models VLA ReFineVLA: Multimodal Reasoning-Aware Generalist Robotic Policies via Teacher-Guided Fine-Tuning VLA OFlow: Injecting Object-Aware Temporal Flow Matching for Robust Robotic Manipulation VLA ST-$\pi$: Structured SpatioTemporal VLA for Robotic Manipulation VLA StableIDM: Stabilizing Inverse Dynamics Model against Manipulator Truncation via Spatio-Temporal Refinement VLA Unmasking the Illusion of Embodied Reasoning in Vision-Language-Action Models VLA COFFAIL: A Dataset of Successful and Anomalous Robot Skill Executions in the Context of Coffee Preparation VLA Can Explicit Physical Feasibility Benefit VLA Learning? An Empirical Study VLA SafeVLA: Towards Safety Alignment of Vision-Language-Action Model via Constrained Learning VLA On the Importance of Tactile Sensing for Imitation Learning: A Case Study on Robotic Match Lighting VLA UniDomain: Pretraining a Unified PDDL Domain from Real-World Demonstrations for Generalizable Robot Task Planning VLA Contact-Rich Robotic Assembly in Construction via Diffusion Policy Learning VLA Stable Language Guidance for Vision-Language-Action Models VLA ROBOGATE: Adaptive Failure Discovery for Safe Robot Policy Deployment via Two-Stage Boundary-Focused Sampling VLA World-Value-Action Model: Implicit Planning for Vision-Language-Action Systems VLA InternScenes: A Large-scale Simulatable Indoor Scene Dataset with Realistic Layouts VLA DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Models VLA

歸檔索引归档索引

73 期
— 2026 年 4 月 —
AI 最新
小米 MiMo-V2.5 发布:罗福莉领衔,对标 Claude Opus 4.6 / GPT-5.4
5 篇
VLA 最新
🔧 15 📖 24 Object-centric task representation and transfer using diffused orientation fields
39 篇
AI 昨日
ChatGPT Images 2.0 发布:首个「会思考」的图像模型
4 篇
VLA 昨日
🔧 6 📖 14 FASTER: Value-Guided Sampling for Fast RL
20 篇
AI 2日前
SpaceX 洽以 600 亿美元收购 Cursor
6 篇
VLA 2日前
🔧 21 📖 8 Demonstrate once, execute on many: Kinematic intelligence for cross-robot skill transfer
29 篇
AI
Anthropic 未来十年将在 AWS 投入超 1000 亿美元
6 篇
AI
Headless everything for personal AI
3 篇
AI
DeepSeek 启动首轮外部融资,估值超 100 亿美元
3 篇
AI
Cursor 洽談融資 20 億美元,NVIDIA 計劃參與
6 篇
VLA
⚡ 2 🔧 4 📖 9 Model-Based Reinforcement Learning Exploits Passive Body Dynamics for High-Performance Biped Robot Locomotion
15 篇
AI
OpenAI 发布 Codex 重大更新:可与用户协同操作电脑进行持续性工作
6 篇
VLA
⚡ 1 🔧 7 📖 12 Jump-Start Reinforcement Learning with Vision-Language-Action Regularization
20 篇
AI
Claude Opus 4.7 刚刚曝光,Claude Code 一夜重构,7x24 小时替你打工
8 篇
VLA
⚡ 1 🔧 5 📖 11 Habitat-GS: A High-Fidelity Navigation Simulator with Dynamic Gaussian Splatting
17 篇
AI
Tell HN: Fiverr left customer files public and searchable via Cloudinary
6 篇
AI
OpenAI:微软限制了我们接触客户的能力
5 篇
VLA
🔧 5 📖 12 AssemLM: Spatial Reasoning Multimodal Large Language Models for Robotic Assembly
17 篇
AI
OpenAI 估值 8520 亿美元:CEO 零持股,股东博弈白热化
2 篇
AI
Small models also found the vulnerabilities that Mythos found
3 篇
VLA
📖 2 LAMP: Lift Image-Editing as General 3D Priors for Open-world Manipulation
2 篇
AI
Sam Altman 住所遭燃烧瓶袭击,嫌疑人被捕
3 篇
VLA
🔧 5 📖 23 LAMP: Lift Image-Editing as General 3D Priors for Open-world Manipulation
28 篇
AI
AWS Bedrock AgentCore:在 React 应用中嵌入实时 AI 浏览器 Agent
6 篇
VLA
⚡ 1 🔧 5 📖 14 RichMap: A Reachability Map Balancing Precision, Efficiency, and Flexibility for Rich Robot Manipulation Tasks
20 篇
AI
OpenAI 发布儿童安全蓝图(Child Safety Blueprint)
7 篇
VLA
🔧 8 📖 18 ICR-Drive: Instruction Counterfactual Robustness for End-to-End Language-Driven Autonomous Driving
26 篇
AI
Anthropic Claude Mythos Preview 发布(安全研究限定)
6 篇
VLA
🔧 6 📖 31 Diffusion Policy with Bayesian Expert Selection for Active Multi-Target Tracking
37 篇
AI
博通 - 谷歌-Anthropic 达成长期 TPU 供应协议(至 2031 年)
5 篇
VLA
🔧 3 📖 12 F2F-AP: Flow-to-Future Asynchronous Policy for Real-time Dynamic Manipulation
15 篇
AI
2M 周健康保险咨询:ChatGPT 成美国医疗「隐形基础设施」
5 篇
AI
Anthropic 封杀 OpenClaw 等第三方 Claude API 访问
6 篇
AI
Vulnerability Research Is Cooked
6 篇
VLA
🔧 6 📖 19 Causal Scene Narration with Runtime Safety Supervision for Vision-Language-Action Driving
25 篇
AI
Codex now offers more flexible pricing for teams
5 篇
VLA
🔧 5 📖 5 Multi-Camera View Scaling for Data-Efficient Robot Imitation Learning
10 篇
AI
OpenAI 为 Claude Code 发布 Codex 插件
4 篇
VLA
🔧 2 📖 6 Object Affordance Recognition and Grounding via Multi-scale Cross-modal Representation Learning
8 篇
AI
OpenAI 完成新一轮融资,估值达 $852B
7 篇
VLA
🔧 5 📖 23 ViPRA: Video Prediction for Robot Actions
28 篇
— 2026 年 3 月 —
AI
Mr. Chatterbox:维多利亚时代伦理训练的可本地运行模型
5 篇
VLA
🔧 3 📖 14 MPDiT: Multi-Patch Global-to-Local Transformer Architecture For Efficient Flow Matching and Diffusion Model
17 篇
AI
ChatGPT won't let you type until Cloudflare reads your React state
6 篇
AI
Domscribe:给 AI 编程 Agent 装上「前端透视眼」
6 篇
AI
We Rewrote JSONata with AI in a Day, Saved $500K/Year
6 篇
VLA
⚡ 1 🔧 5 📖 29 FODMP: Fast One-Step Diffusion of Movement Primitives Generation for Time-Dependent Robot Actions
35 篇
VLA
🔧 5 📖 19 NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos
24 篇
AI
谷歌发布 Lyria 3 Pro 音乐生成模型,支持 3 分钟曲目
8 篇
VLA
🔧 3 📖 19 Model Predictive Control with Differentiable World Models for Offline Reinforcement Learning
22 篇
AI
OpenAI 宣布关闭 Sora 视频生成服务
8 篇
VLA
⚡ 2 🔧 4 📖 5 Causal World Modeling for Robot Control
11 篇
VLA
🔧 2 📖 8 R2-Dreamer: Redundancy-Reduced World Models without Decoders or Augmentation
10 篇
VLA
📖 1 FASTER: Rethinking Real-Time Flow VLAs
1 篇
VLA
🔧 2 📖 11 Grounding Robot Generalization in Training Data via Retrieval-Augmented VLMs
13 篇
VLA
⚡ 2 🔧 5 📖 5 DexGrasp-Zero: A Morphology-Aligned Policy for Zero-Shot Cross-Embodiment Dexterous Grasping
12 篇
VLA
📖 3 Panoramic Affordance Prediction
3 篇
VLA
📖 30 MoE-ACT: Scaling Multi-Task Bimanual Manipulation with Sparse Language-Conditioned Mixture-of-Experts Transformers
30 篇
VLA
📖 3 Robot-mediated haptic feedback outperforms vision in violin duo coordination
3 篇
VLA
⚡ 1 🔧 2 📖 8 Cross-embodied Co-design for Dexterous Hands
11 篇
VLA
🔧 3 📖 8 RL-100: Performant Robotic Manipulation with Real-World Reinforcement Learning
11 篇
VLA
⚡ 1 🔧 3 📖 25 Evolving Diffusion and Flow Matching Policies for Online Reinforcement Learning
29 篇
VLA
⚡ 1 🔧 5 📖 10 What if? Emulative Simulation with World Models for Situated Reasoning
16 篇
VLA
📖 1 Beyond the Patch: Exploring Vulnerabilities of Visuomotor Policies via Viewpoint-Consistent 3D Adversarial Object
1 篇
VLA
📖 9 Beyond Pixel Histories: World Models with Persistent 3D State
9 篇
VLA
📖 9 Next Embedding Prediction Makes World Models Stronger
9 篇
VLA
🔧 5 📖 19 Robometer: Scaling General-Purpose Robotic Reward Models via Trajectory Comparisons
24 篇
VLA
🔧 5 📖 15 Beyond State-Wise Mirror Descent: Offline Policy Optimization with Parameteric Policies
20 篇
VLA
📖 1 UCM: Unifying Camera Control and Memory with Time-aware Positional Encoding Warping for World Models
1 篇
VLA
📖 1 When to Act, Ask, or Learn: Uncertainty-Aware Policy Steering
1 篇
— 2026 年 2 月 —
VLA
🔧 2 📖 1 On Sample-Efficient Generalized Planning via Learned Transition Models
3 篇
VLA
🔧 4 📖 21 Provably Safe Generative Sampling with Constricting Barrier Functions
25 篇
VLA
🔧 11 📖 15 Path-Decoupled Hyperbolic Flow Matching for Few-Shot Adaptation
26 篇