VLA(Finetuned Flower)
Base VLA policy
1Toyota Technological Institute at Chicago 2Stony Brook University
*Equal contribution
Base VLA policy
Flow-prior action sampling
Energy-guided flow editing
Inversion-free editing
Collision failure
Minor collision
Successful serving
Successful serving
Leveraging prior knowledge from pretrained policies, foundation models, or human operators offers an efficient alternative to learning robot skills from scratch. However, these agents often provide actions that are suboptimal, noisy, or misaligned with task-specific expert behavior. We propose GLOVES, a family of flow-based adaptation methods that correct non-expert actions by transporting them toward an expert action distribution. Rather than replacing agentic control with full autonomy, GLOVES performs selective action-level adaptation, improving task success while preserving agent intent. The learned flow also provides a natural in-distribution scoring mechanism through reverse flow evaluation. We use this signal as an intervention gate: actions that appear consistent with the expert distribution are passed through unchanged, while anomalous or out-of-distribution (OOD) actions are corrected. In this way, assistance is only provided when necessary. GLOVES requires only limited expert supervision, using a small number of demonstrations or reusable successful skill segments. By learning local expert action patterns and stitching them during execution, GLOVES provides a lightweight shared-control module for robust action adaptation across tasks and environments.
Interactive Results
Hover over a bar to inspect values.
†denotes OOD-gated variants.
Leaderboard
| Method | Top1 | Top2 | Top3 | Rank | Success | Gain |
|---|
†denotes OOD-gated variants.