Overview

Overview of flow-based action adaptation with GLOVES.

Method Comparison

Real Robot: Charger Insertion

01

VLA(Finetuned Flower)

Base VLA policy

Zoom-in view
Normal view
02

VLA+FPAS

Flow-prior action sampling

Zoom-in view
Normal view
03

VLA+FEEG

Energy-guided flow editing

Zoom-in view
Normal view
04

VLA+IFAE

Inversion-free editing

Zoom-in view
Normal view

Real Robot: Cup Serve

01

VLA(Finetuned Flower)

Collision failure

Red: collision after 2.5s
02

VLA+FPAS

Minor collision

Yellow: minor collision after 2.5s
03

VLA+FEEG

Successful serving

Green: final success state
04

VLA+IFAE

Successful serving

Green: final success state

Abstract

Leveraging prior knowledge from pretrained policies, foundation models, or human operators offers an efficient alternative to learning robot skills from scratch. However, these agents often provide actions that are suboptimal, noisy, or misaligned with task-specific expert behavior. We propose GLOVES, a family of flow-based adaptation methods that correct non-expert actions by transporting them toward an expert action distribution. Rather than replacing agentic control with full autonomy, GLOVES performs selective action-level adaptation, improving task success while preserving agent intent. The learned flow also provides a natural in-distribution scoring mechanism through reverse flow evaluation. We use this signal as an intervention gate: actions that appear consistent with the expert distribution are passed through unchanged, while anomalous or out-of-distribution (OOD) actions are corrected. In this way, assistance is only provided when necessary. GLOVES requires only limited expert supervision, using a small number of demonstrations or reusable successful skill segments. By learning local expert action patterns and stitching them during execution, GLOVES provides a lightweight shared-control module for robust action adaptation across tasks and environments.

Methods

A. FPAS

B. FEEG

C. IFAE

VLA + Assistive Policy

Interactive Results

Slalom

Hover over a bar to inspect values.

† denotes OOD-gated variants.

IL + Wrapper Agents

Leaderboard

Method Top1 Top2 Top3 Rank Success Gain

† denotes OOD-gated variants.

Condition Top 3