One-liner. The first survey to treat variable impedance from both the control and the learning side jointly, proposing a clean taxonomy — Variable Impedance Control (VIC), Variable Impedance Learning (VIL), and the merged Variable Impedance Learning Control (VILC) — that gives you a vocabulary for "how should a robot modulate its stiffness/damping during contact, and where does that modulation policy come from?"
Robots leaving the cage and into unstructured human spaces must exploit
contact forces safely. Impedance control (Hogan 1985) makes the end-effector
behave like a virtual spring–damper, trading off compliance vs. tracking
to avoid large impact forces and absorb position uncertainty. But a
fixed impedance is rarely right for a whole task: turning a valve,
inserting a peg, opening a door, or co-lifting a table with a human each demand
different stiffness at different phases. Hence variable impedance
— modulating K (stiffness) and D (damping)
online. The authors note prior surveys (Vanderborght et al. 2013, Calanca et
al. 2015, Keemink et al. 2018, Song et al. 2019) cover impedance hardware,
compliance, admittance, or general impedance control, but none focuses on the
control + learning approaches to variability specifically (Table 1).
This is positioned as the first such review.
The organizing contribution is the taxonomy in Fig 2. Mechanical impedance splits into Constant (standard impedance control, out of scope) and Variable impedance, which the survey divides into three branches:
VIC — Variable Impedance Control (Section 2). The
control-theoretic branch: hand-designed laws that vary the gains. The standard
interaction model (Eqs 3–4) is the spring–damper of Eqs 1–2
with time-subscripted gains KPt,
DVt. A representative online rule is
kt = k0 + α et2
(Eq 5): stiffen as tracking error grows. Two sub-themes: stability
& passivity (§2.1 — Lyapunov- and passivity-based
guarantees for time-varying gains; energy-tank / port-Hamiltonian methods,
Ferraguti et al.; state-independent stability conditions, Kronander &
Billard) and human-in-the-loop (§2.2 — tele-impedance,
EMG-estimated human stiffness as an "intention" signal, multimodal HRI
interfaces).
VIL — Variable Impedance Learning (Section 3). Treats
finding the gains as a supervised-learning problem over human demonstrations,
decoupled from the controller: learn a (nonlinear) mapping
K̂Pt = φK(xt, ẋt, fet, θK)
(Eqs 6–7) from demos, then feed it to a separate VIC at run time
(Fig 4). Dominated by imitation learning / LfD:
GMM/GMR encodings (Calinon, Kronander & Billard), and crucially the
Riemannian / SPD-manifold insight — stiffness and damping are
Symmetric Positive Definite matrices, so naive vectorization discards geometric
structure; tensor-based GMM/GMR and geometry-aware DMPs (Abu-Dakka et al. 2018;
Abu-Dakka & Kyrki 2020) learn directly on the manifold.
VILC — Variable Impedance Learning Control (Section 4).
The branch where learning and control cannot be cleanly separated
because data collection depends on the controller (Fig 5).
Three sub-categories: imitation (§4.1 — e.g.
i-MOGIC, Khansari-Zadeh et al. 2014, with Lyapunov-proved stability),
iterative learning (§4.2 — ILC-based, Eq 9:
ur+1,t = ur,t + γr er,t,
modified to track an impedance target rather than a trajectory), and
reinforcement learning (§4.3 — VIC as a parameterized
policy, Eq 10; PI2 with DMP/SEDS parameterizations, Buchli et al.
2011, Rey et al. 2018; Natural Actor-Critic; the "safe exploration" problem of
keeping the robot in a Lyapunov-stable safe set during learning, Khader et al.
2021).
The headline deliverables are the three-way taxonomy (Fig 2) and the advantages/disadvantages comparison (Table 2), reproduced in condensed form:
| Approach | Advantages | Disadvantages |
|---|---|---|
| Stability & passivity of VIC (control) | Efficient, accurate; provable stability/passivity for safe interaction | Rely on accurate system models, which are nontrivial to derive; less general |
| Human-in-the-loop | Human reacts where AI is unsure; human impedance is a good target | Needs anatomy priors, complex multi-sensor setup, calibration; human error / poor repeatability |
| Imitation learning (VIL) | User-friendly; humans naturally demonstrate good impedance | Quality bounded by teacher; some tasks hard to demonstrate; human→robot transfer imperfect |
| Iterative learning (VILC) | Compute- and data-efficient; analytic convergence proofs | Target impedance must be hand-defined; needs many same-condition repetitions; poor generalization |
| Reinforcement learning (VILC) | Can discover policies for complex, hard-to-model tasks; good transferability | Data-hungry (esp. model-free); safety constraints limit exploration → risk of suboptimal policies |
Cross-cutting takeaways the survey emphasizes:
From the authors:
What I noticed reading it:
This is a classical-foundations anchor, not a manipulation
result to build on directly — but it is the load-bearing reference for the
control-theory half of the batch thesis. The big idea I'm chasing is that many
manipulation predicates (is_inserted, is_screwed_tight,
surface_is_rough) are not visually evaluable — they
live in touch/force. Variable impedance is the classical machinery for
acting on exactly those force-defined situations: when a peg
is_inserted, you want low stiffness to comply; when it isn't yet,
you want a search behavior with the right damping. Where
BLADE
hides all continuous parameters (force, grasp pose, pour amount) inside a
diffusion policy and keeps the abstraction layer purely categorical, this survey
is the catalogue of how the field has historically parameterized and learned
the force-modulation policy that such a body would need. The "Why I care"
of the BLADE summary flags exactly this — force/continuous parameters as a
new contribution area, marrying a symbolic body with continuous constraint specs
— and VIC/VIL/VILC is the prior-art vocabulary for that body's controller.
Concretely useful framings: (i) the SPD-manifold point (impedance gains have
geometric structure a flat policy discards) generalizes beyond impedance to any
learned continuous controller parameter; (ii) the VIL/VILC split mirrors the
"learn the parameters separately vs. jointly with the controller" question that
recurs in Reactive
Diffusion Policy and force-aware VLAs. It is firmly a control/learning
survey with no language and no perception — I'm filing it as the classical
counterweight that tells me what the modern force-aware-VLA papers are (often
implicitly) re-deriving.
In this context, variable impedance control arises as a powerful tool to modulate the robot's behavior in response to variations in its surroundings. — Abstract
Approaches for Variable Impedance Learning (VIL) treat the problem of finding variable impedance gains as a supervised learning problem … On the contrary, Variable Impedance Learning Control (VILC) approaches attempt to directly learn a variable impedance control law. — §1 / p.4 (taxonomy)
At the current stage, none of the approaches has all the features that a variable impedance behavior requires. — §6 Concluding remarks / p.19
Papers cited here that should likely be ingested next (forward references):
Newly ingested in 2026-06-24 batch — directly relevant: