netvsp & net_mana - instrument VTL0 & host calls at risk of hanging by ben-zen · Pull Request #3170 · microsoft/openvmm

ben-zen · 2026-04-01T00:20:33Z

There's a bunch of async operations with long or no timeout which are at risk of hanging the NetVSP code; adding spans to those actions makes them more obvious in tracing, with additional context to track down miscreant calls or make a misbehaving VM more obvious.

Copilot

Pull request overview

This PR improves observability around NetVSP and MANA endpoint operations that can take a long time or potentially hang, by adding tracing spans with relevant identifiers (instance/vport/VF IDs). This helps correlate stalls in async shutdown/reconfiguration paths when debugging misbehaving VMs or host/VTL0 interactions.

Changes:

Add an info_span around NetVSP coordinator endpoint shutdown.
Add info_span instrumentation around MANA endpoint teardown steps (RX disable, vport resource destruction, queue drain wait).
Add info_span instrumentation around Underhill VTL0 VF offer/remove notification calls, plus an explicit log when VTL0 bus isn’t present.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated no comments.

File	Description
vm/devices/net/netvsp/src/lib.rs	Instruments coordinator endpoint `stop().await` with an info span including `instance_id`.
vm/devices/net/net_mana/src/lib.rs	Adds `tracing::Instrument` and spans around endpoint stop/cleanup operations with `vport_id`.
openhcl/underhill_core/src/emuplat/netvsp.rs	Adds spans around VTL0 VF removal notice and VF offer call to better pinpoint stalls and add context.

vm/devices/net/netvsp/src/lib.rs

erfrimod

Added a comment to trace one more thing, but otherwise ready to go. :)

openhcl/underhill_core/src/emuplat/netvsp.rs

take Brian's better comment. Co-authored-by: Brian Perkins <Brian-Perkins@users.noreply.github.com>

Copilot

Pull request overview

Copilot reviewed 3 out of 3 changed files in this pull request and generated 3 comments.

Copilot · 2026-04-07T18:33:52Z

openhcl/underhill_core/src/emuplat/netvsp.rs

+                        } else {
+                            tracing::info!(
+                                vtl2_vfid,
+                                vtl0_vfid = vtl0_vfid_from_bus_control(&self.vtl0_bus_control),
+                            tracing::info!(
+                                vtl2_vfid,
+                                %vtl0_bus_control,
+                                "Ignoring VTL0 device request from guest"
+                            );
                        }


The else branch has a malformed tracing::info! invocation (a second tracing::info! starts before the first is closed), which will not compile. Collapse this into a single tracing::info! call that includes all desired fields (vtl2_vfid, vtl0_vfid, %vtl0_bus_control) and the message, and ensure the macro invocation is properly closed.

Copilot · 2026-04-07T18:33:52Z

vm/devices/net/netvsp/src/lib.rs

+            self.coordinator
+                .task_mut()
+                .endpoint
+                .stop()
+                .instrument(tracing::info_span!(
+                    "stopping coordinator endpoint",
+                    instance_id = %self.instance_id,
+                ))
+                .await;


This uses .instrument(...), which requires the tracing::Instrument trait to be in scope for method resolution. If this module doesn’t already use tracing::Instrument;, the build will fail—add that import alongside the other use statements (or use UFCS).

Copilot · 2026-04-07T18:33:52Z

openhcl/underhill_core/src/emuplat/netvsp.rs

+        if let Err(err) = self
+            .send_vf_state_change_notifications()
+            .instrument(tracing::info_span!(
+                "sending VTL0 VF removal notice",
+                vtl2_vfid,
+                vtl0_bus = %bus_control))
+            .await


This uses .instrument(...), which requires the tracing::Instrument trait to be in scope for method resolution. If this module doesn’t already use tracing::Instrument;, compilation will fail—add the import near the top-level use list (or switch to UFCS to avoid relying on the trait being in scope).

Ben Lewis added 4 commits March 30, 2026 17:24

Initial instrumenting.

b3e0b15

Adding instrumentation to the cleanup paths for ManaQueues.

44abf18

Added tracing for netvsp & mana communication between VTLs & host.

bbca85e

Last minor additions for now.

58458f1

ben-zen requested a review from a team as a code owner April 1, 2026 00:20

Copilot AI review requested due to automatic review settings April 1, 2026 00:20

Copilot started reviewing on behalf of ben-zen April 1, 2026 00:21 View session

Copilot AI reviewed Apr 1, 2026

View reviewed changes

erfrimod reviewed Apr 2, 2026

View reviewed changes

vm/devices/net/netvsp/src/lib.rs Show resolved Hide resolved

erfrimod reviewed Apr 2, 2026

View reviewed changes

benhillis added the enhancement New feature or request label Apr 6, 2026

Brian-Perkins reviewed Apr 7, 2026

View reviewed changes

openhcl/underhill_core/src/emuplat/netvsp.rs Outdated Show resolved Hide resolved

Update openhcl/underhill_core/src/emuplat/netvsp.rs

8b2abcf

take Brian's better comment. Co-authored-by: Brian Perkins <Brian-Perkins@users.noreply.github.com>

Copilot AI review requested due to automatic review settings April 7, 2026 18:27

Copilot AI reviewed Apr 7, 2026

View reviewed changes

Copilot started reviewing on behalf of ben-zen April 7, 2026 18:55 View session

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

netvsp & net_mana - instrument VTL0 & host calls at risk of hanging#3170

netvsp & net_mana - instrument VTL0 & host calls at risk of hanging#3170
ben-zen wants to merge 5 commits intomicrosoft:mainfrom
ben-zen:netvsp-span-guest-calls

ben-zen commented Apr 1, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

erfrimod left a comment

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 7, 2026

Uh oh!

Copilot AI Apr 7, 2026

Uh oh!

Copilot AI Apr 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

ben-zen commented Apr 1, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

erfrimod left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants