Emanuele Tonelli Posted September 1, 2022 Share Posted September 1, 2022 On our 8.2 LTSR hypervisor, some running VM (all of them are Windows 2012R2) cannot recover the space used for deleted snapshots (our linux VMs never show this problem). They get a message like this one: SMGC: [29036] Iteration: 6 -- Initial size 172425728 --> Final size 212349440 SMGC: [29036] Unexpected bump in size, compared to minimum acheived What does it mean? How can the size increase (from 172425728 to 212349440) instead of decreasing? Is this the reason for the Leaf-coalesce failure? If the VM is shut down when the Garbage Collection starts, then no error is received and the space is reclaimed correctly. Why the difference between a running VM and a stopped VM? SMGC: [29036] No progress, attempt: 1 SMGC: [29036] Aborted coalesce SMGC: [29036] Iteration: 1 -- Initial size 3811787264 --> Final size 1122189824 SMGC: [29036] Iteration: 2 -- Initial size 1122189824 --> Final size 338424320 SMGC: [29036] Iteration: 3 -- Initial size 338424320 --> Final size 309006848 SMGC: [29036] Iteration: 4 -- Initial size 309006848 --> Final size 222855680 SMGC: [29036] Iteration: 5 -- Initial size 222855680 --> Final size 172425728SMGC: [29036] Iteration: 6 -- Initial size 172425728 --> Final size 212349440SMGC: [29036] Unexpected bump in size, compared to minimum acheived SMGC: [29036] Starting size was 3811787264 SMGC: [29036] Final size was 212349440 SMGC: [29036] Minimum size acheived was 172425728 SMGC: [29036] Removed leaf-coalesce from 51054e79[VHD](101.000G/202.512M/101.203G|a) SMGC: [29036] *~*~*~*~*~*~*~*~*~*~*~*~*~*~*~*~*~*~*~*~* SMGC: [29036] *********************** SMGC: [29036] * E X C E P T I O N * SMGC: [29036] *********************** SMGC: [29036] leaf-coalesce: EXCEPTION <class 'util.SMException'>, VDI 51054e79-6cb0-4a9e-8603-2c1f813aff34 could not be coalesced SMGC: [29036] File "/opt/xensource/sm/cleanup.py", line 1644, in coalesceLeaf SMGC: [29036] self._coalesceLeaf(vdi) SMGC: [29036] File "/opt/xensource/sm/cleanup.py", line 1919, in _coalesceLeaf SMGC: [29036] .format(uuid=vdi.uuid)) SMGC: [29036] SMGC: [29036] *~*~*~*~*~*~*~*~*~*~*~*~*~*~*~*~*~*~*~*~* SMGC: [29036] Leaf-coalesce failed on 51054e79[VHD](101.000G/202.512M/101.203G|a), skipping Link to comment
0 Tobias Kreidl Posted September 1, 2022 Share Posted September 1, 2022 How full is the SR on which those VMs have their storage? I take it these VMs have XenTools all installed OK, right? Link to comment
0 Emanuele Tonelli Posted September 2, 2022 Author Share Posted September 2, 2022 We have 4 TB used and 6 TB free.... Anyway the problem is the 9.3.0 tools on Windows 2012 R2. With the 9.2.3 it works perfectly... Link to comment
0 Tobias Kreidl Posted September 5, 2022 Share Posted September 5, 2022 Perhaps try to uninstall and re-install XenTools. Hard to understand why thin provisioning would not let to coalesce unless it's something like 90% full. Worst case, if you have the space, you could storage-migrate the VMs to different SRs and start from scratch with an empty SR. Of course, storage migration will probably not work if the VMs do not have the tools installed properly. -=Tobias Link to comment
0 Mark Syms Posted September 5, 2022 Share Posted September 5, 2022 This means that the VM is continuously writing data to its virtual disk at a rate faster than the garbage collection process can consolidate the data into the parent node in the VHD tree. As you can see the GC has several attempts and then gives up marking the VDI as not leaf coalescable. As the GC process has to read data from one VHD node and write to another it is unfortunately quite common that a VM with a moderately high IO throughput will win the race as that only has to write data to the disk and not perform the secondary read. Link to comment
0 Tobias Kreidl Posted September 5, 2022 Share Posted September 5, 2022 Try to do a powerstate reset on the VM then? -=Tobias Link to comment
0 Mark Syms Posted September 6, 2022 Share Posted September 6, 2022 Unlikely to help doing a powerstate reset. Shutting the VM down for a period and running a scan of the SR while it is stopped may address the issue, or reducing the amount of data being written. Link to comment
0 Tobias Kreidl Posted September 6, 2022 Share Posted September 6, 2022 @Mark, if the VM was hung, is what I meant, but yes, best to shut it down if possible. -=Tobias Link to comment
0 Mark Syms Posted September 6, 2022 Share Posted September 6, 2022 If the VM were hung it would not be producing Write IOs at a rate faster than what the GC process can consolidate them. That it is gives a pretty good indication that the VM is live and active. Link to comment
0 Tobias Kreidl Posted September 6, 2022 Share Posted September 6, 2022 Good point! -=Tobias Link to comment
Question
Emanuele Tonelli
On our 8.2 LTSR hypervisor, some running VM (all of them are Windows 2012R2) cannot recover the space used for deleted snapshots (our linux VMs never show this problem).
They get a message like this one:
SMGC: [29036] Iteration: 6 -- Initial size 172425728 --> Final size 212349440
SMGC: [29036] Unexpected bump in size, compared to minimum acheived
What does it mean?
How can the size increase (from 172425728 to 212349440) instead of decreasing?
Is this the reason for the Leaf-coalesce failure?
If the VM is shut down when the Garbage Collection starts, then no error is received and the space is reclaimed correctly.
Why the difference between a running VM and a stopped VM?
SMGC: [29036] No progress, attempt: 1
SMGC: [29036] Aborted coalesce
SMGC: [29036] Iteration: 1 -- Initial size 3811787264 --> Final size 1122189824
SMGC: [29036] Iteration: 2 -- Initial size 1122189824 --> Final size 338424320
SMGC: [29036] Iteration: 3 -- Initial size 338424320 --> Final size 309006848
SMGC: [29036] Iteration: 4 -- Initial size 309006848 --> Final size 222855680
SMGC: [29036] Iteration: 5 -- Initial size 222855680 --> Final size 172425728
SMGC: [29036] Iteration: 6 -- Initial size 172425728 --> Final size 212349440
SMGC: [29036] Unexpected bump in size, compared to minimum acheived
SMGC: [29036] Starting size was 3811787264
SMGC: [29036] Final size was 212349440
SMGC: [29036] Minimum size acheived was 172425728
SMGC: [29036] Removed leaf-coalesce from 51054e79[VHD](101.000G/202.512M/101.203G|a)
SMGC: [29036] *~*~*~*~*~*~*~*~*~*~*~*~*~*~*~*~*~*~*~*~*
SMGC: [29036] ***********************
SMGC: [29036] * E X C E P T I O N *
SMGC: [29036] ***********************
SMGC: [29036] leaf-coalesce: EXCEPTION <class 'util.SMException'>, VDI 51054e79-6cb0-4a9e-8603-2c1f813aff34 could not be coalesced
SMGC: [29036] File "/opt/xensource/sm/cleanup.py", line 1644, in coalesceLeaf
SMGC: [29036] self._coalesceLeaf(vdi)
SMGC: [29036] File "/opt/xensource/sm/cleanup.py", line 1919, in _coalesceLeaf
SMGC: [29036] .format(uuid=vdi.uuid))
SMGC: [29036]
SMGC: [29036] *~*~*~*~*~*~*~*~*~*~*~*~*~*~*~*~*~*~*~*~*
SMGC: [29036] Leaf-coalesce failed on 51054e79[VHD](101.000G/202.512M/101.203G|a), skipping
Link to comment
9 answers to this question
Recommended Posts
Create an account or sign in to comment
You need to be a member in order to leave a comment
Create an account
Sign up for a new account in our community. It's easy!
Register a new accountSign in
Already have an account? Sign in here.
Sign In Now