Bugzilla – Bug 1206814
[Build 20230104-1] openQA test fails in snapper_cleanup, 'snapper cleanup number' gets error 'Failure (error.something)'
Last modified: 2024-05-07 08:48:26 UTC
## Observation openQA test in scenario opensuse-15.4-DVD-Updates-x86_64-extra_tests_filesystem@64bit fails in [snapper_cleanup](https://openqa.opensuse.org/tests/3010736/modules/snapper_cleanup/steps/65) ## Test suite description Maintainer: QE Core Filesystem related tests, for example snapper and btrfs features. ## Reproducible Fails since (at least) Build [20230102-3](https://openqa.opensuse.org/tests/3006470) ## Expected result Last good: [20230102-2](https://openqa.opensuse.org/tests/3005830) (or more recent) ## Further details Always latest result in this scenario: [latest](https://openqa.opensuse.org/tests/latest?arch=x86_64&distri=opensuse&flavor=DVD-Updates&machine=64bit&test=extra_tests_filesystem&version=15.4)
I tried to collect some logs manually. please see attached files.
Created attachment 863819 [details] snapper.log
Created attachment 863820 [details] strace snapperd
Created attachment 863821 [details] strace snapper cleanup number
susetest:/tmp # rpm -qf /usr/bin/snapper snapper-0.8.16-1.1.x86_64 susetest:/tmp # uname -r 5.14.21-150400.24.38-default susetest:/tmp # cat /etc/*release NAME="openSUSE Leap" VERSION="15.4" ID="opensuse-leap" ID_LIKE="suse opensuse" VERSION_ID="15.4" PRETTY_NAME="openSUSE Leap 15.4" ANSI_COLOR="0;32" CPE_NAME="cpe:/o:opensuse:leap:15.4" BUG_REPORT_URL="https://bugs.opensuse.org" HOME_URL="https://www.opensuse.org/" DOCUMENTATION_URL="https://en.opensuse.org/Portal:Leap" LOGO="distributor-logo-Leap"
Looking at openQA it seems as if the test sometimes succeeds and sometimes fails. Also, there hasn't been a change to snapper for more than a year (since GA) in Leap 15.4. So likely some other component is causing trouble.
Anyway, I could not reproduce the problem. The newer snapper version at https://build.opensuse.org/project/show/filesystems:snapper has improved error logging. Running the test with that version might help find the problem. Access to a machine with the problem would also be good.
(In reply to Arvin Schnell from comment #7) > Anyway, I could not reproduce the problem. The newer snapper version at > https://build.opensuse.org/project/show/filesystems:snapper has improved > error logging. Running the test with that version might help find the > problem. Access to a machine with the problem would also be good. A vm machine is ready for you, I will send the access info via mail
(In reply to Richard Fan from comment #8) > (In reply to Arvin Schnell from comment #7) > > Anyway, I could not reproduce the problem. The newer snapper version at > > https://build.opensuse.org/project/show/filesystems:snapper has improved > > error logging. Running the test with that version might help find the > > problem. Access to a machine with the problem would also be good. > > A vm machine is ready for you, I will send the access info via mail Bad news, I can't reproduce the issue after vm reset. I will try to find a new vm
QA has now provided a machine and I can reproduce the problem there.
There is a btrfs quota rescan stuck on the machine for more than half an hour. The handling in snapper is not good. I will improve that but it will cause snapper to be stuck until the btrfs rescan is done.
This fails reproducibly in openQA still. cat /proc/(pid of btrfs quota rescan kworker)/task/*/stack shows that it's idle/stuck in rescuer_thread.
(In reply to Fabian Vogt from comment #12) > This fails reproducibly in openQA still. Edit: On 15.5 now: https://openqa.opensuse.org/tests/3438103 Both 15.4 and 15.5 are affected, I'll set the version to the more recent one.