Bug 1034646 - stuck jobs: show more before killing
Summary: stuck jobs: show more before killing
Status: NEW
Alias: None
Product: openSUSE.org
Classification: openSUSE
Component: BuildService (show other bugs)
Version: unspecified
Hardware: Other Other
: P5 - None : Enhancement (vote)
Target Milestone: ---
Assignee: Adrian Schröter
QA Contact: Adrian Schröter
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2017-04-18 12:54 UTC by Stanislav Brabec
Modified: 2017-04-21 09:31 UTC (History)
0 users

See Also:
Found By: ---
Services Priority:
Business Priority:
Blocker: ---
Marketing QA Status: ---
IT Deployment: ---


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Stanislav Brabec 2017-04-18 12:54:22 UTC
When there are stuck jobs, OBS kill the virtual machine after defined amount of time.

It would be nice to show some information before killing the machine. For example ps axww and/or backtrace of stalled command would be useful for debugging. (Or add support for a user definable command.)

Use case: testsuite of util-linux sometimes unexpectedly does not finish. These cases are very interesting for debugging.

[  188s]        logger: errors                                        ... OK (all 11 sub-tests PASSED)
[28990s] qemu-system-s390x: terminating on signal 15 from pid 35473


Job seems to be stuck here, killed. (after 28800 seconds of inactivity)
[28990s] ### VM INTERACTION END ###
[28990s] No buildstatus set, either the base system is broken (kernel/initrd/udev/glibc/bash/perl)
[28990s] or the build host has a kernel or hardware problem...