|
Bugzilla – Full Text Bug Listing |
| Summary: | Reiserfs easily deadlocks on large machines under stress | ||
|---|---|---|---|
| Product: | [openSUSE] SUSE LINUX 10.0 | Reporter: | Andreas Kleen <ak> |
| Component: | Kernel | Assignee: | Chris L Mason <mason> |
| Status: | RESOLVED FIXED | QA Contact: | E-mail List <qa-bugs> |
| Severity: | Critical | ||
| Priority: | P5 - None | CC: | aj, jeffm, sitnikov |
| Version: | RC 2 | ||
| Target Milestone: | --- | ||
| Hardware: | x86-64 | ||
| OS: | All | ||
| Whiteboard: | released:kernel:sles8 affected:sles8 applied:sles8 | ||
| Found By: | Other | Services Priority: | |
| Business Priority: | Blocker: | --- | |
| Marketing QA Status: | --- | IT Deployment: | --- |
|
Description
Andreas Kleen
2005-09-12 15:38:49 UTC
Our current theory is this is due to a lock inversion problem between the BKL and the inode semaphore. I'm building a kernel on bellini now to try. If this doesn't work, I'll use Andi's patch to get a stack trace of all cpus, it should lead us quickly to the deadlock. If you wait a bit the current kernel has the patch applied and reaim is running. So eventually it should deadlock and you should get a full backtrace. Andi, is this still an issue? Probably not. I let avitrano run reaim tests for a few days on reiserfs and there weren't any deadlocks (well it eventually hung, but that was probably a different problem) Let's close this one for now. I assume it's already fixed. I should add avitrano had only half as many cores as the 16 core system where I saw the problem. Perhaps it would be a good idea to run some testing on a larger core system again. But I assume that will happen during SLES10 testing anyways. when i use stress (http://weather.ou.edu/%7Eapw/projects/stress/, running as: stress -v -c 8 -i 4 -m 32 -d 16) it crash my server (2x Dual Core Opteron, 16G RAM) after 2-6 hours with reiserfs. with one of result: - 100% disk usage - io precesses died in "Uninterruptible sleep" - fs crash if i use xfs server "stable" about 4 days (stress test stoped manualy) P.S. i used some version 2.6.16 and 2.6.15 (from factory) Andrew - This bug has been marked resolved and the lock inversion problem has been addressed. If you're seeing ReiserFS problems with a recent release kernel, please open a separate bug. setting whiteboard Just for the record: Patch: patches.common/reiserfs-bkl-inversion included, enabled, and released in: SLES-8 SP4 kernel update 2.4.21-325 dated Oct 04, 2007 & released Oct 12, 2007. Setting Whiteboard Status for SLES8 --> released |