Bug 135825

Summary: failure to upgrade from 9.2 to 10.0: the Brouwer story
Product: [openSUSE] SUSE LINUX 10.0 Reporter: Martin Vidner <mvidner>
Component: Update ProblemsAssignee: Martin Vidner <mvidner>
Status: RESOLVED INVALID QA Contact: Klaus Kämpf <kkaempf>
Severity: Normal    
Priority: P5 - None CC: jsrain
Version: Final   
Target Milestone: ---   
Hardware: x86   
OS: Other   
Whiteboard:
Found By: Customer Services Priority:
Business Priority: Blocker: ---
Marketing QA Status: --- IT Deployment: ---

Description Martin Vidner 2005-11-29 12:48:04 UTC
Mvidner: This has been reported by Andries Brouwer. I will Cc him when he overcomes our registration form.
Let's identify the individual problems here and make smaller bugs out of it.


Installation of SuSE does not become simpler. A time consuming business.

Starting point: Intel PC, 400 MHz, 256 MB, several disks.
On hdb2 a SuSE 9.2 installation
Plan: upgrade to SuSE 10.0.

[Sneak preview: complete failure]

I boot from CD1. Get message "Cannot boot from this CD. Please use CD2".
I boot from CD2. Insert CD1. Ask for "Installation".

[First remark here: There have been times that "Installation"
would overwrite the disk. I choose such an option with fear.
Much more reassuring would be a choice "Installation or Upgrade".]

Rebuilding database takes a long time. I made coffee and drank coffee
and read the newspaper, and it was still rebuilding the database.
(About 1600 packages, I think.) The amount of information is fairly
small. Clearly inferior technology is used here - perhaps many
linear searches, or quadratic algorithms. Rebuilding a list
of packages should not take longer than compiling a kernel.

The upgrade was from 9.2 to 10.0 and it concluded:
"Packages: cannot solve all conflicts - manual intervention required".
Hmm. Pity. Must I really have an opinion about lots of packages and
libraries with names I have never seen before?
It wanted to install KDE but I do not use KDE, and moreover was
afraid that space might be tight, so chose for "minimum graphical,
without Gnome or KDE".
A warning: Do you really want to reset your detailed selection?
Well, I had not selected anything yet, but perhaps the detailed
selection was my installed set of packages. OK.
Now a manual resolution of conflicts - I mostly asked for not installing
anything that conflicted.

Selected "no backups" - mostly because space was tight, and I had
already made a backup of /etc myself.

Now the installation started.

I have two CD readers, one 4x and one 50x. The BIOS can boot only
from the 4x. Earlier installs could use the 50x as installation medium,
but this time that didnt work, it wanted to read from the same device
it had booted from. Pity.
The result was that the estimated time was more than 10 hours,
and the estimated time for the first CD more than 2 hours.
I watched the install for half an hour, but after that the expected
time for the first CD was still more than two hours.
Went to bed. Next morning the screen contains the message
"Cannot boot from this CD. Please use CD2."

Hmm. Probably CD1 finished, and the install procedure autorebooted,
and forgot that reboot from CD1 does not work, since it also failed
last time.

OK. I insert CD2, then CD1, and the same things happen.
In fact the very same things. I had expected that the installation
program would see quickly that this was the second phase of an
installation. But it didnt.

Compared with the night before I noticed only two differences:
(i) the partition was now called a "SuSE 10.0" partition, and
(ii) the install procedure did not want to delete lots of packages.
The list of conflicts was short but nonempty (ial conflicts with fnfx).
Now precisely the same stuff as before is installed. Hmm.

Maybe it was a bug that it rebooted from CD?

After a few hours: again the message on the screen "Cannot boot from
this CD. Please use CD2". Doing that for the third time does not look
like a good idea. I remove the CD and boot from disk.
Get a Grub prompt. Ach.

[I have complained about this before.
It is *EXTREMELY STUPID* (yes, shouting) to change the boot setup
during an upgrade. The system was working, so the boot setup was fine.
Anything changed there can only make matters worse.
Indeed, will make matters worse, I have seen many examples.
Upgrade packages, but by default leave the MBR alone.]

Hmm. Grub. I am a LILO user myself. What does one say to grub?
The word Grub does not occur in the index of the Novell/SUSE
Linux 10.0 Start-Up book that came with these CDs.

grub> ?
Error 27: Unrecognized command.
grub> help
blocklist FILE
...
vbeprobe [MODE]
grub>

Hmm. I want to boot some kernel, so have to specify a device
and a file on that device. Maybe a partition is specified using
the root command. Try "root hdb2", "root hd1(1)", "root hd1,1",
"root hd1:1", "root (hd1,1)". Yes! Fifth attempt, not bad.

[It would really help if Grub answered something more helpful
instead of "Error 11: Unrecognized device string". It might add
"Use, e.g., (hd3,0)." It would be even better if Grub behaved
like a Linux program and used Linux names like hdb6.]

Now that we found the right syntax, the error message changes into
"Error 18: Selected cylinder exceeds maximum supported by BIOS".
Hmm. Did I select a cylinder? Is the start of hdb2 too far away?

grub> geometry (hd1)
drive 0x81: C/H/S=1024/16/63, 1032192 sectors
partition 0: type 82
partition 1: Error 18: Selected cylinder exceeds ...

It looks like it. Maybe grub cannot boot this system today,
or at least I do not know how to proceed. Find a rescue CD.

# mkdir /a
# mount /dev/hdb2 /a
# /a/sbin/lilo -r /a
# umount /a

Reboot into a vanilla 2.6.14. System boots fine,
lots of well-known and less well-known messages.
/etc/init.d/boot: line 283: /dev/tty: No such device or address
More boot stuff. When booting is almost done the screen
goes blank. CapsLock still works, but no reaction to keystrokes,
not even to Alt/F1 or Ctrl/Alt/F1 or Ctrl/Alt/Del.
No echo to a ping from outside. Waited a few minutes.

Rebooted the rescue CD.

# mv /etc /etc-suse
# mv /etc-backup /etc

Rebooted again into 2.6.14. The boot completes and I can send
this mail. There were some error messages. E.g.,
Suseconfig.fonts complains:
use of uninitialized value in pattern match (m//) at /usr/sbin/fonts-config

A somewhat working system, semiupgraded to SUSE 10.0 - no stuff
from CD2-5 was installed.

Since I prefer a well-defined setup and not this mess, I'll probably
do another install of some other distribution this evening.

So far for today,
Andries
Comment 2 Martin Vidner 2007-08-31 08:21:56 UTC
Somehow I thought that I would have the energy to slice this bug up, but obviously I didn't. If anyone wants to pick this up, please open individual bugs for the remaining problems.