October 2nd, 2006 • 1:54 pm
Regular Betalogue readers now that, for the past few months, I have been struggling with my G5 Quad computer, which has been experiencing freezes or kernel panics on a regular basis. I have made all kinds of attempts to identify the cause of the freezes, and have been unsuccessful so far.
Last time I wrote about this problem, I mentioned that I had been able to log in remotely from another machine even after the G5 Quad had become completely frozen. But it’s quite possible that this was just a one-off. Certainly when the computer does not freeze but has an actual kernel panic, it’s hard to image that there is any possibility that one could still log in remotely.
In any case, last week I also contacted the “local” authorized repair shop about the problem, after AppleCare told me to do so. Since getting there is a three-hour drive (to the provincial capital Halifax) for me and they are aware of this, they offered to order replacement parts for all the parts that might be involved (logic board, power supply, CPU, etc.) and call me once they had all the parts. (They can return unused parts to Apple afterwards.) They had all the parts on Thursday, and I went there on Friday.
After describing all the symptoms and all the troubleshooting steps I had taken over the past few months, I left the machine with them for a few hours. When I came back, the technician told me that, of course, they had not found anything wrong with the machine in their testing (I didn’t expect them to), but they did replace the logic board with the replacement one that they got.
They tried to also replace the CPU unit, but for some reason the replacement one that Apple had sent them didn’t work at all. In fact, the technician told me that it looked different, and he simply couldn’t get it to work. So they were unable to replace the CPU and he had to put the old CPU unit back in. It took another hour and it was about closing time, so he told me that there was one last step that needed to be done, which was thermal calibration of the CPU. It would take a little while and he didn’t have time to do it, so he said I would have to do it myself when I got home.
He put a CD-R of the software needed in the machine, and told me I just needed to boot from the CD and run the thermal calibration tool, which would take up to 45 minutes. I got home on Saturday morning and followed his instructions. The thermal calibration tool itself said that it would take between 10 and 30 minutes, and in actual fact it only took something like 20 minutes. At least I think it did. The tool doesn’t exactly have a user-friendly interface and does not tell you when it’s finished. It just reported in its log that both “CPU1” and “CPU2” had been calibrated successfully, so I quit when that was done, after waiting for a little longer and seeing nothing else happen. (I am assuming here that there are only two “CPU” units listed because the Quad actually contains two dual-core G5 processors, and not four separate CPU units.)
After that, I booted the machine normally. A few basic settings had been lost, no doubt due to the logic board swap, but other than that everything seemed to be normal… except that the machine froze again within a couple of hours. And again later in the afternoon I got a kernel panic—the old “ugly” kind of kernel panic with lines of white text over a black background on top of the frozen Mac OS X screen.
So obviously the logic board swap did nothing to fix the problem. The machine froze (or panicked—I can’t tell the difference when it happens with the monitors asleep) again once during the Saturday/Sunday night, but has not frozen since—which of course doesn’t mean anything. All it means is that the problem is still there, and we still have no idea what causes it.
The only other “new” thing I have noticed is that, since doing a clean install of Mac OS X a couple of weeks ago, after the freeze/panic and the hard reset, I now get the “Mac OS X has unexpectedly quit” dialog during the restart process, which I never used to get before. The dialog provides a log of the kernel panic. I have kept those logs, even though the technicians in Halifax told me that it wouldn’t help them in any way.
Typically those logs start with some error message regarding “cpu 1” and then provide a configuration list with the kernel version, machine model, memory modules installed, etc.
The error messages in the beginning of the logs are not all the same. One says:
panic(cpu 1 caller 0x0003EE58): wait queue deadlock - wq=0x50f9aa4, cpu=1
followed by the “
latest stack backtrace for cpu 1.”
Another one says:
panic(cpu 1 caller 0x000A5048): simple lock (0x0038BF50) deadlock detection, pc=0x0003C5EC
again followed by the “
latest stack backtrace for cpu 1.”
Yet another one says:
Unresolved kernel trap(cpu 1): 0x400 - Inst access DAR=0x00000000E0292000 PC=0x0000000000000000
this time followed by the “
latest crash info for cpu 1.”
I don’t know if this means anything. All the logs mention “
cpu 1” and not “
cpu 2,” but this could just be a coincidence.
It’s all very frustrating. I am going to have to see if I can get the repair shop to try and order a replacement CPU unit again and make sure that they get a working one this time, and then go back there to get it installed and see if that makes any difference.
In the mean time, I am going to try a couple more things, such as borrowing RAM modules from a colleague who has the same G5 Quad and has no problems with hers. (The only situation where I haven’t yet been able to get a freeze is when running the G5 with its original 512 MB of RAM only. But it could only be because I didn’t wait long enough. Running with only 512 MB of RAM is quite frustrating and I must admit that I haven’t been able to do it for more than 3-4 days. I do have work to do with this machine, after all.)
The only good piece of news that I got last week was that my employer is going to buy me a new Mac Pro. (I bought the G5 Quad with my own hard-earned money as a “personal” machine.) The last one that they bought me was the G4 MDD four years ago, so I was due for a new one.
Once I get the Mac Pro (the shipping estimate is the middle of October right now), I will be able to bring the G5 Quad to the repair shop in Halifax and leave it there with them and get them to fix it once and for all. Hopefully if they keep it for a while they will be able to reproduce the freezes/kernel panics themselves.
But the whole experience is still very frustrating, especially for a machine that was supposed to be top-of-the-line a year ago, and didn’t have any problems for the first eight months (except for a slightly defective video card). I have a three-year AppleCare warranty for the G5 Quad, so there is no concern of being stuck with a freezing machine out of warranty, but obviously I want to get it fixed and in working order, whether I decide to keep it for myself or resell it.