Fixing my b0rken WordPress installation

A little over a week ago, WordPress started acting weird. First, it just got dog slow. Then my site stats page started freezing until I scrolled down and then back up again. Then I started seeing a WordPress.com logon screen on my site stats page. I had to look that account up. Thank goodness for Gmail. Then my Akismet spam filter quit working. Then my stats page stopped working entirely.

I lived with it for a couple of days. I figured maybe WordPress and Akismet had changed something. Or maybe my Linux distribution had. And maybe some update messed things up, and some other update would come along and fix it. No such luck. Read more

Completely bizarre computer problems? Check your system date

PC Magazine’s editor in chief wrote a long column late last week talking about his weird computer problems and a Quixotic quest to fix them. Among other things, his antivirus wasn’t working, and Windows wanted to be activated and wouldn’t let him. He thought he had a virus, but all his scans came up clean.

It turned out his computer thought it was 2013. The date and time were right, but the computer was trying to live three years in the future. Read more

Operating System Not Found, Missing Operating System, and friends

So the PC that stored my resume got kicked (as in the foot of a passer-by hitting it) and died, and the backup that I thought I had… Well, it wasn’t where I thought it was.

Time for some amateur home data recovery. Here’s how I brought it back.This machine ran Windows 2000. The first trick to try on any machine running any flavor of Windows is to boot from a DOS boot disk containing FDISK.EXE and issue the command FDISK /MBR. This replaces the master boot record. A corrupt MBR is the most common malady that causes these dreaded error messages, and this is the easiest fix for it.

That didn’t work for me.

The second trick is to use MBRWork. Have it back up the first sector, then have it delete the boot record. Then it gives you an option to recover partitions. Run that, then run the option that installs the standard MBR code. I can’t tell you how many times this tool has made me look like I can walk on water.

No dice this time either.

Next I tried grabbing the Windows 2000 CD and doing a recovery install. This has brought systems back to life for me too. Not this time. As happens all too often, it couldn’t find the Windows 2000, so it couldn’t repair it.

The drive seemed to work, yet it couldn’t boot or anything. I could have and probably should have put it in another PC to make sure it was readable. But I didn’t have a suitable donor handy. Had there been such a system, I would have put the drive in, checked to see if it was readable, and probably would have run CHKDSK against it.

Lacking a suitable donor, instead I located an unused hard drive and put it in the system. I booted off the drive just to make sure it wasn’t a hardware problem. It wasn’t–an old copy of Windows 98 booted and dutifully spent 20 minutes installing device drivers for the new motherboard hardware. So I powered down, installed both drives, and broke out a copy of Ghost.

Ghost, as I have said before, doesn’t exactly copy data–what it does is better described as reinterpreting the data. This allows you to use Ghost to lay down an image on dissimilar hard drives. It also makes Ghost a fabulous data recovery tool. Ghost complained that the NTFS log needed to be flushed. Well, that requires booting into Windows (and I think that’s all that’s necessary), but I couldn’t do that. It offered to try the copy anyway, so I chose that. So it cranked for about 15 minutes. I exited Ghost, powered down, and disconnected the bad drive. I powered back up, and it booted. Fabulous.

Now I can use Ghost to copy the now-good drive back over to the drive that was bad in the first place. I’ll do that, but sending out the resume takes much higher priority.

Fixing a computer that shows the wrong partition size after resizing

So, I’ve got these Windows 2000 boxes that didn’t have enough space, so I resized some partitions. No error messages, no problems. I reboot, and the drives still show their old size, even though in Disk Administrator they show the right size.

What gives? Microsoft acknowledges this issue in Windows XP, but hasn’t released a fix yet. But these aren’t XP, they’re 2000.

I’ve got a crazy solution.

If you have a copy of Ghost by Symantec, take a Ghost image of the partition that’s sized wrong. Then, immediately after creating the image, write the image back to the partition you just Ghosted.

Makes no sense, right? Well, but Ghost doesn’t do a bit-by-bit copy. It makes sure it gets good copies of your files, but it saves an interpretation of the partition, rather than the partition itself. So when it writes it back, minor errors that were there before get wiped out.

Now, why there can’t be a disk utility that does this same thing to a partition without the imaging runaround, I don’t know.

I just know I’ve brought a lot of computers with weird disk problems back to life over the years by making Ghost images of them and then writing the image back. This one today is just the latest in that long line.

Advice on troubleshooting and buying printers

I gave some out-of-character advice this week when someone came calling looking for help troubleshooting an inkjet printer.

Essentially, I told him that unless the problem turned out to be a problem with his cabling (it was–his USB hub had gone bad), he’d be best off just buying a new printer.

Read more

Windows 98 CD-ROM drive not working? Try this

Windows 98 CD-ROM drive not working? Try this

Occasionally, a PC’s CD or DVD-ROM drive will stop responding for no known good reason. Sometimes the problem is hardware–a CD-ROM drive, being a mechanical component, can fail–but as often as not, it seems, the problem is software rather than hardware. Here’s what to do with a Windows 95 or Windows 98 CD-ROM drive not working when the same drive works just fine in another OS.

If Windows has both 16- and 32-bit CD-ROM drivers, it can get confused and disable the drive to protect itself. The solution is to remove the 16-bit driver, then delete the obscure NoIDE registry key to re-enable the 32-bit driver.

Read more

Spontaneous system reboots

Steve DeLassus asked me the other day what I would do to fix a PC that was rebooting itself periodically. It’s not him who’s having the problem, he says, it’s someone he knows. He must be trying to show up someone at work or on the Web or something.
So I gave him a few things I’d check, in order of likelihood.

Static electricity. A big static shock can send a system down faster than anything else I’ve seen. Keep a humidifier in the computer room to reduce static electricity. If you’re really paranoid, put a metal strip on your desk and connect it to ground (on your electrical outlet, not on your PC) and touch it before touching your PC. Some people metalize and ground part of their mouse pad. That’s a bit extreme but it works.

Power supply. This is the big one. A failing power supply can take out other components. And even if you have an expensive, big-brand box like a PCP&C or Enermax, they can fail. So I always keep a spare ATX power supply around for testing. It doesn’t have to be an expensive one–you just want something that can run the machine for a day or two to see if the problem goes away.
Overheating. Check all your fans to make sure they’re working. An overheated system can produce all sorts of weird behavior, including reboots. The computer we produced our school newspaper on back in 1996 tended to overheat and reboot about 8 hours into our marathon QuarkXPress sessions.

Memory. It’s extremely rare, but even Crucial produces the occasional defective module. And while bad memory is more likely to produce blue screens than reboots, it’s a possibility worth checking into. Download Memtest86 to exercise your memory.

CPU. If you’re overclocking and experiencing spontaneous reboots, cut it out and see what happens. Unfortunately, by the time these reboots become common, it may be too late. That turned out to be the case with that QuarkXPress-running PC I mentioned earlier. Had we replaced the fans with more powerful units right away, we might have been fine, but we ended up having to replace the CPU. (We weren’t overclocking, but this was an early Cyrix 6×86 CPU, a chip that was notorious for running hot.) Less likely today, but still possible.

Hard drive. I’m really reaching here. If you’re using a lot of virtual memory and you have bad sectors on your hard drive and the swapfile is using one or more of those bad sectors, a lot of unpredictable things can happen. A spontaneous reboot is probably the least of those. But theoretically it could happen.

Operating system. This is truly the last resort. People frequently try to run an OS that’s either too new or too old to be ideal on a PC of a particular vintage. If the system is failing but all the hardware seems to be OK, try loading the OS that was contemporary when the system was new. That means if it’s a Pentium-133, try Win95 on it. If it’s a P4, try Windows 2000 or Windows XP on it. When you try to run a five-year-old OS on a new system, or vice-versa, you can run into problems with poorly tested device drivers or a system strapped for resources.

Another good OS-related troubleshooting trick for failing hardware is to try to load Linux. Linux will often cause suspect hardware to fail, even if the hardware can run Windows successfully, because Linux pushes the hardware more than Microsoft systems do. So if the system fails to load Linux, start swapping components and try again. Once the system is capable of loading Linux successfully, it’s likely to work right in Windows too.

Troubleshooting advice: When you suspect a bad component, particularly a power supply, always swap in a known-good component, rather than trying out the suspect component in another system to see if the problem follows it. The risks of damaging the system are too great, particularly when you try a bad power supply in another system.

And, as always, you minimize the risks of these problems by buying high-quality components, but you never completely eliminate the risk. Even the best occasionally make a defective part.

Web browser troubleshooting

Over the last couple of workdays, in between viruses, I’ve had to diagnose some flaky Web browsers. The browser might work fine for me, but a user will swear it’s crashing. You can run the repair tool on Internet Explorer and/or take them up to Internet Explorer 5.5SP2, then you can hit a few sites and see what happens, but even a Microsoft product can usually survive 15 minutes of regular work (unless we’re talking about Word 2000 with one of my book chapters loaded, in which case you’re lucky to get five minutes’ stability, but I digress).
A Web search on “web browser stress test” turned up nothing useful. I found lots of tools for stress-testing Web servers, but I wanted to stress-test the other side. Finally I just went to BrowserTune and ran it. When IE passed all the tests, I declared the installation stable.

Then yesterday I grew weary of only having the Dillo and Konqueror browsers on my main Linux box. Dillo is lightning fast but it still won’t render sites like cnn.com and espn.com properly. It’s readable, but sometimes annoying. It’s the best occasional-use browser I’ll ever see, but I’m not comfortable using it all the time. Konqueror’s a decent browser, and has a lot of promise, but it doesn’t always render things properly either. And sometimes it’s annoyingly slow. I needed a well-known browser that most Web designers keep in mind when coding their pages, but of course I wanted to compile it myself… That leaves out Internet Exploder, of course. And I can’t compile Netscape myself… Or can I? I went and got Mozilla, and the current build is almost indistinguishable in appearance from Netscape Communicator 6.x. I found it renders well, and quickly, loads about as fast as Konqueror. And it includes an invaluable resource: Under the Debug menu, there’s an option called “choffman’s browser buster.” It’s just a link to a Mozilla site that repeatedly loads Web pages for as long as the browser stays running (so, until it crashes or you close a window). Great stuff. If part of your job is building Ghost images for new PCs, you’d do well to run one or more of these tests overnight (why not run several, in separate browser windows?) to make sure your browser installation took. If you ever have to troubleshoot an end-user’s Web browser, you’ll thank yourself for bookmarking that site.

Troubleshooting a flaky PC

After church on Wednesday, Pastor pointed at me. “I need to talk to you,” he mouthed. That’s usually not a good sign, but I hadn’t done anything too stupid lately, so I figured he probably didn’t want to talk about me. I was right. His computer was flaking out.
Having, at that point, nothing to go on, I suggested he run SFC.EXE, which scans system files for corruption. It flagged a few files, including user.exe. This isn’t always indicative of a problem, but it can be. I had him go ahead and replace the files it flagged. He called me and we walked through the process, then the computer just crashed. Bluescreen, in the kernel. Constant spontaneous crashes were the norm now. And the error messages were never the same, he said.

He asked what causes these things. I told him if I knew precisely, I’d be able to write a book that’d sell a whole lot more copies than my last one. Microsoft doesn’t even know why Windows does these things sometimes.

At one point he got an error message that said, in essence, to reinstall Windows. I told him I could walk him through it, but at that point I’d be more comfortable seeing it myself.

So that was what I did yesterday. The computer booted up fine, and I couldn’t find anything wrong. So we tried going online. That flaked out–the modem couldn’t connect and just gave false busy signals. I brought up HyperTerminal, tried dialing my own phone number with it, and the system crashed. Bluescreen. Kernel. OK, so I reinstalled Windows. It crashed during the installation.

Several things can cause that. I jumped straight to memory. Opening the case, I saw several possibilities. No-name Eastern Rim power supply. A microATX motherboard with the notorious SiS 530 chipset. I told him I’d run a diagnostic on it. Just to be on the safe side, I pulled the memory. It wasn’t commodity memory–it was an Apacer module. Apacer isn’t the best but it’s far from the worst. I cleaned the module’s contacts with a dollar bill, then put it back in and turned the system on to see if Windows could pick up where it left off. Note that I didn’t replace the case cover. An experienced tech never does that. Why not? I noticed something the minute I turned on the power. The CPU fan wasn’t turning. I found the problem. The dead CPU fan, incidentally, was a Vantec. No, not Antec–Vantec. It was a ball-bearing fan, and Vantec is fairly reputable, so this fan just died before its time. It happens sometimes.

So it appeared the crashes were due to overheating.

The local CompUSA didn’t have a single Pentium/K6/Mendocino Celeron CPU fan in stock. Bummer, because the marked price on an Antec was $13. So I went to Best Bait-n-Switch, where I found an Antec fan, made in Taiwan (good), for $15. This one had a drive connector, and I’d have preferred one with a motherboard connector, and I’d have preferred a slightly bigger heatsink, but it was all they had, it was bigger than the part it was replacing, and I wanted something right then and there. I bought it, brought it back, swapped it in, and the computer booted up fine, and finished Windows installation without a problem. I even ran Defrag afterward, since that’s pretty taxing on the whole system. I let it go for half an hour without a single hiccup. Problem solved, apparently.

It’s nice when you can tell someone it wasn’t anything they did, that it was a hardware problem. And a CPU fan is a pretty cheap item.

Troubleshooting intermittent PC problems

How to troubleshoot an intermittent PC problem. We’ve got an aging P2-233 at work that likes to bluescreen a lot under NT4–usually once every day or two. No one who looked at it was able to track it down. The first thing I noticed was that it still had the factory installation of NT, from about three years ago. Factory installations are bad news. The first thing you should do with any PC is install a fresh copy of Windows. If all you have are CAB files and no CD, don’t format the drive–just boot to DOS, go into that directory, run Setup, and install to a new directory other than C:Windows. With NT, it’s also possible to install from DOS though the syntax escapes me momentarily.

The first thing I suggested was to run RAM Stress Test, from www.ultra-x.com , over the course of a weekend to eliminate the possibility of bad memory. I followed that by formatting the drive FAT and running SpinRite. After six hours, SpinRite gave the disk a completely clean bill of health.

Knowing the memory and disk were good, I built up the system, installing NT, then installing SP5 128-bit, then installing IE 5.01SP1, then installing Diskeeper Lite, then installing Office 97 and Outlook 98 and WRQ Reflection, then running Windows Update to get all the critical updates and SP6a. I ran Diskeeper after each installation to keep the drive in pristine condition–I find I get better results that way than by installing everything and then running Diskeeper.

The system seemed pretty stable through all that. Then I went to configure networking and got a bluescreen. Cute. I rebooted and all was well and remained well for an hour or two.

How to see if the bluescreen was a fluke?

I devised the following batch file:

:loop
dir /w /s c:
goto loop

Who says command lines are useless and archaic? Definitely not me! I saved the file as stress.bat and ran 10 instances of it. Then I hit Ctrl-Alt-Del to bring up Task Manager. CPU usage was at 100%. Good.

The system bluescreened after a couple of hours.

How to track down the problem? Well, I knew the CD-ROM drive was bad. Can a bad CD-ROM cause massive system crashes? I’ve never heard of that, but I won’t write off anything. So I disconnected the CD-ROM drive. I’d already removed all unnecessary software from the equation, and I hadn’t installed any extraneous peripherals either. So with the CD-ROM drive eliminated, I ran 10 instances of the batch file again.

The system didn’t make it through the night.

OK. Memory’s good. Hard drive’s good. Bad CD-ROM drive out of equation. Fresh installation of OS with nothing extra. What next?

I called my boss. I figured maybe he’d have an idea, and if not, he and I would contact Micron to see what they had to suggest–three-year warranties and a helpful technical support staff from a manufacturer who understands the needs of a business client are most definitely a good thing.

My boss caught the obvious possibility I missed: heat.

All the fans worked fine, and the CPU had a big heatsink put on at the factory that isn’t going anywhere. Hopefully there was thermal compound in there, but if there wasn’t, I wouldn’t be getting in there to put any in, nor would I be replacing the heatsink with a heatsink/fan combo. So I pulled the P2-333 out of the PC I use–it was the only 66 MHz-bus P2 I had–and put it in the system. I’d forgotten those old P2s weren’t multiplier-locked, so the 333 ended up running at 233. That’s fine. I’ve never had overheating problems with that chip at its rated speed, so at 100 MHz less, I almost certainly wouldn’t run into problems.

With that CPU, the system happily ran 10 instances of my batch file for 30 hours straight without a hiccup. So I had my culprit: That P2-233 was overheating.

Now, ideally a stress test would tax more system memory than this one did and would force some floating-point operations as well. Prime95 is ideal.

If you have time and parts available, you can troubleshoot a recalcitrant PC by running such a real-world stress test, then replacing possible suspect parts (CPU, memory, hard drive, motherboard) one at a time until you isolate the problem.