LaslowNET Technical Rants, HOWTOs, and Writing

6Jul/100

An HTC DoA: Part 1

When Google first offered the Nexus One in Canada (by making a GSM version for AT&T, which is compatible with Rogers, Telus, and Bell in Canada), I jumped on. I had my order in the day the web store was updated, and in my hands by the end of the same week. Despite the cost, I consider it the best gadget purchase I've ever made.

Fast-forward to last week. With my wifes birthday coming up, I thought it would be nice to surprise her with a trip to the local Telus dealer (her carrier) for a shiny new phone. After looking at their offerings (the only two real considerations were the Motorola Milestone and the HTC Hero), she decided that she'd come down with a case of Phone Envy and wanted a Nexus One as well.

So, no big deal - we ordered the phone and waited. Unfortunately, due to the stat holiday on July 1st, the phone didn't arrive on Friday like expected and instead came in yesterday (the following Monday). I left the phone plugged in to charge, and when the light turned green to say all was well I fired it up.

And waited.

And waited some more.

Nexus One Boot Screen

The Boot Screen on my other, unlocked (and functional) Nexus One

After about five minutes of staring at the above picture (minus the lock on the bottom, as I hadn't gotten around to doing that yet), I got the feeling something was wrong.

I powered off the phone by pulling out the battery, then booted in to HBOOT mode (hold down the VOLUME DOWN key while pressing Power). That worked fine, so I tried to enter recovery. Unfortunately, I was met with the same screen. I tried several more times for both normal mode and recovery without success. Then I gave up and called Google.

At this point, I'd like to point out that I'm fairly certain it's just a software error. Unfortunately, the only way I could really do anything about that would be to Unlock the phone, which voids the warranty. As such, the call to support was warranted.

I called the number listed on the Google Phone Contact Support page (1-888-48-NEXUS) and after a few quick IVR choices I was greeted nearly immediately by a tech. He asked about the problem, what I had tried, and immediately agreed the phone was D0A and would need to be replaced. That's when things started to go downhill.

The problem is that I live in Canada, and HTC has a different policy for international orders than their standard 'send you a replacement and then send the old one back' policy for domestic ones. If you live in Canada and get a DoA phone, you have two options:

  1. Send it back via their normal repair process and get a refurbished phone (no option for a new one)
  2. Initiate a 'Buyers Remorse' return, get a refund for the device, and then order a new one.

The choice was fairly obvious - I ordered my wife a new phone, so that's what she was going to get. After gathering a little more information, the Google tech transfered me over to HTC support to process the return order.

As the new tech was going over the specifics, a thought occurred:

Me: Okay, so I do the return, and within 14 business days of you receiving the device and okay'ing everything, you credit the purchase price back, correct?

Tech: Yes, that's correct.

Me: So what about the duty charge that I had to pay CoD?

Tech: Oh, that. Hang on. --Pause-- Okay, I checked on that, and what happens is you'll get some paperwork from us once the refund has been processed. Once you have that paperwork, contact Customs and they'll be able to issue a refund.

So at this point, I may have to wait three weeks for credit from HTC (although apparently it usually doesn't take that long), and then I can file a claim to get my ~$70CAD that I paid for taxes back.

All was said and done, and I prompted received a confirmation email repeating what the agent said. However, as of posting I still haven't received a FedEx shipping label in my email as promised. I called HTC back, and they escalated the ticket and said they'd resend the label, so now I'm waiting on that. Once I've got the label in and the phone sent away, I'll order another N1 and see what happens.

The pain begins in Part 2!

31May/100

Rant: Microsoft Telephone Activation

I've blogged about Microsoft Telephone Action before, but now I'm off on a rant....

Click 'Continue Reading' for the wall-of-text!

7Apr/100

From the Search Results: Backing Up

To the person who found my blog by searching for "raid 1 backup", please note: RAID IS NOT AN ACCEPTABLE BACKUP SOLUTION!

Any method of RAID (other than RAID0) is good for protecting against failing hard drives. However, it is by no means a backup solution, as it doesn't save your data from things like accidental deletion, file system corruption, users, and malicious software. If you're looking for a backup solution, investigate external hard drives, SAN/NAS devices, tape backup units, etc....

Please. For the sake of your data, get a real backup solution.

25Feb/100

Support: I Want This Guy On Speed Dial

Although today has been a real Gong Show, my morning was brightened by, of all things, a chat with Dell Support:

10:04:03 AM System You are now being connected to an agent. Thank you for using Dell Chat
10:04:03 AM System Connected with ***_Oliver_******

10:04:08 AM ***_Oliver_******: Hi, thank you for contacting DELL Canada support, my name is Oliver, how may I help you?

10:05:01 AM Laslow: I have a system that is unable to POST. If I remove the video card and use the onboard video, it POSTs fine. If I take the video card from an identical unit and try that, it also works fine. It appears that the video card is defective, and I need a replacement sent out.

10:07:13 AM ***_Oliver_******: I'll do my best to assist you with this one.
10:07:48 AM ***_Oliver_******: I could see here that system is an ***** and warranty is active until *****.

10:07:55 AM Laslow: That is correct.

10:08:58 AM ***_Oliver_******: I'll replace this.
10:09:13 AM ***_Oliver_******: Would you like me to send an onsite tech ?

10:09:21 AM Laslow: No thank you - the part will be fine.

10:09:33 AM ***_Oliver_******: May I have the shipping address?

10:09:58 AM Laslow: ***** ***** ***** *****

10:11:23 AM ***_Oliver_******: By the way, may I know who's the registered owner of the system?

10:12:46 AM Laslow: ***** ***** ***** *****

10:13:28 AM ***_Oliver_******: Thanks. I'll go ahead and process the case and dispatch. This would take me about 5 minutes. Hold on.

10:13:34 AM Laslow: Thank you.

10:13:39 AM ***_Oliver_******: Would you like me to send the reference numbers thru email instead?

10:13:50 AM Laslow: That would be greatly appreciated.

10:14:49 AM ***_Oliver_******: Please wait for the part next business day. Return to us the old part not later than ten days once the replacement part is received. You could use the same box that came in with it.
10:14:54 AM ***_Oliver_******: Anything else I can help you with?

10:15:09 AM Laslow: No, that is fine. Thank you very much! You have been very helpfull!

10:15:19 AM ***_Oliver_******: Would you like me to contact you to followup on this?

10:15:36 AM Laslow: No, that will not be required.

10:17:39 AM ***_Oliver_******: Ok. Thank you! Thank you for choosing DELL, have a good day!

10:17:51 AM Laslow: Thanks again!

This is probably the first time that I've spoken with Dell where I haven't had to fight to get the part I wanted, but where the turn-around time was less than 15 minutes! I'm completely and utterly shocked! Also, that was the shortest 5 minutes I've ever had to wait for them to process the dispatch request.
Edit: Sorry for the poor formatting - WordPress also hates me today.
10Nov/090

Support: Those Pesky BIOS/Windows Video Drivers

Yesterday, a staff member told me one of the public-access computers was on the fritz - specifically, the screen was black and the fans were on full. Going over to investigate and try the basic troubleshooting, it looked like it had a dead motherboard, so I took it back to my work bench for further troubleshooting.

As the system was a Dell, it had a set of Diagnostic LEDs on the front to help diagnose the problem. Unfortunately, looking up the Optiplex 745 manual, it didn't list the particular code the lights were showing: 1 3 4.

Looking through the listed codes, most of the ones related to 1 or 3 4 were for either faulty RAM or a bad memory controller, so I started there. I tried removing one stick, then replacing it and removing the other, using a completely different stick, and then none at all. Each time I was met with the same diagnostic code, 1 3 4.

Convinced I had the problem pin-pointed, I tried to get on to Dell's Live Chat Hardware Support to get a replacement motherboard as the system was still covered by Next Business Day parts. The site, unfortunately, was down, so I was forced to call Dell instead.

Now, there's a reason that I typically use Live Chat instead of the phone. The main reason is on Live Chat, I can quickly bang-out the troubleshooting I've already done, and 9 times out of 10, the tech will just send me the part. Phone support is rarely that easy, and this time was no exception.

The wait time was average (maybe about 5 minutes), and once I got the agent on, I quickly rattled off my troubleshooting and told him the diagnostic code. He put me on hold for a few minutes and when he came back, told me that the diagnostic code 1 3 4 on an Optiplex 745 means the video card isn't working. Figuring it would be easy enough to test, I pulled out the PCI-E video card and switch to the onboard video. Sure enough, it worked.

At this point, I figured we'd be done. I didn't have another PCI-E card to make sure it wasn't the slot, but I supposed out loud that if I got the replacement card and it didn't work, I could always send it back, make another call, and get a motherboard sent out. The 'tech' had another idea, though.

Him: There's actually one more troubleshooting step I'd like to try.

Me: Okay, what's that?

Him: Well, leave the video card out, boot in to Windows, go to Device Manager and remove the device. Then turn the computer off, put the video card back in, and see if it work.

Him: I just want to see if the driver is the problem.

Me: ...what?

At this point, I nearly hung up the phone. Instead, I figured a little education would be in order.

Me: Okay, I think you might be a little confused. The problem is that, with the video card in, the system won't even POST properly. I fail to see how a Windows driver could possibly be the problem, because at this stage in the boot process, the computer has no clue what operating system is on the hard drive. It hasn't even looked to see if there's a hard drive there. How can a driver possibly be the problem.

Him: Sir, it's just part of my troubleshooting process.

Me: Okay, I'll humor you. I'm going to pretend to boot my computer.

Him: No, I need you to actually--

Me (interrupting): Now I'm going to pretend to open Device Manager. Oh, would you look at that! Because the video card isn't present, it's not showing in the list.*

Him: ...oh. Well, I guess we can send you a replacement video card....

Me: I thought so.

This morning, I received a call back from Dell asking me if I'd received the video card. I said no, and he told me that the card had been delivered and signed for. Figuring that maybe Purolator had delivered it to the wrong office, I put the agent on hold and checked around, but nobody had received anything today. Going back to the agent, I asked who signed for it, at which point he informed me that it hadn't been delivered yet and would likely arrive this afternoon. Gee, thanks for that.

*Note: I'm aware that you can make Device Manager show non-present device by following this KB article, however that would have only confused the poor agent and wouldn't have made a difference anyway.

30Sep/090

Gong Show: The Tail of an IBM x3400 Server (Part 4)

(Continued from Part 3)

So Tuesday afternoon rolled around. I ran a manual backup of the Exchange server before IBM Dude came around and did a test restore to make sure everything was working, much like I should have done last time. As soon as he arrived, we powered down the server and swapped out the board. After everything was back in place, we crossed our fingers and pressed the power button.

-Click- WHIIIIIIIRRRRRRRRRRRRR

As the server powered on, we noticed two things. One was that the server sounded like a hurricane. With most servers, be they IBM or Dell, when you first turn them on all of the fans will spin up to full power, then settle down. In this case, the fans spun up, then stayed up. We could barely hear each other. The other thing we noticed, however, was an error message on-screen:

1604 Machine type mismatch detected

Neither of us panicked, though - we still had to flash the BIOS so we could put in the correct Machine Type and Serial numbers. The fans were starting to get annoying, though.

After the machine booted off the update CD, I plugged in the right numbers, double-and-triple-checking them, then let it do it's thing. When it rebooted, the fans were as loud as ever, and, unfortunately, the error persisted.

1604 Machine type mismatch detected

Popping in to BIOS, I double-checked the Machine Type - it was set correctly. We both scratched our heads, and then noticed that the part number on the new board was different from the old one. In fact, after IBM Dude did a little searching, he found the new board was actually for an x3500, although it was supposedly a valid substitutable part. Regardless, and believing we'd found the problem, he ordered a new board of the correct part number and promised he'd be back Friday with the correct part. In the mean time, the server was still running, albeit a little slower and a lot louder, but at least now the power button was fixed and tape was no longer required.

14Sep/090

Gong Show: The Tail of an IBM x3400 Server (Part 3)

(Continued from Part 2)

Now for most Exchange administrators, there's not a lot worse than when one of your storage groups isn't mounting. Worse things would include the RAID array dying and the server catching fire (maybe one as a result of the other), or a user who decides that the server room doesn't need air conditioning when nobody's working in there and shuts it off over a long weekend.

Not that the last one has ever happened to anyone. *Cough*

Unfortunately, because I was an idiot and didn't copy the error messages at the time (I was more worried about getting the server back up and running), I can only summarize what happened.

  • Tried repeatedly to mount the database. As they say, if it doesn't work the first time, it probably won't work the seventh. Turns out, 'they' were right.
  • Ran 'chkdsk /r' on the RAID array containing the transaction logs, and then on the array with the .edb - no love, still no mounting
  • Tried every possible way to get eseutil /r to replay the transaction logs to the database, only to find that both were corrupt. Great.
  • Tried to restore the last backup using Backup EXEC. It didn't work.
  • Admitted defeat and ran eseutil /p on the database.

Here's the kicker: when running eseutil with the /p switch on a database that wasn't shutdown cleanly or had the /r switch run on it first, all of the data in the transaction logs gets discarded. However, when they're corrupt anyways, there's really not a lot to lose.

When eseutil finally finished it's repair after over an hour of grinding away, the database finally mounted. Heaving a sigh of relief I double-checked the tape and went home for night know I'd done all I could do. Surprisingly no one reported any missing emails the next morning, and I was able to grab a full backup of the server without issue.

When mid-afternoon rolled around, IBM Dude showed up with the 'front diagnostic panel', aka 'the switch assembly'. We powered down the server, he ripped things apart, pulled out the old part, popped in the new one, and turned on the server.

Or at least, tried to turn it on.

–Click– *WHIRRRRRRRrrrrrr* –Click–

Fantastic. It looked like the first replacement switch assembly had the same problem. Ripping things apart again, IBM Dude swapped the replacement with the freshly ordered spare. Crossing our fingers, he tried the button again.

–Click– *WHIRRRRRRRrrrrrr* –Click–

Crap. At this point, he cut his losses and called for help. The suggestion? Replace the system board.

IBM Dude ordered the part, I booted the server again, once more relying on scotch tape to do it's thing, and we made plans to have the board replaced the following Tuesday afternoon.

Will the gong show continue? Find out in Part 4.

11Sep/090

Backup EXEC, the Bane of my Existence

Okay, so maybe it isn't the bane, but it's certainly a bane. Probably in the top three right now.

Environment: Backup EXEC 12.5 SP2 (fully updated) running on Windows Server 2008 SP2, backing up, among other things, Exchange Server 2007 SP2 x64, running on Windows Server 2003 x64.

Setup: Due to horrible issues with Backup EXEC's (referred to as 'BE' from now on) handling of GRT (Granular Restore Technology) with tape backups, BE runs a B2D (Backup to Disk) job to backup the Exchange mailboxes, and then a second B2T (Backup to Tape) job does the standard backup, plus grabs the B2D directory.

Problem: The B2D job fails intermittently with the following error:

Cannot extract mailbox messages from the Exchange backup. V-79-57344-759. -1213 The database page size does not match the engine on the backup to disk job.

Troubleshooting: According to this article, Hotfix 319699 is supposed to resolve the issue, in most cases. According to the article, though, it only applies to 12.0 and Service Packs, 12.5, and 12.5 SP1. The issue is supposed to be resolved in SP2, which I'm currently running.

After spending an hour on the phone with Symantec support, as the article says to call them if the Hotfix doesn't resolve the issue (it didn't for me), I was basically told that the version of BERestore.exe in the C:\Program Files\Symantec\Backup Exec\RAWS directory was older than the version on the BE Server (12.5.2213.101 compared to 12.5.2213.144).

Resolution: So far, the fix seems to be to manually copy the C:\Program Files\Symantec\Backup Exec\Agents\RAWSX64\Updates directory to the Exchange server, then sort by date and apply RAWSx642213RSP2.msp, plus every update newer than that. Once the updates are complete (it took me about 10 minutes to run all 8 applicable patches), reboot the server.

After doing so, I ran the B2D job and it completed, however that isn't necessarily proof that this worked - as the problem is intermittent, it'll take another week of scheduled jobs to tell for sure whether that resolved the problem.

For the time being, my fingers are crossed.

9Sep/090

Gong Show: The Tail of an IBM x3400 Server (Part 2)

(Continued from Part 1)

Having determined that there was most definitely a hardware problem with the server, and now that it was back up and running (albeit with tape being the lynch-pin of the whole thing), I did a quick search for IBM's support number and gave them a call. Surprisingly, there was no wait, and I was passed to an agent. Also not surprising, after spending ten minutes giving them information, they finally determined that my company had never contacted them for support before, and I had to give them my contact information again while they created an account. Once that was done, and the agent was armed with a brief description of my problem, I was transferred to Hardware Support.

The agent I spoke with there quickly agreed that this was a serious problem, and was slightly mortified that I was using tape to keep the server running, despite the fact I had little other choice. He also agreed with me that the problem was likely with the micro-switch in the power button and ordered a replacement for the whole 'front panel diagnostic assembly'. That done, he informed me that a local technician would be contacting me shortly to confirm a time for a service call.

Apparently, despite the fact that replacing that particular part was nearly as easy as it gets, IBM, unlike Dell, insists on sending a tech. Whatever. I was just happy to get the problem fixed.

4Sep/090

IBM has a sense of humour?

A few weeks ago, my Exchange Server had a series of unfortunate problems, and in the end the planar board had to be replaced (I'll eventually get around to doing a blog post about that gong show). When the (first) replacement board arrived, I had to take a picture of the asset labeling on the box. In light of the whole situation, 'C0KZUP' was certainly appropriate.

The entry for 'Loc-emplacement', whatever that is.

The entry for 'Loc-emplacement', whatever that is.