Posts Tagged ‘Hardware’

VMware revamps HCL publications

December 11th, 2008

The timing of VMware’s latest update is uncanny.  I had just written the other day about the VMware HCL and product documentation.  Yesterday, VMware launched a new Hardware Compatibility Guide portal making it easier than ever to find out out if your system, peripheral, storage, thin client, etc. hardware is compatible with VMware Virtual Infrastructure or VMware View.  The portal replaces the VI HCL document library previously maintained here and in fact, all of the HCL documentation has been removed from that page and replaced with a link to the new portal.

Leary of varying query based search engine usefulness, I tried the portal out for myself this morning by searching on “dl585“.  I was pleasantly surprised by the results.  Instead of indexing the HCL by VMware product platform (as was the case with the previous .pdf documentation library where there was a separate HCL document for each major generation of VI), the portal returns a list of results indexed by my hardware query displaying all versions of ESX that the dl585 hardware is compatible with.  In my opinion, this is much more efficient.

I then ventured over to the VMware View tab and searched on “chip pc” and was presented with a good sized list of Chip PC thin clients compatible with VMware View.  Another search on “chip” produced the same query results, however a search on “chippc” produced no results.  The query engine could use some polishing to showcase a more Google-like web 2.0 friendliness (Did you mean chip pc?)

Adding to a documentation junkie’s pleasure (that would be me), the portal also allows us to download the full version of the compatibility guides in .pdf format from the right side menu of the portal web page.  If that’s your thing, you still have the option to maintain your own offline .pdf repository.  This is one of the habits I do follow and I hope that VMware continues to notify us via RSS feed when an HCL has been updated, providing me with a direct link to the .pdf in the RSS feed so I can easily right click and “save as” into my offline document repository.

VMware configuration maximums

December 9th, 2008

Configuration Maximums for Virtual Infrastructure 3 is by far one of my favorite VMware documents.  This is a useful document for the VMware evangelist and any VMware VI administrator to have tacked up on the wall of their office for use as a quick reference.  It’s also handy for identifying platform comparison points of discussion or decision.

The document answers most of the “How many…”, “How much…” type questions about the VMware Virtual Infrastructure capabilities (ESX hypervisor, VirtualCenter, guest VMs, etc.)  more than once I’ve used this document as the basis for interactive VMware trivia sessions at our local VMware User Group meetings.  This is one of the documents that will most often be updated as new releases of VMware VI are released so it’s a good one to keep tabs on.

The VI3 documentation page keeps us informed as to what date the document was last updated.  In addition, one of the RSS feeds I am subscribed to is VMware, Inc. This feed lets me know the moment any of the VI documents are updated (at which time I then download the updated document for my own document repository I maintain).  Hardware Compatibility List (HCL) documents seem to update almost weekly which is a good indicator that VMware Engineers are hard at work in their labs certifying compatible hardware thereby expanding the list of hardware we may run our VI on.

The virtualization hypervisors (I never thought about it but is this the correct plural for hypervisor?) and management tools are evolving rapidly.  VMware, by far the most innovative of all companies in the virtualization arena, must have teams of technical writers keeping product documentation up to date.  For me personally, accurate product documentation is of the utmost importance and I hope VMware stays on top of it.  Vendor documentation is the gospel for the products and it defines what’s supported and what is not.  Keep yourself informed by reading the vendor documentation once in a while.  Even if you’re not into reading, at least know where the documentation is located for reference purposes.  I promise you the VMware configuration maximums is an interesting/fun read.

ps.  For those paying close attention, the scheduled server maintenance has been completed this evening.  I am now going out to shovel the snow in the driveway for the 3rd time in 24 hours.

Maintenance tonight

December 9th, 2008

The blog, web, and Team Fortress 2 servers will be down briefly tonight for a little maintenance on the virtualized gateway router.  Duration should be about half an hour at the most.  I apologize in advance for any inconvenience.

Speaking of maintenance, I doubled my hosting bandwidth over the weekend from 5Mbps down/512Kbps up to 10Mbps down/1Mbps up.  I performed a little bandwidth speed testing last night and initially I wasn’t overly pleased the results.  Depending on the remote host I tested speed against, I wasn’t seeing the numbers I should be on the download side.  Eventually I did find a remote host that proved I had a 10Mbps down pipe (I don’t have bursting AFAIK).  On the up side (which is what really counts for hosting performance and you readers), I wasn’t able to find any remote hosts that showed I had upstream bandwidth beyond 512Kbps.  I’ll be performing more tests and I will contact my service provider if I am not completely satisfied.  For what I’m paying for business class broadband, I insist that I be consistently getting the 80% of the promised speeds which I believe is the SLA with my provider.

Trust me, I could go really hysterical with regards to my provider but you readers deserve better so I’ll keep it bottled up for now.  Thank your lucky stars for whatever provider you have because chances are they are much better than what I have to work with.

Toodles.

Update: Bandwidth is looking good.  Explanation in comments below.

12-9-2008 9-45-07 PM

LAN party!

December 3rd, 2008

Once or twice a year, I like to pack up the truck with computer equipment and head to a LAN party. Last weekend, a friend of mine at work hosted his annual “After Thanksgiving LAN Party”. We got together Saturday morning and set up. We played games and ate comfort food throughout the day, into the evening, and past midnight (this is the usual schedule). Games played were Team Fortress 2, Unreal Tournament 2004, and Battlefield II Special Forces. The Team Fortress 2 dedicated server was virtualized with VMware Workstation 6.5 – it ran flawlessly. Thank you Brad for putting on a great time as usual!

Pictures:

DSC00007 DSC00002 DSC00004 DSC00005 DSC00006

Get the most out of your laptop battery when traveling

November 22nd, 2008

I typically bring my laptop on the plane to use for working or watching a DVD. I don’t like to be interrupted by losing my battery mid flight. Here are some of my best practices for getting the most out of my laptop battery:

  1. Make sure the laptop batteries are charged before going to the airport. Don’t necessarily count on finding AC power at the airport to take care of this. You may also not have time at the airport to fully charge a dead battery. Trickle charges can take a few hours.
  2. Arrive early enough to find one of the few AC outlets at the gate. Top off the charge on your batteries before boarding the plane. AC outlets are often tucked away along the windows, in the floor, built into vertical support beams. If you can’t find an available outlet at your gate, go to the gate across the hall or to the next gate over and look for an AC outlet you can use there while still within earshot range of the boarding calls at your gate. I’ve also seen outlets in airport restaurants – relax, have some food, charge your laptop.
  3. If you’re going to use the laptop while waiting at the gate, make sure it’s on AC power instead of using batteries.
  4. Before boarding the plane, put the laptop in suspend mode instead of shutting it down. When you power the laptop back on when seated on the plane, the laptop comes up instantly in the OS rather than chewing up battery during a 5-10 minute boot up process
  5. Limit the use of USB devices, CD-ROM, and DVD-ROM. These peripherals chew up battery at a fast rate.
  6. Limit the use of disk intense applications such as defrag or virus scan. This also chews up batteries at a fast rate.
  7. Limit the use of processor intense applications which will cause the CPU and fans to draw more power.
  8. Run the display at the dimmest setting possible.
  9. Turn off the Wireless and Blue tooth radio.
  10. Bring an extra travel battery that is obviously already charged. The last couple of laptops I’ve had I had the extra travel battery installed along with the regular battery at the same time. It makes the laptop a little bulkier and heavier, but for me the extended battery time (5-6+ hours) is worth it.
  11. When you’re done using the laptop on board the plane, if you’re critically low on battery, shut down the OS or hibernate as opposed to putting it into suspend mode. Suspend mode still draws minimal amounts of power and critical bits may still be active in RAM. If you go into suspend and then the laptop battery dies completely, the plug/power will have been basically been pulled from the OS and you’ll lose any unsaved information you were actively working on. It’s just plain not good to yank the power from your computer.

Migrating from ESX to ESXi? Watch the HCL differences

November 16th, 2008

If you are an ESX shop contemplating a migration to ESXi and all that it has to offer, keep in mind the two products do not share an identical list of compatible hardware. Generally speaking, the ESXi Hardware Compatibility List (HCL) is more limited in terms of hardware than ESX. There is a chance the hardware you have running ESX in the datacenter right now may not be supported by VMware for running ESXi.

Here are some examples of hardware models that are supported for ESX but not for ESXi:

Dell

  • 1855
  • 2600
  • 2650
  • 6650
  • M805
  • R300
  • T300

HP

  • DL360G3/G4
  • DL380G3/G4
  • DL385
  • DL580G2/G3/G4
  • DL585
  • DL760G2
  • All BL p-class blades

IBM

  • HS20-8843
  • LS20-8850
  • 346-8840
  • 366
  • 445
  • 460-8872
  • x3400
  • x3500-7977
  • x3550
  • x3650
  • x3655
  • x3755
  • x3800
  • x3950

Keep in mind VMware updates the various HCLs on average about once per week so what is or is not on the HCL today, may change at any time. Typically, hardware is added to the HCL on a regular basis as it is certified by VMware.

The ESX and ESXi 3.5u3 HCL can be found here.

Live migration between CPU vendors demonstrated by AMD and Red Hat

November 11th, 2008

Live migration (VMotion in VMware speak) across AMD and Intel processors is a feature we don’t have today and a technology that many would describe as nearly impossible.

The capability could be in your datacenter sooner than you think. Last Thursday, the Inquirer published an article along with a video where Red Hat and AMD demonstrate the process (of course using streaming video and sound to drive home the point of no interruption) proving that it is possible and the technology to do so may not be so far off. The article goes on to explain that not only can live migration occur between CPU vendors, the same or similar technology can be used to live migrate between CPU architectures from the same vendor (ie. AMD Barcelona Opteron <–> AMD Shanghai Opteron).

Take a look at the video:

VMware adds additional thin client support for VDM 2.1

November 6th, 2008

Effective 11/6/08, VMware has added support for four addtional thin client devices for VMware Desktop Manager version 2.1

  • IGEL Compact 3210
  • Premium 5310
  • Smart 2110
  • Winestra 4210

I thought it would also be worthy to mention that the thin client devices being given away for free by Chip PC at VMworld 2008, the Chip PC Xtreme PC NG 6600, are also supported by VDM 2.1.  I received a unit and I am currently evaluating its capabilities as I have time.  Hopefully one day it will make a good blog post.

Source:  Thin Client Compatibility Guide For VMware Virtual Desktop Manager (VDM)

Microsoft Windows x64 (64-bit) and the VI

November 4th, 2008

32-bit computing is still very much alive, well, and very much supported today which may be one of the primary reasons you have not investigated 64-bit yet or invested the time it takes to migrate your software and/or servers to 64-bit architecture.  Part of the adaptation process is learning and understanding the underlying mechanics behind a technology to be sure it makes good sense from an economical, roadmap strategy, and business need standpoint.  I think 64-bit is one of those technologies that is so deep and covers so much territory that there is a chance for the spread of misinformation. 

As VMware Administrators, at one point or another our careers intersect with Microsoft Windows technologies.  For some like myself, the Windows experience is a daily tradition.  Everyone who is running VirtualCenter is using Microsoft Windows as both the server and client platform.  VMware Update Manager users are using Windows.  License Manager runs on Windows.  Even those without VirtualCenter are probably using the Virtual Infrastructure Client which runs on Windows.  My point is that although this is mainly a VMware virtualization centric blog, we can’t completely ignore Windows.  Understanding the benefits that 64-bit Windows technologies provides might help our virtual infrastructures run faster and more efficiently.  In the long term, I think it’s going to allow our VI to scale up.

Fortunately for those who have not yet rolled up their sleeves and gotten dirty with 64-bit, there’s an IT Architect by the name of Helge Klein who has written an absolutely fantastic seven part series entitled “Windows x64 – All the Same Yet Very Different” in terms that I think most of us can understand.  Even if you’re not a big fan of Windows, some of the content is universal and applies to many platforms.  If you maintain a 3-ring binder of good stuff you’ve found on the internet, I think this series would belong there.

N_Port ID Virtualization (NPIV) and VMware Virtual Infrastructure

October 28th, 2008

A few weeks ago, an associate got me curious about N_Port ID Virtualization (NPIV for short) and what could be done with it in VMware’s current Virtual Infrastructure offerings (VC 2.5u3, ESX 3.5u2).  Most of my SAN equipment is a little on the older side so I haven’t had much chance to play with NPIV or investigate its benefits.  I decided to head into the lab and kick the tires.

To the best of my knowledge, I thought NPIV was somewhat of a newer technolgoy so the first thing was to inventory my hardware for NPIV capability.

  • VMware Virtual Infrastructure 3.5 – check!
  • Compaq StorageWorks 4/8 SAN switch - bzzz!
  • Preferably 4Gb SFPs but 2Gb should work also - check!
  • QLogic 2Gb HBAs – bzzz!

Right off the bat, I’ve got some obstacles to overcome.  My SAN switch doesn’t support NPIV in the current firmware version but the fact that it’s a 4GB switch leads me to believe there may be hope in a newer firmware version.  The SAN switch needs to support NPIV in any NPIV implementation, VMware or otherwise.  The good news is that there’s newer firmware available for the SAN switch.  I upgraded the SAN switch firmware and see that I now have NPIV configuration options on my SAN switch.  One issue resolved.

To validate whether or not a Brocade switch port supports NPIV, check the Port Admin in the GUI console or run the following command from the switch CLI via telnet:

portcfgshow 1  (where 1 is the switch port number)

If NPIV is disabled, it can easily be enabled via the Port Admin GUI or by using the following command from the switch CLI via telnet:

portCfgNPIVPort 5 1  (where 5 is the port number and 1 is the mode 1=enable, 0=disable)

I don’t have a compatible HBA.  That’s a tough one.  VMware’s documentation explains “Currently, the following vendors and types of HBA provide [NPIV] support”

  • QLogic – any 4GB HBA
  • Emulex – 4GB HBAs that have NPIV-compatible firmware

A quick look online at Ebay reveals that 4Gb HBAs are outside of my lab’s budget range (most of the lab budget this year was reallocated for a new deck and sprinkler system for the house – funny how things at home tend to mimic the politics in the office).  Fortunately, there’s more than one way to skin a cat.  A few emails later and I have a 60 day demo HBA coming from Hewlett Packard (HP’s OEM part number: FC1243 4GB PCI-X 2.0 DC, QLogic’s part number QLA2462). 

To validate whether or not your current HBA supports NPIV, open up the ESX console and run the following command:

cat /proc/scsi/qla2300/1 |grep NPIV  (where qla2300 is the HBA type and 1 is the HBA number)

For Emulex, it’s going to be something like cat /proc/scsi/lpfc/1 |grep NPIV

Obviously, browse your /proc/scsi/ directory to see what HBAs are in use by ESX.

npiv4

In addition to the hardware issues, VMware sparsely distributes key NPIV information across several different documents in their library.   This is a pet peeve of mine.  Nonetheless, these are the VMware documents you need to pay attention to (but like me, you can choose to save the reading until AFTER you run into issues):

After a few days, the demo HBA from HP arrives.  I notice the firmware is from 2005 so I upgrade the firmware to current.  I then begin my testing.  I connected the fibre beteen the HBA and the SAN switch and powered on the ESX host.  Before allowing the ESX host to boot up, I entered the <CTRL + Q> BIOS configuration of the HBA to see if any new NPIV options had been added with the firmware upgrade.  None.  No mention of NPIV anywhere in the BIOS.  I proceeded to allow ESX to boot up.  Now that the fibre port is hot, I opened the management interface of the Brocade SAN switch and configured the port for the correct speed and NPIV support (this is configured on a port by port basis).  Unfortunately, I’m not seeing that NPIV is in use from the SAN switch point of view.  I decide to create a VM and see if I need to enable NPIV inside the VM first.  Another roadblock as shown below - the NPIV configuration is essentially all gray’d out and I see a hint at the bottom saying I need RDM storage.  I’m not sure why I need RDM.  Seems like an odd requirement, but I’ll find out why a little later.

npiv1

In the lab I have swiSCSI shared storage suitable enough for testing with RDMs.  A few mouse clicks later and I have myself a VM with an RDM.  I head back to the VM configuration and I’m greeted with the success of being able to add WWNs.  Although I could create the WWNs myself by editing the .vmx file by hand, it’s much easier to let ESX assign them for me.  ESX generates exactly five WWNs:  1x Node WWN and 4x Port WWNs (the Port WWNs are what you should zone to).  It goes without saying that once these WWNs are generated, they should remain static in zoned fabrics (you do zone your fabric don’t you?!).

npiv2     npiv3

The entries in the .vmx file look like this (really, that’s it):

wwn.node = “25bb000c29000ba5″
wwn.port = “25bb000c29000da5,25bb000c29000ca5,25bb000c29000ea5,25bb000c29000fa5″
wwn.type = “vc”
 
Two steps forward, one step back.  I power cycled the VM a few times and I’m still not seeing any sign of NPIV kicking in on the SAN switch.  I should be seeing the virtual WWNs coming online so that I can zone them to something.  Referring to the sparse VMware documentation on NPIV, I discovered how VMware’s implementation of NPIV (version 1.0) works and I also learned I was missing a critical hardware component:  a SAN.  This ties back to my previous questioning of why an RDM is required for NPIV.  So quickly, here’s how NPIV works on VMware Virtual Infrastructure when NPIV is enabled (I obtained access to a SAN to work all of this out):
  1. When the VM is powered on, before the virtual hardware POSTs, it scans the physical HBAs of the ESX host for the RDM mapping to SAN storage.  SAN storage connected to HBAs is a hard requirement.  If an HBA doesn’t support NPIV, it is skipped in the detection process.  If ESX cannot see the zoned RDM LUN through an NPIV aware HBA, the HBA is skipped in the detection process.
  2. If and when an RDM SAN LUN is discovered through the detection process via an NPIV aware HBA through an NPIV capable SAN switch, fireworks go off and magic happens.  One of the four virtual Port WWN s (in the order as they appear in the .vmx file) are assigned to the phsyical HBA and the NPIV virtual Port WWN is activated on the SAN switch.
  3. ESX will assign a maximum of four NPIV Port WWNs during the detection process.  What this means is that if you have four NPIV HBAs connected to four NPIV aware SAN switch ports which are in turn zoned to four SAN LUNs, all four will be NPIV activated.  If you have only one NPIV HBA, you’ll only use one of the virtual Port WWNs.  If you have six NPIV HBAs, only the first four will be activated with NPIV Port WWNs in the discovery process.
  4. Zoning and storage presentation.  Here’s the catch 22 in this contraption and it’s a big one.
    1. I can’t get the ESX generated NPIV Port WWNs to activate on the switch until the VM can see RDM SAN LUN storage targets!
    2. I can’t easily zone RDM SAN storage processors to NPIV Port WWNs until the SAN switch can see the NPIV Port WWNs come online (I use soft zoning by WWN, not hard zoning by physical switch port)!!
    3. I can’t configure selective storage presentation (easily) on the SAN until the SAN can see the NPIV Port WWNs!!!
    4. The detection process at VM POST literally takes less than five seconds total to be successful or to fail and one second or less per HBA scan so to coordinate the correct GUI screens in the SAN switch management console, the selective storage presentation SAN console, and the VM console to toggle power state, takes incredible hand/eye coordination and timing.  It’s literally lining up all the screens, powering on the VM and hitting the refresh button in each of the SAN management consoles to capture the NPIV Port WWN that briefly comes online during the detection process, then goes away after failing to find an RDM SAN LUN.
    5. The only way to make this all work easily in my favor is to disable zoning on the SAN switch and disable selective storage presentation on the SAN.
  5. At any time during the initial detection process or while the VM is already online in operation, should an NPIV hardware or zoning requirement fail to be met for the RDM raw storage on SAN, the VM will fall back to using the Port WWN of the physical HBA it was traversing through it’s NPIV Port WWN assignment.

Once I met all of the requirements above and got NPIV working, the result was rather anticlimactic for the amount of effort that was involved.  Here’s what NPIV looks like from Port Admin on a Brocade switch (blue is the physical HBA, green in is the NPIV Port WWN that VMware generated): 

 npiv5  (Zoom in using the Flickr toolbar)

 I asked myself the questions “Why would anyone even do this?  What are the benefits?”.  There aren’t many, at least not right now with this implementation.  By far, I think the largest benefit is going to be for the SAN administrator.  Maybe a SAN switch port or storage controller is running hot.  Without NPIV, we have many VMs communicating with back end SAN storage over a shared HBA which to the SAN administrator appears as a single Port WWN in his/her SAN admin tools.  However, with NPIV, the SAN admin tools can now monitor the individual virtualized streams of I/O traffic that tie back to individual VMs.  I liken it much to the unique channels in the Citrix ICA protocol that is carried over TCP/IP.  Each of those channels can be monitored and in some cases be throttled or given priority.  The same concept applies to virtualized channels of VM disk I/O traffic through a physical HBA.  Another analogy would be VLANs for disk I/O traffic, but in a very primatave stage.

Another thought for this is to provide a layer of security if we could zone a SAN storage controller solely to an NPIV Port WWN, however, right now this is impossible because as was explained in #5 above, any time the physical HBA is removed from the NPIV visibility chain, NPIV shuts down and falls back to the physical HBA for traffic, and at that point you’ve zoned out your phyiscal HBA and disk I/O traffic would quickly queue and then halt, sending your VM into obvious distress.

A few tips that I’ve personally come up with in this exploration process:

  1. Don’t remove and then readd NPIV WWNs in the VM once it has all initially been zoned because ESX will assign a completely new set of WWNs.
  2. If you’ve done the above, you can modify the WWNs by hand in the .vmx file.  Remove the VM from inventory first, then modify the .vmx, then readd the VM back to inventory because VirtualCenter (or the VIC) likes to hold on to the generated WWNs if you don’t.
  3. Adding or removing phsyical HBAs on the host or RDMs on the VM causes the discovery process to mismatch different NPIV Port WWNs with physical HBAs thus throwing off the zoning and causing the whole thing to bomb to the point all NPIV discovery fails.
  4. If the above happens, you can change the order of the NPIV Port WWN assignment discovery in the .vmx file.
  5. You can VMotion with NPIV, however, make sure the RDM file is located on the same datastore where the VM configuration file resides.  Storage VMotion or VMotion between datastores isn’t allowed with NPIV enabled.
  6. The location of the RDM metadata (pointer) file can be on SAN or local VMFS storage
  7. On an HP MSA SAN, the hosts and corresponding Port WWNs can manually be created in the CLI (or temporarily disable SSP to ease the zoning process)
  8. Removing/adding RDMs can throw off the NPIV Port WWN assignments which in turn throws off zoning
  9. Discovery order of NPIV Port WWNs is tied to physical HBAs.  Adding or removing HBAs throws off the NPIV Port WWN assignments which in turn throws off zoning

Conclusion:  This is version 1.0 of VMware NPIV and it functions as such.  We need much more flexibility in future versions from all facets:  discovery process, better interface for management, editing of the WWNs in the VIC, pinning of WWNs to physcial HBAs, monitoring of NPIV Port WWN disk I/O traffic in VIC performance graphs, guaranteed isolation for security, etc.

Connect a fibre attached tape device to a VM on ESX

October 27th, 2008

Have you ever considered virtualizing your tape backup server? Maybe you’ve thought about it in the past but reasoning produced drawbacks that were too compelling to go forth and virtualize. For instance, pinning a VM to a clustered ESX host which is connected with a SCSI cable attached tape device hanging off of it. Pinning a VM to a clustered host means you lose the benefit of a VM portability, you lose the flexibility of host maintenance during production cycles, and you lose the use of valuable dollars spent on VMotion, DRS, HA, shared storage, and FT (future).

What if you had the hardware to make it possible? Would you do it? If I had to purchase hardware to specifically make this happen, cost effectiveness would need to be researched. Everything else being equal, if I had the hardware infrastructure in place already, yes I would. I had access to the hardware, so I headed into my lab to give it a shot.

What’s required?

  • Hardware
    • One or more ESX hosts
    • At least one fibre HBA in each ESX host that supportes fibre tape devices (enabled in the HBA BIOS typically at POST)
    • A fibre attached tape device (the fibre HBA in a tape device is called an NSR or Network Storage Router)
    • At least one fibre SAN switch
      • If using more than one in a single fabric, be sure they are ISL’d together
      • If multipathing in separate fabrics, at least two HBAs per host will be required and at least two NSRs in the tape device (although this is really going overboard and would be quite expensive)
    • Fibre cable between ESX host and SAN switch
    • Fibre cable between NSR and SAN switch
    • Optional components: shared storage
  • Software
    • VMware ESX or ESXi
    • Virtual Infrastructure Client
    • Latest firmware on HBA(s), NSR(s), and SAN switch(es)
    • Appropriate zoning created and enabled on SAN switch for all ESX host HBAs and NSR
    • Optional components: VirtualCenter, VMotion, DRS, HA

The steps to setting this up aren’t incredibly difficult.

  1. Attach fibre cables between HBAs and SAN switch
  2. Attach fibre cable between NSR and SAN switch
  3. On the fibre SAN switch, zone the NSR to all HBAs in all ESX hosts that will participate. Be sure to enable the active zone configuration. On Brocade SAN switches this is a two step process.
  4. Perform a scan on the fibre HBA cards (on all ESX hosts) to discover the fibre tape device. In this case, I’ve got an HP MSL5026 autoloader containing a robotic library and two tape drives:
    fctape1
  5. Once each ESX host can “see” the tape device, add the tape device to the VM as a SCSI passthru device. In the drop down selection box, the two tape drives are seen as “tape” and the robotic library is seen as “media”. Take a look at the .vmx file and see how the SCSI passthru device maps back to VMHBA1:1:2 and ultimately the tape drive as a symbolic link:
    fctape2 fctape3 fctape9 fctape10 fctape11
  6. The VM can now see the tape device. Notice it is SCSI and not fibre. At this time, VMs only see SCSI devices. Fibre is not virtualized within the VMware virtual machines to the extent that a VM can see virtual fibre or a virtual HBA. The current implementation of NPIV support in VMware is something different and will be explored in an upcoming blog:
    fctape4

Good news! The fibre attached tape drive works perfectly using Windows ntbackup.exe. Effective throughput rate of many smaller files to tape is 389MB/minute or 6.5MB/second. As expected, running a second backup job with less files but larger sizes, I saw an increased throughput rate of 590MB/minute or nearly 10MB/second. These speeds are not bad:
fctape5 fctape6 fctape8

Now for the bad news. When trying to migrate the VM while it was powered on (VMotion) or powered off (cold migration), I ran into a snag. VMware sees the fibre tape device as a raw disk with an LSI Logic SCSI controller which is not supported for migration (I tried changing the LSI Logic bus to use Physical bus sharing, but that did not work):
fctape7 fctape9

The VM migration component of my test was a failure, but the fibre connectivity was a success. Perhaps we’ll have SCSI passthru migration ability in a future version of VMware Virtual Infrastructure. Maybe v-SCSI passthru is the answer (v-* seems to be the next generation answer to many datacenter needs). What this experiment all boils down to is that I can’t do much more with a fibre attached tape device than I can with a SCSI attached tape device. In addition, a VM with an attached SCSI passthru device remains pinned to an ESX host and therefore doesn’t belong on a clustered host.  However, I can think of a few potential advantages of a fibre attached tape device which may still be of interest:

  1. Fibre cabling offers better throughput speed and more bandwidth than SCSI.
  2. Fibre cabling offers much longer cable run distances in the datacenter.
  3. A failed SCSI card on the host often means a motherboard replacement. A failed HBA on the host means replacing an HBA.
  4. Fibre cabling allows multipathing while SCSI cabling along with the required bus termination does not.
  5. Fibre cabling leverages a SAN fabric infrastructure which can be used to gather detailed reports using native and robust SAN fabric tools such as SAN switch performance monitor, SANsurfer, HBAnywhere, etc.
  6. VMs with fibre attached tape can still be migrated to other zoned hosts by simply removing the tape device in virtual hardware, performing the migration, then re-adding the tape device, all without leaving my chair. A SCSI attached tape device would actually need to be re-cabled behind the rack.

The Lab – Then and Now

October 19th, 2008

I’ve completed a new blog page:

The Lab – Then and Now
A gallery documenting its evolution using VMware

To visit, just click the “Lab” menu item at the top of the page.