Many of my users out there run Microsoft Cluster Services on ESX.Β A great questions was asked of me today: have the rules changed with running MSCS on vSphere?Β The answer is: a little.
There are 3 scenarios of MSCS clusters and ESX: Cluster-in-a-box (both MSCS nodes are on the same physical host – great for testing), cross-host (where each of the MSCS node VMs resides on different ESX hosts), and physical-virtual (where one MSCS node is physical, one is virtual).Β The requirements for MSCS can change, even in the minor updates, so check the documentation often.Β Here’s my compiled list of requirements/tips for MSCS on ESX 4.0:
- You are still limited to two-node clusters with MSCS on ESX 4.
- From a storage perspective, you can use local storage (for cluster-in-a-box) or Fiber Channel (for cross-host or physical-virtual clusters).Β There is still no support for NFS or iSCSI (I personally think this is because FC and local storage have more predictable performance – although iSCSI is improving on this).
- If you are doing cross-host, both hosts must be running the same version of ESX (this just makes sense really).
- The MSCS node VMs cannot move as part of HA or DRS.Β (HA is being a little redundant for MSCS, DRS is because MSCS is so hyper-sensitive to network connectivity that even a ping loss could failover the MSCS cluster).
- You cannot use MSCS with Fault ToleranceΒ (i.e. FT VM’s can reside on the same physical ESX hosts, but MSCS node VMs cannot run as FT pairs)
- You cannot vMotion MSCS node VM’s.Β (Same reason as DRS).
- You cannot use N-Port ID Virtualization (NPIV)
- If you are using FC and using the native multipathing in ESX, you cannot use round robin as a path policy.
- You must use VM hardware version 7 with ESX/ESXi 4.0 (if you migrated the VMs from ESX 3.5 or before, make sure to upgrade your VM hardware version)
- Failover clustering with Windows Server 2008 is not supported with virtual compatibility mode RDM’s, for Win2008 use physical compatibility mode RDMs.
- You cannot use thin-provisioned disks for the Windows OS vmdk’s, they have to be thick.
- For Win2000 and Win2003 use LSI Logic Parallel as the controller type for the shared storage.Β For Win2008 use LSI Logic SAS.
- For physical-virtual MSCS clusters, use RDMs in physical compatibility mode (this just makes sense if you think about it)
- You cannot run storage multipathing software in the VMs or on ESX (i.e. no PowerPath VE).
- You cannot over-commit memory for the MSCS node VMs, set the Memory Reservation option for each of the nodes to the amount of memory assigned to the virtual machine.
- Set the disk I/O timeout to 60 sec. or more (HKLM\System\CurrentControlSet\Services\Disk\TimeOutValue) in the registry.
You can find all the details and steps walking you thru the setup of MSCS on ESX in this article .Β If you’re not on vSphere yet but you want to run MSCS nodes as VMs, you can find the proper docs for your version of ESX in a freshly updated KB article located here .
August 18th, 2009 at 6:06 am
disk time out is set to 60 by VMware tools by the way by default.
August 19th, 2009 at 12:47 am
Why can thin-provisioned disks not be used for the Windows OS vmdkβs??
August 20th, 2009 at 12:01 pm
You state to use physical compatibility mode for Win2008. I assume you should also use physical mode for win2003 cluster across a box?
August 28th, 2009 at 5:41 pm
Your first item on the list states that we are limited to 2-node clusters on ESX 4. What is the limitation? what would keep you from adding more nodes?
I am currenty running a 3-node Windows 2008/SQL 2008 cluster on 3 ESXi 4 hosts, so I don’t think your statement is valid…
September 18th, 2009 at 10:16 pm
I second Steven’s comment, why can’t a thin provisioned OS disk be used?
Thanks for the compilation of note BTW.
September 21st, 2009 at 7:56 am
You can run more than two nodes. Only two node MSCS clusters are supported from VMware. For thin provisioned disks, I believe it’s the hyper sensitivity of the clustering that’s the issue. If you read this: http://www.vmware.com/pdf/vsphere4/r40/vsp_40_mscs.pdf and do a search for eagerzeroedthick you’ll see that support mandates that disks are created this way. Extending of the thin disks can cause a pause in the storage writes while the extension happens. MSCS is too sensitive to allow for this pause.
October 2nd, 2009 at 11:52 am
ΠΡΠΈΠ²Π΅Ρ. ΠΠΎΠ΄ΡΠΊΠ°ΠΆΠΈΡΠ΅, ΠΊΠ°ΠΊ ΠΏΠ΅ΡΠ΅Π²Π΅ΡΡΠΈ Π±Π»ΠΎΠ³ Ρ Π±Π΅ΡΠΏΠ»Π°ΡΠ½ΠΎΠ³ΠΎ Ρ ΠΎΡΡΠΈΠ½Π³Π°, Π²ΠΎΡ ΠΌΠΎΠΉ Π±Π»ΠΎΠ³ ΠΆΠΈΠ·Π½Ρ
ΠΡΠΎΠ΄Π΅ ΠΊΠ°ΠΊ wordpress ΠΎΠ±ΡΠ·Π°Π½ Ρ Π»Π΅Π³ΠΊΠΎΡΡΡΡ Π΄Π²ΠΈΠ³Π°ΡΡΡΡ Π½Π° Π½ΠΎΠ²ΡΡ ΠΏΠ»ΠΎΡΠ°Π΄ΠΊΡ, Π½ΠΎ Ρ ΠΌΠ΅Π½Ρ Π²ΡΠ΅ Π²ΡΠ΅ΠΌΡ ΠΏΡΠΎΠΌΠ°Ρ ΠΈ Π² Π±Π°Π·Π΅ Π΄Π°Π½Π½ΡΡ . Π― Π³Π»ΡΠ½ΡΠ» ΡΠ°ΠΌ, Ρ ΠΎΡΡ Π² php Π½ΠΈΡΠ΅Π³ΠΎ Π½Π΅ ΠΏΠΎΠ½ΠΈΠΌΠ°Ρ Π²ΠΎΠΎΠ±ΡΠ΅ ΠΈ ΠΏΡΠΈΠ²Π»Π΅ΠΊΠ°ΡΡ ΡΡΠΎΡΠΎΠ½Π½ΠΈΡ ΡΠ°Π·ΡΠ°Π±ΠΎΡΡΠΈΠΊΠΎΠ² ΠΏΡΠΎΠ³ΡΠ°ΠΌΠΌΠ½ΠΎΠ³ΠΎ ΠΎΠ±Π΅ΡΠΏΠ΅ΡΠ΅Π½ΠΈΡ ΡΠΎ Π²Π΅Π΄Ρ Π½Π΅ Ρ ΠΎΡΠ΅ΡΡΡ. ΠΠΌΠ΅Π΅Ρ Π²ΠΎΠ·ΠΌΠΎΠΆΠ½ΠΎΡΡΡ ΠΏΠΎΠ΄ΡΠΊΠ°ΠΆΠΈΡΠ΅, ΠΊΠ°ΠΊ Π½Π΅Π±ΠΎΠ»Π΅Π·Π½Π΅Π½Π½ΠΎ ΠΏΠ΅ΡΠ΅Π½Π΅ΡΡΠΈ Π±Π»ΠΎΠ³?
October 2nd, 2009 at 12:54 pm
Huh!? I don’t read/speak Russian…
October 5th, 2009 at 3:29 pm
ΠΡΠΈΠ²Π΅Ρ. ΠΠΎΠ·Π½ΠΈΠΊΠ»Π° ΠΏΡΠΎΠ±Π»Π΅ΠΌΠ° – ΠΊΡΠΏΠΈΠ» Ρ ΡΠ»Π΅ΠΊΡΡΠΎΠ΄ΡΠ΅Π»Ρ Π½Π° ΠΌΠ°Π³Π°Π·ΠΈΠ½ ΡΡ
Π ΠΎΠ½Π° ΡΠ»ΠΎΠΌΠ°Π»Π°ΡΡ Ρ ΠΌΠ΅Π½Ρ Π² ΡΠΎΡ Π²Π΅Π΄Ρ Π΄Π΅Π½Ρ – Π³Π°ΡΠ°Π½ΡΠΈΠΈ ΠΏΡΠ°ΠΊΡΠΈΡΠ΅ΡΠΊΠΈ Π½ΠΈΠΊΠ°ΠΊΠΎΠΉ Π½Π΅ Π΄Π°Π»ΠΈ. ΠΡΠΎΡΡΠΎ ΠΏΠΎΡΡΠ° ΠΏΡΠΈΡΠ»Π° Ρ ΠΊΠΎΡΠΎΠ±ΠΊΠΎΠΉ, Π° ΡΠ°ΠΌ Π²ΡΠ΅ Π½Π° ΠΊΠΈΡΠ°ΠΉΡΠΊΠΎΠΌ. ΠΠ°ΠΏΠΈΡΠ°Π» Π² ΠΌΠ°Π³Π°Π·ΠΈΠ½ Π΄Π°Π½Π½ΡΠΉ, ΠΎΡΠ²Π΅ΡΠΈΠ»ΠΈ, ΡΡΠΎ ΠΎΠ±ΡΠ°ΡΠ°ΠΉΡΠ΅ΡΡ Π² ΡΠ΅ΡΠ²ΠΈΡ ΡΠ΅Π½ΡΡ, Ρ ΠΎΡΡ Ρ ΠΌΠ΅Π½Ρ Π½ΠΈ Π΄ΠΎΠΊΡΠΌΠ΅Π½ΡΠΎΠ² Π½Π° ΡΡΠΊΠ°Ρ Π½Π΅Ρ, Π½ΠΈΡΠ΅Π³ΠΎ. ΠΠ°ΠΊ Π²ΠΎΠ·ΠΌΠΎΠΆΠ½ΠΎ ΠΏΡΠΈΡΡΡΡΠ½ΠΈΡΡ Π΄Π°Π½Π½ΡΠΉ ΠΈΠ½ΡΠ΅ΡΠ½Π΅Ρ ΠΌΠ°Π³Π°Π·ΠΈΠ½? ΠΡΡΡ Π»ΠΈ Π½Π΅ΠΊΠΈΠ΅ ΠΊΠΎΠΌΠΏΠ΅ΡΠ΅Π½ΡΠ½ΡΠ΅ ΠΎΡΠ³Π°Π½Ρ, ΡΡΠΎ ΠΏΠΈΡΠ°ΡΡ. ΠΡΠ΄Π° ΠΏΠΈΡΠ°ΡΡ, ΠΊΠΎΠΌΡ ΠΏΠΈΡΠ°ΡΡ. ΠΡΠΈΠ±ΡΡ, Π΄Π°, Π½Π°Π΄ΠΎ Π±ΡΠ»ΠΎ Π² ΡΠΎΡΠ³ΠΎΠ²ΠΎΠΌ ΡΠ΅Π½ΡΡΠ΅ Π½ΠΎΡΠΌΠ°Π»ΡΠ½ΠΎΠΌ ΠΏΡΠΈΠΊΡΠΏΠΈΡΡ, Ρ ΠΎΡΡ Π½ΡΠΆΠ½ΠΎΠΉ ΠΌΠ½Π΅ ΠΌΠΎΠ΄Π΅Π»ΠΈ ΠΏΡΠΎΡΡΠΎ Π½Π΅ Π±ΡΠ»ΠΎ, Π΄Π° ΠΈ Π΄Π΅ΡΠ΅Π²Π»Π΅ Π² ΠΈΠ½ΡΠ΅ΡΠ½Π΅Ρ-ΠΌΠ°Π³Π°Π·ΠΈΠ½Π΅. Π Π²ΠΎΡ ΠΈ ΡΠΆΠ΅ ΡΠΈΠΆΡ Ρ ΡΠ°Π·Π±ΠΈΡΠΎΠ³ΠΎ ΠΊΠΎΡΡΡΠ°. ΠΠΎΠ΄ΡΠΊΠ°ΠΆΠΈΡΠ΅, ΡΡΠΎ Π΄Π΅Π»Π°ΡΡ.