Jump to content
Welcome to our new Citrix community!
  • How to Setup a Nutanix Protection Domain to Replicate VMs from One Cluster to Another.


    cugcblogs

    raydavis22rnd-1.jpg by Ray Davis, CTA

    This is a quick blog to show the power of a protection domain in an AHV setup. First, I wanted to explain that my lab is not complex yet. Meaning I don't have firewall rules, different VLANS, or any kind of segmentation. It's a flat Network with 2 Nutanix AHV Host. When I started writing a blog on how to setup a Nutanix protection domain, I was doing this on a production and DR site at a previous company. It worked like a treat, and I was able to move AHV virtual machines from different clusters to a DR cluster and back with no problem.  

    But since then, I  moved on to another company where they don't have Nutanix :(. I don't have access to the environment anymore. My lab is 100% AHV. Why? Well, because I got into Nutanix 2 years ago or more. I love what they offer and the Hypervisor's easy use and other software they provide. I am using the Nutanix CE version. However, the Nutanix CE version is behind compared to what is released now. I noticed in an AOS 6.0 setup that some of the options are in different areas compared to the CE version. The CE version is based on 5.18, from what I can tell. I am not sure when they will release a newer build for the CE edition either at this time. 

    This allowed me to maintain base skills set in the Nutanix realm. I know it's not 100% solution when compared to what many are running in a Prod setup.  But It's the same steps in a production environment. I am now rewriting this blog around my lab setup instead of what I originally had written up. Since I need to get my Virtual Machines off the other AHV host, I decided to write a little blog on protection domains VM replication. This blog shows you and gives you an idea of what it really can do. It is very straightforward. The biggest issue I had back when I was doing it in a Production setup was Firewalls. It is critical to make sure this is right. If not, you will struggle. Ask me how I know :). We all have to go through the firewall setups with Information Security or whoever manages your firewalls. Just make sure it's set up correctly.

    As I was saying above, my lab is simple. It is just for me to test things and keep up to date with my CVADS/CVAD journey.

    In my lab, I will be referring to names called:

    NTX-Cluster-03

    NTX-Cluster-01

    I created two Single Node Clusters, so I can have the option to do failovers and try to mirror a source and destination AHV cluster.

    davis060122-01.png.27ce9ab0d50e1c9acd67910bcb1f45ed.png

    I was in a situation where I needed to get another Lab server. So after searching online, I found a great site called Refurbished/Used Dell & HP Servers, Hard Disk Drives – TechMikeNY

    I ended up ordering this. This is all I needed for my second host at this time.

    Items QtySubtotal
     Dell PowerEdge R630 8-Bay 2.5" 1U ServerDELL_PE_R630_8B1$364.00
     Dell 0C34X6 2TB SSD SATA 2.5'' 6Gbps Solid State Drive2TB_SSD_SATA_SFF_6G2$408.50
     Intel Xeon E5-2698 v3 2.30GHz 16-Core LGA 2011 / Socket R-3 Processor SR1XE2-30Ghz_E5-2698_V3_16C2$245.60
     32GB PC4-2133P ECC-Registered Server Memory RAM32GB_PC4-2133P8$683.20
     Dell 1100W 80+ Platinum Power SupplyDELL_1100W_80-PLUS2$153.80
     Dell iDRAC8 Enterprise Remote Access LicenseiDRAC8_Enterprise1$67.20
     Dell HBA330 12Gbps SAS HBA Controller (NON-RAID) MiniCardHBA330_Mini1$62.70
     200GB SSD SATA 2.5'' 6Gbps Solid State Drive200GB_SSD_SATA_SFF_6G2$73.30
     Dell 0R1XFC I350 Quad-Port 1GBe Daughter CardDELL_0R1XFC1$21.00
     64GB SATA Disk-On-Module SATADOM SATA III 6Gbps Drive64GB_SATADOM_6G1$15.50
     Dell 2.5in R-Series CaddyR-Series_SFF_Caddies4$19.00
    Subtotal$2,113.80
    Discount-$0.00
    Shipping$55.60
    GST$0.00
    Grand total$2,169.40

     @TechMikeNY

    Here are the firewall Ports

    Ports and Protocols ANY - Disaster Recovery - Protection Domain (nutanix.com)

    Protection Domains (nutanix.com)

    You need to create a Protection domain to do this first. This is what tells Nutanix what to replication and how often.

    Let's start by setting up a protection domain on NTX-Cluster-01

    Drop down, select data protection.

    davis060122-02.png.334f7d3b5f04bc6e3973724524a2d8eb.png

    Click + protection domain

    davis060122-03.png.6e33549192841a026d473cf7d6340195.png

    Please give it a name. Give it something that makes sense to what you are working with.

    davis060122-04.png.e80e644936fe6514e574e2d6787197ae.png

    Now it will ask you for the VM names.

    davis060122-05.png.b0687b1880b819b450f133a4d6832bdd.pngdavis060122-06.png.2a4433c3df9626a1daef82813e4c9c01.pngdavis060122-07.png.931fa7d797198b47d079b61eba36bd87.png

    You can select what schedule fits you best in the setup. I set this up to run every day as an example. The 10 minutes in the screen show is to show you the schedule options.

    davis060122-08.png.394a7c4f2596cdedec49d4c6ed81edcf.png

    Now you need to add the Remote site (NTX-Cluster-03) to which you want it to send. You will do this on the source cluster and add the Remote location to the source cluster.

    NTX-Cluster-01 >>>>>>> NTX-Cluster-03

    davis060122-09.png.eae812af7ee73ecf113349c8913375a6.pngdavis060122-10.png.4dd83fe7aea539b62555f8509753f9e5.pngdavis060122-11.png.3da1a4f9a30c5669ceff9977d52cffe3.pngdavis060122-12.png.f20b53ee0044473b5f174fe3ec1cb4c2.png

    Once both sites are set up to talk to each other, we will need to come back here.

    davis060122-13.png.83126b8027dba237fad7efc793f21a40.png

    Go to the remote Site Cluster and Log in.

    Go to the Data Protection section.

    Then create the new remote site connection.

    NTX-Cluster-013 >>>>>>> NTX-Cluster-01 ( this is for reverse sync) Basically to replicate it back if needed. In my case, it's not. But in a Prod setup, you would want to send it back once your DR failover activities are completed.

    davis060122-14.png.27c873c07b6f788947c16108053b07fd.pngdavis060122-15.png.ced36037e567c333df31ecfbb8a10f58.pngdavis060122-16.png.fd38bf064aaa596f62d16edddb692eb5.png

    Again once we get the sites talking I will come back here to update the mappings.

    davis060122-17.png.8570a054a38550f5c4e0003041ddf48d.png

    Now to check the connections from NTX-Cluster-01 >>>>>>> NTX-Cluster-03

    Now, as you can see, 03 is talking to 01

    davis060122-18.png.16fbd1fc8b13a69e417c86db9dd2ca1a.png

    Then you can see that 01 is talking to 03

    NTX-Cluster-03 >>>>>>> NTX-Cluster-01

    davis060122-19.png.004618c4c711db854e57c7eea3607bca.png

    Now let's set the mappings. We can start on NTX-Cluster-01.

    Go back to the Remote Site and edit "Update" the Remote site settings.

    davis060122-20.png.78e620a6ebaa6423e6d85a5d81d6919c.png

    Click on Settings, and scroll down.

    Add the Network Mappings and vStore Name mappings. This is just setting up the Source and destination network and storage.

    davis060122-21.png.25ea328042cf35e67a52b5e4b78d538c.png

    On Cluster-01

    So AHV: Data-Cluster01 will send to AHV: Data-Cluster-03

    My network name is the same as the test sites. This would be different in your environment based on what you have the Network name when creating a base VLAN. I used the default storage location as well on both Clusters.

    davis060122-22.png.d2067ed9baa847fbfafa853ecc306431.png

    NTX-Cluster-01

    davis060122-23.png.a1f8035c24d6a00e3bac3ed5fbd0af7a.png

    Go back to the Remote Site (NTX-Cluster-03)  and do the same but in reverse.

    On NTX-Cluster-03

    So AHV: Data-Cluster03 will send to AHV: Data-Cluster-01

    My network name is the same as the test sites.

    Save settings

    davis060122-24.png.48da7ec72fd7ab9bbba465ef16d18f56.png

    NTX-Cluster-03 updated with the remote site info.

    davis060122-25.png.42b098be8508298333f0b6dc8ade7803.png

    Replication has started

    davis060122-27.png.2ed04cee313de4dccbe6f0bf6d3c703b.png

    On the remote tab, you can see the data completed along with start times, and then it shows outgoing.

    davis060122-28.png.2ccc1191e44ef1c9f2d0a29d26c0f83b.png

    Now, if I log into my "remote site" NTX-Cluster-03

    I should see incoming and some stats. You can see it listing NTX-Cluster-01 as the remote site. This is because we are logged into NTX-Cluster-03.

    davis060122-29.png.1e4008f3aacde020a34022239eddddfe.pngdavis060122-30.png.99a3eeb19338bacd767e4ad4b0f373e3.pngdavis060122-35.png.19d7b885fa514d32847de5bc56a320c9.png

    Now it's done:

    davis060122-34.png.7d31d1ee4ea417b50e4e364fea869e2e.png

    I would like to migrate the server from NTX-Cluster-01 to NTX-Cluster-03.

    Go to the source Prism Element, and click on the Async DR tab. Then click Migrate option.

    Now click on Migrate.

    davis060122-32.png.37d4549d6aeff5ff9d56b2b55fc7b81c.pngdavis060122-36.png.279dac83c56f2df59fc9990f9b72c750.png

    As you can see, it's gone from the Prod location

    davis060122-37.png.86c198fd1ee7510d282c4f5ed38db693.png

    Check the Remote location or DR NTX-Cluster-03, and you will see it now.

    davis060122-38.png.e1d7e43a8cde750022e3bbb7d5c34707.png

    Now let's power it one.

    davis060122-39.png.c3b8ecf0bd633488447e26f8cedf7e1d.pngdavis060122-40.png.02e4bfe185debd7d1956557709d541df.png

    Up and online

    davis060122-41.png.41615edb00849d299308028d514b69bc.png

    Now, in this case,  I am going to migrate it back.

    NTX-Cluster-03>>>>>>>NTX-Cluster-01

    davis060122-42.png.6b74ef59105aaf0be01ead0bd3e736e0.png

    Now it will, Snapshot the VM and send it back to NTX-Cluster-01.

    Now let's check NTX-Cluster-01 and it's back.

    davis060122-43.png.7bd7d798921ba28c45097d4f886806f6.pngdavis060122-44.png.4cd1f5cffdbe83dfd69c3a1307bf9e10.png

    In the next example, I need to move most of my lab VMs from NXT-Cluster-01 to NXT-Cluster-03 to free up resources on a host.

    davis060122-45.png.92ec9c741a23794f98a59d2be5931068.png

    I selected more this time.

    davis060122-46.png.f9d12d56074df67882978d6f3c17ecac.pngdavis060122-47.png.80aaf84fe6ecbf28d428c57645ace9b2.png

    This time around, I will let the schedule do what it needs to do. This will give you an idea of how it works.

    Starts at 5 pm, and now it's 4:53 pm.

    davis060122-48.png.7f1a8bf631a080dca2f00de8ff36f884.png

    At 5 pm we will see a local snapshot start, then it will start replicating to NTX-Cluster-03.

    Snapshot Started

    davis060122-49.png.fe1c56aa64cf75696a4176ed9a231a86.png

    Replication started.

    davis060122-50.png.da2edefc681067a0728773a4756a54e9.pngdavis060122-51.png.d8ef06c0c8810d8adadb9c852813a585.png

    Finished

    davis060122-52.png.9802da26c34bf956e3bc526c9cea29ae.png

    Now to fail them over.

    Click on Async DR,  The Protection Domain name, Entities.

    davis060122-53.png.63b1005953dbab3eb7c30e841ea738f2.png

    Click Migrate

    davis060122-54.png.cb90d7007d30660e0ef2918905a837d5.pngdavis060122-55.png.b638b68ed08b8e9774d1a79f7b707c05.pngdavis060122-56.png.76314849a28c661df0d5eb9d4d5d3684.png

    They're gone and now on NTX-Cluster-03.

    davis060122-57.png.da066c4426768fc261a55ab2e3eeda45.png

    I am powering them up now.

    They are all online.

    davis060122-58.png.9c7bdf0f5f9189fa01955c8739fe1405.png

    Working

    davis060122-59.png.fd7127520654b8dcbc58fdaeb46f7f89.png

    I hope you find this helpful if you want to play around with Protection domains. As I stated above, this is in my lab.  But I have used this in a production environment to move workloads around. I mainly used this to move resources from one cluster to another in preparation for a data center migration involving a CVAD setup. It worked like a charm and saved me big time in areas where this was needed.

    See more posts from Ray Davis.


    User Feedback

    Recommended Comments

    Do you know if it is possible to bring the servers up on the DR side, without taking them down on the prod side? We us a DR bubble for audit testing. We use Zerto to bring them online in DR during the middle of the day, and then just destroy them without committing changes back. Problem is, Zerto doesn't like MCS provisioned machines.
    Link to comment
    Share on other sites

    Hey Craig, Good question. I am not certain on that to be honest. But I will do some checking and find out. Give me some time :)
    Link to comment
    Share on other sites

    Hey Craig, I am not 100% on that. But Let me see what I can do on getting you an answer. I only used them to failover things and back. I understand what you are saying though, I am just not sure. But again, let me see what I can do for an answer.
    Link to comment
    Share on other sites

    Hey Craig, I reach out and found out that you can't do this with Protection domains. However, with DR orchestration (Leap) that will allow you to test failovers from Prism Central.

     

    https://portal.nutanix.com/page/documents/details?targetId=Leap-Xi-Leap-Admin-Guide-v2022_6:Leap-Xi-Leap-Admin-Guide-v2022_6

     

    https://www.vmwaremine.com/2019/05/02/nutanix-dr-orchestration-part-3/

    Link to comment
    Share on other sites



    Create an account or sign in to comment

    You need to be a member in order to leave a comment

    Create an account

    Sign up for a new account in our community. It's easy!

    Register a new account

    Sign in

    Already have an account? Sign in here.

    Sign In Now

×
×
  • Create New...