Jump to content
Welcome to our new Citrix community!
  • My First Nutanix AHV/AOS Upgrade


    cugcblogs

    raydavis22rnd-1.jpg by Ray Davis, CTA

    We recently made a good investment in our VDI CVAD space: We purchased 28 Nutanix Nodes, and as this is our first upgrade, I wanted to break down everything we did. It was over two days, and I captured all the screenshots and documentation I could in a production environment. Nutanix does a fantastic job of documenting everything. I do have to admit, this was one of the smoothest upgrades in my entire career. This doc is to go over what they already have written up. It's purely AOS/AHV and Super Micro Servers and no VMware.

    davis050421-01.png.a307ce0cae0f8f3310591d045ad98c54.pngdavis050421-02.png.431c2dbe227c7c4d653f38bd0db964a9.pngdavis050421-03.png.b313d7eb834635794bb77206a191facc.png
    1. Support numbers just in case you need help or get stuck
      • 1-855-NUTANIX
      • 1-855-688-2649
    • Either reference old case or give a serial number.
    • Serial Number for support = ######## ( I use this to open cases)
    1. Use the Nutanix Check Sheet in OneNote to check off items you have upgraded
    2. https://portal.nutanix.com/page/documents/details?targetId=Acropolis-Upgrade-Guide-v5_19:upg-upgrade-recommended-order-r.html
    davis050421-04.png.5e9fd4ff72ad9bdd2a8be58c5490e214.pngdavis050421-05.png.a6579be42e08ee0776ca475601f89658.png
    1. How the upgrades work
    1. Upgrade Matrix (Check compatibility) Compare what you are on and what you are going to. It will show you if it's supported or not.
    1. The upgrade Matrix will give you an upgrade path for what products you are updating.

     

    Example:

    davis050421-06.png.447fcefcdb8d73ee554930f61a36bcee.png
    1. We will be referring to the Nutanix Upgrade procedure here https://portal.nutanix.com/page/documents/details?targetId=Acropolis-Upgrade-Guide-v5_19:upg-upgrade-recommended-order-r.html
    2. We will log into Prism Center with the admin account. The URL is https://Priscenter.Domain.com:9440
    1. Check the version you are on for Prism Center, which will show you NCC and LCM versions.
    davis050421-07.png.3859a3a5c4ef39e7114af5642387f26d.pngdavis050421-08.png.fe95872f2983864b0d4f417697b2029a.png
    1. Prism Central (PC)
    2. Perform an LCM inventory, which also updates the LCM framework. Do not upgrade any other software component except LCM in this step.
    3. Upgrade and run Nutanix Cluster Check (NCC) on Prism Central.
    davis050421-09.png.6836d31bce238cf928dd034375d6077c.pngdavis050421-10.png.aef4eb0654c980ef9a370259eb710e05.pngdavis050421-11.png.93902614052d96fbcf7b82301320ed55.png
    1. After these downloads, you can now upgrade.
    davis050421-12.png.ecf32d90a735edf3f4e37a0d02e2758c.png

     14. Yes

    davis050421-13.png.41bb0e6c464ec33a79611bebd41be160.png
    1. After about 15 minutes, it will be complete. Click the gear icon top right, and hit "About Nutanix."
    davis050421-14.png.789c8ddd6a33377854a389997c6b0ce9.png
    1. Upgrade Prism Center
    1. PC: Upgrade Prism Central.
    2. Check compatibility https://portal.nutanix.com/page/documents/upgrade-paths
    davis050421-15.png.48ef587ddae6b32248780f68b6386599.pngdavis050421-16.png.2b067196a3a62a7aaf87e317eca588b1.pngdavis050421-17.png.9d2d7e950d79ceaa8bb45de639908262.png
    1. Downloading
    davis050421-18.png.ebbaa92087bc9b77684338808d1d3d33.png
    1. Now click pre-Upgrade to simulate a test (although it will do it when it upgrades).
    davis050421-19.png.7644ce0bd2cf1b3bd35147225d7a407b.png
    1. Now upgrade PC
    davis050421-20.png.006aac5c303e4cfb9530fd7b733b12a3.png
    1. This will take about 30 minutes.
    davis050421-21.png.6b7150b63c12f06bedb31c051fb3ddc6.pngdavis050421-22.png.e3aac82b613ba0198dc43a15040be631.png
    1. This will occur as well.
    davis050421-23.png.a35bc9dce7f325d8eaffb57dfdc6b006.png
    1. After it's finished, let's check it.
    2.  Looks good.
    davis050421-24.png.12fa818e1d9890eaeb2112abd6aec28f.pngdavis050421-25.png.a7c2d84b2b0479c1dbf17dfdd5ea1206.png
    1. Prism Element clusters - upgrade LCM and NCC You can use LCM >Software to perform the rest of the updates.
    2. Perform an LCM inventory, which also updates the LCM framework. Do not upgrade any other software component except LCM in this step.
    3. PE: Run and upgrade Life Cycle Manager (LCM)
    davis050421-26.png.9a1a81a82be4615b5cf09f744352dedd.pngdavis050421-27.png.83cbaa11e8ad82c96ea4d70f8f8ed721.pngdavis050421-28.png.eb8f67f77055a2856215fdd66b470ad9.png
    1. Log into PE and upgraded NCC and Foundation
    davis050421-29.png.61b469bf07f265d4e6087784728d3c89.pngdavis050421-30.png.2a4f9b8ff47ee997387bf0d445c3f9d5.pngdavis050421-31.png.a48c63ee898e7993ea4d8ba138f83129.png

    You will see it being downloaded.

     

    davis050421-32.png.45899e046e7f4a7dda9ff16a15c98982.png

    Now click upgrade

    davis050421-33.png.c480e1b8384cce3151e416f85cc85b99.pngdavis050421-34.png.d649b97bd0d90491a26b2569e32fe8a8.pngdavis050421-35.png.9745b4bf0e6b849e395ef0b463127b44.pngdavis050421-36.png.c409e10fc474202040d3cad8a28d5442.pngdavis050421-37.png.8ef0b9e1d62b05cac0f63b559339d95b.png

    PE: Upgrade Foundation.

    davis050421-39.png.4bb80b38a993e879991354ab778ed906.pngdavis050421-38.png.e41691f368d23d3773390ba8e5e1f818.pngdavis050421-40.png.6f3581d071318ec5b6ce950ab5b38070.pngdavis050421-41.png.656515f6d41c4e1862f5106cf2f06d8e.pngdavis050421-42.png.fcfd69c9d473eb55ed2f15ea03e5e6dc.pngdavis050421-43.png.f07c625977bfd74a4bbb0e3b17891c05.png

    PE: Run and upgrade Life Cycle Manager (LCM)

    davis050421-44.png.1d2a0cdc0eb256cd439afcb0256bb121.png
    1. File Server (Nutanix Files) Software

    Installing (or Upgrading) Files

    What happens when I click "Upgrade Now"?

    • First, the pre-upgrade checks will run to make sure that the cluster is able to be upgraded. If any of the pre-upgrade checks fail, you will see information about this in Prism and the actual File Server upgrade will not start. Users will have to click "Back to Versions" and start the upgrade again after the issue reported by the pre-checks is resolved. To see the full list of pre-checks and their related KB articles, check out KB-6524.
    • Once the File Server upgrade beings, each File Server VM is upgraded one-at-a-time onto the new Nutanix Files version. While an FSVM is down for the upgrade, users connected to shares hosted by this node may experience a loss of connectivity for a duration of roughly 20-30 seconds. After this short period, another FSVM will pick up on hosting those shares, and users will regain access to their files.
    • After each FSVM completes its reboot onto the new version of Nutanix Files, the upgrade will make sure that it can once again host shares before starting to upgrade the next FSVM.

    How long does it take? About 20 minutes per-File Server VM.

    davis050421-45.png.91334a823512335bfe375697eb774f01.pngdavis050421-46.png.07a1703a2655d2a02629d75c3c8eb37f.pngdavis050421-47.png.d20e02cb669556ae750cde8a7427e437.pngdavis050421-48.png.4cdee59ce6ca0a5a3a70314d2c175a5c.pngdavis050421-49.png.d5235fca1fe89edffe98a3956520d748.pngdavis050421-50.png.2c6ba484535df39616296ce855645348.png
    • Upgrade FSM: This will take 20-30 minutes

    The File Server Module (FSM) manages the Files lifecycle and appears in LCM. The FSM includes the Files UI component but relies on AOS for the control plane.

     The File Server Module (FSM) manages the Files lifecycle and appears in LCM

    From <https://portal.nutanix.com/page/search/list?stq=FSM>

    davis050421-51.png.0839332396875eff9553ea04c400c170.pngdavis050421-52.png.1a5845b43b98ffc97923632dd204a5d7.pngdavis050421-53.png.b578e6edb43cd1d29d30f6f9c0e941a8.pngdavis050421-54.png.22b70c3f95a613a683a8df04c715754d.pngdavis050421-55.png.f9e5e1027ddf7ee929428cbbef4f8c62.png
    1. File Analytics
    davis050421-56.png.f931c20a06939907208db20304260146.pngdavis050421-57.png.1b48692970791ebdc2970adcc2c53cfb.pngdavis050421-58.png.f35f94e906c71e1c470210e57222d1a1.pngdavis050421-59.png.6abd0a58f05b353566e636188a7e5165.png

    View all task to see the progress.

    davis050421-60.png.64d260fcace50dcd48ee124a41407fe0.png
    1. Cluster Maintenance Upgrade

    Go to LCM > Software>  Cluster Maintenance

    Cluster Maintenance would have been here, but we upgraded it, and I currently get a screenshot.

    davis050421-61.png.1cb266544193968587e201cbe2b59d02.pngdavis050421-62.png.4ba4e88b0b8abe35e3ff5afb3972a46f.png4e15d4ebabb7454daabb717d2dcd9fc9.png
    1. AOS Software

    Upgrade Prerequisites

    What happens when I click "Upgrade Now"?

    • First, the pre-upgrade checks will run to make sure that the cluster is able to be upgraded. If any of the pre-upgrade checks fail, you will see information about this in Prism and the actual AOS upgrade will not start. Users will have to click "Back to Versions" and start the upgrade again after the issue reported by the pre-checks is resolved. To see the full list of pre-checks and their related KB articles, check out KB 6524.
    • Next, the AOS software is copied to each CVM (Controller VM) in the cluster.
    • In the last stage, the Controller VMs in the cluster reboot one-at-a-time onto the new AOS version. Storage traffic from User VMs will be redirected to a neighboring CVM while the local one is upgrading. During this short period (about 10 minutes) the local User VMs may experience a small amount of additional latency since they are receiving their storage I/O from a remote CVM.

    How long does it take? 15-20 minutes per node. The upgrade process in a two-node cluster will take longer than the usual process because of the additional step of syncing data while transitioning between single and two node state. Nevertheless, the cluster remains operational during upgrade.

    From <https://portal.nutanix.com/page/documents/kbs/details?targetId=kA00e000000LMgICAW>

    davis050421-64.png.5df974e0ebefb558eaffb31b7e13081b.pngdavis050421-65.png.865624203900447cc0d57a64aa63e362.pngdavis050421-66.png.2ae7ccfc29891cdbd5a4e552c074da62.pngdavis050421-67.png.2ae43b0953d1560816b1f299ddbce23a.pngdavis050421-68.png.61e5f502e4e9ddc8504aa8d8306ddda9.pngc5cb51d17290430ba9aa7630e88ad2db.pngdavis050421-70.png.2f492880f055ed4aeabfbf61bd7ee224.pngdavis050421-71.png.a6cba46e69766aaa06cc1e235aae1e41.pngdavis050421-72.png.c4d9e77e3d8f5fde3384857237240c18.pngdavis050421-73.png.dd86405ef7e68fe5f056ca9021e1f29b.pngdavis050421-74.png.0fa6e1ecd45728f0dfb8b34f5f78d504.pngdavis050421-75.png.bbf8ca2c70d5f97af1d647bfcf83adeb.png
    • Perform available firmware updates (BIOS/BMC/Host boot drive or other critical firmware as recommended by LCM).

    After upgrading AOS and before upgrading your hypervisor on each cluster, perform a Life Cycle Manager (LCM) inventory, update LCM, and upgrade any recommended firmware. See the Life Cycle Manager documentation for more information.

    PE: Run and upgrade Life Cycle Manager (LCM):

    • Perform an LCM inventory (also updates LCM framework).
    • Upgrade SATA DOM firmware (for hardware using SATA DOMs) as recommended by LCM.
    • Upgrade all other firmware as recommended by LCM (BMC / BIOS / other).

    For release-specific information (all branches), see the Life Cycle Manager Release Notes.

    LCM performs two functions: taking inventory of the cluster and performing updates on the cluster.

    From <https://portal.nutanix.com/page/documents/details?targetId=Acropolis-Upgrade-Guide-v5_19:upg-firmware-upgrades-c.html>

    • AVH Hypervisor upgrade
    davis050421-76.png.31e8949a147cb081d5051d80254173af.pngdavis050421-77.png.45e532c0d6ef7666c39ca11fe690e6c1.pngdavis050421-78.png.a686effb03c36f7da406f8547736600c.pngdavis050421-79.png.7ccc0afeed2f34bd60ceca3b2eb1235e.pngdavis050421-80.png.608a5ff3e91bcde77f5643172dcee721.pngdavis050421-81.png.dc87b1a37d4292d6c049c94cd4145417.pngdavis050421-82-1.png.e4e40693ab0c375f9f70a16310658133.pngdavis050421-83-1.png.47e1b9410cf7f5de5ee92b7707ef6ff7.pngdavis050421-84.png.f4ac8ff4f2e3abc333d05fd8b13a6218.png
    1. Upgrade FSM and FA. I don't have screenshots for this. But you can see them in LCM and Do the FSM first and FA next. It's straightforward from a GUI part.

    You will see this. Just ignore it because it's a part of FA.

    davis050421-85.png.7a1eae162d792f47da82bd298756010a.png

    36. Run another LCM to make sure all upgrades and good.

    davis050421-86.png.ea8528e3ca6ee2f4e0ee687eb02edefc.png

    That concludes the upgrade process that I went through. I didn't see any performance impact, it took me about 27 hours straight. It would have been faster if I could have took the CVAD offline while doing this, but as many of you know, most of the time that is not ideal. But overall, I score Nutanix at 100% for the KISS method.

    davis050421-82.png.4c55ebabeef1ead4767c5e731b3e0161.png

    davis050421-83.png.cb6e5a02752e0c74303016af556db5c7.png


    User Feedback

    Recommended Comments

    Very useful article with screenshots. Appreciate your efforts for providing the screenshots which makes life easier to understand the flow.
    Link to comment
    Share on other sites



    Create an account or sign in to comment

    You need to be a member in order to leave a comment

    Create an account

    Sign up for a new account in our community. It's easy!

    Register a new account

    Sign in

    Already have an account? Sign in here.

    Sign In Now

×
×
  • Create New...