Data Center Virtualization & Cloud Infrastructure Services Support

vRealize Operations Troubleshooting Webcast Q&A

Our monthly webcasts always generate excellent questions from the audience. A few weeks ago Jaikishan Tayal (one of our Senior Technical Training Specialists who trains VMware’s technical staff) delivered VMware vRealize Operations Manager 6.x Troubleshooting Tips & Tricks and the questions were great as always.

 

We pulled a few of them below, but you can read the entire list and listen to the recording on your schedule.

 

————————————————————————

Can you please show the basic install and config settings of vRealize Operations?

This link will walk you through the vRealize Operations installation, configuration, how to use it, etc.
I am assuming that there will be very few instances where we will need to check/modify any of these config files? And even if we did, it would be advised to contact GSS first?

Absolutely….it is not advised to make changes to the config files directly as it can cause problems if not done it the correct way. VMware recommends using the Admin UI and the Product UI to make changes to the nodes or services.

 

If you had multiple vCenters globally separated, do you only need remote collectors in the local datacenters?

Remote collectors are mainly used if there is security restriction and only some firewall ports can be opened. For more information, refer this article.

 

Is there any relation between vRealize Orchestration and Operation Manager?

Orchestrator is for automation to push the scripts. Operation manager is for monitoring the environment. There is no dependency on each one.

 

What are best practices to use vRealize Operations?

Here are couple of things to get started: the sizing guideline  and also best practices.

 

What are the recommended CPU/memory thresholds when checking TOP?

It not only depends on what is the threshold but for how long. Example for the first part CPU usage 80% and above, Memory usage 80% and above however vROPS SUSE is memory hungry (usage). Now for second part if the usage is for 10 to 30 seconds it’s ok. But if it stays for 5 minutes and more then it’s a problem (Threshold).

 

What is the best method to backup the databases?

The best way is to backup the entire vRops node. For more information, refer to this article.

 

What would cause AD or local users from logging in to the UI, it keeps saying “Authenticating…” and eventually it would say bad username/password

You may want to check the network traffic between vRealize Operations and your LDAP source. It could also be related to the overall load on the vRealize Operations node, may be the load is too high. You can reduce the load on the node, by adding more nodes to the cluster. “/usr/lib/vmware-vcops/user/log/web.log” log will have capture the information. You can search for the specific user and see what error is logged, which will give further clue.

 

When bringing a cluster back online does this restart the services as well or would these need to be started again manually?

Yes, bringing the cluster back will automatically start the node services, and when all the nodes services are up then the cluster state will change to online.

 

Why are there master-replica nodes?

Master Replica node is an exact copy of the Master node. Because Master is the most important node in vRealize Operations cluster, you want to make sure its services are highly available. You achieve this by enabling HA on the cluster and designating one data node as a replica.