Couchbase rebalance freeze issue

We came across a Couchbase bug during a rebalance while upgrading online to 1.8.1 from 1.8.0.  

Via the UI, we upgraded our first node, re-added it to the cluster, and then set the rebalance off.  It was progressing fine, then stopped around 48% for all nodes.  The tap and disk queues were quiet and there were no servers in pending rebalance.  The upgraded node was able to service requests, but with only a small percentage of the items relative to the other nodes.  The cluster as a whole did not suffer in performance during this issue though there are some spikes in cpu during any rebalance.  

We decided to stop the rebalance, wait a few minutes, then rebalance and we see it is moving again, progressing beyond what it was.  It stopped again, now at 75%. Let sit for 7 mins, then hit Stop Rebalance and Rebalance. Not progressing at all now.

Couchbase support pointed to a bug where if there are empty vbuckets, rebalancing can hang.  This is fixed in 2.0.  The work around solution is to populate buckets with a minimum of 2048 short time to live (TTL >= (10 minutes per upgrade + (2 x rebalance_time)) x num_nodes) items so all vbuckets have something in them.  We then populated all buckets successfully and were able to restart the rebalance process which completed fine.

Reference:

http://www.couchbase.com/docs/couchbase-manual-1.8/couchbase-getting-started-upgrade-online.html

Comments

So another workaround would be to drop the empty vbuckets? Cheers,Jan

Jan
Wed, 11/07/2012 - 06:28

This bug only affects rebalance to upgrade from 1.8.0 to 1.8.1. Once on 1.8.1 you can rebalance all you want with empty vBuckets/clusters. So this does not affect users strating with Couchbase Server 1.8.1

Frank
Fri, 09/28/2012 - 00:59

This bug only affects rebalance to upgrade from 1.8.0 to 1.8.1. Once on 1.8.1 you can rebalance all you want with empty vBuckets/clusters. So this does not affect users strating with Couchbase Server 1.8.1

Frank
Fri, 09/28/2012 - 00:59

This bug only affects rebalance to upgrade from 1.8.0 to 1.8.1. Once on 1.8.1 you can rebalance all you want with empty vBuckets/clusters. So this does not affect users strating with Couchbase Server 1.8.1

Frank
Fri, 09/28/2012 - 00:58

Reply

The content of this field is kept private and will not be shown publicly.
  • Web page addresses and e-mail addresses turn into links automatically.
  • Allowed HTML tags: <a> <em> <strong> <cite> <code> <ul> <ol> <li> <dl> <dt> <dd>
  • Lines and paragraphs break automatically.

More information about formatting options

By submitting this form, you accept the Mollom privacy policy.
Website by Digital Loom