[BUG] Fix Colocate table balance bug#4936

gengjun-git · 2020-11-20T10:44:43Z

Fix bug #4935

当前策略：

每个group中维护一个bucketId所在的be列表：backendSeq

线程每隔20s：

检测backendSeq中是否有be不可用，如果有，则选择可用的be将其在backendSeq中替换
检测group中的tablet是否与backendSeq相匹配，如果不匹配，将group设置为unstable，并且执行迁移任务
对处于stable状态的group进行均衡：根据backendSeq计算所有be中bucketId的数目，从bucketId占有高的be迁移到bucketId占有低的be。此处只更新backendSeq，实际执行迁移任务在第2步。

存在的问题：

如果在相同的时间down掉比较多的be，在第1步中，会将这些be从backendSeq中移除，并且第2步检测到backendSeq不匹配，将group标记为unstable，但是如果现有的be磁盘不能容纳down掉的be上的所有tablet，此时group会一直处于unstable状态，即使再加入新的be，也不能触发第3步，因为第3步只会在group是stable状态下才能执行。

策略更改：

将现有策略的1和3融合成一个过程：

首先检测backendSeq中是否存在不可用的be，均衡时，优先迁移不可用be的bucketId到buckedId占有低的be，其次再从bucketId占有高的be迁移到bucketId占有低的be。
同现有策略

kangkaisen

+1

fix

005de54

kangkaisen added kind/fix Categorizes issue or PR as related to a bug. area/balance Issues or PRs related to data balance labels Nov 21, 2020

kangkaisen approved these changes Nov 21, 2020

View reviewed changes

kangkaisen added the approved Indicates a PR has been approved by one committer. label Nov 21, 2020

morningman merged commit 37a6731 into apache:master Nov 22, 2020

This was referenced Nov 24, 2020

Colocate table stays unstable when a large number of be restarts frequently #4935

Closed

[BUG] Fix colocate balance bug when there is decommissioned be #4955

Merged

yangzhg mentioned this pull request Feb 9, 2021

Release Notes 0.14.0 #5374

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Fix Colocate table balance bug#4936

[BUG] Fix Colocate table balance bug#4936
morningman merged 1 commit intoapache:masterfrom
gengjun-git:fix_colocate_balance

gengjun-git commented Nov 20, 2020

Uh oh!

kangkaisen left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

gengjun-git commented Nov 20, 2020

Uh oh!

kangkaisen left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments