Skip to content

DAOS-18705 rebuild: set rebuild flag before creating rebuild_pool_tls (#17941)#17985

Open
gnailzenh wants to merge 10 commits intorelease/2.8from
liang/b2_8_agg_epc
Open

DAOS-18705 rebuild: set rebuild flag before creating rebuild_pool_tls (#17941)#17985
gnailzenh wants to merge 10 commits intorelease/2.8from
liang/b2_8_agg_epc

Conversation

@gnailzenh
Copy link
Copy Markdown
Collaborator

@gnailzenh gnailzenh commented Apr 12, 2026

  • set rebuild flag before creating rebuild_pool_tls, otherwise aggregation
    can progress to higher epoch than rebuild.
  • aggregation doesn't do full scan anymore after rebuild
  • fix a rpt refcount leak in rebuild_tgt_scan_handler()

Steps for the author:

  • Commit message follows the guidelines.
  • Appropriate Features or Test-tag pragmas were used.
  • Appropriate Functional Test Stages were run.
  • At least two positive code reviews including at least one code owner from each category referenced in the PR.
  • Testing is complete. If necessary, forced-landing label added and a reason added in a comment.

After all prior steps are complete:

  • Gatekeeper requested (daos-gatekeeper added as a reviewer).

gnailzenh and others added 6 commits April 12, 2026 18:25
- stop refreshing aggregation epoch while rebuilding
- set rebuilding flag before setting rebuild fence

Signed-off-by: Liang Zhen <gnailzenh@gmail.com>
Signed-off-by: Wang Shilong <shilong.wang@hpe.com>
Signed-off-by: Liang Zhen <gnailzenh@gmail.com>
Signed-off-by: Liang Zhen <gnailzenh@gmail.com>
Signed-off-by: Liang Zhen <gnailzenh@gmail.com>
Signed-off-by: Liang Zhen <gnailzenh@gmail.com>
@gnailzenh gnailzenh requested review from a team as code owners April 12, 2026 10:27
@github-actions
Copy link
Copy Markdown

Ticket title is 'DAOS 2.6.5: Interrupt rebuild with reintegration and interrupt with exclude with active IO'
Status is 'In Progress'
Labels: 'test_2.6.5,testp1'
https://daosio.atlassian.net/browse/DAOS-18705

Signed-off-by: Liang Zhen <gnailzenh@gmail.com>
Signed-off-by: Liang Zhen <gnailzenh@gmail.com>
Signed-off-by: Liang Zhen <gnailzenh@gmail.com>
@daosbuild3
Copy link
Copy Markdown
Collaborator

Test stage Functional Hardware Large MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-17985/2/execution/node/1244/log

@daosbuild3
Copy link
Copy Markdown
Collaborator

Test stage Functional Hardware Medium Verbs Provider MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-17985/2/execution/node/1370/log

@daosbuild3
Copy link
Copy Markdown
Collaborator

Test stage Functional Hardware Medium MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-17985/2/execution/node/1360/log

Signed-off-by: Liang Zhen <gnailzenh@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Development

Successfully merging this pull request may close these issues.

3 participants