refactor(query): refactor failed task retry control procedure. #33498

hjxilinx · 2025-11-07T02:11:07Z

Description

refactor(query): refactor failed task retry control procedure.

Checklist

Please check the items in the checklist if applicable.

Is the user manual updated?
Are the test cases passed and automated?
Is there no significant decrease in test coverage?

gemini-code-assist · 2025-11-07T02:11:25Z

Summary of Changes

Hello @hjxilinx, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request focuses on a significant refactoring of the scheduler's failed task retry control procedure. The changes aim to enhance the robustness and clarity of how tasks are retried upon failure, particularly in scenarios involving data-bound tasks and node redirects. By simplifying data structures, introducing dedicated helper functions for retry logic and node address retrieval, and refining retry parameter initialization, the PR makes the retry mechanism more efficient and easier to understand, ultimately improving the scheduler's overall reliability.

Highlights

Refactored Retry Control Procedure: The core logic for handling failed task retries has been significantly refactored to improve clarity and maintainability.
Simplified Redirect Context: The SSchRedirectCtx structure was streamlined by removing several fields related to retry counting, simplifying its state management.
New Retry and Node Address Functions: Introduced schFailedTaskNeedRetry to centralize the decision-making for task retries and schGetTaskCurrentNodeAddr to abstract the retrieval of the current node address for a task.
Enhanced Task Retry Initialization: The schInitTaskRetryInfo function (formerly schInitTaskRetryTimes) now calculates maxRetryTimes more dynamically, considering tsMaxRetryWaitTime, redirectDelayMs, and a REPLICA_FACTOR.
Centralized Endpoint Set Updates: A new function schUpdateCurrentEpset was added to encapsulate the logic for switching to the next endpoint in an endpoint set, improving modularity.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request is a solid refactoring of the failed task retry control procedure. The changes simplify the retry context SSchRedirectCtx and centralize retry logic into new helper functions like schFailedTaskNeedRetry, schUpdateCurrentEpset, and schGetTaskCurrentNodeAddr. This significantly improves code clarity and maintainability. The retry strategy is also updated, for instance, by making the retry delay fixed and recalculating max retry times. My review includes a few suggestions to further enhance readability and fix a potential logging issue.

source/libs/scheduler/src/schTask.c

gemini-code-assist · 2025-11-07T02:13:20Z

source/libs/scheduler/src/schTask.c

+  pTask->redirectCtx.redirectDelayMs = 5000;  // 5s by default
+
+  // 3 is the maximum replica factor in tsdb, so here multiply 3 to increase the retry chance
+  int32_t REPLICA_FACTOR = 3;


The constant REPLICA_FACTOR is defined as a local variable with a magic value of 3. To improve maintainability and code clarity, it's better to define this as a named constant at a more visible scope, for example, as a macro SCH_DEFAULT_REPLICA_FACTOR in source/libs/scheduler/inc/schInt.h. The comment explaining it is good, but a constant would make the code more self-documenting and easier to change in the future.

source/libs/scheduler/src/schTask.c

hjxilinx and others added 30 commits October 28, 2025 10:35

refactor(gpt): refactor the requirements.

c3c7f74

refactor: do some internal refactor.

f247248

refactor: do some internal refactor.

7a44afd

fix(gpt): update the requirements.

84aa5d7

Merge branch 'main' into fix/req

6e178af

fix(gpt): remove unused code in install script.

1b7d534

Merge branch 'main' into fix/req

9b227e0

fix(gpt): update the requirements.

d7bb423

fix(gpt): update the requirements.

8410369

Merge branch 'main' into fix/req

f707637

fix(gpt): update requirements.

7fa4b4f

fix(gpt): update requirements.

28b9308

Merge branch 'main' into fix/req

43d27db

fix(gpt): update requirements.

57bdcca

fix(gpt): update requirements.

23ad2ae

fix(gpt): update requirements.

ccd6432

fix(gpt): update the requirements.

d2f7784

fix(gpt): update the requirements.

160c09e

fix(gpt): update requirements.

c328c97

fix(gpt): update the requirements.

c6f8fc7

fix(gpt): update the requirements.

915371c

fix(gpt): update the requirements.

42fc30f

fix(gpt): update the requirements.

8ccc814

fix(gpt): update the requirements.

f2374d6

fix(gpt): suppress warning.

456acbe

fix: update test case.

2005a67

test: update test cases.

704f200

test: update test cases.

a710ba3

fix(gpt): fix error.

d083614

refactor(query): refactor query task redirect process.

dba1428

Merge branch 'main' into fix/req

bd115e3

hjxilinx requested review from dapan1121, guanshengliang and hzcheng as code owners November 7, 2025 02:11

taosdata-bot bot added the team engine label Nov 7, 2025

gemini-code-assist bot reviewed Nov 7, 2025

View reviewed changes

hjxilinx added 2 commits November 7, 2025 10:23

refactor: refactor some codes following the suggestions by Gemini.

9cd581f

fix(test): fix error in unit test.

ed918a9

dapan1121 approved these changes Nov 7, 2025

View reviewed changes

guanshengliang approved these changes Nov 7, 2025

View reviewed changes

guanshengliang merged commit 80e9b65 into main Nov 7, 2025
12 of 14 checks passed

hjxilinx deleted the fix/req branch November 15, 2025 07:45

hjxilinx restored the fix/req branch November 15, 2025 07:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

refactor(query): refactor failed task retry control procedure. #33498

refactor(query): refactor failed task retry control procedure. #33498

Uh oh!

hjxilinx commented Nov 7, 2025

Uh oh!

gemini-code-assist bot commented Nov 7, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

gemini-code-assist bot Nov 7, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

refactor(query): refactor failed task retry control procedure. #33498

refactor(query): refactor failed task retry control procedure. #33498

Uh oh!

Conversation

hjxilinx commented Nov 7, 2025

Description

Checklist

Uh oh!

gemini-code-assist bot commented Nov 7, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

gemini-code-assist bot Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants