Skip to content

Fix batch index task naming convention to accommodate simultaneous tasks #8494

@a2l007

Description

@a2l007

Description

Modify the batch indexing task ID generation logic from

<index_type_name>_<datasource>_timestamp to
<index_type_name>_<datasource>_<random_id>
This would impact native index tasks as well as hadoop index tasks.

Motivation

With the existing task naming convention, if there are multiple indexing tasks submitted at the same time for a datasource, only one of the tasks gets accepted and the remaining tasks fail with Task[xxx] already exists. I'd like to modify the naming convention to the one mentioned above, so that this issue can be avoided.
@jihoonson Any comments regarding this?

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions