Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Publish segment schema from MSQ engine #16463

Draft
wants to merge 7 commits into
base: master
Choose a base branch
from

Conversation

findingrish
Copy link
Contributor

@findingrish findingrish commented May 16, 2024

  • Introduce a new POJO class which extends DataSegment to include schema information. This will be sent by the workers to the Controller.
  • Changes in SegmentGeneratorFrameProcessor to generate segment schema.
  • Changes in ControllerImpl to publish segment schema along with segment metadata.

@github-actions github-actions bot added Area - Batch Ingestion Area - MSQ For multi stage queries - https://github.com/apache/druid/issues/12262 labels May 16, 2024
@findingrish findingrish changed the title Msq schema publish Publish segment schema from MSQ engine May 16, 2024
import java.util.Map;
import java.util.Objects;

public class DataSegmentExtendedWithSchema extends DataSegment
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There is already a class DataSegmentWithMetadata, why not reuse that?

@@ -57,7 +57,7 @@ public RemoteTaskActionClient(
@Override
public <RetType> RetType submit(TaskAction<RetType> taskAction) throws IOException
{
log.debug("Performing action for task[%s]: %s", task.getId(), taskAction);
log.info("Performing action for task[%s]: %s", task.getId(), taskAction);
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this really needed? This can cause a lot of noise in task logs.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Area - Batch Ingestion Area - Ingestion Area - MSQ For multi stage queries - https://github.com/apache/druid/issues/12262
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants