Skip to content

Decoupling FirehoseFactory and InputRowParser #5584

@jihoonson

Description

@jihoonson

Original discussion is here.

Firehose is designed to support any file formats, but FirehoseFactory isn't: it should be associated to a specific type of InputRowParser at compile time. This means that we should implement different FirehoseFactorys for each various file formats which has a significant limitation on extensibility.

As intended from the first place, FirehoseFactory should specify how to make a new Firehose specifying how data comes to Druid (like downloading S3 objects from static s3 firehose), while InputRowParser specifies how Druid parses incoming data through Firehose.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions