Skip to content

[Feature Request]: Support Stateful Processing in Spark Runner (Without Timer Functionality) #33237

@twosom

Description

@twosom

What would you like to happen?

I would like to add support for stateful processing in the Spark Runner, specifically:

  1. Implement state management functionality for streaming pipelines
  2. Add validation to reject pipelines that use timers (as timer support will be implemented later)
  3. Include proper error messaging when timer functionality is attempted

This implementation will follow the same approach that was successful with the Dataflow runner - implementing state management first, with timer support to follow in a future update.

mail context

Issue Priority

Priority: 2 (default / most feature requests should be filed as P2)

Issue Components

  • Component: Python SDK
  • Component: Java SDK
  • Component: Go SDK
  • Component: Typescript SDK
  • Component: IO connector
  • Component: Beam YAML
  • Component: Beam examples
  • Component: Beam playground
  • Component: Beam katas
  • Component: Website
  • Component: Infrastructure
  • Component: Spark Runner
  • Component: Flink Runner
  • Component: Samza Runner
  • Component: Twister2 Runner
  • Component: Hazelcast Jet Runner
  • Component: Google Cloud Dataflow Runner

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions