diff --git a/docs/_blog/2019-12-09-announcing-the-release-of-apache-samza--1.3.0.md b/docs/_blog/2019-12-09-announcing-the-release-of-apache-samza--1.3.0.md new file mode 100644 index 0000000000..7af7c2aeeb --- /dev/null +++ b/docs/_blog/2019-12-09-announcing-the-release-of-apache-samza--1.3.0.md @@ -0,0 +1,102 @@ +--- +layout: blog +title: Announcing the release of Apache Samza 1.3.0 +icon: git-pull-request +authors: + - name: Hai Lu + website: + image: +excerpt_separator: +--- + + + +# **Announcing the release of Apache Samza 1.3.0** + + + + +**IMPORTANT NOTE**: We may introduce a **backward incompatible changes regarding samza job submission** in the future 1.4 release. Details can be found on [SEP-23: Simplify Job Runner](https://cwiki.apache.org/confluence/display/SAMZA/SEP-23%3A+Simplify+Job+Runner) + +We’re thrilled to announce the release of Apache Samza 1.3.0. + +Today Samza forms the backbone of hundreds of real-time production applications across a multitude of companies, such as LinkedIn, VMWare, Slack, Redfin among many others. Samza provides leading support for large-scale stateful stream processing with: + +* First class support for local state (with RocksDB store). This allows a stateful application to scale up to 1.1 Million events/sec on a single machine with SSD. + +* Support for incremental checkpointing of state instead of full snapshots. This enables Samza to scale to applications with very large state. + +* A fully asynchronous programming model that makes parallelizing remote calls efficient and effortless. + +* High level API for expressing complex stream processing pipelines in a few lines of code. + +* Beam Samza Runner that marries Beam’s best in class support for EventTime based windowed processing and sophisticated triggering with Samza’s stable and scalable stateful processing model. + +* A fully pluggable model for input sources (e.g. Kafka, Kinesis, DynamoDB streams etc.) and output systems (HDFS, Kafka, ElastiCache etc.). + +* A Table API that provides a common abstraction for accessing remote or local databases and allowing developers are able to "join" an input event stream with such a Table. + +* Flexible deployment model for running the applications in any hosting environment and with cluster managers other than YARN. + +* Features like canaries, upgrades and rollbacks that support extremely large deployments with minimal downtime. + +### **New Features, Upgrades and Bug Fixes:** +This release brings the following features, upgrades, and capabilities (highlights): + +* Startpoint support improvement + +* Samza SQL improvement + +* Table API improvement + +* Miscellaneous bug fixes + +Full list of the jiras addressed in this release can be found [here](https://issues.apache.org/jira/browse/SAMZA-2354?jql=project%20%3D%20%22SAMZA%22%20and%20fixVersion%20in%20(1.3)). +### **Upgrading your application to Apache Samza 1.3.0** + +### Startpoint support improvement +[SAMZA-2201](https://issues.apache.org/jira/browse/SAMZA-2201) Startpoints - Integrate fan out with job coordinators + +[SAMZA-2215](https://issues.apache.org/jira/browse/SAMZA-2215) StartpointManager fix for previous CoordinatorStreamStore refactor + +[SAMZA-2220](https://issues.apache.org/jira/browse/SAMZA-2220) Startpoints - Fully encapsulate resolution of starting offsets in OffsetManager + +### Samza SQL improvement +[SAMZA-2234](https://issues.apache.org/jira/browse/SAMZA-2234) Samza SQL : Provide access to Samza context to the Sama SQL UDFs + +[SAMZA-2313](https://issues.apache.org/jira/browse/SAMZA-2313) Samza-sql: Add validation for Samza sql statements + +[SAMZA-2354](https://issues.apache.org/jira/browse/SAMZA-2354) Improve UDF discovery in samza-sql + +#### Table API improvement +[SAMZA-2191](https://issues.apache.org/jira/browse/SAMZA-2191) support batching for remote tables + +[SAMZA-2200](https://issues.apache.org/jira/browse/SAMZA-2200) update table sendTo() and join() operation to accept additional arguments + +[SAMZA-2219](https://issues.apache.org/jira/browse/SAMZA-2219) Add a dummy table read function + +[SAMZA-2309](https://issues.apache.org/jira/browse/SAMZA-2309) Remote table descriptor requires read function + +#### Miscellaneous bug fixing +[SAMZA-2198](https://issues.apache.org/jira/browse/SAMZA-2198) containers process always takes task.shutdown.ms to shut down + +[SAMZA-2293](https://issues.apache.org/jira/browse/SAMZA-2293) Propagate the watermark future to StreamOperatorTask correctly + +### Sources downloads +A source download of Samza 1.3.0 is available [here](https://dist.apache.org/repos/dist/release/samza/1.3.0/), and is also available in Apache’s Maven repository. See Samza’s download [page](https://samza.apache.org/startup/download/) for details and Samza’s feature preview for new features. + diff --git a/docs/_config.yml b/docs/_config.yml index 31edfbcda5..695e19217c 100644 --- a/docs/_config.yml +++ b/docs/_config.yml @@ -25,7 +25,7 @@ exclude: [_docs] baseurl: http://samza.apache.org version: latest # this is the version you will go if you click 'switch version' in "latest" pages. -latest-release: '1.2.0' +latest-release: '1.3.0' collections: menu: output: false diff --git a/docs/_releases/1.3.0.md b/docs/_releases/1.3.0.md new file mode 100644 index 0000000000..5d7fb80bb9 --- /dev/null +++ b/docs/_releases/1.3.0.md @@ -0,0 +1,93 @@ +--- +version: '1.3' +order: 130 +layout: page +menu_title: '1.3' +title: Apache Samza 1.3 [Docs] +--- + + +**IMPORTANT NOTE**: We may introduce a **backward incompatible changes regarding samza job submission** in the future 1.4 release. Details can be found on [SEP-23: Simplify Job Runner](https://cwiki.apache.org/confluence/display/SAMZA/SEP-23%3A+Simplify+Job+Runner) + +We’re thrilled to announce the release of Apache Samza 1.3.0. + +Today Samza forms the backbone of hundreds of real-time production applications across a multitude of companies, such as LinkedIn, VMWare, Slack, Redfin among many others. Samza provides leading support for large-scale stateful stream processing with: + +* First class support for local state (with RocksDB store). This allows a stateful application to scale up to 1.1 Million events/sec on a single machine with SSD. + +* Support for incremental checkpointing of state instead of full snapshots. This enables Samza to scale to applications with very large state. + +* A fully asynchronous programming model that makes parallelizing remote calls efficient and effortless. + +* High level API for expressing complex stream processing pipelines in a few lines of code. + +* Beam Samza Runner that marries Beam’s best in class support for EventTime based windowed processing and sophisticated triggering with Samza’s stable and scalable stateful processing model. + +* A fully pluggable model for input sources (e.g. Kafka, Kinesis, DynamoDB streams etc.) and output systems (HDFS, Kafka, ElastiCache etc.). + +* A Table API that provides a common abstraction for accessing remote or local databases and allowing developers are able to "join" an input event stream with such a Table. + +* Flexible deployment model for running the applications in any hosting environment and with cluster managers other than YARN. + +* Features like canaries, upgrades and rollbacks that support extremely large deployments with minimal downtime. + +### **New Features, Upgrades and Bug Fixes:** +This release brings the following features, upgrades, and capabilities (highlights): + +* Startpoint support improvement + +* Samza SQL improvement + +* Table API improvement + +* Miscellaneous bug fixes + +Full list of the jiras addressed in this release can be found [here](https://issues.apache.org/jira/browse/SAMZA-2354?jql=project%20%3D%20%22SAMZA%22%20and%20fixVersion%20in%20(1.3)). +### **Upgrading your application to Apache Samza 1.3.0** + +### Startpoint support improvement +[SAMZA-2201](https://issues.apache.org/jira/browse/SAMZA-2201) Startpoints - Integrate fan out with job coordinators + +[SAMZA-2215](https://issues.apache.org/jira/browse/SAMZA-2215) StartpointManager fix for previous CoordinatorStreamStore refactor + +[SAMZA-2220](https://issues.apache.org/jira/browse/SAMZA-2220) Startpoints - Fully encapsulate resolution of starting offsets in OffsetManager + +### Samza SQL improvement +[SAMZA-2234](https://issues.apache.org/jira/browse/SAMZA-2234) Samza SQL : Provide access to Samza context to the Sama SQL UDFs + +[SAMZA-2313](https://issues.apache.org/jira/browse/SAMZA-2313) Samza-sql: Add validation for Samza sql statements + +[SAMZA-2354](https://issues.apache.org/jira/browse/SAMZA-2354) Improve UDF discovery in samza-sql + +#### Table API improvement +[SAMZA-2191](https://issues.apache.org/jira/browse/SAMZA-2191) support batching for remote tables + +[SAMZA-2200](https://issues.apache.org/jira/browse/SAMZA-2200) update table sendTo() and join() operation to accept additional arguments + +[SAMZA-2219](https://issues.apache.org/jira/browse/SAMZA-2219) Add a dummy table read function + +[SAMZA-2309](https://issues.apache.org/jira/browse/SAMZA-2309) Remote table descriptor requires read function + +#### Miscellaneous bug fixing +[SAMZA-2198](https://issues.apache.org/jira/browse/SAMZA-2198) containers process always takes task.shutdown.ms to shut down + +[SAMZA-2293](https://issues.apache.org/jira/browse/SAMZA-2293) Propagate the watermark future to StreamOperatorTask correctly + +### Sources downloads +A source download of Samza 1.3.0 is available [here](https://dist.apache.org/repos/dist/release/samza/1.3.0/), and is also available in Apache’s Maven repository. See Samza’s download [page](https://samza.apache.org/startup/download/) for details and Samza’s feature preview for new features. + diff --git a/docs/archive/index.html b/docs/archive/index.html index bddc271b88..d83937fa50 100644 --- a/docs/archive/index.html +++ b/docs/archive/index.html @@ -27,6 +27,14 @@