PARQUET-2355: Deprecate parquet-thrift#1175
Conversation
e60d633 to
bc38a73
Compare
a5b9f1c to
460edb3
Compare
| <exclude>org.apache.parquet.hadoop.ColumnChunkPageWriteStore</exclude> | ||
| <exclude>org.apache.parquet.hadoop.ParquetRecordWriter</exclude> | ||
| <!-- Already deprecated classes/methods/constants --> | ||
| <exclude>org.apache.parquet.hadoop.thrift.ThriftReadSupport#setProjectionPushdown(org.apache.hadoop.mapred.JobConf,java.lang.String)</exclude> |
| <exclude>org.apache.parquet.hadoop.ParquetRecordWriter</exclude> | ||
| <!-- Already deprecated classes/methods/constants --> | ||
| <exclude>org.apache.parquet.hadoop.thrift.ThriftReadSupport#setProjectionPushdown(org.apache.hadoop.mapred.JobConf,java.lang.String)</exclude> | ||
| <exclude>org.apache.parquet.hadoop.thrift.ThriftReadSupport#THRIFT_COLUMN_FILTER_KEY</exclude> |
| <!-- Already deprecated classes/methods/constants --> | ||
| <exclude>org.apache.parquet.hadoop.thrift.ThriftReadSupport#setProjectionPushdown(org.apache.hadoop.mapred.JobConf,java.lang.String)</exclude> | ||
| <exclude>org.apache.parquet.hadoop.thrift.ThriftReadSupport#THRIFT_COLUMN_FILTER_KEY</exclude> | ||
| <exclude>org.apache.parquet.thrift.projection.deprecated.DeprecatedFieldProjectionFilter</exclude> |
pom.xml
Outdated
| Got this from: | ||
| org.apache.parquet.thrift.projection.FieldProjectionFilter[org.apache.parquet.thrift.projection.FieldProjectionFilter]:INTERFACE_REMOVED | ||
|
|
||
| But it didn't change, only the @Deprecated annotation was added |
There was a problem hiding this comment.
Maybe it is a bug, let's try to bump it: #1176
There was a problem hiding this comment.
Nice, the bump fixed it :)
b67bf21 to
52472b3
Compare
52472b3 to
857b84c
Compare
| * This is used for parsing values assigned to ThriftReadSupport.THRIFT_COLUMN_FILTER_KEY | ||
| */ | ||
| @Deprecated | ||
| public class DeprecatedFieldProjectionFilter implements FieldProjectionFilter { |
There was a problem hiding this comment.
These classes are marked as deprecated in 2015, and released in Parquet 1.8.0 :) Removing these was also suggested by @steveloughran in #979
| * Deprecated. Use {@link #STRICT_THRIFT_COLUMN_FILTER_KEY} | ||
| * Accepts a ";" delimited list of globs in the syntax implemented by {@link DeprecatedFieldProjectionFilter} | ||
| */ | ||
| @Deprecated |
There was a problem hiding this comment.
These are marked as deprecated in 2015. I think removing these is a first step before removing the whole class.
There was a problem hiding this comment.
part of my attempts to move up to later hadoop versions hit real problems here. and along with thrift in general -i had to grab a copy of an x86 MBP for my mac m1, but even there homebrew had decided to upgrade it, so it was a backup in a corner I ended up with. At which point I discovered that homebrew isn't a real package manager as you can't roll back...
There was a problem hiding this comment.
I'm also on a M1, and it takes quite a bit of effort to get Thrift compiled. I got a patch ready to update Thrift to the latest version. I'm not sure if we need to update parquet-format first, since the release is still on an older version. I'll give it a try in a moment.
There was a problem hiding this comment.
that or stick the old version into the repo...
a3961d4 to
09c3d41
Compare
|
Hi @Fokko thanks for bringing this up. We at Pinterest still use Parquet thrift quite extensively. How urgent is thrift 0.19.0 upgrade and how much time can we wait before we absolutely need to upgrade thrift? Also, since elephantbird is not maintained anymore, one option would be to just get rid of elephantbird dependency completely, even if it means moving some code in parquet-mr. Thoughts? |
|
@tlazaro, @ttim, @piyushnarang, @isnotinvain in case you all still use parquet-thrift, this discussion maybe of interest to you as well. |
|
Thanks so much for pinging, though I'm no longer at Twitter, I will try to find someone who might be interested in the conversation. |
|
@SinghAsDev and @tlazaro I think we can make it work with the latest version of Elephantbird. Let me close this for now. |
Make sure you have checked all steps below.
parquet-thriftrelies on Elephantbird: https://github.com/twitter/elephant-birdThe current version of
elephant-birdthat we use is still on Thrift 0.7.0, and this doesn't work with 0.19.0 that we're trying to upgrade to. Updating is non-trivial since a lot of the code that was used for testing has been made private: #1156I would suggest removing this from the repository since it looks like no one is using it anymore:
apache-thriftdoes not seem to be used anymore on Maven Central (usages of 0 since 1.12.2):Jira
Tests
Commits
Documentation