-
Notifications
You must be signed in to change notification settings - Fork 2k
StarRocks supports querying Hive tables with encrypted ORC file format #46809
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
let me have a review |
@@ -18,6 +18,8 @@ | |||
|
|||
#include "exec/exec_node.h" | |||
#include "exec/iceberg/iceberg_delete_builder.h" | |||
#include "exprs/base64.h" | |||
#include "formats/orc/apache-orc/c++/src/security/InMemoryKeystore.hh" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We should put all the ORC-related header files which are exposed to users in apache-orc/c++/include/orc
@@ -40,6 +40,7 @@ | |||
|
|||
public class Config extends ConfigBase { | |||
|
|||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
rm it
@@ -168,6 +168,7 @@ HdfsTableDescriptor::HdfsTableDescriptor(const TTableDescriptor& tdesc, ObjectPo | |||
_hdfs_base_path = tdesc.hdfsTable.hdfs_base_dir; | |||
_columns = tdesc.hdfsTable.columns; | |||
_partition_columns = tdesc.hdfsTable.partition_columns; | |||
_properties = tdesc.hdfsTable.properties; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
need to check __isset.properties first
Ping . Is the CLA/DCO sign necessary or optional? |
Why I'm doing:
Spark 2.4.x uses the Java version of the Apache OCR project to enable encryption and decryption of ORC format Hive tables.
StarRocks uses the C++ version of the Apache OCR project for reading and writing OCR files, but the C++ version does not implement encryption and decryption functionality. When performing ad-hoc analysis on Hive tables with encrypted columns, StarRocks is unable to query the Hive table. Therefore, modifications need to be made to StarRocks' Apache OCR module to support reading encrypted Hive table data in ORC format.
What I'm doing:
Fixes #issue
StarRocks supports querying Hive tables with encrypted ORC file format #45777
What type of PR is this:
Does this PR entail a change in behavior?
If yes, please specify the type of change:
Checklist:
Bugfix cherry-pick branch check: