Databricks Releases Open Source Data Sharing Tool

Databricks has released “the industry’s first open protocol for secure data sharing” across organizations. According to the blog post, the Delta Sharing project simplifies cross-organization sharing and allows secure real-time exchange of large datasets.

Delta Sharing, which is included within the open source Delta Lake project, simplifies data sharing with other organizations “regardless of which storage or computing platform they use," says Matei Zaharia, Chief Technologist and Co-Founder of Databricks in the announcement.

According to the blog post, Delta Sharing was designed with the following goals:

  • Sharing live data directly without copying it, to simplify sharing existing data in real time.
  • Allowing recipients to directly consume data without installing a new platform.
  • Strong security, auditing, and governance to help you meet privacy and compliance requirements.
  • Scaling to massive datasets.

"The top challenge for data providers today is making their data easily and broadly consumable. … An open, interoperable standard for real-time data sharing will dramatically improve the experience for both data providers and data users," Zaharia says.

Comments