Tag: Delta Lake
Apache Hudi Is Not What You Think It Is
Vinoth Chandar, the creator of Apache Hudi, never set out to develop a table format, let alone be thrust into a three-way war with Apache Iceberg and Delta Lake for table format supremacy. So when Databricks recently ple Read more…
Putting Your Data On the Table
One of the big breakthroughs in data engineering over the past seven to eight years is the emergence of table formats. Typically layered atop column-oriented Parquet files, table formats like Apache Iceberg, Delta, and A Read more…
The Data Lakehouse Is On the Horizon, But It’s Not Smooth Sailing Yet
Data warehouses and data lakes serve clear and distinct purposes. Typically, data warehouses store structured data according to a predefined schema to generate fast query speeds for reporting purposes. Data lakes, on t Read more…
Databricks Puts Unified Data Format on the Table with Delta Lake 3.0
Databricks today rolled out a new open table format in Delta Lake 3.0 that it says will eliminate the possibility of picking the wrong one. Dubbed Universal Format, or UniForm, the new table format can read and write dat Read more…
Open Table Formats Square Off in Lakehouse Data Smackdown
If you’re constructing a data lakehouse today, you’ll need a table format to build on. But which open table format should you choose: Apache Iceberg, Databricks Delta Table, or Apache Hudi? A good place to start is i Read more…
Onehouse Announces $25M Series A, New Feature for Its Managed Lakehouse Platform
Managed data lakehouse firm Onehouse has announced a $25 million Series A funding round, bringing its total funding to $33 million. Additionally, the company announced a new feature of its platform called Onetable. On Read more…
A Dozen Questions for Databricks CTO Matei Zaharia
Matei Zaharia is a very busy man. When he’s not helping to shape the future of Databricks as its CTO, he is helping to shape the future of computer science as an assistant professor at Stanford University. He also fi Read more…
How Intuit Is Building AI, Analytics, and Streaming on One Lakehouse
With more than 100 million customers and revenues close to $10 billion, Intuit has enterprise-scale data processing needs, as well as enterprise-scale challenges. Faced with the prospect of building independent architect Read more…
Databricks Claims 30x Advantage in the Lakehouse, But Does It Hold Water?
Databricks CEO Ali Ghodsi turned some heads last week with a bold claim: Customers can get 30x price-performance advantage over Snowflake when running SQL queries in a lakehouse setup. However, Snowflake waved off the st Read more…
Databricks Opens Up Its Delta Lakehouse at Data + AI Summit
Databricks, which had faced criticism of running a closed lakehouse, is open sourcing most of the technology behind Delta Lake, including its APIs, with the launch of Delta Lake 2.0. That was one of a number of announcem Read more…
Databricks Cranks Delta Lake Performance, Nabs Redash for SQL Viz
Today at its Spark + AI Summit, Databricks unveiled Delta Engine, a new layer in its Delta Lake cloud offering that uses several techniques to significantly accelerate the performance of SQL queries. The company also ann Read more…
Databricks, Partners, Open a Unified ‘Lakehouse’
Coalescing around an open source storage layer, Databricks is pitching a new data management framework billed as combining the best attributes of data lakes and warehouses into what the company dubs a “lakehouse.” Read more…
Databricks Snags $400M, Now Valued at $6.2B
Databricks, the commercial venture behind Apache Spark, has just completed a Series F round of funding worth $400 million. That brings the cloud analytics vendor's valuation to more than $6 billion, more than twice what Read more…
Databricks Donates Delta Code to Open Source
Databricks today announced that it's open sourcing the code behind Databricks Delta, the Apache Spark-based product it designed to help keep data neat and clean as it flows from sources into its cloud-based analytics env Read more…
How Databricks Keeps Data Quality High with Delta
Data lakes have sprung up everywhere as organizations look for ways to store all their data. But the quality of data in those lakes has posed a major barrier to getting a return on data lake investments. Now Databricks i Read more…