Tag: ORC
Tabular Seeks to Remake Cloud Data Lakes in Iceberg’s Image
The creators of the table format Apache Iceberg launched a new company this summer called Tabular that’s aiming to remake how companies store data in the cloud. If the company has its way, much of the minutia of how da Read more…
Presto the Future of Open Data Analytics, Foundation Says
The openness of Presto, its adherence to standard SQL, and the ubiquity and performance of modern cloud storage have combined to put Presto in the driver’s seat of the big data analytics stack for the foreseeable futur Read more…
Data Headaches Targeted with a Dose of .BIG
Working with large numbers of files--and large files--remains a roadblock to productivity for data professionals around the world. Now a software startup named Exponam says it has come up with a potential solution to the Read more…
Return of the Living Data
When Google published a paper on its proprietary BigQuery engine about nine years ago, the open source community reproduced the technology as best they could, just as they did with MapReduce and the Google File System, w Read more…
Celebrating Data Independence
Every company wants the independence to do what they wish with their data. That's one of the first assumptions underlying this whole big data movement. But depending on where and how a business stores its data -- such as Read more…
Big Data File Formats Demystified
So you're filling your Hadoop cluster with reams of raw data, and your data analysts and scientists are champing at the bit to get started. Then the question hits you: How are you going to store all this data so they can Read more…