Follow BigDATAwire:

Tag: data lake

Snowflake Dips Into Agentic AI with Snowflake Intelligence

Snowflake made a blizzard of announcements today at its Build 2024 user conference, including a new generative AI-powered capability in Snowflake Cortex AI, dubbed Snowflake Intelligence, that allows customers to build a Read more…

Cloudian Partners with Lenovo for EPYC All-Flash ‘HyperStore’

Cloudian and Lenovo today announced they’re teaming up to deliver a new HyperStore cluster designed to run big data, AI, and HPC workloads. Each HyperStore cluster will be composed of six Lenovo ThinkSystem SR635 V3 Read more…

AtScale Claims Text-to-SQL Breakthrough with Semantic Layer

One of the bottlenecks in getting value out of generative AI is the difficulty in turning natural language into SQL queries. Without detailed contextual understanding of the data, the text is converted into SQL that does Read more…

LinkedIn Implements New Data Trigger Solution to Reduce Resource Usage For Data Lakes

With its vast user base and the numerous interactions that occur daily, LinkedIn generates an enormous amount of data every day. The billions of data points fuel various applications, from rankings to search. The additio Read more…

Data Observability in 2024: A Guide

In today's data-driven world, data observability is a critical concept for organizations aiming to effectively manage their data. Simply put, it means having the ability to constantly monitor and understand the status of Read more…

The Data Lakehouse Is On the Horizon, But It’s Not Smooth Sailing Yet

Data warehouses and data lakes serve clear and distinct purposes. Typically, data warehouses store structured data according to a predefined schema to generate fast query speeds for reporting purposes.  Data lakes, on t Read more…

Data Engineering in 2024: Predictions For Data Lakes and The Serving Layer

The data landscape experienced significant changes in 2023, presenting new opportunities (and potential challenges) for data engineering teams. I believe we will see the following this year in the areas of analytics, Read more…

Inside AWS’s Plans to Make S3 Faster and Better

As far as big data storage goes, Amazon S3 has won the war. Even among storage vendors whose initials are not A.W.S., S3 is the defacto standard for storing lots of data. But AWS isn’t resting on its laurels with S3, a Read more…

Unveiling Chaos LakeDB: First Lake Database for Live Search, SQL, and Gen AI Analytics

With the rapidly evolving digital landscape, the ascent of generative AI is not just a passing phase. Organizations that are able to harness the potential of gen AI are set to gain a substantial competitive advantage. Ye Read more…

There Are Many Paths to the Data Lakehouse. Choose Wisely

You don’t need a crystal ball to see that the data lakehouse is the future. At some point soon, it will be the default way of interacting with data, combining scale with cost-effectiveness. Also easy to predict is t Read more…

Oracle Announces GA of MySQL HeatWave Lakehouse

Oracle recently announced the general availability of MySQL HeatWave Lakehouse, a fully managed database service. The company previously debuted the service at its CloudWorld event last October. This lakehouse is the Read more…

A Truce in the Cloud Data Lake Vs. Data Warehouse War?

At the 2nd Annual Semantic Layer Summit, which took place April 26, AtScale founder and CTO Dave Mariani sat down with Bill Inmon, recognized by many as the father of the data warehouse, to discuss the evolution of moder Read more…

Cyberspooks Need Big Data Portability, Too

The problem of how to effectively move and manage large amounts of data is one that impacts all organizations of a certain size, including U.S. Government agencies working in cybersecurity. Now a new partnership between Read more…

Starburst Bolsters Trino Platform as Datanova Begins

Starburst today rolled out a host of enhancements to its Trino-based analytics platform for the cloud, called Galaxy, including support for Python, new caching and indexing features, and a new data catalog. The company u Read more…

Onehouse Announces $25M Series A, New Feature for Its Managed Lakehouse Platform

Managed data lakehouse firm Onehouse has announced a $25 million Series A funding round, bringing its total funding to $33 million. Additionally, the company announced a new feature of its platform called Onetable. On Read more…

Datanova- the coolest data conference of the year

This event is for those who seek to not just ‘do data’ incrementally better, but differently. For the leaders who want to help their companies become truly data-driven. For the analyst in all of us that seeks sim Read more…

Two Cancer-Fighting Startups Gain a Foothold in AWS

The cloud is a natural place for startups that have large computing and data storage needs but also have uncertain futures. For two startups aiming to stop cancer, including Lyell Immunopharma and Hurone AI, the public c Read more…

Mastering the Mesh: Finding Clarity in the Data Lake

Data lakes are great in theory, but their application in the real world often leaves the user wanting more. A data mesh is one approach to cleaning up chaos left by data lakes and the resulting swing back to data decentr Read more…

AWS Bolsters Glue ETL Tool with Data Observability, Ray Support

AWS has made a big push into data management during re:Invent this week, with the unveiling of DataZone and launch of zero-ETL capabilities in Redshift. But AWS also bolstered its ETL tool with the launch of Amazon Glue Read more…

BigDATAwire