Tag: performance
Are SSDs Required for Your Big Data Workflow? The Answer May Surprise You
Have you heard the buzz about the predicted death of hard disk drives (HDDs)? Some have gone all in on projections that the growth of SSD deployments will eliminate demand for HDDs within five years. Other industry analy Read more…
Fivetran Benchmarks Five Cloud Data Warehouses
ETL software maker Fivetran this week released results of a benchmark test it ran comparing the cost and performance of five cloud data warehouses, including BigQuery, Databricks, Redshift, Snowflake, and Synapse. The bi Read more…
Five Signs Your Cache-Based Database Architecture May Be Obsolete
The digital economy comprises business moments, critical fractions of seconds when lightning-fast chain reactions take place that transform data into insights and turn opportunities into business values. As data has incr Read more…
Benchmarking NoSQL Databases
Developers have a large number of databases to choose from today, particularly when it comes to newer NoSQL databases. Figuring out which databases excel in different areas can be tough, but the folks at Altoros aimed to Read more…
Dr. Elephant Leads the Performance Parade
I started working on big data infrastructure in 2009 when I joined Cloudera, which at the time was a small startup with about 10 engineers. It was a fun place to work. My colleagues and I got paid to work on open source Read more…
Big Performance Gains Seen Across SQL-on-Hadoop Engines
You can't really go wrong these days when it comes to picking a SQL-on-Hadoop engine. As long as you stick to the mainstream open source products like Hive, Impala, Spark SQL, and Presto, your SQL queries are likely runn Read more…
Unraveling Hadoop and Spark Performance Mysteries
What do you do when your Spark or Hive job runs like molasses? If you're like most end-users who lack in-depth technical skills, the answer is "not much." Now a startup named Unravel Data is working to show you what's ac Read more…
Does InfiniBand Have a Future on Hadoop?
Hadoop was created to run on cheap commodity computers connected by slow Ethernet networks. But as Hadoop clusters get bigger and organizations press the upper limits of performance, they're finding that specialized gear Read more…
The GPU “Sweet Spot” for Big Data
GPUs have stirred some vicious waves in the supercomputing community, and these same performance boosts are being explored for large-scale data mining by a number of enterprise users. During our conversation with NVIDIA's Tesla senior manager for high performance computing, Sumit Gupta, we explored how traditional data... Read more…
This Week’s Big Data Big Seven
We wrap up this week with news about a new high performance, data-intensive supercomputer from SGI, new Hadoop announcements, including those from Hortonworks, Datameer, and Karmasphere, some software enhancements for big data infrastructure from ScaleOut and some other startup goodness--all with an eye on next week's International.... Read more…
ParAccel Lifts Hood on CARFAX Overhaul
In the wake of their competitors being gobbled up by giants and their announcement today of a new connector to Hadoop, we step back and look at how analytical platforms like ParAccel's are handling large amounts of complex data for companies like CARFAX, which recently put the company in the driver's seat following limitations from their legacy.... Read more…
Analytics Power Solar Energy Boost
While solar energy pioneers are still seeking ways to tweak the price performance issue, others are focusing on less obvious potential solutions. One such group of solar cell researchers at MIT is addressing the problem of current solar technolo.... Read more…
Oracle Flags Down R Users with Analytics Option
Today Oracle announced a new piece to its enterprise analytics capabilities via the addition of an option to include the open source R statistics language within bundles of its enterprise R and data mining offerings that the company... Read more…