
Hortonworks Hocks Hadoop Upgrade
Apache Hadoop contributor Hortonworks announced Hortonworks Data Platform version 2. HDPv2 will be using the most recent version of Hadoop (0.23). According to the Apache Software Foundation, curators and cultivators of Hadoop, the newest release is enterprise ready.
The Hortonworks Data Platform, which is powered by Hadoop, is the company’s scalable open source platform for handling big enterprise and research data. As with the other Hadoop distros floating around out there, the key to the success of the platform is the ability to integrate data from just about any source imaginable and provide a more simplified way to make use of it.
The company describes how they differentiate themselves from others offering Hadoop simplification for the enterprise, noting:
“Unlike other Hadoop solutions that lock away management features within proprietary extensions, Hortonworks Data Platform includes Ambari, an open source installation and management system out of the box. Hortonworks Data Platform also includes HCatalog, a metadata management service for simplifying data sharing between Hadoop and other enterprise information systems, along with a complete set of open APIs, including WebHDFS and those for Ambari and HCatalog, to make it easier for ISVs to integrate and extend Apache Hadoop.”
On Jan.6th, when the Apache Software Foundation made news announcing Hadoop v1.0 after 6 years of development, a number of notable new features and enhancements were made. With the release of Hadoop version 0.23, improvements have been made to both HDFS and MapReduce including:
- NextGen MapReduce (also known as YARN)
- HDFS Federation, which allows Namenodes to act independently and without coordination with eachother
- Splitting MapReduce JobTracker into 2 components (resource management and life-cycle management)
- The Resource manager will now manage global assignment of compute resources for each application while ApplicationMaster will manage scheduling and coordination.
According to Eric Baldeschwieler, CEO of Hortonworks, “With more than three years of development and much anticipation, Apache Hadoop 0.23 delivers important advancements in scalability, performance, high availability and data integrit.
He continued, “Apache Hadoop 0.23 is currently being tested across hundreds of applications in the world’s largest Hadoop deployment. We are excited to make the technology advancements in Apache Hadoop 0.23 available through an easily consumable version via the Hortonworks Data Platform v2.”
HDP was created to extremely scalable and fully open-source platform for storage, processing, analysis of large scale data. Along with HDFS and MapReduce, Hortonworks Data Platform includes Pig, Hive, HBase and Zookeeper.
Hortonworks was created by Yahoo! and Benchmark Capital to facilitate Apache Hadoop development. They provide tech support, training and certifications for vendors, enterprises, service providers and systems integrators.
Related Stories
Hadoop Hits Primetime with Production Release
RainStor Brings Database to Hadoop
Karmasphere Ushers in New Hadoop Partner
February 14, 2025
- Clarifai Unveils Control Center for Enhanced AI Visibility and Decision-Making
- EDB Strengthens Partner Program to Accelerate Postgres and AI Adoption Worldwide
- Workday Introduces Agent System of Record for AI Workforce Management
- Fujitsu Unveils Generative AI Cloud Platform with Data Security Focus
- NTT DATA Highlights AI Responsibility Gap as Leadership Fails to Keep Pace
- Gurobi AI Modeling Empowers Users with Accessible Optimization Resources
February 13, 2025
- SingleStore Unveils No-Code Solution Designed to Cut Data Migration from Days to Hours
- Databricks Announces Launch of SAP Databricks
- SAP Debuts Business Data Cloud with Databricks to Turbocharge Business AI
- Data Science Salon Kickstarts 2025 with DSS ATX Conference, Featuring AI Startup Showcase
- Hydrolix Achieves Amazon CloudFront Ready Designation
- Astronomer Launches Astro Observe to Unify Data Observability and Orchestration
- EU Launches InvestAI Initiative to Build AI Gigafactories Across Europe
- HPE Announces Shipment of Its First NVIDIA Grace Blackwell System
- IDC Celebrates 60 Years of Tech Intelligence at Directions 2025
- Lucidity Gains $21M to Scale AI-Driven Cloud Storage Optimization
- Glean Launches Open Security and Governance Partner Program for Enterprise AI
February 12, 2025
- OpenTelemetry Is Too Complicated, VictoriaMetrics Says
- What Are Reasoning Models and Why You Should Care
- Three Ways Data Products Empower Internal Users
- Keeping Data Private and Secure with Agentic AI
- Memgraph Bolsters AI Development with GraphRAG Support
- Three Data Challenges Leaders Need To Overcome to Successfully Implement AI
- PayPal Feeds the DL Beast with Huge Vault of Fraud Data
- Top-Down or Bottom-Up Data Model Design: Which is Best?
- Nvidia CEO Touts a ‘Million X’ Speedup in AI
- Inside Nvidia’s New Desktop AI Box, ‘Project DIGITS’
- More Features…
- Meet MATA, an AI Research Assistant for Scientific Data
- AI Agent Claims 80% Reduction in Time to Complete Data Tasks
- DataRobot Expands AI Capabilities with Agnostiq Acquisition
- Collibra Bolsters Position in Fast-Moving AI Governance Field
- Snowflake Unleashes AI Agents to Unlock Enterprise Data
- Observo AI Raises $15M for Agentic AI-Powered Data Pipelines
- Anaconda’s Commercial Fee Is Paying Off, CEO Says
- Microsoft Open Sources Code Behind PostgreSQL-Based MongoDB Clone
- Confluent and Databricks Join Forces to Bridge AI’s Data Gap
- Mathematica Helps Crack Zodiac Killer’s Code
- More News In Brief…
- Informatica Reveals Surge in GenAI Investments as Nearly All Data Leaders Race Ahead
- Gartner Predicts 40% of Generative AI Solutions Will Be Multimodal By 2027
- PEAK:AIO Powers AI Data for University of Strathclyde’s MediForge Hub
- DataRobot Acquires Agnostiq to Accelerate Agentic AI Application Development
- TigerGraph Launches Savanna Cloud Platform to Scale Graph Analytics for AI
- EY and Microsoft Unveil AI Skills Passport to Bridge Workforce AI Training Gap
- Alluxio Enhances Enterprise AI with Version 3.5 for Faster Model Training
- DeepSeek-R1 models now available on AWS
- Lightning AI Brings DeepSeek to Private Enterprise Clouds with AI Hub
- Seagate Unveils IronWolf Pro 24TB Hard Drive for SMBs and Enterprises
- More This Just In…