DataStax just announced the general availability of its vector search capability in Astra DB, its DBaaS built on Apache Cassandra.
Vector search is a must-have capability for building generative AI applications. In machine learning, vector embeddings are the distilled representations of raw training data and act as a filter for running new data through during inference. Training a large language model results in potentially billions of vector embeddings.
Vector databases store these embeddings and perform a similarity search to find the best match between a user’s prompt and the vectorized training data. Instead of searching with keywords, embeddings allow users to conduct a search based on context and meaning to extract the most relevant data.
There are native databases specifically built to manage vector embeddings, but many relational and NoSQL databases (like Astra DB) have been modified to include vector capabilities due to the demand surrounding generative AI.
This demand is palpable: McKinsey estimates that generative AI could potentially add between $2.6 and $4.4 trillion in value to the global economy. DataStax CPO Ed Anuff noted in a release that databases capable of supporting vectors are crucial to tapping into the potential of generative AI as a sustainable business initiative.
“An enterprise will need trillions of vectors for generative AI so vector databases must deliver limitless horizontal scale. Astra DB is the only vector database on the market today that can support massive-scale AI projects, with enterprise-grade security, and on any cloud platform. And, it’s built on the open source technology that’s already been proven by AI leaders like Netflix and Uber,” he said.
DataStax says one advantage of vector search within Astra DB is that it can help reduce AI hallucinations. LLMs are prone to fabricating information, called hallucinating, which can be damaging to business. This vector search release includes Retrieval Augmented Generation (RAG), a capability that grounds search results within specific enterprise data so that the source of information can be easily pinpointed.
Data security is another factor to consider with generative AI deployment, as many AI use cases involve sensitive data. DataStax says Astra DB is PCI, SOC2, and HIPAA enabled so that companies like Skypoint Cloud Inc., which offers a data management platform for the senior living healthcare industry, can use Astra DB as a vector database for resident health data.
“Envision it as a ChatGPT equivalent for senior living enterprise data, maintaining full HIPAA compliance, and significantly improving healthcare for the elderly,” said Skypoint CEO Tisson Mathew in a statement.
To support this release, DataStax also created a Python library called CassIO aimed at accelerating vector search integration. The company says this software framework easily integrates with popular LLM software like LangChain and can maintain chat history, create prompt templates, and cache LLM responses.
The new vector search capability is available on Astra DB for Microsoft Azure, AWS, and Google Cloud. The company also says vector search will be available for customers running DataStax Enterprise, the on-premises, self-managed offering, within the month.
Matt Aslett of Ventana Research expects generative AI adoption to grow rapidly and says that through 2025, one-quarter of organizations will deploy generative AI embedded in one or more software applications.
“The ability to trust the output of generative AI models will be critical to adoption by enterprises. The addition of vector embeddings and vector search to existing data platforms enables organizations to augment generic models with enterprise information and data, reducing concerns about accuracy and trust,” he said.
Related Items:
Vector Databases Emerge to Fill Critical Role in AI
DataStax Bolsters Real-Time Machine Learning with Kaskada Buy
DataStax Nabs $115 Million to Help Build Real-Time Applications
November 25, 2024
- AI and Analytics Firm datakulture Embarks on Fresh Data Strategy to Drive Growth
- H2O.ai Unveils Agentic AI That Converges Generative and Predictive AI with Purpose-built SLMs
- Elastic Now Collaborates with AWS to Leverage Generative AI Capabilities
- Tokyo University of Science Unveils Synaptic Device for Low-Power Edge AI Sensors
- NetApp Joins the Vultr Cloud Alliance to Provide Scalable Data Management for Enterprise AI Workloads
- Anomalo Secures $10M to Advance Data Quality Solutions for Generative AI
November 22, 2024
- Telmai Partners with ZoomInfo to Enhance Data Quality Across Multi-Cloud Ecosystems
- BigID Redefines Privacy Operations with AI Automation
- HPE Advances Hybrid Cloud Leadership with Unified Virtualization Management, Fast Object Storage and Disconnected Private Clouds
- DataOps.live Achieves SOC 2 Type II Compliance
- LogicMonitor Gains $800M in Strategic Investment to Scale Global Operations
November 21, 2024
- Snowflake Agrees to Acquire Open Data Integration Platform, Datavolo
- Denodo Platform 9.1 Brings New Advanced AI Capabilities and Enhanced Data Lakehouse Performance
- Teradata AI Unlimited in Microsoft Fabric Public Preview Now Available Through Microsoft Fabric Workload Hub
- Zilliz Cloud Powers GenAI Readiness with Cost-Effective Enterprise-Grade Performance and Scalability
- Snowflake and Anthropic Team Up to Bring Claude Models Directly to the AI Data Cloud
- Duality AI Launches EDU Subscription to Empower Aspiring AI Developers with Digital Twin Simulation and Synthetic Data Skills
- Striim Offers Mirroring Solution for SQL Server to Fabric at Microsoft Ignite