NVIDIA - Latest News, Articles & Stories

Features

Vector Databases Emerge to Fill Critical Role in AI

Vector databases arrived on the scene a few years ago to help power a new breed of search engines that are based on neural networks as opposed to keywords. Companies like Home Depot dramatically improved the search exper Read more…

Milvus 2.3 Launches with Support for Nvidia GPUs

Zilliz has beta launched Milvus 2.3, the latest version of its open source vector database. Milvus 2.3 supports Nvidia GPUs which Zilliz says affords greater flexibility and improved real-time workload performance. Zilli Read more…

Bill Gates Says the Age of AI Has Begun, Bringing Opportunity and Responsibility

The latter half of March has been a whirlwind moment for artificial intelligence. OpenAI released GPT-4 last week, and this week, Nvidia announced a new cloud services platform for generative AI while Google initiated th Read more…

ChatGPT Puts AI At Inflection Point, Nvidia CEO Huang Says

It’s been 11 years since three AI researchers shocked the world with a breakthrough in computer vision, kickstarting the deep learning craze. But with emergence of generative language models like ChatGPT over the past Read more…

Nvidia Unveils GPUs for Generative Inference Workloads like ChatGPT

Today at its GPU Technology Conference, Nvidia took the wraps off three new GPUs designed to accelerate inference workloads for generative AI applications, including generating text, images, and videos. It also launched Read more…

New PyTorch 2.0 Compiler Promises Big Speedup for AI Developers

Machine learning and AI developers are eager to get their hands on PyTorch 2.0, which was unveiled in late 2022 and is due to become available this month. Among the features greeting eager ML developers is a compiler as Read more…

OpenXLA Delivers Flexibility for ML Apps

Machine learning developers gained new abilities to develop and run their ML programs on the framework and hardware of their choice thanks to the OpenXLA Project, which today announced the availability of key open source Read more…

Peak:AIO Cranks the Storage Throughput for Affordable AI Data Serving

Organizations that want to keep their pricey GPUs fed with data for machine learning training purposes but don’t want to break the bank with a big parallel file system installation may be interested in a fast new NFS-b Read more…

Big Things Ahead for AI in 2023: Predictions

The AI train has been gaining steam for several years now, and nothing appears ready to stop it (except for bad data, that is). With momentum building, which direction will AI head in 2023? We leave that to the experts. Read more…

IBM Collaboration Looks to Bring Massive AI Models to Any Cloud

Training machine learning foundation models with sometimes billions of parameters demands serious computing power. For example, the largest version of GPT-3, the famous large language model behind OpenAI’s DALL-E 2, ha Read more…

This Just In

Cirrascale Powers AI and HPC Advancements with NVIDIA HGX H200 Server Integration

Oct 3, 2024 |

SAN DIEGO, Oct. 3, 2024 — Cirrascale Cloud Services, a leading provider of innovative cloud solutions for AI and high-performance computing (HPC) workloads, today announced the general availability of NVIDIA HGX H200 servers in its AI Innovation Cloud. Read more…

IBM Expands AI and HPC Offerings with NVIDIA H100 GPUs on IBM Cloud

Oct 1, 2024 |

Oct. 1, 2024 — IBM has announced that NVIDIA H100s Tensor Core GPU instances are now globally available on IBM Cloud. IBM is extending its high-performance computing (HPC) offerings, giving enterprises more power and versatility to carry out research, innovation and business transformation. Read more…

VAST Data Unveils VAST InsightEngine with NVIDIA to Unlock Enterprise Data Insights

Oct 1, 2024 |

Oct. 1, 2024 — VAST Data today announced VAST InsightEngine with NVIDIA, the world’s first solution to securely ingest, process, and retrieve all enterprise data (files, objects, tables, and streams) in real-time. Read more…

Salesforce and NVIDIA Forge Strategic Collaboration to Advance AI Agent Innovation

Sep 18, 2024 |

SAN FRANCISCO‌, Sept. 18, 2024 — Salesforce and NVIDIA have announced a strategic collaboration to develop advanced AI capabilities for the enterprise with autonomous agent and interactive avatar experiences. Read more…

NVIDIA and Oracle to Accelerate AI and Data Processing for Enterprises

Sep 13, 2024 |

Sept. 11, 2024 — Enterprises are looking for increasingly powerful compute to support their AI workloads and accelerate data processing. The efficiency gained can translate to better returns for their investments in AI training and fine-tuning, and improved user experiences for AI inference. Read more…

Oracle Unveils Cloud Computing Cluster with Up to 131,072 NVIDIA Blackwell GPUs

Sep 11, 2024 |

LAS VEGAS, Sept. 11, 2024 — Oracle today announced new cloud computing clusters accelerated by the NVIDIA Blackwell platform. Oracle Cloud Infrastructure (OCI) is now taking orders for the largest AI supercomputer in the cloud—available with up to 131,072 NVIDIA Blackwell GPUs. Read more…

Deloitte Partners with NVIDIA and Oracle to Accelerate GenAI Solutions with AI Factory

Sep 11, 2024 |

NEW YORK, Sept. 11, 2024 — Deloitte has announced the launch of AI Factory as a Service, a scalable, one-stop shop suite of Generative AI (GenAI) capabilities built on the NVIDIA AI platform, including NVIDIA AI Enterprise software, NVIDIA NIM Agent Blueprints, accelerated computing and leveraging Oracle‘s enterprise AI technology. Read more…

NVIDIA Blackwell Joins MLPerf Inference with Upgraded Performance Metrics

Aug 28, 2024 |

Aug. 28, 2024 — In the latest round of MLPerf industry benchmarks, Inference v4.1, NVIDIA platforms delivered leading performance across all data center tests. The first-ever submission of the upcoming NVIDIA Blackwell platform revealed up to 4x more performance than the NVIDIA H100 Tensor Core GPU on MLPerf’s biggest LLM workload, Llama 2 70B, thanks to its use of a second-generation Transformer Engine and FP4 Tensor Cores. Read more…

NVIDIA and Partners Launch NIM Agent Blueprints for Enterprises to Make Their Own AI

Aug 27, 2024 |

SANTA CLARA, Calif., Aug. 27, 2024 — NVIDIA today announced NVIDIA NIM Agent Blueprints, a catalog of pretrained, customizable AI workflows that equip millions of enterprise developers with a full suite of software for building and deploying generative AI applications for canonical use cases, such as customer service avatars, retrieval-augmented generation and drug discovery virtual screening. Read more…

NVIDIA Launches New CUDA Libraries for Accelerated LLM Data Curation and Generation

Aug 26, 2024 |

Aug. 26, 2024 — NVIDIA has announced new libraries in accelerated computing to deliver order-of-magnitude speedups and reduce energy consumption and costs in data processing, generative AI, recommender systems, AI data curation, data processing, 6G research, AI-physics and more. Read more…

Vendor » NVIDIA

Features

This Just In

April 1, 2025

March 31, 2025

March 28, 2025

Sponsored Partner Content

Leading Solution Providers

Tabor Network

Sponsored Whitepapers

Sponsored Multimedia

Contributors