Follow BigDATAwire:

Tag: synthetic data

Are We Running Out of Training Data for GenAI?

The advent of generative AI has supercharged the world’s appetite for data, especially high-quality data of known provenance. However, as large language models (LLMs) get bigger, experts are warning that we may be runn Read more…

Gretel Open Sources 100,000 Text-to-SQL Samples

Synthetic data generation company Gretel last week announced it has donated more than 100,000 examples of text-to-SQL conversions and parked them on Huggingface, providing enterprises with another free, open source resou Read more…

IBM Patents a Faster Method to Train LLMs for Enterprises

Deep learning AI models, such as GenAI chatbots, possess an insatiable appetite for data. These models need data for training purposes so they can be effective for real-world scenarios. It can be challenging, in term Read more…

SDV: A Generative Model for Creating Synthetic Data

Getting access to the right data in the right amounts remains a major obstacle for a range of digital endeavors, from developing AI models to testing software applications. If you find yourself short of valuable tabular Read more…

Why AI Alone Won’t Solve Drug Discovery

Verseon is an AI-powered drug company--in theory, anyway. The Bay Area startup does use machine learning to help predict which combination of proteins and drug-like molecules will yield novel compounds to test in the lab Read more…

The Key to Computer Vision-Driven AI Is a Robust Data Infrastructure

For infrastructure, the sign of true greatness is to go unnoticed. The better it is, the less we think about it. Mobile infrastructure, for example, only ever crosses our minds when we find ourselves struggling to unders Read more…

Gretel Keeps the Data Trail Hidden

When Alex Watson co-founded the security company Harvest.ai back in 2014, using machine learning to identify sensitive data to protect it seemed like a good idea--so good, in fact, that AWS bought the company. Fast forwa Read more…

Computer Vision Platform Datagen Raises $50M Series B

Datagen, a firm specializing in computer vision artificial intelligence, announced it has raised $50 million in a Series B round bringing its total financing to $70 million. Computer vision (CV) has quickly become ubi Read more…

Accenture Report Explores the ‘Unreal’ World of Synthetic Data and Generative AI

Accenture has released its Accenture Technology Vision 2022, a report examining key technologies under the theme “Meet Me in the Metaverse: The Continuum of Technology and Experience Reshaping Business.” The repor Read more…

10 NLP Predictions for 2022

Natural language processing (NLP) has been one of the hottest sectors in AI over the past two years. Will the string of big data breakthroughs continue into 2022? We checked in with industry experts to find out. There Read more…

Data Science and AI Predictions for 2022

The pace of technological change increased in 2021, and if history is any guide, will continue to accelerate in 2022. At the leading edge of high tech are data science and artificial intelligence, two disciplines that pr Read more…

2021 Big Data Year in Review: Part 2

There was a lot going on in 2021, but we’ve done our best to synthesize some of the top stories of the year. Here, we pick up where we left off in part one of this two-part piece. One of the most interesting develop Read more…

Fake Data Comes to the Forefront

The lack of data historically has been a limiting factor in the development of predictive models. But with the advent of automated methods to generate skads of synthetic data, or what some call “fake data,” the lack Read more…

Synthetic Data: Sometimes Better Than the Real Thing

Having a large stockpile of data is still a prerequisite for advanced analytics and AI. But companies building AI models increasingly are finding that artificially created data can be just as good as the real thing. And Read more…

Synthetic Data Market Gets Real

A growing list of data privacy regulations along with demand for better training data is spawning new AI-based approaches to managing “personally identifiable” information, including “synthetic” data sets that re Read more…

Five Reasons Synthetic Data Is the Electrolyte to Speed Up Your AI Initiatives

When taking on an artificial intelligence (AI) project, there is one ingredient above all that is essential  for success: clean, well organized, relevant data. In theory, data is everywhere (2.5 quintillion bytes of Read more…

BigDATAwire