Follow BigDATAwire:

Vendor » Patronus AI

Features

AI Needs a Third-Party Benchmark. Will It Be Patronus AI?

Which is the more accurate language model, GPT-4 or Bard? How does Llama-2-7b stack up to Mistral 7B? Which models have the worst bias and hallucination rates? These are pressing questions for would-be AI model users, bu Read more…

This Just In

Patronus AI Launches Small, High-Performance Judge Model for Fast and Explainable AI Evaluations

Dec 20, 2024 |

SAN FRANCISCO, Dec. 20, 2024 — Patronus AI has announced the release of GLIDER, its new 3.8B parameter model designed as a fast, flexible, and explainable judge for language models. Read more…

Patronus AI Launches Self-Serve API for AI Evaluation and Guardrails

Oct 31, 2024 |

SAN FRANCISCO, Oct. 31, 2024 — Patronus AI today announced the launch of the Patronus API, the first self-serve solution that empowers developers to reliably detect and prevent AI failures in production. Read more…

Patronus AI Releases Lynx for Real-Time Hallucination Detection in LLMs

Jul 12, 2024 |

SAN FRANCISCO, July 12, 2024 — Patronus AI announced the release of Lynx, a State-of-the-Art hallucination detection model designed to address the challenge of hallucinations in large language models (LLMs). Read more…

Patronus AI Raises $17M to Detect LLM Mistakes at Scale

May 23, 2024 |

SAN FRANCISCO, May 23, 2024 — Patronus AI has announced it has raised a $17 million Series A round, bringing the total amount raised to $20 million. Read more…

Patronus AI Launches Industry-first Solution to Detect Copyrighted Content Generated by LLMs

Mar 7, 2024 |

NEW YORK, March 7, 2024 — Patronus AI has launched CopyrightCatcher, the industry’s first solution to detect when a Large Language Model (LLM) outputs copyrighted content. Read more…

MongoDB and Patronus AI Partner to Boost Enterprise Confidence in Generative AI

Jan 11, 2024 |

NEW YORK, Jan. 11, 2024 — Patronus AI has announced it is partnering with MongoDB to bring automated LLM evaluation and testing to enterprise customers. Read more…

BigDATAwire