Beyond Benchmarks: How GPT-5 and OSS Are Redefining AI Evaluation (E.16)

In this episode of Free Form AI, Michael and Ben unpack the GPT-5 release, with a focus on what really matters: fewer hallucinations, smarter reasoning and why traditional benchmarks may no longer cut it.

Tune in as we explore open-source OSS, agentic systems and the growing challenge of evaluating models that might already be outsmarting us.
Beyond Benchmarks: How GPT-5 and OSS Are Redefining AI Evaluation (E.16)
Broadcast by