A PHD in Everything | Grok 4 Crushes Every Leading AI Model

My Account

Mijn Account

Mon Compte

Mein Konto

我的帐户

Inloggen

Connexion

Anmeldung

Acceso

A PHD in Everything | Grok 4 Crushes Every Leading AI Model

In this episode, I dive deep into the release of Grok 4 by XAI and its groundbreaking performance on various benchmarks.

We compare its capabilities with popular leading AI models like OpenAI's O3, Gemini 2.5, and Claude 4. Grok 4 tops the ARC AGI leaderboard and excels in complex tasks but also shows some limitations in nuanced queries.

I test its efficiency in real-world scenarios, from ranking global snack foods to evaluating image authenticity. Despite some challenges, Grok 4 showcases impressive advancements, and I discuss its potential impact on the AI landscape.

Stay tuned for more in-depth tests and community reactions in future videos!

00:00 - Introduction to Grok Four

00:23 - Benchmark Performance of Grok Four

01:33 - ARC AGI Benchmark Validation

02:50 - Humanity's Last Exam and Other Benchmarks

04:24 - New Features and Voice Mode

05:22 - Grok Four Heavy and Advanced Capabilities

06:43 - Coding and Real-World Applications

07:49 - Live Testing Grok Four

11:58 - Comparative Analysis with Other Models

16:06 - Image Analysis and Multimodal Capabilities

18:43 - Final Thoughts and Future Prospects

MattVidPro AI

ai talk

A PHD in Everything | Grok 4 Crushes Every Leading AI Model

Join the conversation 🎭

Wes Roth | Google is About to Bust the AI Bubble...

Sabine Hossenfelder | 5 Signs the AI Bubble is Bursting

FE2E | Qwen3 | Ernie 4.5 | Stable Audio 2.5 | HunyuanImage 2.1

Matt Wolfe | 30 AI Demos and News Headlines You Missed

NetworkChuck | You Need to Learn Model Context Protocol Right Now

Seedream 4.0 is Proof There’s No Stopping AI Advancement

ChatGPT Just Got Its Most Powerful Upgrade Yet

Aperture | The AI Takeover Is Closer Than You Think

Neil Patel | The Google Update No One Is Ready For

Sabine Hossenfelder | How Fast will AI Change Everything?

Google’s AI EmbeddingGemma is Breaking Records

Matthew Berman | Did OpenAI Just Solve Hallucinations?

Wes Roth | Grok 4.2 Sonoma Sky Will be Scary Good

New AI Beats NanoBanana | AI Minecraft | Dish-Washing Robots

AI Ode To Greta Thunberg | How Dare You?!

Anthropic to Pay $1.5 Billion to Settle Lawsuit With Book Authors

Upper Echelon | Real Life Skynet

OpenAI is about to Launch an AI Jobs Network

Economy Media | How AI Is Crushing Junior Developers

OpenAI is Making ChatGPT Into Something Way Bigger

Dr. Roman Yampolskiy | Only 5 Jobs will remain in 2030

Google Prepares to Unleash Jules | OpenAI Faces Billionaire Conspiracy

VibeVoice | The best Free AI Text to Speech Voice Cloner is Here

Wes Roth | How To Use Midjourney | Beginners Guide

Patrick Boyle | Is AI Slop Killing the Internet?

VoxHammer | CoMPaSS | Bytedance Waver | OmniHuman

Matt Wolfe | AI News: Adobe Just Gave in to Google’s NanoBanana

Hermes 4 Just Proved Open Source AI Can Beat OpenAI

Google’s Nano anana just killed Photoshop... Let’s Run It

Sabine Hossenfelder | I Tried Vibe Physics, This is What I Learned