Powered by RND
PodcastsTechnologyBuild Wiz AI Show

Build Wiz AI Show

Build Wiz AI
Build Wiz AI Show
Latest episode

Available Episodes

5 of 137
  • Training-Free Group Relative Policy Optimization for LLM Agents
    Are expensive Large Language Model (LLM) fine-tuning methods holding back your specialized agents, demanding massive computational resources and data? We dive into Training-Free Group Relative Policy Optimization (Training-Free GRPO), a novel non-parametric method that enhances LLM agent behavior by distilling semantic advantages from group rollouts into lightweight token priors, eliminating costly parameter updates. Discover how this highly efficient approach achieves significant performance gains in specialized domains like mathematical reasoning and web searching, often surpassing traditional fine-tuning while using only dozens of training samples.
    -------- ย 
    13:38
  • OpenAI's Vision: AGI, Sora, and Bottlenecks
    Join us for a deep dive with Greg Brockman on the future of AI, where he reveals the internal struggle ("pain and suffering") of managing compute scarcity and the immense physical infrastructure build required to scale systems like Sora 2. Brockman discusses the shift from viewing AGI as a destination to a continuous process, emphasizing that current scaling curves and algorithmic progress continue unabated. We also explore the inevitable move toward proactive AI agents and a fully generative web, predicting a major change to the social contract and web monetization.
    -------- ย 
    12:21
  • Agentic Context Engineering: Evolving Contexts for LLMs
    Tune in as we explore Agentic Context Engineering (ACE), a novel framework designed to overcome limitations like "brevity bias" and "context collapse" that plague traditional LLM context adaptation methods. ACE transforms model contexts into continuously evolving, structured "playbooks" by employing a modular process of generation, reflection, and curation. We discuss how this approach enables scalable, self-improving agents, yielding substantial performance gains on complex tasksโ€”such as +10.6% on agent benchmarksโ€”while significantly lowering adaptation latency and cost.
    -------- ย 
    16:35
  • Less is More: Recursive Reasoning with Tiny Networks
    This episode explores the Tiny Recursive Model (TRM), a novel approach that leverages a single, tiny network (as small as 7M parameters) to tackle hard puzzle tasks like Sudoku, Maze, and ARC-AGI. We investigate how this simplified, recursive reasoning strategy achieves significantly higher generalization and outperforms much larger models, including complex Large Language Models (LLMs) and the Hierarchical Reasoning Model (HRM). Discover why this "less is more" philosophy is leading to breakthroughs in parameter-efficient AI reasoning by simplifying complex mathematical theories and biological justifications.
    -------- ย 
    14:28
  • Understanding the 4 Main Approaches to LLM Evaluation - from Sebastian Raschka
    Demystify Large Language Model (LLM) evaluation, breaking down the four main methods used to compare models: multiple-choice benchmarks, verifiers, leaderboards, and LLM judges. We offer a clear mental map of these techniques, distinguishing between benchmark-based and judgment-based approaches to help you interpret performance scores and measure progress in your own AI development. Discover the pros and cons of each methodโ€”from MMLU accuracy checks to the dynamic Elo ranking systemโ€”and learn why combining them is key to holistic model assessment.Original blog post: https://magazine.sebastianraschka.com/p/llm-evaluation-4-approaches
    -------- ย 
    15:16

More Technology podcasts

About Build Wiz AI Show

> Building the future of products with AI-powered innovation. < Build Wiz AI Show is your go-to podcast for transforming the latest and most interesting papers, articles, and blogs about AI into an easy-to-digest audio format. Using NotebookLM, we break down complex ideas into engaging discussions, making AI knowledge more accessible. Have a resource youโ€™d love to hear in podcast form? Send us the link, and we might feature it in an upcoming episode! ๐Ÿš€๐ŸŽ™๏ธ
Podcast website

Listen to Build Wiz AI Show, Your Undivided Attention and many other podcasts from around the world with the radio.net app

Get the free radio.net app

  • Stations and podcasts to bookmark
  • Stream via Wi-Fi or Bluetooth
  • Supports Carplay & Android Auto
  • Many other app features
Social
v7.23.9 | ยฉ 2007-2025 radio.de GmbH
Generated: 10/13/2025 - 11:49:27 PM