Loading…

Stack Overflow Business

Stack Internal: the knowledge intelligence layer that powers enterprise AI.

Stack Data Licensing: decades of verified, technical knowledge to boost AI performance and trust.

Stack Ads: engage developers where it matters — in their daily workflow.

llm

Related Tags

AI agentic AI data hardware software development machine learning

Subscribe to the podcast

Get The Stack Overflow Podcast at your favorite listening service.

Apple Podcasts Overcast Pocket Casts Spotify RSS feed

July 7, 2026

Agent orchestration is so two-years ago

Ryan welcomes Saahil Jain, CTO of You.com, to discuss why building agents with a 2024 mindset is a mistake as modern models improve at long-horizon tasks, why heavy orchestration layers can hurt model performance more than help it, and why the 2026 competitive edge actually comes from information retrieval and unique data paired with end-to-end evaluation.

The Stack Overflow Podcast AI agentic AI architecture

June 30, 2026

Why intent prediction needs more than an LLM

Ryan sits down with Frank Portman, CTO at Yobi, to talk about why next-token prediction, though great for language, isn’t the right inductive bias for forecasting human behavior. They discuss how Yobi builds a “foundation model of behavior” using transformers and graph neural networks instead of chat-style LLMs, and what it takes to run millions of personalization decisions per second while keeping consumer data private.

The Stack Overflow Podcast agentic AI AI

April 28, 2026

Your LLM issues are really data issues

Ryan welcomes Harsha Chintalapani, co-founder and CTO at Collate and co-creator of Open Metadata, to the show to discuss why AI and LLMs struggle with real-time, structured production data.

The Stack Overflow Podcast data AI data quality

March 10, 2026

Even the chip makers are making LLMs

Ryan welcomes Kari Briski, NVIDIA’s VP of Generative AI Software for Enterprise, to the show to explore how a chip manufacturer got into the model development game.

The Stack Overflow Podcast AI chip nvidia

February 3, 2026

Generating text with diffusion (and ROI with LLMs)

Two guests for the price of one! This episode has two interviews recorded at AWS re:Invent back in December.

The Stack Overflow Podcast AI hardware

December 23, 2025

Settle down, nerds. AI is a normal technology

Ryan welcomes Anil Dash, writer and former Stack Overflow board member, back to the show to discuss how AI is not a magical technology, but rather the normal next step in computing’s evolution. They explore the importance of democratizing access to technology, the unique challenges that LLMs’ non-determinism poses, and how developers can keep Stack Overflow’s ethos of community alive in a world of AI.

The Stack Overflow Podcast software development AI coding community

December 19, 2025

Last week in AWS re:Invent with Corey Quinn

Ryan sits down with Corey Quinn, Chief Cloud Economist at Duckbill, at AWS re:Invent to get Corey’s patented snarky take on all the happenings from the conference.

The Stack Overflow Podcast aws cloud computing infrastructure management software development AI agentic AI

August 19, 2025

The server-side rendering equivalent for LLM inference workloads

Ryan is joined by Tuhin Srivastava, CEO and co-founder of Baseten, to explore the evolving landscape of AI infrastructure and inference workloads, how the shift from traditional machine learning models to large-scale neural networks has made GPU usage challenging, and the potential future of hardware-specific optimizations in AI.

July 8, 2025

Attention isn’t all we need; we need ownership too

Ryan welcomes Illia Polosukhin, co-author of the original "Attention Is All You Need" Transformers paper and co-founder of NEAR, on the show to talk about the development and impact of the Transformers model, his perspective on modern AI and machine learning as an early innovator of the tech, and the importance of decentralized, user-owned AI utilizing the blockchain.

blockchain AI machine learning AI governance agentic AI AI agents The Stack Overflow Podcast

June 11, 2025

Why you need diverse third-party data to deliver trusted AI solutions

Diverse, high-quality data is a prerequisite for reliable, effective, and ethical AI solutions.

David Gibson, Michael Geden

Business Hub data data quality data diversity AI responsible ai

May 30, 2025

Getting rid of the pain for developers on Shopify

Ryan welcomes Glen Coates, VP of Product at Shopify, to dive into the intricacies of managing a developer-focused product, the challenges of backwards compatibility, and the implications of AI and LLMs in Shopify's development environment.

The Stack Overflow Podcast dev tools shopify AI ai assistant developer tools ecommerce

May 2, 2025

Improving on a 30-year-old hardware architecture

At HumanX 2025, Ryan chatted with Rodrigo Liang, cofounder and CEO of SambaNova, about reimagining 30-year-old hardware architecture for the AI era.

The Stack Overflow Podcast generative AI AI hardware architecture software architecture humanx

April 28, 2025

How self-supervised learning revolutionized natural language processing and gen AI

Self-supervised learning is a key advancement that revolutionized natural language processing and generative AI. Here’s how it works and two examples of how it is used to train language models.

Cameron R. Wolfe, PhD

February 28, 2025

“Translation is the tip of the iceberg”: A deep dive into specialty models

Olga Beregovaya, VP of AI at Smartling, joins Ryan and Ben to explore the evolution and specialization of language models in AI.

The Stack Overflow Podcast AI generative AI machine learning

February 26, 2025

Variants of LoRA

Want to train a specialized LLM on your own data? The easiest way to do this is with low rank adaptation (LoRA), but many variants of LoRA exist.

Cameron R. Wolfe, PhD

February 24, 2025

Writing tests with AI, but not LLMs

How Diffblue leverages machine learning techniques to write effective unit tests.

The Stack Overflow Podcast software development software engineering AI generative AI autonomous agents automation unit tests testing java refactoring Productivity copilot ai coding dev tools developer tools

December 27, 2024

Breaking up is hard to do: Chunking in RAG applications

A look at some of the current thinking around chunking data for retrieval-augmented generation (RAG) systems.

retrieval augmented generation

December 5, 2024

Four approaches to creating a specialized LLM

Wondering how to go about creating an LLM that understands your custom data? Start here.

Cameron R. Wolfe, PhD

December 3, 2024

Even high-quality code can lead to tech debt

Ben talks with Eran Yahav, a former researcher on IBM Watson who’s now the CTO and cofounder of AI coding company Tabnine. Ben and Eran talk about the intersection of software development and AI, the evolution of program synthesis, and Eran’s path from IBM research to startup CTO. They also discuss how to balance the productivity and learning gains of AI coding tools (especially for junior devs) against very real concerns around quality, security, and tech debt.

The Stack Overflow Podcast AI generative AI software development tech debt ai assistant ai coding

November 26, 2024

Your docs are your infrastructure

Fabrizio Ferri-Benedetti, who spent many years as a technical writer for Splunk and New Relic, joins Ben and Ryan for a conversation about the evolving role of documentation in software development. They explore how documentation can (and should) be integrated with code, the importance of quality control, and the hurdles to maintaining up-to-date documentation. Plus: Why technical writers shouldn’t be afraid of LLMs.

The Stack Overflow Podcast AI generative AI documentation technical writing software development

November 12, 2024

A student of Geoff Hinton, Yann LeCun, and Jeff Dean explains where AI is headed

Ben and Ryan are joined by Matt Zeiler, founder and CEO of Clarifai, an AI workflow orchestration platform. They talk about how the transformer architecture supplanted convolutional neural networks in AI applications, the infrastructure required for AI implementation, the implications of regulating AI, and the value of synthetic data.

The Stack Overflow Podcast AI data training machine learning synthetic data

November 8, 2024

One of the world’s biggest web scrapers has some thoughts on data ownership

Or Lenchner, CEO of Bright Data, joins Ben and Ryan for a deep-dive conversation about the evolving landscape of web data. They talk through the challenges involved in data collection, the role of synthetic data in training large AI models, and how public data access is becoming more restrictive. Or also shares his thoughts on the importance of transparency in data practices, the likely future of data regulation, and the philosophical implications of more people using AI to innovate and solve problems.

The Stack Overflow Podcast AI data training data ethics data scraping

November 7, 2024

No code, only natural language: Q&A on prompt engineering with Professor Greg Benson

Will prompt engineering replace the coder’s art or will software engineers who understand code still have a place in future software lifecycles?

October 31, 2024

A brief summary of language model finetuning

Here's a (brief) summary of language model finetuning, the various approaches that exist, their purposes, and what we know about how they work.

Cameron R. Wolfe, PhD