llm

blockchain AI machine learning AI governance agentic AI AI agents

June 11, 2025

Why you need diverse third-party data to deliver trusted AI solutions

Diverse, high-quality data is a prerequisite for reliable, effective, and ethical AI solutions.

David Gibson, Michael Geden

Business Hub data data quality data diversity AI responsible ai

May 30, 2025

Getting rid of the pain for developers on Shopify

Ryan welcomes Glen Coates, VP of Product at Shopify, to dive into the intricacies of managing a developer-focused product, the challenges of backwards compatibility, and the implications of AI and LLMs in Shopify's development environment.

Phoebe Sajor

The Stack Overflow Podcast dev tools shopify AI ai assistant developer tools ecommerce

May 2, 2025

Improving on a 30-year-old hardware architecture

At HumanX 2025, Ryan chatted with Rodrigo Liang, cofounder and CEO of SambaNova, about reimagining 30-year-old hardware architecture for the AI era.

The Stack Overflow Podcast generative AI AI hardware architecture software architecture humanx

April 28, 2025

How self-supervised learning revolutionized natural language processing and gen AI

Self-supervised learning is a key advancement that revolutionized natural language processing and generative AI. Here’s how it works and two examples of how it is used to train language models.

8 comments

February 28, 2025

“Translation is the tip of the iceberg”: A deep dive into specialty models

Olga Beregovaya, VP of AI at Smartling, joins Ryan and Ben to explore the evolution and specialization of language models in AI.

The Stack Overflow Podcast AI generative AI machine learning

February 26, 2025

Variants of LoRA

Want to train a specialized LLM on your own data? The easiest way to do this is with low rank adaptation (LoRA), but many variants of LoRA exist.

February 24, 2025

Writing tests with AI, but not LLMs

How Diffblue leverages machine learning techniques to write effective unit tests.

The Stack Overflow Podcast software development software engineering AI generative AI autonomous agents automation unit tests testing java refactoring Productivity copilot ai coding dev tools developer tools

December 27, 2024

Breaking up is hard to do: Chunking in RAG applications

A look at some of the current thinking around chunking data for retrieval-augmented generation (RAG) systems.

Ryan Donovan

retrieval augmented generation

December 5, 2024

Four approaches to creating a specialized LLM

Wondering how to go about creating an LLM that understands your custom data? Start here.

December 3, 2024

Even high-quality code can lead to tech debt

Ben talks with Eran Yahav, a former researcher on IBM Watson who’s now the CTO and cofounder of AI coding company Tabnine. Ben and Eran talk about the intersection of software development and AI, the evolution of program synthesis, and Eran’s path from IBM research to startup CTO. They also discuss how to balance the productivity and learning gains of AI coding tools (especially for junior devs) against very real concerns around quality, security, and tech debt.

The Stack Overflow Podcast AI generative AI software development tech debt ai assistant ai coding

November 26, 2024

Your docs are your infrastructure

Fabrizio Ferri-Benedetti, who spent many years as a technical writer for Splunk and New Relic, joins Ben and Ryan for a conversation about the evolving role of documentation in software development. They explore how documentation can (and should) be integrated with code, the importance of quality control, and the hurdles to maintaining up-to-date documentation. Plus: Why technical writers shouldn’t be afraid of LLMs.

The Stack Overflow Podcast AI generative AI documentation technical writing software development

November 12, 2024

A student of Geoff Hinton, Yann LeCun, and Jeff Dean explains where AI is headed

Ben and Ryan are joined by Matt Zeiler, founder and CEO of Clarifai, an AI workflow orchestration platform. They talk about how the transformer architecture supplanted convolutional neural networks in AI applications, the infrastructure required for AI implementation, the implications of regulating AI, and the value of synthetic data.

The Stack Overflow Podcast AI data training machine learning synthetic data

November 8, 2024

One of the world’s biggest web scrapers has some thoughts on data ownership

Or Lenchner, CEO of Bright Data, joins Ben and Ryan for a deep-dive conversation about the evolving landscape of web data. They talk through the challenges involved in data collection, the role of synthetic data in training large AI models, and how public data access is becoming more restrictive. Or also shares his thoughts on the importance of transparency in data practices, the likely future of data regulation, and the philosophical implications of more people using AI to innovate and solve problems.

The Stack Overflow Podcast AI data training data ethics data scraping

November 7, 2024

No code, only natural language: Q&A on prompt engineering with Professor Greg Benson

Will prompt engineering replace the coder’s art or will software engineers who understand code still have a place in future software lifecycles?

Ryan Donovan

5 comments

October 31, 2024

A brief summary of language model finetuning

Here's a (brief) summary of language model finetuning, the various approaches that exist, their purposes, and what we know about how they work.

October 25, 2024

Tragedy of the (data) commons

Ben chats with Shayne Longpre and Robert Mahari of the Data Provenance Initiative about what GenAI means for the data commons. They discuss the decline of public datasets, the complexities of fair use in AI training, the challenges researchers face in accessing data, potential applications for synthetic data, and the evolving legal landscape surrounding AI and copyright.

The Stack Overflow Podcast AI data

September 26, 2024

Masked self-attention: How LLMs learn relationships between tokens

Masked self-attention is the key building block that allows LLMs to learn rich relationships and patterns between the words of a sentence. Let’s build it together from scratch.

September 20, 2024

Detecting errors in AI-generated code

Ben chats with Gias Uddin, an assistant professor at York University in Toronto, where he teaches software engineering, data science, and machine learning. His research focuses on designing intelligent tools for testing, debugging, and summarizing software and AI systems. He recently published a paper about detecting errors in code generated by LLMs. Gias and Ben discuss the concept of hallucinations in AI-generated code, the need for tools to detect and correct those hallucinations, and the potential for AI-powered tools to generate QA tests.

The Stack Overflow Podcast

September 13, 2024

The world’s largest open-source business has plans for enhancing LLMs

Ben and Ryan talk to Scott McCarty, Global Senior Principal Product Manager for Red Hat Enterprise Linux, about the intersection between LLMs (large language models) and open source. They discuss the challenges and benefits of open-source LLMs, the importance of attribution and transparency, and the revolutionary potential for LLM-driven applications. They also explore the role of LLMs in code generation, testing, and documentation.

The Stack Overflow Podcast AI Open Source

August 22, 2024

LLMs evolve quickly. Their underlying architecture, not so much.

The decoder-only transformer architecture is one of the most fundamental ideas in AI research.

August 15, 2024

Practical tips for retrieval-augmented generation (RAG)

Retrieval-augmented generation (RAG) is one of the best (and easiest) ways to specialize an LLM over your own data, but successfully applying RAG in practice involves more than just stitching together pretrained models.

retrieval augmented generation generative AI contributed

July 16, 2024

The framework helping devs build LLM apps

Ben and Eira talk with LlamaIndex CEO and cofounder Jerry Liu, along with venture capitalist Jerry Chen, about how the company is making it easier for developers to build LLM apps. They touch on the importance of high-quality training data to improve accuracy and relevance, the role of prompt engineering, the impact of larger context windows, and the challenges of setting up retrieval-augmented generation (RAG).

The Stack Overflow Podcast retrieval augmented generation generative AI

July 9, 2024

We chat search from both sides now

In this episode, Ben chats with Elastic software engineering director Paul Oremland along with Stack Overflow staff software engineer Steffi Grewenig and senior software developer Gregor Časar about vector databases and semantic search from both the vendor and customer perspectives.