retrieval augmented generation

Related Tags

Subscribe to the podcast

Get The Stack Overflow Podcast at your favorite listening service.

Apple Podcasts Overcast Pocket Casts Spotify RSS feed

May 27, 2025

“The future is agents”: Building a platform for RAG agents

Douwe Kiela, CEO and cofounder of Contextual AI, joins Ryan and Ben to explore the intricacies of retrieval-augmented generation (RAG). They discuss the early research Douwe did at Meta that jump started the whole thing, the challenges of hallucinations, and the significance of context windows in AI applications.

Eira May

1 comment

The Stack Overflow Podcast AI developer tools dev tools ai coding agentic AI autonomous agents

April 11, 2025

How do you fact-check an AI?

Ryan chats with Amr Awadallah, founder and CEO of GenAI platform Vectara about how retrieval-augmented generation (RAG) has advanced, why fact-checking and accurate data are essential in building AI applications, and how Vectara’s Mockingbird model seeks to minimize hallucinations.

Eira May

0 comments

The Stack Overflow Podcast generative AI AI data quality data quality

March 25, 2025

“The power of the humble embedding”

Ryan speaks with Edo Liberty, Founder and CEO of Pinecone, about building vector databases, the power of embeddings, the evolution of RAG, and fine-tuning AI models.

Eira May

0 comments

The Stack Overflow Podcast generative AI AI machine learning embedding vector database semantic search

December 27, 2024

Breaking up is hard to do: Chunking in RAG applications

A look at some of the current thinking around chunking data for retrieval-augmented generation (RAG) systems.

Ryan Donovan

2 comments

retrieval augmented generation

October 18, 2024

How API security is evolving for the GenAI era

Ben Popper chats with Keith Babo, Head of Product at Solo.io, about how the API security landscape is changing in the era of GenAI. They talk through the role of governance in AI, the importance of data protection, and the role API gateways play in enhancing security and functionality. Keith shares his insights on retrieval-augmented generation (RAG) systems, protecting PII, and the necessity of human-in-the-loop AI development.

Eira May

0 comments

The Stack Overflow Podcast AI API security Business Hub

August 15, 2024

Practical tips for retrieval-augmented generation (RAG)

Retrieval-augmented generation (RAG) is one of the best (and easiest) ways to specialize an LLM over your own data, but successfully applying RAG in practice involves more than just stitching together pretrained models.

Cameron R. Wolfe, PhD

0 comments

retrieval augmented generation

July 16, 2024

The framework helping devs build LLM apps

Ben and Eira talk with LlamaIndex CEO and cofounder Jerry Liu, along with venture capitalist Jerry Chen, about how the company is making it easier for developers to build LLM apps. They touch on the importance of high-quality training data to improve accuracy and relevance, the role of prompt engineering, the impact of larger context windows, and the challenges of setting up retrieval-augmented generation (RAG).

Eira May

0 comments

The Stack Overflow Podcast generative AI llm

April 2, 2024

Are long context windows the end of RAG?

The home team is joined by Michael Foree, Stack Overflow’s director of data science and data platform, and occasional cohost Cassidy Williams, CTO at Contenda, for a conversation about long context windows, retrieval-augmented generation, and how Databricks’ new open LLM could change the game for developers. Plus: How will FTX co-founder Sam Bankman-Fried’s sentence of 25 years in prison reverberate in the blockchain and crypto spaces?

Eira May

0 comments

AI llm cryptocurrency blockchain programming language The Stack Overflow Podcast

March 15, 2024

Your whole repo fits in the context window

The home team discusses the challenges (hardware and otherwise) of building AI models at scale, why major players like Meta are open-sourcing their AI projects, what Apple’s recent changes mean for developers in the EU, and Perplexity AI’s new approach to search.

Eira May

1 comment

llm AI generative AI The Stack Overflow Podcast

March 8, 2024

A leading ML educator on what you need to know about LLMs

Machine learning scientist, author, and LLM developer Maxime Labonne talks with Ben and Ryan about his role as lead machine learning scientist, his contributions to the open-source community, the value of retrieval-augmented generation (RAG), and the process of fine-tuning and unfreezing layers in LLMs. The team talks through various challenges and considerations in implementing GenAI, from data quality to integration.

Eira May

1 comment

AI generative AI llm The Stack Overflow Podcast

March 5, 2024

Chunking express: An expert breaks down how to build your RAG system

This is part two of our conversation with Roie Schwaber-Cohen, Staff Developer Advocate at Pinecone, about retrieval-augmented generation (RAG) and why it’s crucial for the success of your AI initiatives.

Eira May

1 comment

The Stack Overflow Podcast

March 1, 2024

It’s RAG time for LLMs that need a source of truth

On this episode: Roie Schwaber-Cohen, Staff Developer Advocate at Pinecone, joins Ben and Ryan to break down what retrieval-augmented generation (RAG) is and why the concept is central to the AI conversation. This is part one of our conversation, so tune in next time for the thrilling conclusion.

Eira May

0 comments

AI llm The Stack Overflow Podcast

October 18, 2023

Retrieval augmented generation: Keeping LLMs relevant and current

Retrieval augmented generation (RAG) is a strategy that helps address both LLM hallucinations and out-of-date training data.

Manny Silva

7 comments

Code for a Living AI llm contributed