Shreya Shankar is a PhD student at UC Berkeley in the EECS department. This episode explores how Large Language Models (LLMs) are revolutionizing the processing of unstructured enterprise data like text documents and PDFs. It introduces DocETL, a framework using a MapReduce approach with LLMs for semantic extraction, thematic analysis, and summarization at scale.Subscribe to the Gradient Flow Newsletter 📩 https://gradientflow.substack.com/Subscribe: Apple · Spotify · Overcast · Pocket Casts · AntennaPod · Podcast Addict · Amazon · RSS.Detailed show notes - with links to many references - can be found on The Data Exchange web site.
--------
27:46
--------
27:46
Building Production-Grade RAG at Scale
Douwe Kiela, Founder and CEO of Contextual AI, discusses why RAG isn’t obsolete despite massive context windows, explaining how RAG 2.0 represents a fundamental shift to treating retrieval-augmented generation as an end-to-end trainable system. Subscribe to the Gradient Flow Newsletter 📩 https://gradientflow.substack.com/Subscribe: Apple · Spotify · Overcast · Pocket Casts · AntennaPod · Podcast Addict · Amazon · RSS.Detailed show notes - with links to many references - can be found on The Data Exchange web site.
--------
31:24
--------
31:24
Unlocking AI Superpowers in Your Terminal
Zach Lloyd, Founder/CEO of Warp, joins the podcast to discuss how Warp is revolutionizing the command-line terminal by integrating AI.Subscribe to the Gradient Flow Newsletter 📩 https://gradientflow.substack.com/Subscribe: Apple · Spotify · Overcast · Pocket Casts · AntennaPod · Podcast Addict · Amazon · RSS.Detailed show notes - with links to many references - can be found on The Data Exchange web site.
--------
44:59
--------
44:59
From Vibe Coding to Autonomous Agents
Jackie Brosamer and Brad Axen from Block discuss codename goose (Goose), their open-source AI agent designed to automate complex engineering and knowledge work.Subscribe to the Gradient Flow Newsletter 📩 https://gradientflow.substack.com/Subscribe: Apple · Spotify · Overcast · Pocket Casts · AntennaPod · Podcast Addict · Amazon · RSS.Detailed show notes - with links to many references - can be found on The Data Exchange web site.
--------
51:16
--------
51:16
How a Public-Benefit Startup Plans to Make Open Source the Default for Serious AI
Oumi Labs CEO Manos Koukoumidis lays out a vision for “unconditionally open” foundation models—where data, code, weights, and recipes are all transparent and reproducible—arguing this is the only path to production-grade, trustworthy AI. Subscribe to the Gradient Flow Newsletter 📩 https://gradientflow.substack.com/Subscribe: Apple · Spotify · Overcast · Pocket Casts · AntennaPod · Podcast Addict · Amazon · RSS.Detailed show notes - with links to many references - can be found on The Data Exchange web site.
A series of informal conversations with thought leaders, researchers, practitioners, and writers on a wide range of topics in technology, science, and of course big data, data science, artificial intelligence, and related applications. Anchored by Ben Lorica (@BigData), the Data Exchange also features a roundup of the most important stories from the worlds of data, machine learning and AI. Detailed show notes for each episode can be found on https://thedataexchange.media/ The Data Exchange podcast is a production of Gradient Flow [https://gradientflow.com/].