<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[Justified Posteriors]]></title><description><![CDATA[Home of the Justified Posteriors podcast + essays on economics from Andrey Fradkin and Seth Benzell.]]></description><link>https://empiricrafting.substack.com</link><image><url>https://substackcdn.com/image/fetch/$s_!JrtW!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe04c2b84-5e8f-43d1-b922-74edea8b528a_1280x1280.png</url><title>Justified Posteriors</title><link>https://empiricrafting.substack.com</link></image><generator>Substack</generator><lastBuildDate>Wed, 17 Jun 2026 12:53:30 GMT</lastBuildDate><atom:link href="https://empiricrafting.substack.com/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Andrey Fradkin]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[sbenzell@gmail.com]]></webMaster><itunes:owner><itunes:email><![CDATA[sbenzell@gmail.com]]></itunes:email><itunes:name><![CDATA[Andrey Fradkin]]></itunes:name></itunes:owner><itunes:author><![CDATA[Andrey Fradkin]]></itunes:author><googleplay:owner><![CDATA[sbenzell@gmail.com]]></googleplay:owner><googleplay:email><![CDATA[sbenzell@gmail.com]]></googleplay:email><googleplay:author><![CDATA[Andrey Fradkin]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[Ioana Marinescu on Insuring Workers for AI, Monopsony, and Philosophy]]></title><description><![CDATA[This week we&#8217;re joined by Ioana Marinescu, labor economist at the University of Pennsylvania&#8217;s School of Social Policy & Practice, former Principal Economist at the U.S.]]></description><link>https://empiricrafting.substack.com/p/ioana-marinescu-on-insuring-workers</link><guid isPermaLink="false">https://empiricrafting.substack.com/p/ioana-marinescu-on-insuring-workers</guid><dc:creator><![CDATA[Andrey Fradkin]]></dc:creator><pubDate>Mon, 15 Jun 2026 12:03:34 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/201759768/72c33e66d2cbc0f890b52e7b1437bd4f.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>This week we&#8217;re joined by <strong><a href="https://marinescu.eu/">Ioana Marinescu</a></strong>, labor economist at the University of Pennsylvania&#8217;s <a href="https://sp2.upenn.edu/person/ioana-e-marinescu/">School of Social Policy &amp; Practice</a>, former <strong>Principal Economist at the U.S. Department of Justice Antitrust Division</strong>, and a member of <strong><a href="https://www.anthropic.com/economic-index">Anthropic&#8217;s Economic Advisory Board</a></strong>. Ioana is one of the people who put labor-market monopsony on the antitrust map, and she&#8217;s now thinking hard about what the social safety net should look like if AI hits the labor market the way the optimists (and the doomers) say it might.</p><p>We start with her <em><a href="https://www.digitalistpapers.com/vol2/marinescu">Digitalist Papers</a></em><a href="https://www.digitalistpapers.com/vol2/marinescu"> essay</a>, which proposes a flexible, two-tier toolkit: <strong>AI Adjustment Insurance</strong> (extended unemployment benefits + retraining + wage insurance, modeled on Trade Adjustment Assistance) for the churn scenario, and a scalable <strong>Digital Dividend</strong> &#8212; a broad-based cash transfer funded by a small tax on the digital sector &#8212; for the world where the jobs don&#8217;t come back. Along the way: whether to make policy now or wait, what counts as the &#8220;status quo,&#8221; moral hazard in mass unemployment, the TAA wage-insurance result that <em>repaid its own subsidy</em>, and Andrey&#8217;s &#8220;we can&#8217;t afford UBI&#8221; pushback.</p><p>Then we get into her new model with Konrad Kording, <strong><a href="https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6051694">(Artificial) Intelligence Saturation and the Future of Work&#8221;</a></strong>&#8212; why splitting the economy into an intelligence sector and a physical sector implies that output and wages <em>saturate</em> even as AI scales to infinity, the robots-vs-LLMs debate, and whether to just relabel &#8220;physical&#8221; as the <strong>non-automatable sector</strong>. We close with her DOJ years: defining monopsony, the <em>transmigrante</em> used-car collusion-and-murder case, the Penguin Random House&#8211;Simon &amp; Schuster merger (yes, Stephen King testified), antitrust and AI, and a lightning round on ikigai, Camus, and Rawls vs. Mill.</p><div><hr></div><h2>Links &amp; References</h2><p><strong>Ioana&#8217;s work</strong></p><ul><li><p><a href="https://marinescu.eu/">marinescu.eu</a> &#8212; Ioana&#8217;s website &#183; <a href="https://sp2.upenn.edu/person/ioana-e-marinescu/">Penn SP2 faculty page</a></p></li><li><p>Ioana Marinescu, <a href="https://www.digitalistpapers.com/vol2/marinescu">&#8220;Resilient by Design: Dual Safety Nets for Workers in the AI Economy&#8221;</a> &#8212; <em>The Digitalist Papers, Vol. 2: The Economics of Transformative AI</em> (<a href="https://www.digitalistpapers.com/volume2">volume</a>)</p></li><li><p>Konrad Kording &amp; Ioana Marinescu, <a href="https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6051694">&#8220;(Artificial) Intelligence Saturation and the Future of Work&#8221;</a> &#8212; working paper (<a href="https://www.brookings.edu/articles/artificial-intelligence-saturation-and-the-future-of-work/">Brookings write-up &amp; interactive tool</a>). The model finds wage growth can <em>reverse</em> once roughly a third of intelligence tasks are automated.</p></li><li><p>Ioana Marinescu, <a href="https://www.nber.org/system/files/chapters/c15320/c15320.pdf">comments on Betsey Stevenson&#8217;s chapter</a> &#8212; NBER, <em>The Economics of Artificial Intelligence: An Agenda</em> (the ikigai discussion)</p></li></ul><p><strong>Concepts, papers &amp; people discussed</strong></p><ul><li><p><a href="https://en.wikipedia.org/wiki/Trade_Adjustment_Assistance">Trade Adjustment Assistance (TAA)</a> &#8212; the template for Ioana&#8217;s adjustment insurance; the wage-insurance component that got people back to work faster and was net fiscally positive</p></li><li><p>Betsey Stevenson, <a href="https://www.nber.org/system/files/chapters/c15319/c15319.pdf">&#8220;Artificial Intelligence, Income, Employment, and Meaning&#8221;</a> &#8212; the post-AGI meaning / ikigai argument Ioana was commenting on</p></li><li><p>&#8220;GPTs are GPTs&#8221; &#8212; Eloundou, Manning, Mishkin &amp; Rock, <em><a href="https://arxiv.org/abs/2303.10130">GPTs are GPTs: An Early Look at the Labor Market Impact Potential of LLMs</a></em> &#8212; the occupational LLM-exposure measure (&#8221;Eloundou et al. / Daniel Rock&#8221;) correlated with COVID-era telework</p></li><li><p>Pascual Restrepo &#8212; job-market work on <strong>skill mismatch and structural unemployment</strong> during automation waves</p></li><li><p>Daron Acemoglu &amp; Pascual Restrepo, <a href="https://www.journals.uchicago.edu/doi/10.1086/705716">&#8220;Robots and Jobs: Evidence from US Labor Markets&#8221;</a>. </p></li><li><p>Albert Camus, <em><a href="https://en.wikipedia.org/wiki/The_Myth_of_Sisyphus">The Myth of Sisyphus</a></em>; <strong>ikigai</strong> (Japanese: &#8220;reason for being&#8221;)</p></li><li><p><a href="https://en.wikipedia.org/wiki/Baumol_effect">Baumol&#8217;s cost disease</a> </p></li><li><p>John Rawls and John Stuart Mill (<em>Utilitarianism</em>)</p></li></ul><p><strong>Antitrust &amp; the DOJ</strong></p><ul><li><p>The <strong>DOJ Antitrust Division</strong>, monopsony in the labor market, and the <a href="https://www.justice.gov/atr/2023-merger-guidelines">2023 Merger Guidelines</a></p></li><li><p><a href="https://www.npr.org/2022/11/01/1133375227/federal-judge-blocks-penguin-random-house-from-buying-simon-schuster">Judge blocks the Penguin Random House&#8211;Simon &amp; Schuster merger</a> (2022) on a labor theory of harm to authors &#8212; Stephen King testified for the government</p></li><li><p>The <em><a href="https://www.justice.gov/opa/pr/eight-individuals-plead-guilty-wide-ranging-scheme-monopolize-transmigrante-forwarding">transmigrante</a></em> used-car export case &#8212; collusion (and worse) in the US-to-Latin America used-car trade</p></li><li><p>Anthropic&#8217;s <a href="https://www.anthropic.com/economic-index">Economic Index</a> and Economic Advisory Board</p></li><li><p>Leopold Aschenbrenner&#8217;s <em><a href="https://situational-awareness.ai/">Situational Awareness</a></em> &#8212; the &#8220;we&#8217;ll have to nationalize it&#8221; argument referenced on consolidation</p></li></ul><p><strong>Previously on Justified Posteriors</strong></p><ul><li><p>Our  episode on the Anthropic Economic Index.</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;1b53c661-1baf-4013-af5c-edc92197034c&quot;,&quot;caption&quot;:&quot;In this episode of Justified Posteriors, we dive into the paper \&quot;Which Economic Tasks Are Performed with AI: Evidence from Millions of Claude Conversations.\&quot; We analyze Anthropic's effort to categorize how people use their Claude AI assistant across different economic tasks and occupations, examining both the methodology and implications with a critical&#8230;&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;lg&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Claude Just Refereed the Anthropic Economic Index&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:191755003,&quot;name&quot;:&quot;Andrey Fradkin&quot;,&quot;bio&quot;:&quot;Professor writing about AI, digital technology, marketing, economics, and academia. Also, some personal introspection along the way.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!qqBF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb729e424-5fcf-4691-886d-a65500401344_1175x1177.jpeg&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null},{&quot;id&quot;:3215096,&quot;name&quot;:&quot;Seth Benzell&quot;,&quot;bio&quot;:&quot;Co-Host of Justified Posteriors Podcast https://empiricrafting.substack.com/podcast&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F1351ec23-f5f1-4613-8844-04c8f814335b_1030x687.jpeg&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;post_date&quot;:&quot;2025-04-21T13:03:08.565Z&quot;,&quot;cover_image&quot;:&quot;https://substack-video.s3.amazonaws.com/video_upload/post/161752047/bea82dc5-c62d-4f06-8835-a553adc022a6/transcoded-00001.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://empiricrafting.substack.com/p/claude-just-refereed-the-anthropic&quot;,&quot;section_name&quot;:null,&quot;video_upload_id&quot;:&quot;bea82dc5-c62d-4f06-8835-a553adc022a6&quot;,&quot;id&quot;:161752047,&quot;type&quot;:&quot;podcast&quot;,&quot;reaction_count&quot;:1,&quot;comment_count&quot;:0,&quot;publication_id&quot;:2684979,&quot;publication_name&quot;:&quot;Justified Posteriors&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!JrtW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe04c2b84-5e8f-43d1-b922-74edea8b528a_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div></li></ul><p><strong>Our sponsor</strong></p><ul><li><p>This episode is brought to you by <a href="https://www.reveliolabs.com/">Revelio Labs</a>, the leading provider of labor-economics data, available to academics on <a href="https://wrds-www.wharton.upenn.edu/">WRDS</a>.</p></li></ul><div><hr></div><h2>Chapters</h2><ul><li><p>(00:00) Intro &amp; sponsor</p></li><li><p>(00:47) The <em>Digitalist Papers</em> proposal: a flexible safety net for the AI labor shock &#8212; and why make policy <em>now</em></p></li><li><p>(03:48) Why unemployment insurance isn&#8217;t enough, and the Trade Adjustment Assistance template</p></li><li><p>(05:51) What counts as the &#8220;status quo&#8221;? Banning AI vs. letting it run</p></li><li><p>(07:42) How much to insure: moral hazard, mass unemployment, and the three parts of AI Adjustment Insurance</p></li><li><p>(11:15) Skill mismatch (Restrepo), and how do you certify a layoff was &#8220;due to AI&#8221;?</p></li><li><p>(14:45) Did TAA buy social buy-in for free trade? Underfunding &#8212; and the wage insurance that repaid its own subsidy</p></li><li><p>(16:38) &#8220;Would Hillary be president?&#8221; General-equilibrium pushback and the ski-instructor problem</p></li><li><p>(19:28) Will the new jobs still be there in two years? The lump-of-labor fallacy</p></li><li><p>(22:09) Policy B: the <strong>Digital Dividend</strong> &#8212; unconditional, broad-based cash from a small digital-sector tax</p></li><li><p>(23:52) How to fund it: a sales tax, a sovereign-style fund, and deliberately slowing diffusion a little</p></li><li><p>(26:00) &#8220;We can&#8217;t afford UBI&#8221;: productivity growth, 0.5% vs. the deficit, and setting money aside <em>ex ante</em></p></li><li><p>(30:47) Taxing digital goods: VPNs, evasion, and land-value taxes</p></li><li><p>(34:23) The motte-and-bailey worry, and the other reasons to like UBI</p></li><li><p>(36:05) The new model: <strong>(Artificial) Intelligence Saturation</strong> &#8212; intelligence vs. physical sectors, and the telework &#215; AI-exposure correlation</p></li><li><p>(40:14) Gross complements: why output and wages saturate even with infinite intelligence</p></li><li><p>(42:23) Won&#8217;t enough intelligence just automate the physical world? Robots vs. LLMs</p></li><li><p>(45:52) &#8220;15% by 2030&#8221;: humanoid robots, cost, and bespoke vs. general-purpose machines</p></li><li><p>(47:58) Baumol, the &#8220;humanness sector,&#8221; and relabeling physical as the non-automatable sector</p></li><li><p>(48:52) The capital-share / profit-share puzzle: if they&#8217;re complements, why has the intelligence share risen?</p></li><li><p>(50:25) The DOJ years: monopsony, and what the Antitrust Division actually does <em>(mid-roll sponsor at 51:29)</em></p></li><li><p>(54:52) &#8220;Assassinating rival CEOs&#8221;: the <em>transmigrante</em> collusion-and-murder case</p></li><li><p>(58:12) Favorite cases: Stephen King, the publisher merger, and the chicken-farmer monopsony settlement</p></li><li><p>(1:01:30) Antitrust and AI: foundation models, consolidation, and the natural-monopoly question</p></li><li><p>(1:06:05) Slowing AI by allowing market power; Leopold, nationalization, and diminishing returns vs. the singularity</p></li><li><p>(1:09:27) Substitutability, the AK economy, and short-run vs. long-run wages</p></li><li><p>(1:10:59) Lightning round: ikigai, Camus, and the myth of Sisyphus</p></li><li><p>(1:12:44) Can we build market-like mechanisms for ikigai? Loneliness and coordination costs</p></li><li><p>(1:14:13) The Anthropic Economic Advisory Board and the Economic Index</p></li><li><p>(1:15:21) What&#8217;s next: monopsony and industrial policy</p></li><li><p>(1:17:59) Favorite philosopher: Rawls vs. John Stuart Mill</p></li><li><p>(1:19:45) Sign-off</p></li></ul><div><hr></div><p><em>Justified Posteriors is the podcast that updates its beliefs about the economics of AI and technology, hosted by <a href="https://www.andreyfradkin.com/">Andrey Fradkin</a> and <a href="https://www.sethbenzell.com/">Seth Benzell</a>. If we changed your priors, <a href="https://empiricrafting.substack.com/">subscribe</a>, share it with a friend, and keep your posteriors justified.</em></p><div><hr></div><h2>Intro &amp; Sponsor [00:00 &#8211; 00:47]</h2><p>[00:00:06] <strong>Seth:</strong> Welcome to Justified Posteriors, the podcast that updates beliefs about the economics of AI and technology. I&#8217;m Seth Benzell, excited to learn about what AI is other than what my bubbe says after I spill hot water on her, coming to you from Chapman University in sunny Southern California.</p><p><strong>Andrey:</strong> And I&#8217;m Andrey Fradkin, coming to you from San Francisco, California. We&#8217;re very thankful to our sponsors at Revelio Labs, purveyors of fine data products. And we&#8217;re very excited to have Ioana Marinescu join us today. Welcome to the show, Ioana.</p><p><strong>Ioana:</strong> Thank you. I&#8217;m so glad to be here.</p><div><hr></div><h2>Make Policy Now: A Flexible Safety Net [00:47 &#8211; 05:51]</h2><p>[00:00:47] <strong>Andrey:</strong> To get started &#8212; you have this very provocative, interesting piece in the Digitalist Papers about various social policy solutions for transformative AI scenarios. Could you tell us about the piece?</p><p><strong>Ioana:</strong> Absolutely. As part of doing this Digitalist piece, I was thinking, as somebody who has worked a lot on the social safety net: what do we do if AI leads to a lot of job loss, like many people are saying it would? We&#8217;ll talk later about the various scenarios, but assuming that&#8217;s at least a possibility we have to acknowledge, what would you want to have from a policy perspective?</p><p>And so I was really thinking hard about devising a flexible policy toolkit that will be able to address issues in the labor market no matter how big the shock is. That was the overarching theme of the policy design I&#8217;m proposing &#8212; just to start a discussion. I&#8217;ve tried to propose some helpful options, but it&#8217;s really with the idea of, let&#8217;s talk about doing something like this, what are the pros and cons.</p><p>[00:02:10] <strong>Andrey:</strong> So what are the options on the menu for &#8212; let&#8217;s say AI comes along, a lot of people lose their jobs. The first thing we should get started with: do you think we should be making policy today, or should we wait until something happens and then make policy?</p><p><strong>Ioana:</strong> I think it&#8217;s very important to make policy today, but in a flexible way &#8212; meaning the policy cannot depend on some very specific detail of exactly how AI is going to impact the labor market, because we don&#8217;t know exactly what&#8217;s going to happen. It&#8217;s important to put the policy in place today because the political process is very long, so it may not be able to come online quickly enough when we really need it. That&#8217;s one reason.</p><p>The other is &#8212; and I work a lot on social insurance &#8212; for workers, they want to and should feel insured. &#8220;Whatever happens, we the government have got you covered.&#8221; If we don&#8217;t have that, and we&#8217;re just waiting for bad stuff to happen, that defeats the purpose of having a social safety net. That&#8217;s a core reason I think it&#8217;s good to have something in place sooner rather than later, even before all the effects of AI on the labor market have materialized.</p><p><strong>Seth:</strong> Something that will automatically kick in.</p><p><strong>Ioana:</strong> Exactly.</p><p><strong>Andrey:</strong> And why is &#8212; we do have some programs like that, like unemployment insurance. Why is unemployment insurance not enough in its current form?</p><p><strong>Ioana:</strong> Unemployment insurance is incredibly valuable, but if we have a big shock like AI, it&#8217;s going to affect a lot of people who will not necessarily lose their job forever, but simply have to change jobs &#8212; and that&#8217;s very costly. The whole purpose of the social safety net is to help people through those transitions.</p><p>The thing is, we have AI, and the way it&#8217;s being deployed is a policy choice. We could say we&#8217;re going to try to stop AI, but we&#8217;re not doing that &#8212; and I&#8217;m not saying we should or shouldn&#8217;t, just that it&#8217;s a policy choice. We&#8217;re saying we&#8217;re not going to stop AI, we&#8217;re going to let it be. But then some people are going to get hurt, at least in the short run, and we need to do something so those people have something to fall back on. Just like with trade: we decided to have free trade, we knew some people were going to get hurt and lose their wages, and we put in place policies like trade adjustment assistance &#8212; which inspired some of my proposals &#8212; to make sure the policy we&#8217;d chosen wouldn&#8217;t leave people on the side of the road. That policy hasn&#8217;t completely worked, because it was underfunded, but the big point is: the technology is exciting and has a lot of benefits, we&#8217;ve decided to deploy it quickly, and some people are going to bear a cost. We just want to make sure we help those people.</p><div><hr></div><h2>What Counts as the &#8220;Status Quo&#8221;? [05:51 &#8211; 07:42]</h2><p>[00:05:51] <strong>Seth:</strong> I&#8217;m really excited to hear your specific ideas, but I&#8217;m curious about this framing of what counts as the status quo and what counts as the policy shock. In the case of trade, you might say the status quo is protectionism and the policy shock is allowing trade &#8212; so it makes sense to frame the social insurance relative to that: you shouldn&#8217;t be worse off relative to introducing trade. But with AI it&#8217;s not obvious. It seems like the policy shock would be to <em>ban</em> AI &#8212; AI would happen without the shock. So why not say, &#8220;If you banned AI, there should be social insurance to help the people who would have been better off if we&#8217;d allowed AI to go full throttle&#8221;? How do you think about what the status quo is here?</p><p><strong>Ioana:</strong> I don&#8217;t know that the status quo is necessarily the correct reference point &#8212; that&#8217;s something you could debate. My point is rather that a lot of these technologies have a lot of ramifications, and there&#8217;s a decision we make about how we want to control it. Collectively, we&#8217;ve made the decision that we&#8217;re not going to try to control it too much in its effect on the labor market. And therefore we need to deal with the consequences of helping people who might get hurt, at least in the short run &#8212; even though, hopefully, in the long run it will be great for everybody, including them. So we can reassure people that it&#8217;s going to be fine. That&#8217;s part of the goal of the policy.</p><div><hr></div><h2>AI Adjustment Insurance: Moral Hazard and Three Components [07:42 &#8211; 14:45]</h2><p>[00:07:42] <strong>Andrey:</strong> How do you know how much to insure? Full insurance would be very expensive, but it would also create a lot of moral hazard &#8212; we <em>do</em> want people making choices in anticipation of AI. If everyone just puts their head in the sand and pretends it&#8217;s not happening...</p><p><strong>Seth:</strong> &#8220;The automation insurance is too good. I <em>want</em> to get automated &#8212; give me that automation insurance.&#8221;</p><p><strong>Ioana:</strong> So the insurance is going to be incomplete. And I&#8217;ll talk in a moment about what I&#8217;m proposing, which is modeled on trade adjustment assistance. But it&#8217;s also important to know that some of the moral hazard issues in unemployment insurance &#8212; which is something I&#8217;ve studied a lot &#8212; are much less economically important during times of high unemployment.</p><p>If what happens is huge amounts of unemployment &#8212; not necessarily forever, but a lot of people needing to change jobs &#8212; then typically there are too many people looking for jobs relative to the number of vacancies. In that case, the fact that some people might put in less effort to find a job, sending fewer applications, is actually fine, because collectively they&#8217;re sending a lot of applications. They&#8217;re shooting themselves in the foot by competing too aggressively for the limited number of jobs.</p><p>[00:09:25] <strong>Ioana:</strong> So even if more generous unemployment insurance seems like it&#8217;s desensitizing people from looking hard for a job, the end effect doesn&#8217;t necessarily reduce the number of jobs found, because there are too many unemployed people relative to jobs. My work has shown that in prior situations like COVID. So in a situation like that, we should be far less worried about the disincentive effects of more generous unemployment benefits.</p><p>And maybe now I&#8217;ll come to the policy &#8212; I call it AI Adjustment Insurance. It includes more generous unemployment benefits, meaning they last longer (again modeled on trade adjustment assistance); additional training; and a third component, wage insurance. Wage insurance means that if you find a job with a lower wage than your prior job, the policy covers part of that gap. That actually encourages you to take a new job even if the wage is a little lower &#8212; so it&#8217;s directly pro-reallocation. The training and wage insurance push reallocation, which counteracts the concern that longer unemployment benefits might discourage job search.</p><p><strong>Seth:</strong> Maybe one of your sources is Pascual Restrepo&#8217;s job-market paper &#8212; this idea that during an automation-driven unemployment wave you get a skill mismatch. Lots of people are applying, but they have the wrong skills, and that increases friction in the market. From an efficiency standpoint, it might not be the worst thing if some of the people with out-of-date skills aren&#8217;t looking for jobs.</p><p><strong>Ioana:</strong> Exactly right. That would increase the friction, and this policy, if well-implemented, has the ability to manage that friction better.</p><p>[00:11:52] <strong>Seth:</strong> There are many reasons a person might lose their job. They could be losing it because they&#8217;re doing a bad job, or because of macroeconomic things that have nothing to do with AI &#8212; or it could literally be AI. Recently Coinbase claimed to fire a bunch of people because of AI, and a cynic might say their stock price was going down and crypto was struggling. Do you care to identify that? Is that an important part of the policy?</p><p><strong>Ioana:</strong> Of course you need to identify it, and there are going to be inclusion and exclusion errors &#8212; it&#8217;s not foolproof. Some people who should be eligible won&#8217;t be deemed eligible, and vice versa. But I believe we can put in place a process with reasonable accuracy. That was the case with trade adjustment assistance: the company had to certify that the job was lost due to trade. You can question it, but there was a process. In the case of AI, similarly &#8212; and I haven&#8217;t fully thought this through; if someone wants to do it, let&#8217;s do it. People in these offices know a lot about how to do this.</p><p>An example: if you&#8217;re in an occupation that&#8217;s highly exposed, and there&#8217;s recently been investment in AI at the firm &#8212; buying new software that uses AI to do business services &#8212; that might plausibly amount to a layoff due to AI. It won&#8217;t be 100% accurate. You could do it at a micro level, trying to figure out whether this particular job got automated, or you could do a macro counterfactual simulation &#8212; in a different universe there would have been 30,000 more taxi-driver jobs, so you attribute some percentage of that to your loss. But you can&#8217;t do that if you need to decide <em>right now</em> whether this person gets the service. That&#8217;s interesting from a research perspective, but operationally we&#8217;d have to determine eligibility &#8212; maybe just being in an exposed occupation, though that might be too crude.</p><div><hr></div><h2>Did Trade Adjustment Assistance Work? [14:45 &#8211; 19:28]</h2><p>[00:14:45] <strong>Seth:</strong> Before we move to your other policy idea &#8212; this trade adjustment policy that was supposed to get big societal buy-in for free trade was a glowing success, right? Everybody loves free trade...</p><p><strong>Ioana:</strong> The policy didn&#8217;t do well because it was underfunded. The amount of funding is a strict cap decided by Congress, so they just couldn&#8217;t spend more. The number of people who received it at all is very small relative to the number exposed to trade. <em>However</em>, for those who did receive it &#8212; and remember, I&#8217;d argue it wasn&#8217;t enough &#8212; it worked well. There&#8217;s a really cool paper looking at the wage-insurance dimension.</p><p>So: I lost my job due to trade, I take a new job that pays less, and the wage insurance covers part of that gap for two years. What happened, which is fascinating, is that this component helped people return to work quicker &#8212; and it was fiscally net beneficial from the government&#8217;s point of view, because they returned to jobs that were no lower-paid than they&#8217;d otherwise have taken, started paying payroll taxes again, and essentially repaid their own subsidy over time. So at least based on that experience, it&#8217;s a highly effective way to support people through the transition. While the scale of TAA was too small, for those who got it, they benefited a good bit, and it was effective even from a fiscal perspective.</p><p><strong>Seth:</strong> If TAA was more generous, would Hillary Clinton be president?</p><p><strong>Ioana:</strong> Who knows? But I&#8217;m going to push back on the argument. Earlier you were making the case that with unemployment insurance, moral hazard isn&#8217;t an issue because in general equilibrium there aren&#8217;t enough jobs. But if we expanded the size of TAA, the general-equilibrium effects could also swamp the benefits &#8212; maybe only a small share of those people could effectively have found jobs, and if you gave the benefits to many more of them, they wouldn&#8217;t be able to take advantage.</p><p><strong>Seth:</strong> You mean because the fiscal cost would become significant?</p><p><strong>Ioana:</strong> No &#8212; they wouldn&#8217;t be able to find the jobs.</p><p><strong>Seth:</strong> Right. If a small number of people get wage insurance and there are some jobs they can take, they take them. But if you give wage insurance to everyone, many wouldn&#8217;t be able to find jobs &#8212; or they&#8217;d cannibalize jobs from people who would have gotten them anyway. Say I worked at a factory, and now I decide to become a ski instructor, and you give me wage insurance for that. There have to be equilibrium effects.</p><p><strong>Ioana:</strong> This is close to my heart, because I&#8217;m a big skier. For sure this increases competition for jobs wherever people decide to go. But from a micro perspective, wage insurance helps because people are now willing to expand to jobs that pay a little less. And mind you, it&#8217;s only for two years, so you need some commitment that the job is reasonable &#8212; and by that time you can increase your wage through returns to experience.</p><p>It&#8217;s similar to research on job-search assistance: you put people on benefits and help them &#8212; or even require them &#8212; to apply to more jobs. What the research shows is that if you do that for a lot of people within a given labor market, it stops being effective, because they&#8217;re trampling on each other&#8217;s toes.</p><div><hr></div><h2>Will the Jobs Come Back? The Lump-of-Labor Fallacy [19:28 &#8211; 22:09]</h2><p>[00:19:28] <strong>Andrey:</strong> Is there an underlying assumption in your proposal that the occupations people move into won&#8217;t go away within those two years? This is one of the big challenges &#8212; we have enormous uncertainty about exactly which labor markets are going to be affected negatively, and maybe some positively. What do we do about that?</p><p><strong>Ioana:</strong> I don&#8217;t think there&#8217;s any guarantee that in two years the places they go will be safe. But if we still have the policy and AI continues to provoke churn, they still have it to rely on and can find another job. Also &#8212; and this is less true for non-economists, but a lot of economists imagine there&#8217;s a fixed number of jobs, so if we lose a lot, there are only so few left and everyone&#8217;s fighting over them. That&#8217;s not how it works. Everything is connected, and especially if the technology improves production, there are positive spillover effects that make other jobs more productive. So there will also be a lot of job creation.</p><p>At the same time, we don&#8217;t know exactly what those jobs will be, and it could take several rounds of adjustment. I don&#8217;t want to rule out that it could be very bad, but I don&#8217;t think a massive <em>net</em> loss in the number of jobs is the most likely scenario. It could be very bad in the sense that a lot of people have to change jobs, which is difficult. But as someone who&#8217;s prudent and wants to be flexible &#8212; that&#8217;s what my second policy is designed to address. What if a lot of jobs are lost forever? Then we need something to fall back on.</p><div><hr></div><h2>The Digital Dividend [22:09 &#8211; 26:00]</h2><p>[00:22:09] <strong>Andrey:</strong> Do you want to tell us about that one?</p><p><strong>Ioana:</strong> The second policy addresses a situation where there&#8217;s durable mass unemployment &#8212; not just people needing to find a different job while other sectors grow, but a lot of jobs disappearing forever, not replaced by new ones, so some people are structurally, permanently unemployed. What do we do with them? This is especially important in the US, where most of the social safety net depends on you either having a job or looking for one. Even food stamps: as a so-called able-bodied adult, you can&#8217;t get food stamps unless you have a job or are looking. If there are just no jobs for a large number of people, there&#8217;s not much to fall back on.</p><p>So that&#8217;s what policy B &#8212; the <strong>Digital Dividend</strong> &#8212; is meant to address. The idea is a cash transfer that&#8217;s unconditional and broad-based. In the specific proposal I give it to everyone, but you could make it broad-based so a lot of people benefit. You might fund it with a tax on the digital sector. I don&#8217;t want to tax AI specifically, but all sectors that can immediately benefit from it &#8212; a broad-based tax, so it&#8217;s harder to avoid.</p><p><strong>Seth:</strong> A profit tax? A consumption tax? An income tax?</p><p><strong>Ioana:</strong> I was thinking a sales tax, just to make it easier &#8212; but this is something we can talk about.</p><p><strong>Andrey:</strong> No sales tax on GPUs, or...?</p><p><strong>Ioana:</strong> Just a sales tax on all digital companies. We can talk about other options &#8212; this is a beginning. The point is it would be very small, and the broader you make it, the smaller it can be while still creating revenue. You&#8217;d invest it in a fund, and the returns come back to people as cash.</p><p>Why this structure? The tax side can slightly slow the diffusion of the technology &#8212; and there are theory papers showing that if there are labor-market frictions and credit constraints, reallocation is painful for workers, so it can be optimal to slow diffusion a little. We&#8217;re not banning anything. And the revenue lets you pay people the cash benefit if we end up in the no-jobs situation. This policy can be expanded &#8212; that&#8217;s the flexibility. You can start very small, almost zero tax, but if we have mass unemployment you scale it up and grow the base to the whole economy.</p><div><hr></div><h2>Can We Afford UBI? [26:00 &#8211; 34:23]</h2><p>[00:26:00] <strong>Andrey:</strong> All of us have had conversations with technologists who jump straight to UBI as the solution to all issues with AI, and this is one version of it. What I always tell them is that we can&#8217;t afford it &#8212; and we <em>deeply</em> can&#8217;t afford it. There&#8217;s some future where productivity gains are so large that the numbers pencil out, but I&#8217;ve yet to see a tractable, plausible version. If we did it today, the amount per person would be trivial. And for UBI to truly work &#8212; as something that lets people not work &#8212; it needs to be a massive transfer. It can&#8217;t be even a thousand dollars a month.</p><p><strong>Ioana:</strong> I hear you. But where the technologists are consistent with themselves is that they often think AI is going to revolutionize productivity. If that&#8217;s true, then it will be possible to have a reasonably high UBI. And that&#8217;s the whole point of my proposal &#8212; it&#8217;s conditional, we scale it up as needed. I could even envision it not being completely universal, but it should be broad-based, so people have an income to fall back on if technology is deleting a lot of jobs.</p><p><strong>Andrey:</strong> Let&#8217;s say a plausible scenario that a lot of economists believe: AI increases per-capita GDP growth by about 0.5 percentage points per year relative to baseline. We&#8217;re five years down the road, and in the limit it&#8217;ll be big enough to support everyone &#8212; but we&#8217;re not there yet, and a lot of people are out of jobs by that point. We might not get the super-productive world until well after we have economic displacement massive enough that wage subsidies won&#8217;t work.</p><p><strong>Seth:</strong> And we need that 0.5 percentage points just to deal with the current deficits &#8212; we&#8217;re already counting on it. But here&#8217;s what&#8217;s natural to me: you start today and make plausible projections &#8212; what&#8217;s a world with another percentage point of GDP growth a year worth, what&#8217;s a world with two? &#8212; and you set aside a fraction of that in advance for a UBI or digital dividend. You make the policy <em>now</em>, rather than after the crazy thing happens. It has that automatic logic I really like.</p><p><strong>Ioana:</strong> That&#8217;s exactly the point. I&#8217;m a little concerned that after a lot of people lose their jobs and the situation looks grim, it might be more difficult to say, &#8220;Now let&#8217;s have a big reshuffling of money to help these people,&#8221; especially when some people have made a lot of money. Whereas if we can commit more ex ante to putting money aside, it&#8217;s a bit of a veil-of-ignorance situation &#8212; we don&#8217;t know for sure who the winners and losers will be. So it can be socially easier to agree to put a parachute in place now, before you know whether you&#8217;re a winner or a loser. It&#8217;s a political-economy argument, but I think it&#8217;s important, because I really worry we get there without enough to support people, and the winners say, &#8220;Ah, too bad.&#8221;</p><p>[00:30:47] <strong>Seth:</strong> My questions are more on the tax side than the spending side. We&#8217;ve seen many efforts to tax digital goods, and they&#8217;ve had a lot of problems. Where was a Netflix video watched? Where was an ad viewed? People use VPNs; these companies have no physical locations and move easily. How convinced are you that we could actually raise significant revenue from a digital tax when my VPN is in the Cayman Islands, so I&#8217;m not paying a sales tax in America?</p><p><strong>Ioana:</strong> That definitely needs to be worked out. With tax policy you always have to think about incidence and evasion &#8212; not necessarily illegal evasion, just ways around it. I haven&#8217;t done a detailed implementation calculation, but it&#8217;s probably feasible to find a version that works. You won&#8217;t eliminate evasion &#8212; that&#8217;s always true with taxes &#8212; but you want to think ahead of the incentives. That&#8217;s why we&#8217;re economists. It&#8217;s not like you slap on a tax and the money comes in; there are behavioral adjustments you need to foresee with a coherent design. In principle, it should be possible to raise a good chunk of money if we wanted to.</p><p><strong>Seth:</strong> Related to that &#8212; it&#8217;s not crazy to think all the rents go to energy producers or even landowners of energy resources. And if we&#8217;re taxing them, we might disincentivize energy production, which raises the cost of living. Land-value tax solves all problems forever, of course &#8212; I do like land-value taxation.</p><p><strong>Ioana:</strong> In the long run you want to tax the inelastic input, and land is the ultimate inelastic input &#8212; that&#8217;s something to think about in the long run. But the reason a digital tax can make sense in the short run is this idea of a small, moderate slowdown &#8212; not massive &#8212; that has the benefit of accumulating some capital to help people with later. As I said in my piece, I&#8217;d definitely expand the tax base at some later point, once we think the transition has happened.</p><div><hr></div><h2>The Motte-and-Bailey Worry [34:23 &#8211; 36:05]</h2><p>[00:34:23] <strong>Seth:</strong> Are you worried about a motte-and-bailey situation? Your proposal is very modest, but I could see politicians using that logic to implement a massive tax-and-transfer scheme today under the pretense that it&#8217;s about the AI future &#8212; and I really worry we can&#8217;t afford it.</p><p><strong>Ioana:</strong> That gets into the social welfare function &#8212; you could call it politics, or simply what we want &#8212; and as economists, it&#8217;s not really our job; as citizens we can have views. Different politicians legitimately have different ideas about what&#8217;s important. Within my piece, I was proposing the digital dividend as a solution to AI unemployment, but there are other reasons to like UBI. I&#8217;ve written about UBI before and discussed some of them. So maybe you also like UBI for those reasons &#8212; that&#8217;s a tenable view, and it&#8217;s also fine to disagree. There could be a push to go big right now, and if that&#8217;s ultimately what people want and they convince the rest of us, that&#8217;s just how the democratic process works.</p><div><hr></div><h2>Intelligence Saturation: The Model [36:05 &#8211; 42:23]</h2><p>[00:36:05] <strong>Seth:</strong> Maybe this is a good time to transition into the new macroeconomic model you came out with, which is informing your beliefs about how radical the changes might be &#8212; &#8220;(Artificial) Intelligence Saturation and the Future of Work.&#8221; I love a title with parentheses in it. Just to lay it out for listeners: it&#8217;s a very neoclassical way of thinking about automation, but with new twists &#8212; a nested constant-elasticity-of-substitution model with an intelligence sector and a physical sector. Why is it important to think about an intelligence sector and a physical sector as complementary, rather than one thing?</p><p><strong>Ioana:</strong> It&#8217;s important to distinguish them because of an empirical fact &#8212; it&#8217;s not yet in the paper, but I&#8217;ll add it in the next version. If you look across occupations at which were most teleworked during COVID versus their exposure to AI &#8212; the Eloundou et al. measure of LLM exposure &#8212; there&#8217;s a very strong correlation. The more an occupation was teleworked during COVID, the higher its exposure to AI, and conversely. By &#8220;physical&#8221; I mean an in-person job, where you need a physical human body. That doesn&#8217;t necessarily mean working with your hands &#8212; teaching in person is physical in my definition but not manual. It&#8217;s just an in-person job.</p><p><strong>Seth:</strong> I was curious about those examples, because you gave the example of an in-person lawyer giving oral arguments &#8212; but didn&#8217;t we do those online during COVID?</p><p><strong>Ioana:</strong> Some of it we did online, but for the economics, the important thing is how substitutable these things are. Online teaching is its own thing &#8212; it has a place and a function, but it&#8217;s not the same as in-person teaching. They&#8217;re differentiated products; you can&#8217;t easily replace one with the other. So based on the fact that AI exposure and the ability to do remote work are highly correlated, that justifies the distinction between physical and intelligence. There are also fundamental limitations of the physical world that are much more stringent than the limitations you meet scaling the virtual world. It&#8217;s much more difficult to expand human bodies and physical capital &#8212; that&#8217;s very slow &#8212; whereas you can scale up AI incredibly fast. It&#8217;s still not costless &#8212; data centers and so on &#8212; but you can do a lot, really quickly. This distinction is critical to understanding how AI could affect the economy.</p><p>[00:40:14] <strong>Seth:</strong> So you&#8217;ve got an intelligence sector growing really fast and a physical sector that maybe doesn&#8217;t grow as fast. Let&#8217;s roll with the assumption that the two are gross complements &#8212; you need both to have a lot of output; you can&#8217;t just have the peanut butter or the jelly. What are the conclusions of the model?</p><p><strong>Ioana:</strong> The core conclusion is that if physical and intelligence are complements, then AI can grow the intelligence side incredibly &#8212; to infinite intelligence &#8212; but as long as the physical sector stays fixed (or grows much slower), the impact of growing AI <em>saturates</em> on both output and wages. By saturating, I mean output goes to a finite limit, a ceiling. Even with infinite intelligence and infinite AI, output is strictly bounded. Wages increase too, because they go together with output, but they hit a ceiling. That&#8217;s what we call intelligence saturation.</p><p>This is super important because a lot of technologists see the progress of AI and imagine the whole economy could expand at a similar rate. This makes the strong point that here&#8217;s a scenario I think is quite plausible: you could expand like crazy in AI and still only hit a ceiling in output.</p><div><hr></div><h2>Robots vs. LLMs [42:23 &#8211; 48:52]</h2><p>[00:42:23] <strong>Andrey:</strong> I understand the thought experiment, but saying intelligence goes to infinity while the physical is a constraint is a little strange &#8212; it&#8217;s pretty clear that with enough intelligence we&#8217;ll figure out how to make robots work, even self-replicating, learning systems.</p><p><strong>Seth:</strong> There are two different parameters in the model, right? You&#8217;ve got the output of the intelligence sector, and then the automatability of the physical sector. So Andrey&#8217;s intuition is: if we had a gazillion intelligence, don&#8217;t we fully automate the physical sector?</p><p><strong>Ioana:</strong> I want to distinguish two things. One is whether humans can be replaced with robots. Robots are improving, but relatively slowly compared to AI. So comparatively, it&#8217;s much more advantageous to replace people on the AI-replaceable side than in applications where you need to be there in person. That&#8217;s not to say there&#8217;s no progress in robotics.</p><p><strong>Andrey:</strong> I&#8217;m just representing the technologist viewpoint &#8212; that this is true <em>for now</em>.</p><p><strong>Ioana:</strong> The argument I&#8217;m going to make is based on history &#8212; the history of technology specifically, and you can think history won&#8217;t be the same. We&#8217;ve had physical robots for the longest time. With the Industrial Revolution we created many more and improved them a lot, and even before LLMs they improved tremendously with machine learning and semi-autonomous systems. These are very real improvements, and they have replaced some jobs in manufacturing &#8212; Daron Acemoglu has papers on that. But it wasn&#8217;t like, &#8220;wow,&#8221; thanks to all that intelligence in the system, it still takes a lot to get there.</p><p>I&#8217;m totally willing to think this new technology can make them even better &#8212; but I&#8217;m skeptical it gets a <em>lot, lot, lot</em> better. It&#8217;s the saturation argument: there&#8217;s a fundamental limit. The cost of robots is fundamental &#8212; you have to use materials to make them and maintain them. We&#8217;ve tried for centuries to improve robots, and they have improved, but I don&#8217;t know how much more you can improve them with this new technology.</p><p><strong>Andrey:</strong> A humanoid robot that&#8217;s smarter than a human seems like a pretty big improvement that&#8217;s plausible.</p><p><strong>Seth:</strong> 15% chance, according to a recent survey of economists and AI researchers. By 2030.</p><p><strong>Ioana:</strong> The question is partially the cost &#8212; not that it&#8217;s technically impossible, but what&#8217;s the cost of the whole thing relative to a human. At least in the medium run, I think it&#8217;s not very plausible you&#8217;ll bring that cost down a great deal. Also, a lot of these robots aren&#8217;t very versatile, unlike AI. That&#8217;s the cool thing with AI &#8212; it&#8217;s super general-purpose; it can use all the tools we had before. But most industrial robots are bespoke, meant for a particular application &#8212; that&#8217;s how you make them cheap and effective.</p><p>I feel somewhat confident that in the short-to-medium run it will be very difficult to make it cost-effective to have robots replace people in most jobs. Harder to tell the further out you go. However, I believe everything you could do on a computer could get automated in the medium term &#8212; possibly even the research I&#8217;m doing. That&#8217;s a different matter, because it&#8217;s all based on computers.</p><p><strong>Andrey:</strong> Even if what I&#8217;m saying is true, the Baumol-style logic would still hold, right? If we still want human teachers even when robots are available...</p><p><strong>Seth:</strong> Maybe don&#8217;t call it the physical sector &#8212; call it the <em>humanness</em> sector.</p><p><strong>Ioana:</strong> You could relabel it. I think &#8220;physical&#8221; is relevant at least in the medium run, but to make it more future-proof, you could relabel the physical sector as the <strong>non-automatable sector</strong>, whatever that turns out to be. As long as there exists a non-automatable sector, that&#8217;s where people will work, and the mechanics of the model are identical.</p><div><hr></div><h2>The Capital-Share Puzzle [48:52 &#8211; 50:35]</h2><p>[00:48:52] <strong>Seth:</strong> I love playing around with these neoclassical models &#8212; automatable part, non-automatable part. I&#8217;ve been doing it for a decade, and one challenge that pushes historically in a different direction: if intelligence and physical stuff are gross complements, you&#8217;d expect that as you get more intelligent stuff, its share of national income would go <em>down</em>. But over the last 50 years we&#8217;ve seen a huge explosion in education in the US, and yet the educated share of income has been going <em>up</em>. So how do I think about it actually looking more like gross substitutes?</p><p><strong>Ioana:</strong> But we&#8217;ve also had the share of capital going up. Part of the last 30 years or so is the ICT revolution, which is somewhat similar to the prior version &#8212; really you could say it&#8217;s the same thing, just different stages of an AI revolution that&#8217;s at an early stage. During that, the share of capital has been going up, and research suggests&#8212;</p><p><strong>Seth:</strong> Is it the share of capital, or the profit share? That&#8217;s an important question we&#8217;ll come to in a minute.</p><div><hr></div><h2>Inside the DOJ Antitrust Division [50:35 &#8211; 58:12]</h2><p>[00:50:35] <strong>Andrey:</strong> You did this stint at the DOJ &#8212; you were on leave from being a professor. Could you tell us what led you to work there, and a bit about your work?</p><p><strong>Ioana:</strong> At the time I was doing work on monopsony power in the labor market &#8212; the difference between wages and the marginal productivity of labor. The Biden administration commissioned a report on labor-market monopsony, and the people doing that called me up; that&#8217;s how I learned about the job. I thought, &#8220;That sounds really interesting &#8212; to do antitrust enforcement as an economist.&#8221; I said yes, and I was lucky enough to get the job.</p><p>[00:51:29] <strong>Seth:</strong> <em>[Sponsor break]</em> For those of you playing along at home, now is your chance to think about how this conversation has changed your priors. This chance to contemplate your posteriors is sponsored by Revelio Labs &#8212; a leading provider of labor-economics data and data services for companies, academics, and independent researchers. Revelio combines comprehensive micro-level data on employee profiles, job postings, and sentiment with standardizations, mappings, and enrichments, all to make the data useful without making your modeling decisions for you. It can be aggregated to company, market, or industry, and used to study everything from career trajectories to occupational transformation to the impact of AI on labor demand. Revelio data is available on WRDS &#8212; so if you&#8217;re an academic with a good library, go see if you have access already. And if not, reach out to their excellent economics team.</p><p>[00:52:43] <strong>Ioana:</strong> It was an incredible experience. What I did there is &#8212; I was the principal economist&#8212;</p><p><strong>Andrey:</strong> Can you pause and define monopsony for our listeners?</p><p><strong>Ioana:</strong> Monopsony power is the idea that employers are able to pay workers less than their marginal productivity. Under perfect competition, the wage exactly equals the marginal productivity of labor &#8212; whatever value the worker brings, the company pays them for it. With monopsony power, workers get paid less than what they bring to the company. One of the key reasons is the lack of competition among employers. Intuitively, in the extreme &#8212; a literal monopsony, only one employer &#8212; that employer doesn&#8217;t need to pay much to keep you. Whereas if there are many employers, they bid up the price of labor by competing for you, and in the competitive extreme you get paid your marginal product, because if someone underpays you, a neighboring employer recruits you away.</p><p><strong>Andrey:</strong> Great &#8212; so you can continue.</p><p><strong>Ioana:</strong> So when I was at the DOJ, I was the principal economist. In the antitrust division, the job is to enforce the antitrust laws, and they do two broad things. One is examining mergers and potentially blocking them if they lead to anti-competitive effects. The other is so-called conduct, which can include criminal conduct like literal collusion&#8212;</p><p><strong>Seth:</strong> Assassinating rival CEOs.</p><p><strong>Ioana:</strong> A situation kind of like that &#8212; which is unbelievable. You think it only happens in the movies, but it happens in real life. Economists don&#8217;t get too involved in those cases, because it&#8217;s more of a whodunit.</p><p><strong>Andrey:</strong> No fun. Let us in &#8212; we want to be detectives.</p><p><strong>Ioana:</strong> If you want to look this up, the keyword is <em>transmigrante</em> &#8212; a trade in used cars from the US being traded toward Latin America. There was unbelievable murder among the companies involved, around collusion. If you were a traitor to the scheme... yes. Anyway, that&#8217;s the monopsony power.</p><p><strong>Andrey:</strong> That&#8217;s one way to get monopsony power.</p><p><strong>Ioana:</strong> This is where the FBI goes &#8212; not really the province of economists. The other category is a bunch of behaviors by firms that hinder competition, the big one being monopolization &#8212; trying to remain or become a monopoly by kneecapping rivals. In my role I oversaw the expert analysis group, a team of about 50 PhDs, mostly economists and data scientists. Whenever we had a case against a company, there&#8217;d be data gathering and economic analysis to support arguments about why a behavior hinders competition and might, for example, increase prices &#8212; or, in a labor-market case, how employers hinder their employees&#8217; ability to find another job, and so can pay them less.</p><p>For someone who worked on monopsony, being able to think about how mergers should be blocked if they lead to greater monopsony power was incredibly rewarding. Intuitively, if two employers merge, that reduces competition for workers and can lower wages or degrade non-wage dimensions of jobs. That&#8217;s now officially in the merger guidelines &#8212; which was unbelievable. How often do you do research and then get to implement the thing?</p><div><hr></div><h2>Favorite Cases, and Antitrust Meets AI [58:12 &#8211; 1:09:27]</h2><p>[00:58:12] <strong>Andrey:</strong> Is there a monopsony case you worked on that you&#8217;re particularly excited about?</p><p><strong>Ioana:</strong> There were a number. They&#8217;re described in papers we write every year reporting on finished cases, in the <em>Review of Industrial Organization</em>. One &#8212; I only caught the tail end &#8212; was a publisher merger. Big publishers were trying to merge, and the argument was that if they merged, authors trying to sell their books would get lower payments. The judge found it very convincing. We even had Stephen King testify about how the merger would reduce what he could get paid.</p><p><strong>Andrey:</strong> He couldn&#8217;t afford all the cocaine he needed.</p><p><strong>Ioana:</strong> Ultimately the merger was blocked, and they decided not to appeal. That was the first merger in the US blocked exclusively on a labor theory of harm &#8212; that it would lead to lower payment for authors.</p><p>The other is an interesting case around small farmers who raise chickens. They work for a processor as subcontractors &#8212; small farmers, not workers, but worker-like. They raise chicken and sell it to a big integrator. There was a contractual term that if you left to work for a different company, you&#8217;d have to pay a big chunk of cash to leave. We argued this significantly restricted competition for these farmers&#8217; services and lowered their pay, because it&#8217;s hard to leave if you have to pay to do so. We won in the sense that there was a settlement &#8212; the company said, &#8220;Fine, we&#8217;re not doing it anymore.&#8221;</p><p>I did a lot of work on agriculture, because farming is often an area with few opportunities to sell labor or goods, so monopsony is prevalent there. This tells listeners that monopsony &#8212; whether there&#8217;s competition among buyers &#8212; isn&#8217;t just about workers. Workers are a big application, but it can also be a more B2B situation, where many small businesses or independent contractors sell to big buyers with market power.</p><p>[01:01:30] <strong>Andrey:</strong> Shifting slightly &#8212; something people are beginning to think about is antitrust and AI. Have you thought about that? Do you have opinions?</p><p><strong>Ioana:</strong> It&#8217;s really important to stay vigilant in AI and antitrust. We&#8217;ve had prior tech giants the government has gone after &#8212; Microsoft, Google &#8212; and in these industries there can be an opportunity to monopolize. We&#8217;re not there right now, but that&#8217;s why we have watchdogs like the Antitrust Division. There&#8217;s been a lot of partnership and financing deals, which might ultimately lead to consolidation, and that should be watched in the ordinary course of antitrust enforcement.</p><p>Why does this matter? We want to maintain low prices and high-quality services for consumers and businesses &#8212; and a big part of AI is used by businesses. If you want this technology to lead to greater productivity through adoption, you want to keep it cheap and good. What usually happens with consolidation is the product gets worse and prices get higher than they&#8217;d be in a more competitive industry. It&#8217;s natural for companies to try to monopolize &#8212; that&#8217;s why we have the Antitrust Division and the FTC, and also so that companies considering certain steps recognize some might not be lawful and stay away from them, preserving competition.</p><p><strong>Seth:</strong> That&#8217;s generally the argument, but sometimes we have natural monopolies. Some argue these big foundation-model builders &#8212; OpenAI, Anthropic &#8212; pouring giant amounts into training runs might be natural monopolies. Maybe we just want one company doing the one giant training run, and the right way to regulate isn&#8217;t competition policy but tax policy or some other government control. What do you think?</p><p><strong>Ioana:</strong> It could be, but it&#8217;s not clear yet, because there are still many foundation models, including outside the US &#8212; the data is out there. You need data and power to train, but you can do it multiple times if you have the resources. There&#8217;s also a difference: in the US there&#8217;s a big focus on the biggest, fastest foundation models, but in places like China it&#8217;s much more focused on applications, and there you see a lot of competition. Some company might want to have it all &#8212; &#8220;Why don&#8217;t I have <em>all</em> the applications?&#8221; &#8212; and that&#8217;s why we have the antitrust authority to stop that. In some situations a utilities-type regulation can make sense, but I think we&#8217;re not there yet. For now I&#8217;d take the position that we should promote competition, and we&#8217;ll see where the dust settles. If you prematurely favor monopoly, that can actually hinder the development of the technology. So I&#8217;ll err on the side of competition.</p><p><strong>Andrey:</strong> One interesting thing here: if you&#8217;re the person who thinks we should slow things down, then you should be rooting for <em>more</em> market power. Leopold famously argued the technology is so powerful we&#8217;re going to have to nationalize it &#8212; which again goes to an argument for more market power rather than less. I&#8217;m not saying I support this, just throwing it out as a slightly unusual difference from most industries.</p><p><strong>Seth:</strong> Energy might be similar &#8212; or nuclear power would be a different analogy.</p><p><strong>Ioana:</strong> The thing is, you can still use a model from elsewhere &#8212; whereas with energy there are huge costs of transmission lines, so it&#8217;s more limited geographically. And let me make a point related to my saturation paper that&#8217;s highly relevant. Assume there are decreasing returns &#8212; you add more intelligence, it&#8217;s helpful, but less and less helpful at the margin. If that&#8217;s the case, then the whole competition between countries is less&#8212;</p><p><strong>Seth:</strong> Then America loses, because China is better at physical stuff.</p><p><strong>Ioana:</strong> The point is rather that being first is not that important if you have diminishing returns. It&#8217;s important, but less so, because if you&#8217;re second, you&#8217;re just a little bit worse and can still do a lot of things almost as well. Whereas if it&#8217;s &#8220;the singularity, boom,&#8221; and then you&#8217;re far ahead of everyone, being there first really matters. So whether you think you&#8217;ll reach a point of explosion versus diminishing returns completely changes how you think about competition between countries &#8212; and even between different models.</p><p><strong>Andrey:</strong> There&#8217;s a subtle point: you could have diminishing returns, but the nature of military conflict is a contest &#8212; you just need the max. So it&#8217;s very different from the economy.</p><p><strong>Ioana:</strong> I feel less of an expert on military. I was talking more about economic might &#8212; if there are diminishing returns, it&#8217;s nice to be first, but it&#8217;s only&#8212;</p><p><strong>Seth:</strong> Let&#8217;s talk about economic might, because your argument is even stronger than the one you&#8217;re making. In a universe where American innovations in AI spill over into Chinese innovation &#8212; which they do, with distillation and publicly written papers &#8212; if China has the advantage in the physical, we&#8217;d want a world less constrained by the physical. We&#8217;d want slower progress.</p><p><strong>Ioana:</strong> That gets to things we discuss in our paper &#8212; how much substitutability there is between physical and intelligence from the workers&#8217; point of view. The paper is about what happens to workers and equilibrium wages during automation versus after. <em>During</em> automation &#8212; assume you&#8217;re automating all intelligence tasks &#8212; low substitutability is a form of insurance for workers; it avoids some of the worst wage outcomes, especially at high levels of automation. But <em>after</em>, once we all work in the physical sector, more substitutability promotes higher wages and growth in the very long run. So the game is different depending on whether you&#8217;re in the short run, during automation, or after.</p><p><strong>Seth:</strong> In the very long run, you want an AK economy.</p><p><strong>Ioana:</strong> Exactly. In the long run it&#8217;s good, but in the short run it could be better not to have it, in terms of avoiding a wage decline.</p><div><hr></div><h2>Lightning Round [1:10:59 &#8211; 1:19:45]</h2><p>[01:10:59] <strong>Seth:</strong> Lightning round. What&#8217;s the meaning of life? In your discussion of a Betsey Stevenson paper at a recent NBER session, you said that after automation takes all our jobs, something called <em>ikigai</em> will be more important. What is that?</p><p><strong>Ioana:</strong> It&#8217;s the idea of having a sense of the inherent meaning of your everyday activities. This is something Betsey Stevenson proposed, and I was commenting on it and thinking about examples from philosophy. Ikigai is a Japanese concept, but we have other examples in Western philosophy &#8212; the French writer Camus, and the myth of Sisyphus. You imagine Sisyphus pushing a boulder up the hill; it rolls down, and he pushes it up again. It seems pointless. However, the myth says you have to imagine Sisyphus happy: he finds satisfaction in the repetition and transcends the fact that it&#8217;s repetitive, making sense of his life by becoming absorbed in it and seeing his freedom in embracing it. It&#8217;s a very inspiring way of thinking, because in the current regime we&#8217;re so obsessed &#8212; especially in economics &#8212; with making more stuff, versus paying attention to what we already have.</p><p><strong>Seth:</strong> But then you immediately economist-brain it, because you have this amazing quote: &#8220;If there are no market-like mechanisms to encourage people to pursue their ikigai, just as wages incentivize people to work, a world without transformative AI&#8221;&#8212; sorry, a world <em>with</em> transformative AI but without work could undermine wellbeing. So how do we incentivize ikigai?</p><p><strong>Ioana:</strong> That&#8217;s something I want to work on &#8212; so if anybody&#8217;s listening and wants to embark on this quest, I&#8217;m all for it. Maybe you&#8217;ve read about the epidemic of loneliness. Why aren&#8217;t people getting out and doing social activities? As economists, we think in terms of cost &#8212; there must be friction costs, coordination costs. The question is how you engineer a world where those costs are lower and people actually get out, meet friends, do their gardening or their rock-pushing, instead of sitting there contemplating how they&#8217;re doing.</p><p><strong>Andrey:</strong> You climb the rocks rather than push them.</p><p><strong>Seth:</strong> We&#8217;ve got a rock climber in the room. The social planner will assign you the rock, and you will experience ikigai pushing the rock.</p><p><strong>Ioana:</strong> No, no &#8212; it&#8217;s going to be a <em>market-like</em> mechanism.</p><p>[01:14:13] <strong>Seth:</strong> Do you want to say anything about your work on the Anthropic Economic Advisory Board?</p><p><strong>Ioana:</strong> I&#8217;m a member of Anthropic&#8217;s economic advisory board, advising them on the economic impact of AI. As you probably know, Anthropic releases data products publicly that measure how Claude is being used. Part of my role is to give feedback on what data would be helpful to release, what checks to do, how to show people what the data means and what its representativeness looks like. It&#8217;s been exciting to collaborate with one of the biggest AI companies and play this advisory role.</p><p><strong>Andrey:</strong> And we&#8217;ve covered the Economic Index on this podcast &#8212; we had an entire episode about it.</p><p><strong>Ioana:</strong> Oh, really? Nice.</p><p><strong>Seth:</strong> What are you working on these days? What can we expect next?</p><p><strong>Ioana:</strong> Right now I&#8217;m working on a project, back to monopsony &#8212; monopsony and industrial policy. Industrial policy is fashionable right now. If you subsidize a sector &#8212; the government pays to create more jobs there &#8212; one effect is that it raises wages in the other sector that competes for those workers. You increase employment here, and wages increase there. We want to demonstrate under what conditions you get a bigger or smaller spillover, and therefore why industrial policy can sometimes be justified through this argument &#8212; you&#8217;re paying to get more competition for jobs.</p><p><strong>Seth:</strong> Although there would be a negative spillover from the taxes or regulation needed to support it.</p><p><strong>Ioana:</strong> Absolutely. You put some cost in for the industrial policy, and one benefit is increasing wages in the non-subsidized sector. It&#8217;s also a way to redistribute between wages and profits &#8212; you decrease profits and increase wages in the non-subsidized sector. And profits are pretty hard to tax, so it&#8217;s an interesting instrument. We&#8217;re developing the theory &#8212; how much monopsony power yields what optimal size of subsidized sector &#8212; and thinking about applications. The big-picture point is that the public has often lost confidence in the government&#8217;s ability to redistribute effectively through tax-and-transfer. So if we can provide more jobs while also increasing wages in the other sector, that could be an interesting policy instrument &#8212; one that comes with its own costs, but worth understanding better.</p><p><strong>Andrey:</strong> Final question: who&#8217;s your favorite philosopher?</p><p><strong>Seth:</strong> And we love the way you pronounce Camus, so say it with that beautiful accent.</p><p><strong>Ioana:</strong> Who&#8217;s my favorite philosopher? That&#8217;s surprisingly difficult. I might go with Rawls &#8212; John Rawls.</p><p><strong>Seth:</strong> Beloved of liberals.</p><p><strong>Ioana:</strong> He&#8217;s done incredible work, because he incorporated considerations from utilitarianism. Whether you agree with his take or not, he clarified a lot about the different theoretical frameworks for thinking about social justice. When I was growing up intellectually, it was very helpful. Actually, I learned about utilitarianism first &#8212; I read Mill. Oh, I love John Stuart Mill. Maybe <em>he&#8217;s</em> my favorite, actually &#8212; because the writing is amazing, and he has such a nuanced view of the world. His book on utilitarianism is amazing. They speak very nicely to each other &#8212; so maybe I have to say John Stuart Mill, which is fitting for an economist.</p><p><strong>Andrey:</strong> Tyler Cowen thinks he&#8217;s the best economist ever. So you&#8217;re in good company.</p><p>[01:19:45] <strong>Seth:</strong> It&#8217;s been an absolute pleasure to have you on the podcast &#8212; I had so much fun.</p><p><strong>Ioana:</strong> Thanks so much. It was great.</p><p><strong>Andrey:</strong> This was awesome. Thank you.</p><p><strong>Seth:</strong> All right, everyone out there &#8212; please like, share, subscribe, and keep your posteriors justified.</p>]]></content:encoded></item><item><title><![CDATA[Kevin Bryan on Bottlenecks, AI in China, and What Economists Should Actually Be Working On]]></title><description><![CDATA[This week we to with Kevin Bryan, Associate Professor of Strategy at the University of Toronto&#8217;s Rotman School, author of the legendary economics blog A Fine Theorem, co-founder of the ed-tech startup All Day TA, and the man behind one of the most-discussed Twitter/X feeds in econ,]]></description><link>https://empiricrafting.substack.com/p/keven-bryan-on-bottlenecks-ai-in</link><guid isPermaLink="false">https://empiricrafting.substack.com/p/keven-bryan-on-bottlenecks-ai-in</guid><dc:creator><![CDATA[Andrey Fradkin]]></dc:creator><pubDate>Mon, 01 Jun 2026 13:30:55 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/199826371/3b10acf9f6a634edca1bf316d49f86ef.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>This week we to with <strong><a href="https://www.kevinbryanecon.com/about.html">Kevin Bryan</a></strong>, Associate Professor of Strategy at the University of Toronto&#8217;s <a href="https://www.rotman.utoronto.ca/the-rotman-experience/our-community/people/bryan-kevin/">Rotman School</a>, author of the legendary economics blog <strong><a href="https://afinetheorem.wordpress.com/">A Fine Theorem</a></strong>, co-founder of the ed-tech startup <strong><a href="https://alldayta.com">All Day TA</a></strong>, and the man behind one of the most-discussed Twitter/X feeds in econ, <strong><a href="https://x.com/afinetheorem">@Afinetheorem</a></strong>.</p><p>Kevin recently published a multi-book review of the economics of AI in the <em>Journal of Economic Literature</em>, and that&#8217;s where we start. Along the way we get into the gap between AI&#8217;s technical capability and its actual diffusion, the stages of how organizations adopt new technology, why the binding constraint on AI value is organizational integration (not prediction vs. judgment), what an AI-for-science research agenda should look like, the coffee test and the fence-post test, what forecasting surveys reveal about how economists and lab researchers actually differ, a dispatch from Kevin&#8217;s recent trip to China (spoiler: they are <em>not</em> AGI-pilled), the future of the academic paper, and a lightning round on comparative advantage in the age of AI.</p><p>A wide-ranging, opinionated, very fun conversation. Grab your Chinese peptides and settle in.</p><div><hr></div><h2>Links &amp; References</h2><p><strong>Kevin&#8217;s work</strong></p><ul><li><p>Kevin Bryan, <a href="https://www.kevinbryanecon.com/BryanAIBookReview.pdf">&#8220;The Economic Impacts of Artificial Intelligence: A Multidisciplinary, Multi-book Review&#8221;</a> &#8212; <em>Journal of Economic Literature</em>, 64(1), 2026.</p></li><li><p><a href="https://afinetheorem.wordpress.com/">A Fine Theorem</a> &#8212; Kevin&#8217;s research blog</p></li><li><p><a href="https://alldayta.com">All Day TA</a> &#8212; turn course content into a custom AI teaching assistant</p></li><li><p><a href="https://creativedestructionlab.com/">Creative Destruction Lab</a> &#8212; the accelerator Kevin helps run (first AI accelerator in the world, 2016)</p></li></ul><p><strong>Books &amp; essays discussed</strong></p><ul><li><p>Leopold Aschenbrenner, <em><a href="https://situational-awareness.ai/">Situational Awareness</a></em> &#8212; the essay Kevin gives all his students (&#8221;read chapter one, believe chapter one&#8221;)</p></li><li><p>Erik Brynjolfsson &amp; Andrew McAfee, <em><a href="https://wwnorton.com/books/the-second-machine-age">The Second Machine Age</a></em></p></li><li><p>Ajay Agrawal, Joshua Gans &amp; Avi Goldfarb, <em><a href="https://www.predictionmachines.ai/">Prediction Machines</a></em> and the follow-up <em><a href="https://www.predictionmachines.ai/power-and-prediction">Power and Prediction</a></em></p></li><li><p>Joel Mokyr, <em><a href="https://press.princeton.edu/books/paperback/9780691120133/the-gifts-of-athena">The Gifts of Athena</a></em> and <em><a href="https://press.princeton.edu/books/hardcover/9780691168883/a-culture-of-growth">A Culture of Growth</a></em> &#8212; Kevin&#8217;s PhD advisor, &#8220;the Michael Jordan of progress world&#8221;</p></li></ul><p><strong>People &amp; projects mentioned</strong></p><ul><li><p><a href="https://www.unjournal.org/">The Unjournal</a> and <a href="https://worksinprogress.co/">Works in Progress</a> &#8212; models for the &#8220;new journal&#8221; </p></li><li><p>Chad Jones, <a href="https://web.stanford.edu/~chadj/">Stanford GSB</a> &#8212; growth theorist read seriously by people in industry</p></li><li><p>Phil Trammell, <a href="https://philiptrammell.com/">GPI / Oxford</a> &#8212; &#8220;Phil World,&#8221; the rapid-growth scenario</p></li><li><p>The coffee test (attributed to Steve Wozniak) and Kevin&#8217;s own <strong>fence-post test</strong> as benchmarks for embodied AGI</p></li></ul><p><strong>Previously on Justified Posteriors</strong></p><ul><li><p><a href="https://empiricrafting.substack.com/p/avi-goldfarb-on-prediction-machines">Avi Goldfarb &#8212; Prediction Machines, O-Ring Tasks, and How AI is Reshaping Economics</a></p></li><li><p><a href="https://empiricrafting.substack.com/p/alex-imas-demand-collapse-bargaining">Alex Imas &#8212; Demand Collapse, Bargaining with Machines, and Behavioral AI Economics</a></p></li></ul><p><strong>Our sponsor</strong></p><ul><li><p>This episode is brought to you by <a href="https://www.reveliolabs.com/">Revelio Labs</a>, the leading provider of labor-economics data, available to academics on <a href="https://wrds-www.wharton.upenn.edu/">WRDS</a>.</p></li></ul><div><hr></div><h2>Chapters</h2><ul><li><p>(00:00) Intro &amp; sponsor</p></li><li><p>(00:39) The JEL book review: what the economics-of-AI canon got right &#8212; and what the older books still beat the new ones on</p></li><li><p>(03:19) Prediction vs. judgment, and the <em>real</em> bottleneck: organizational integration</p></li><li><p>(05:52) Too pessimistic on the tech, too optimistic on diffusion &#8212; Waymo, Pearl Street, and the COVID vaccine</p></li><li><p>(12:34) The four stages of how organizations actually adopt a new technology</p></li><li><p>(15:42) Status-quo bias, banning Anthropic, and treating frontier AI like nuclear material</p></li><li><p>(20:16) Why <em>Situational Awareness</em> beat the economists, and the book Kevin actually wants: AI for science</p></li><li><p>(26:53) Forecasting AI: the surveys, and where economists and lab researchers do (and don&#8217;t) diverge</p></li><li><p>(28:20) Benchmarks, the coffee test, and the fence-post test</p></li><li><p>(35:53) Rapid-growth scenarios, labor-force participation, and &#8220;Phil World&#8221;</p></li><li><p>(41:40) Scaling regularities: what economists should defer to technologists on &#8212; and what they shouldn&#8217;t</p></li><li><p>(43:34) Why forecasts matter for policy and capital allocation</p></li><li><p>(45:50) Dispatch from China: not AGI-pilled, &#8220;involution,&#8221; broken capital markets, EVs and self-driving</p></li><li><p>(1:01:40) War, nationalization, the end of open source &#8212; and why everyone in China uses Claude</p></li><li><p>(1:06:06) A Fine Theorem, the economics of blogging, and the rising value of taste</p></li><li><p>(1:17:48) The economist as plumber: comparative advantage, RCTs, and what grad students should do</p></li><li><p>(1:24:07) What the academic paper looks like in two years</p></li><li><p>(1:28:22) San Francisco, ambition, and the permission structure for growth</p></li><li><p>(1:32:56) Lightning round: favorite economists, All Day TA, and advice for econ grad students</p></li></ul><div><hr></div><h2>Open &amp; Intro [00:00 - 00:39]</h2><p>[00:00:12] <strong>Seth:</strong> Welcome to the Justified Posteriors Podcast, the podcast that updates beliefs about the economics of AI and technology. I&#8217;m Seth Benzell, finally able to meet one of my theoretical heroes, coming to you from Chapman University in sunny Southern California.</p><p><strong>Andrey:</strong> And I&#8217;m Andrey Fradkin, coming to you from San Francisco. Excited to have Kevin Bryan as our guest today. Kevin, welcome.</p><p><strong>Kevin:</strong> Thanks for having me. Very excited.</p><p><strong>Andrey:</strong> Kevin is a leading thinker in the field of progress, and in AI economics. He also has his own startup, All Day TA, and is prolific on Twitter &#8212; at times.</p><p><strong>Kevin:</strong> At times.</p><div><hr></div><h2>The JEL Book Review: What the AI-Econ Canon Got Right [00:39 - 03:19]</h2><p><strong>Andrey:</strong> Kevin, you wrote an article reviewing several prominent books on AI. Why did you do this, and what did you learn from the exercise?</p><p>[00:01:13] <strong>Kevin:</strong> It&#8217;s pretty interesting. Economics of AI is not that new of a field &#8212; some of the canonical books on how economics thinks about AI go back to before large language models existed. Books like <em>The Second Machine Age</em> by Brynjolfsson and McAfee, and <em>Prediction Machines</em> by Agrawal, Gans, and Goldfarb. These are pre-LLM &#8212; written before the attention paper. So it&#8217;s interesting to look at what of the core ideas in the economics of AI have changed given the technological improvements.</p><p>On the technology side, I don&#8217;t think there have been massive surprises for people who were paying attention. At least since the scaling law paper, if you&#8217;d drawn the line on the graph, you&#8217;d have more or less predicted everything that happened. I remember reading Kurzweil &#8212; <em>The Age of Intelligent Machines</em>, <em>The Age of Spiritual Machines</em> &#8212; back in college, and those are just drawing different lines on the graph, in that case based on compute, and we&#8217;re getting very close to what actually happened.</p><p>Likewise on the economic side: given that the technological trajectory hasn&#8217;t changed much, I don&#8217;t think the underlying economics has changed as much as people might think. Where things might be bottlenecked, how technology improvements map into growth, the effects on labor markets &#8212; the fundamental microeconomics of AI&#8217;s predictions hold up pretty well. I found it interesting how few of the 2023, 2024, 2025 books had really advanced my understanding of the economics of AI compared to the older ones.</p><div><hr></div><h2>Prediction vs. Judgment, and the Real Bottleneck [03:19 - 05:52]</h2><p>[00:03:19] <strong>Seth:</strong> Lots to unpack. We just had Avi Goldfarb on the podcast and pressed him on his <em>Prediction Machines</em> approach, where he distinguishes the AI that&#8217;s good at predicting from the human that&#8217;s good at judging. If any of these books would have changed after gen AI, it&#8217;d be that one. Don&#8217;t you think that book maybe gets something wrong?</p><p><strong>Kevin:</strong> I think they&#8217;d agree &#8212; they wrote a follow-up in <em>Power and Prediction</em>. But the disagreement isn&#8217;t about the prediction-versus-judgment distinction. Even in the original book &#8212; and I remember talking to them about this in 2016, 2017 &#8212; judgment is a sliding scale. Take the umbrella example: I know my utility function on an umbrella, I know how much I dislike rain. I give the AI data, it looks at my face, sees light rain, heavy rain, and it can predict my utility function &#8212; in which case judgment is taken over by AI. Everyone understands that.</p><p>That said, on the scale of how easy it is to figure out the underlying utility function from data versus the predictions that go into it, I don&#8217;t think that&#8217;s changed. None of the major language models technologically can &#8212; or even attempt to &#8212; modify how they operate for me versus you. They store a little memory and RAG their way into remembering what you&#8217;re like, but there&#8217;s no attempt to fine-tune the model. We&#8217;d like to use continual learning, but we can&#8217;t yet. So the judgment aspect is still pretty binding even today.</p><p>Where I think there&#8217;s a difference &#8212; and where Ajay, Avi, and Josh would say they were wrong &#8212; is that the fundamental problem for AI&#8217;s creation of value isn&#8217;t prediction versus judgment. It&#8217;s the organizational integration problem. There&#8217;s overlap between the two, but we&#8217;d take the organizational and architectural bottlenecks more seriously now, partly because we&#8217;re applying AI to more complex tasks where those bottlenecks start to bite.</p><div><hr></div><h2>Too Pessimistic on Tech, Too Optimistic on Diffusion [05:52 - 12:34]</h2><p>[00:05:52] <strong>Seth:</strong> You point this out with <em>The Second Machine Age</em> &#8212; Andy and Eric&#8217;s world-historical automated car ride.</p><p><strong>Andrey:</strong> It&#8217;s weird to think that in some ways they&#8217;re a little too pessimistic about the technology, but a little too optimistic about social diffusion. The driverless cars going down the highway in California are a perfect example.</p><p><strong>Kevin:</strong> Such a good example. We all talk to different audiences. When I talk to policy people, I tell them: &#8220;Whatever you think the capabilities of AI will be in the future &#8212; more than that.&#8221; This isn&#8217;t a sales pitch. Every single person inside the lab agrees. You have people high up in government who think about AI as the AI of today plus epsilon. And you want to ask: what did you see in the past 10 years that makes you think this is a good way to plan for the future?</p><p>[00:07:01] On the other hand, out in California they wildly underrate diffusion friction. I give the Waymo example: if diffusion is so easy, how come we rode in a Waymo 10 years ago? I&#8217;m in Toronto &#8212; Jeff Hinton&#8217;s city &#8212; and there&#8217;s not a single one. Clearly there&#8217;s some friction.</p><p>I remember a couple of years ago, Andrey was with us at one of the labs with a few other economists. They brought in a bunch of computer scientists and asked, &#8220;What&#8217;s the effect on GDP productivity in the short run?&#8221; And we said, &#8220;Through 2030, maybe 1% per year.&#8221; Actually we said less. And to be fair, from 2024 to 2026, we&#8217;ve been right so far. They said, &#8220;But why?&#8221; And we said, &#8220;We agree with you technologically.&#8221; At the time we saw technology that hadn&#8217;t come out yet that everyone now thinks is amazing. But on the production side you&#8217;ve got bottlenecks &#8212; you&#8217;re combining complements in some CES or Cobb-Douglas function, and you don&#8217;t need many bottlenecks for growth to stall quickly. Then there are social diffusion factors, regulatory factors, organizational architecture factors, like in Kim Clark and Rebecca Henderson&#8217;s work. Add all these up across every technology ever, and I just don&#8217;t see fast takeoff.</p><p>Honestly, I think it&#8217;s bad. I think we&#8217;ll be able to make personalized medicine very cheaply much more quickly than regulators will allow you to sell it. That&#8217;s a problem.</p><p><strong>Andrey:</strong> The standard retort is one of two things. One: it&#8217;ll be so self-evidently good that people will find a way to take it &#8212; like a cure for cancer. There are already sub-treatments where rich people take them without FDA approval and claim they work. Two: we have autonomous zones where we let AI do whatever it wants, and they out-produce the rest of society.</p><p><strong>Kevin:</strong> Great arguments both. Out here in San Francisco you&#8217;ve probably got a bunch of Chinese peptides in your fridge.</p><p>[00:10:20] But here&#8217;s the thing &#8212; the Chinese peptides are self-evidently really good at the single most costly part of the medical system. And yet can you legally buy them? Anywhere? You can&#8217;t even legally buy them in China. That self-evidence did not change the regulatory process. We just went through COVID. We had the vaccine in January 2020. Everything from then until it diffused was government. What more important thing to diffuse quickly could there be? Waymos &#8212; you sit in one for one minute and it&#8217;s obvious it&#8217;s driving more safely.</p><p><strong>Andrey:</strong> People start crying. It&#8217;s so beautiful.</p><p><strong>Kevin:</strong> And yet. We saw this historically too. If you want self-evidently useful: Pearl Street, 1882, Edison flips the switch &#8212; let there be light. And yet look at the diffusion rate for electric street lighting, especially after Chicago burns down.</p><div><hr></div><h2>The Four Stages of Organizational Adoption [12:34 - 15:42]</h2><p>[00:11:49] <strong>Andrey:</strong> Let me retort. Diffusion of technology has accelerated over time. The smartphone diffused extraordinarily quickly. Aspects of AI have diffused even quicker &#8212; by standard adoption measures, almost everyone has used AI at least once.</p><p><strong>Seth:</strong> And how much do regulations matter for the diffusion rate? If you ask my students, regulation against using AI hasn&#8217;t slowed their adoption at all.</p><p>[00:12:34] <strong>Kevin:</strong> We have an answer to this &#8212; it&#8217;s actually easy. This is why I like starting with a little organizational economics. Every technology I&#8217;m aware of, ever: first we adopt it when individuals can do existing tasks with the new thing more efficiently. That&#8217;s easy &#8212; that&#8217;s your student cheating on their exam, the coder using it for coding, me brainstorming with GPT to prep for a meeting.</p><p>The next step is a group or an organization doing an existing task using the new technology. That&#8217;s tougher. To give you a sense &#8212; how many organizations have changed their IT procurement policies given the change in how we make software? Find one.</p><p>The third stage: a new task that&#8217;s now efficient given the new technology, inside my organization. I haven&#8217;t run into a single large incumbent organization that has reached this level for anything important. The fourth is the hard one: a new task that&#8217;s only efficient because of the new technology, and that requires something on the outside &#8212; partners in the supply chain, regulators, someone to change. That&#8217;s Waymo. That&#8217;s containerized shipping. That&#8217;s UPC codes. The fundamental barrier there isn&#8217;t information or firm growth rates. It&#8217;s that the institutions are built around existing skills, promotion policies, and so on.</p><p>If AI made the optimal university 50% capital and 50% labor &#8212; bringing people in and out instead of having a tenure system &#8212; what year do you think we get that? You can maybe out-compete the incumbents, but I don&#8217;t think Harvard&#8217;s reputation goes away that quickly. And if we&#8217;re talking about governments, you can&#8217;t even out-compete them.</p><div><hr></div><h2>Status-Quo Bias, Banning Anthropic, AI as Nuclear Material [15:42 - 20:16]</h2><p>[00:15:03] <strong>Seth:</strong> Sometimes you can out-compete governments &#8212; it depends how crazy a takeoff we&#8217;re talking about. I talk to Phil Trammell about scenarios where the world is too decadent and we don&#8217;t save enough, so we don&#8217;t get growth from AI. His comeback is always: then one country will accumulate and overwhelm all the others eventually.</p><p><strong>Andrey:</strong> There&#8217;s also a timeline over which universities get out-competed &#8212; maybe not Harvard. Harvard&#8217;s a luxury good, and luxury goods have different economics. But if the ROI to college falls drastically, I don&#8217;t see college education remaining anything other than a luxury or niche thing.</p><p><strong>Kevin:</strong> We see this in the X-inefficiency papers, or the steel mini-mill papers &#8212; quicker organizational change in response to existential threats. It&#8217;s almost worse when the organization has rents to share, because then who wants to be the manager who&#8217;s the jackass firing people? My favorite example: Blockbuster could have bought Netflix for tens of billions. If they had, you&#8217;d have never heard of Netflix &#8212; I don&#8217;t think the transition to streaming happens if retail-location experts are running it, and the relational contracts with the studios have to change wildly. Someone would have out-competed them eventually, but it would have taken longer.</p><p>Something like self-driving cars &#8212; let&#8217;s make a bet. Of the top 500 cities in the world in 2035, how many differentially regulate self-driving cars on safety in a substantial way compared to human drivers?</p><p><strong>Andrey:</strong> All of them.</p><p><strong>Kevin:</strong> All of them, of course. Outright <em>ban</em> self-driving cars &#8212; I wouldn&#8217;t be surprised if that&#8217;s double digits.</p><p><strong>Andrey:</strong> Boston almost did it, as far as I can tell. Very close. Although it&#8217;s one of those things &#8212; kind of like Uber, which entered as a banned entity and got so much consumer goodwill that politicians had to allow it. I&#8217;m not sure that happens with self-driving by 2035, but it&#8217;s not obvious when it tips.</p><p>[00:18:21] <strong>Kevin:</strong> It&#8217;s also not obvious we don&#8217;t get differential regulation &#8212; say, regulation that makes self-driving cars subsidize the insurance rates of traditional cars. The world is status-quo biased. Institutions exist because they won the Darwinian struggle to survive, so they&#8217;re well-fit for the environment they operate in, which makes them inherently conservative. That&#8217;s not crazy &#8212; but at a time of big disruption like AI, you have to take it seriously.</p><p>How many high-up people in Silicon Valley thought the US government would ban Anthropic? I agree it&#8217;d be insane to do. Nonetheless, I&#8217;m not surprised. If you think any government is going to allow open sales of AI at the frontier in two years, you&#8217;re deluded &#8212; they&#8217;re going to treat it like nuclear material. If you don&#8217;t believe that, your vision of the world is way too technological and not nearly organizational enough.</p><div><hr></div><h2>Why <em>Situational Awareness</em> Beat the Economists; AI for Science [20:16 - 26:00]</h2><p>[00:20:16] <strong>Seth:</strong> Let&#8217;s wrap up the JEL article. In some ways you review <em>Situational Awareness</em> in a positive light compared to what the economists wrote. But it&#8217;s a narrative essay, not an economics book. What&#8217;s the economics book you want to read, and why are economists stuck?</p><p><strong>Kevin:</strong> Getting <em>Situational Awareness</em> into the book review took a little persuading &#8212; for one, it&#8217;s not a book. But if an economist asked me for the one book chapter that best explains what&#8217;s happened in the last few years, I&#8217;d say chapter one of <em>Situational Awareness</em>. I give it to all my students.</p><p>What I want to read &#8212; both as a book and as research &#8212; is the endogenous impact of AI on science, including via robotics and via self-improvement. That&#8217;s the whole game.</p><p><strong>Seth:</strong> You point out <em>The Second Machine Age</em> misses this. It says &#8220;imagine a billion researchers,&#8221; and they imagine Africa getting the internet, but they don&#8217;t actually model it.</p><p><strong>Kevin:</strong> Exactly. Think how many papers go: &#8220;Here&#8217;s 2025 AI. I run an experiment where I tell you to do something that takes 10 minutes of work with AI, and I measure the treatment effect.&#8221; Who cares? Nobody&#8217;s reading that paper in five years. What people will care about is: are we getting self-automated science? Is blue-collar work being affected? If AI can do most of the research on the next AI...</p><p>I always ask high-up people in the labs: what year do you think a Chinchilla-law-level result &#8212; in terms of its importance to developing the next model &#8212; comes from AI? The answers are between 2027 and 2029. I&#8217;ve never gotten 2030. That&#8217;s not all research done by AI, but it&#8217;s a substantial speed-up beyond just writing the code faster.</p><p><strong>Andrey:</strong> We already see that in math &#8212; it&#8217;s proving things humans weren&#8217;t proving.</p><p><strong>Kevin:</strong> If something like Navier-Stokes is proven, I&#8217;d put that in the set of a Chinchilla-law-level result done autonomously. And if it can do that, presumably it can do research on sensors, actuators, batteries &#8212; and then the robots improve more quickly, and we get automated labs. That&#8217;s the takeoff question. Everything I&#8217;ve said doesn&#8217;t actually <em>imply</em> a takeoff. It depends what the bottlenecks are. Measuring those bottlenecks &#8212; the production function for specific areas of science and robotics &#8212; is incredibly high value for knowing where to allocate resources.</p><p><strong>Andrey:</strong> And if we correctly predict them, maybe they won&#8217;t exist. It&#8217;s a feedback loop.</p><p><strong>Seth:</strong> Ooh, I love this.</p><p><strong>Kevin:</strong> Think of a production function &#8212; it tells you how much capital and labor to maximize production. I want to know what tools.</p><p><strong>Seth:</strong> Kevin, I just did it &#8212; it turns out it&#8217;s energy. I looked into the future, it&#8217;s energy. Are you saying the best book about AI economics is just a book about energy?</p><p>[00:24:48] <strong>Kevin:</strong> It&#8217;s plausible. I help run Creative Destruction Lab &#8212; we were the first AI accelerator in the world in 2016, and we also run, I think, the biggest space accelerator. I was just down in Texas with astronauts for the Artemis launch. When you hear Elon talk about AI and space, it&#8217;s on the one hand crazy, on the other hand basically unregulated, effectively unlimited energy &#8212; and for training, who cares about latency? It&#8217;s not totally crazy that one way we get around the energy bottleneck is solar sails and ideas like that. In which case we face other bottlenecks. But this is an empirical question, and one where you&#8217;d want energy economists and energy experts, not just labor economists.</p><div><hr></div><h2>Forecasting AI: Surveys, Economists vs. Labs [26:00 - 28:20]</h2><p><strong>Andrey:</strong> One thing where I feel very stupid: about six months ago people around here kept saying &#8220;energy shortage, energy shortage,&#8221; and I thought they were probably right but didn&#8217;t trade on it.</p><p>You&#8217;re also involved in a project &#8212; we discussed it a bit with Avi Goldfarb &#8212; figuring out what economists are forecasting about the future of the economy under different scenarios. Tell us about it.</p><p>[00:27:06] <strong>Kevin:</strong> There are actually two projects &#8212; one I&#8217;ve been involved with, one I&#8217;m an academic advisor on. They both also ask AI-lab researchers, superforecasters, and the general public. The most interesting thing, as far as I&#8217;m concerned: on technical projections, there&#8217;s really no gap between the economists and the people inside the labs. And on economic projections the gap is also pretty small.</p><p>If you go from Acemoglu to Dario Amodei in our sample, Acemoglu is like the 1st percentile and Dario&#8217;s like the 99th &#8212; and neither is really representative of economists or AI researchers. It&#8217;s important to put these projections on paper and see how we did. Some surveys are now old enough to check. The projections of everyone &#8212; economists and non-economists &#8212; on frontier math were <em>low</em>. We were thought to be crazy with some of these projections, and we still underestimated the rate of improvement on certain benchmarks.</p><p>People say &#8220;it&#8217;s a benchmark, they trained to it.&#8221; The problem: I wrote benchmarks for one of the big labs. You know how hard it is to write a benchmark the AIs can&#8217;t solve? I did some in March. I&#8217;m running out of questions I can ask them.</p><p><strong>Seth:</strong> They know how many R&#8217;s there are in strawberry now.</p><div><hr></div><h2>Benchmarks: The Coffee Test &amp; the Fence-Post Test [28:20 - 35:53]</h2><p>[00:28:43] <strong>Kevin:</strong> I have some tricks, but who knows how long they&#8217;ll last. Honestly you need benchmarks that look like the coffee test &#8212; or my favorite, the fence-post test.</p><p>The fence-post test is mine: I can buy a general-purpose embodied AI that I can tell on a Saturday morning, when I want to sleep in, &#8220;Go to my backyard and dig that fence post.&#8221; Not a specific machine &#8212; a general one. Every human could in principle do it. I think we&#8217;re quite a ways from AI doing it at cost.</p><p>The coffee test &#8212; I think this comes from Wozniak &#8212; is that an embodied AI walks into three random houses it&#8217;s never seen, finds the ingredients and the mug, and makes a cup of coffee. Well within the capability of any normal person. Whoever came up with it said the year it&#8217;s possible is &#8220;never.&#8221; We&#8217;ve done surveys &#8212; the modal answer from researchers now is the early 2030s. I think that&#8217;s the kind of benchmark you need, because anything on paper or on a computer &#8212; what Shane Legg calls minimal AGI &#8212; the goose is cooked. We can&#8217;t write tests I&#8217;m confident AI won&#8217;t pass in that domain.</p><p><strong>Seth:</strong> Andrey just wrote a test the AI was very bad at.</p><p><strong>Andrey:</strong> It&#8217;s really bad at predicting how many tokens it&#8217;ll use for a given task and whether it can actually do it. It&#8217;s poorly calibrated.</p><p><strong>Kevin:</strong> That&#8217;s a well-known one, included in some benchmarks on AI&#8217;s ability to self-reflect. But it&#8217;s in the set of things where if I think about it a bit, I don&#8217;t know any reason I can&#8217;t hill-climb to answering it &#8212; ergo it&#8217;ll get solved.</p><p><strong>Andrey:</strong> To be clear, our paper&#8217;s call wasn&#8217;t &#8220;we need these for economic activity, so please RL on them.&#8221;</p><p><strong>Kevin:</strong> That&#8217;s essentially what you need, though. Anything obvious you can hill-climb on is cooked. Anything non-obvious but complementary to you hill-climbing on it is cooked. I need something outside that set.</p><p><strong>Seth:</strong> But it has to be at the intersection of hill-climbability and being economically valuable to hill-climb. Or do you think we&#8217;ll saturate everything even if it&#8217;s not valuable?</p><p>[00:31:46] <strong>Kevin:</strong> I don&#8217;t think there&#8217;s a difference. Once I use AI in adversarial or competitive settings, making a mistake 0.1% of the time screws you. Edge cases are really bad in adversarial settings, and lots of economic activity has that flavor.</p><p><strong>Seth:</strong> There&#8217;s no such thing as an economically unimportant question.</p><p><strong>Kevin:</strong> Right &#8212; if you give me the economically unimportant question, I&#8217;ll design the economic interaction to screw you on it. Before LLMs we had GANs &#8212; you could put a sticker on a stop sign that fools any model but looks identical to a human. There are good statistical reasons we&#8217;ll never fully solve that. My favorite one AI has trouble with: they took an outline map of Europe, filled in part of the Bay of Biscay as if it were land, put an arrow on it, and asked &#8220;what&#8217;s here?&#8221; If you know your geography you say, &#8220;that&#8217;s the Bay of Biscay, oddly colored like land.&#8221; It&#8217;s just a weird thing for the training data to see. Economically it&#8217;s not per se valuable &#8212; most maps you see are the real map. But if I were using AI in a financial system, I&#8217;d be super concerned about my inability to solve that.</p><p><strong>Seth:</strong> It&#8217;s very important to be able to draw pictures of wine filled all the way to the brim.</p><p><strong>Kevin:</strong> Especially for these evening podcasts, Seth.</p><p>[00:34:14] <strong>Andrey:</strong> Back to the forecast &#8212; one question about the composition of people. I&#8217;m a participant in your surveys. I wonder if the economists are all our friends and not the skeptics, like Acemoglu.</p><p><strong>Seth:</strong> We&#8217;re gonna get him on.</p><p><strong>Kevin:</strong> It&#8217;s not just our friends. The selection mechanisms differ between the two, but you have to have published something related to AI at some point &#8212; and plenty of people who&#8217;ve published on AI are quite skeptical. It&#8217;s not snowball sociology; the selection mechanism is completely public.</p><p><strong>Andrey:</strong> But there&#8217;s self-selection into participating &#8212; I do it because I&#8217;m very interested in AI economics; some might not.</p><p><strong>Kevin:</strong> For sure, that&#8217;s an issue. Forget econ &#8212; I was just at a faculty association meeting talking to the humanities people. It&#8217;s amazing: the AI is simultaneously destroying society <em>and</em> can&#8217;t do anything. Very hard to hold both at once.</p><p><strong>Seth:</strong> A very prestigious combination.</p><div><hr></div><h2>Rapid-Growth Scenarios, Labor Force, and &#8220;Phil World&#8221; [35:53 - 41:40]</h2><p>[00:35:53] <strong>Seth:</strong> You said there&#8217;s not much difference between economists and non-economists on economic predictions, but my recollection is there are substantive differences &#8212; like the fast-progress scenario, a percentage point of GDP growth per year difference. That&#8217;s sizable.</p><p><strong>Kevin:</strong> That&#8217;s where the biggest difference is &#8212; the rapid-growth scenario: widespread inexpensive robots that can do basically everything. Call it &#8220;Phil World,&#8221; since we talked about Phil Trammell &#8212; friend of the podcast. In that world in 2050, saying 1% growth per year is a little crazy. It&#8217;s hard to write down a model with bottlenecks that strong.</p><p><strong>Seth:</strong> Or there could be dis-saving &#8212; people taking their labor out of the economy. We asked about labor-force participation, and even there the gaps were off-trend by five or six points. Not enormous.</p><p><strong>Kevin:</strong> That seems small for that magnitude of change. But the description of &#8220;rapid AI&#8221; was technological capability, not diffusion. One explanation: it&#8217;s possible to do this, and we ban robots.</p><p><strong>Seth:</strong> For my prediction I included increased chance of war as something that reduces growth.</p><p><strong>Kevin:</strong> We had a couple of respondents say zero GDP because we&#8217;re all turned into goo. We won&#8217;t say which of our friends. For the rapid scenario, the 25&#8211;75 bounds are stupidly high. But for the AI all three of us would expect, the error bounds aren&#8217;t enormous &#8212; people were generally on the same page across groups. Most of the difference was within-group until you get to 2050 and rapid AI.</p><p>[00:39:10] <strong>Seth:</strong> Give the listeners some numbers for the median scenario.</p><p><strong>Kevin:</strong> The best comparison is something like CBO or IMF projections &#8212; on the order of one percentage point more productivity, one point more growth per year. Which adds up to a lot &#8212; let&#8217;s say it adds up to the single most important invention in human history. On labor-force participation, about half a percentage point more per year in the drop &#8212; substantial out to 2030, not quite as big by 2050. Big effects, but...</p><p><strong>Seth:</strong> It&#8217;s not the singularity. One percentage point additional growth a year for 20 years is the difference between two high-income countries &#8212; not the difference between the Flintstones and the Jetsons.</p><p><strong>Kevin:</strong> I understand the objection: you read <em>Situational Awareness</em>, and from 2001 to 2026 the AI-pilled people were right and everyone else was wrong, so don&#8217;t bet against their projections. Fine &#8212; on technical grounds, say they were right. My response: I&#8217;ll literally take, as my technical projection for 2030, whatever the modal response from researchers inside the labs is. On what grounds would I disagree? But how that maps into labor-force participation &#8212; there I wish some of these people would close their mouths.</p><div><hr></div><h2>Scaling Regularities &amp; What Economists Should Defer On [41:40 - 43:34]</h2><p>[00:41:14] <strong>Seth:</strong> Let me ask about a techno-social prediction. We have this regularity, the scaling law &#8212; which you said should really be called a scaling regularity, because we&#8217;re not sure it&#8217;s a law of nature. The relationship between error rate and number of parameters seems technical. But then there&#8217;s the sociotechnological leap &#8212; that scaling leads to scaling capability. Should economists defer to technical experts on that, or is it a socioeconomic prediction we should have an opinion about?</p><p><strong>Kevin:</strong> That one&#8217;s in the middle &#8212; related to AI for science. The scaling regularities &#8212; let&#8217;s say four of them &#8212; we just take from the computer scientists. But what&#8217;s the production function of medicine? How important are improvements in predicting protein structure to making a new drug? That&#8217;s not economics in the sense that we don&#8217;t have the field expertise, but it&#8217;s also not biology and not computer science. We&#8217;re in the middle.</p><p><strong>Seth:</strong> In Aschenbrenner there&#8217;s a figure: right now it&#8217;s high-schooler level, in a year college, then professor. Is that the first kind of prediction or the second?</p><p><strong>Kevin:</strong> That&#8217;s the first. I take that from the computer scientists. I want field experts and economists to estimate the production function, and social scientists to work out the implications on other parts of the economy.</p><div><hr></div><h2>Why Forecasts Matter for Policy &amp; Capital [43:34 - 45:50]</h2><p>[00:43:34] <strong>Andrey:</strong> Let me retort. People are interested in forecasts, but I don&#8217;t think economists are very good at forecasting. And it&#8217;s not clear how useful the whole exercise is. I could build my own custom macro model to answer these surveys &#8212; how much value to society would there be? Or is this more an exercise in social consensus, to bring to policymakers and say &#8220;here&#8217;s the range of expected outcomes,&#8221; without caring about the specific forecasts?</p><p><strong>Kevin:</strong> A bit of both. Take chapter three of Aschenbrenner. If I believe that forecast, the government should borrow literally everything it can and plow it into chip production &#8212; because if your growth rate is 10% a year, who cares? So it matters a lot for policy. On a micro level: I&#8217;m an executive at Google deciding whether to put money into AI math solvers or into bio &#8212; Anthropic just put Novartis&#8217;s CEO on their board. Which improvements lead to value more quickly? And at the organizational level, if I&#8217;m a university, I need to know which bottlenecks are in my control and where I can just free-ride and wait.</p><p>When you talk about China, I&#8217;ll tell you something interesting I learned there: they&#8217;re not AGI-pilled. I think that&#8217;s going to cause problems &#8212; but we&#8217;ll get to that.</p><p>[00:45:56] <strong>Andrey: </strong>The final thing: yes, Anthropic is going into bio, but you don&#8217;t need forecasts for that. Just look at the share of GDP in different sectors. Economists are valuable, smart people &#8212; but using AI for medicine is the most obvious thing in the world; I don&#8217;t need an economist to tell me that. </p><p><strong>Kevin: </strong>The marginal value comes elsewhere: if I spend $10 million figuring out how to allocate $10 billion of capital, that&#8217;s really high value. And on policy &#8212; listen to how policymakers talk. Bad predictions about the labor market coming out of some labs are going to cause regulation. States are going to ban data centers. We&#8217;re going to tax all the compute before we get the cancer drug.<br><br>I was working on a theory problem this week: I care about the wage bill &#8212; I want AI to be as productive as possible without harming wages. So you take something like Chamley-Judd, add a wage-bill constraint, add informational constraints for the planner about which capital is AI, let it substitute and complement in various ways, and solve. The result on taxation looks nothing like anything being proposed right now. To know that&#8217;s the right way to think about it, you can&#8217;t just say &#8220;AI will be useful in the future.&#8221; No &#8212; they&#8217;re going to ban it.</p><p><strong>Andrey:</strong> This political economy of AI is something I&#8217;m tracking very seriously now. It&#8217;s obvious we&#8217;ll have bans and regulations long before AI actually has effects. People already think AI is causing mass unemployment.</p><p><strong>Kevin:</strong> They&#8217;re immune to the data. &#8220;Block laid off 40% of their workforce.&#8221; It&#8217;s a bad media environment, too. A reasonable hypothesis: the sector most harmed by digitization and then AI is journalists &#8212; so young journalists, especially culture journalists, are incredibly hostile to AI, and the world they influence ends up asking &#8220;unemployment&#8217;s 4.5%, why is everyone talking about this?&#8221; There was an article this week about young people who don&#8217;t want kids because it&#8217;s too expensive &#8212; and the first couple they showed were 25, owned a 2,000-square-foot house, and the husband&#8217;s hobby was golfing in Utah. It&#8217;s a bad epistemic environment, and it&#8217;s bad for AI because it makes people hostile to change &#8212; they feel they have to protect what they have, even though the economy roared.</p><div><hr></div><h2>China Trip: Not AGI-Pilled, Involution, Capital Markets [45:50 - 1:01:40]</h2><p>[00:50:29] <strong>Andrey:</strong> Let&#8217;s get your take on your China trip. What was the occasion?</p><p><strong>Seth:</strong> Is China AGI-pilled? Why or why not?</p><p><strong>Kevin:</strong> We need that one for the clip at the start of the video. I studied diplomacy &#8212; my goal when I was younger was to join the Foreign Service. I worked in China briefly at the embassy in &#8216;05, around WTO session time, and I&#8217;m back there quite a bit. After COVID, the number of foreigners in China dropped, so the information flow is bad. A colleague calls it the G2 when it comes to AI: two countries, plus Google&#8217;s London outpost. Nothing else really matters for AI. So not knowing what&#8217;s going on in China is really important.</p><p>This year I brought a group &#8212; economists, a guy from Epoch AI, a trade lawyer. I wanted to understand robotics, especially in traditional industries. AI&#8217;s effect on most of the market won&#8217;t come through San Francisco or Hangzhou. We met Zhipu&#8217;s COO, journalists who work on AI policy, startup founders, cloud providers, the biggest angel fund. We also went to Dongbei, the northeast &#8212; the fastest-falling population region in the entire world, losing about 1% a year, maybe 100 million people. We went to the one city that&#8217;s hanging on.</p><p>[00:53:34] First thing: nobody we talked to was AGI-pilled. When you ask what the AI is for, it&#8217;s completely about process engineering of existing industry. That&#8217;s it. Why open source? Process engineering. Why build your own non-frontier stack? Process engineering. And they actually use it in industry &#8212; some examples looked better than what we see in the West. But no one talks like the San Francisco or London DeepMind folks: &#8220;in 2029 my robot flies through the air and shoots the robber and delivers my peptides.&#8221; It honestly felt like talking to government people &#8212; &#8220;AI&#8217;s capabilities in 2026 plus epsilon.&#8221;</p><p>Part of what&#8217;s going on is a word in Chinese they translate as &#8220;involution&#8221; &#8212; I always tell them it&#8217;s not a word in English; it actually comes from Clifford Geertz, the anthropologist. It means extreme competition. It&#8217;s very hard to make a profit in certain industries &#8212; a hundred entrants immediately when you start making money. So high-fixed-cost, payoff-in-the-future investments are really hard. You only see it from things like DeepSeek, where it&#8217;s a hedge fund and the guy spends his own money. Even companies that seem to be doing great &#8212; the independent AI producers, not the Alibabas and Tencents &#8212; are in massive financial trouble, because it&#8217;s too competitive.</p><p><strong>Seth:</strong> Part of that&#8217;s the interest rate and capital-market environment, right? American AI companies can lose money for a long time &#8212; why can&#8217;t they access money for more runway?</p><p><strong>Kevin:</strong> China&#8217;s biggest advantages are energy costs about half of ours, and a much stronger hardware ecosystem &#8212; your ability to experiment and prototype blows away North America&#8217;s. It&#8217;s probably not even worth running a battery or robotics-hardware company here; you&#8217;ll get swamped.</p><p><strong>Seth:</strong> Unless you&#8217;ve got a government contract.</p><p><strong>Kevin:</strong> True &#8212; we should probably build our own drones. But things that require big fixed costs and have long payoffs need deep capital markets that reallocate capital quickly, and China doesn&#8217;t have that. The VC market is worse than a decade ago &#8212; foreign VC basically left. Most companies get investment from state-linked banks or rich people out of pocket. DeepSeek is trying to raise $20 billion; if they were in San Francisco they&#8217;d start at ten times that.</p><p>[00:57:06] <strong>Andrey:</strong> Let me play devil&#8217;s advocate. So far most of the rewards go to frontier models &#8212; you can&#8217;t charge enough for non-frontier tokens. So DeepSeek doesn&#8217;t make sense unless it&#8217;s a government-funded national champion.</p><p><strong>Kevin:</strong> If DeepSeek weren&#8217;t in China, with their leadership and computer scientists, they could have attracted the Chinese equivalent of Alec Radford and Ilya Sutskever and been in the race for the frontier. They can&#8217;t, because of the capital markets. This isn&#8217;t just AI &#8212; all sorts of industries face it. They can move quickly when the design already exists, but for &#8220;I&#8217;m doing something genuinely new,&#8221; they&#8217;re behind. Self-driving &#8212; they&#8217;re behind Tesla. Not Waymo, <em>Tesla</em> &#8212; even the frontier Chinese car companies.</p><p><strong>Andrey:</strong> That&#8217;s crazy to me. I&#8217;d have thought they&#8217;d have a separate, more generous government lane.</p><p><strong>Kevin:</strong> Look at what Google had to spend to build Waymo &#8212; no one else in North America pulled it off, because you needed to lose tens of billions and there wasn&#8217;t enough capital.</p><p><strong>Andrey:</strong> In China labor is cheap, so the economics of an autonomous-vehicle service are worse there. But modern neural networks made AVs a lot easier &#8212; Google couldn&#8217;t really have done it before 2022.</p><p><strong>Kevin:</strong> An executive at one of the new Chinese car companies told me that in China, Elon&#8217;s strategy is seen as smarter than Waymo&#8217;s &#8212; they think Waymo&#8217;s approach is out of date: LiDAR is cheap now, don&#8217;t map the roads. Maybe they&#8217;re right and catch up. On the cars themselves they&#8217;ve caught up &#8212; if their cars were sold in North America they&#8217;d take the market. And it&#8217;s not the traditional four &#8212; the Ford and GM of China are also screwed; the architectural shift to electric was too hard. It&#8217;s the new companies that would crush us. But they still haven&#8217;t caught up on self-driving.</p><p>[01:00:11] <strong>Seth:</strong> Follow-up on capital deployment &#8212; bringing Leopold back. He thinks the big frontier labs end up as nationalized projects. China can deploy a lot of capital toward national projects. Do you see this disadvantage reversing if we get one big national lab per country?</p><p><strong>Kevin:</strong> Good question. Hasn&#8217;t happened yet. I think they&#8217;d have the same problem &#8212; China has hippies now. They have words like <em>tang ping</em>, &#8220;lie flat&#8221; &#8212; I&#8217;m not joining the rat race. They have guys like the people at Anthropic wearing sandals and reading the Whole Earth Catalog. Those people, in the US and China, aren&#8217;t going to work for some state-backed project. You can maybe state-back the energy rollout, but it wouldn&#8217;t attract some types of talent.</p><div><hr></div><h2>War, Nationalization, the End of Open Source &#8212; and Claude [1:01:40 - 1:06:06]</h2><p><strong>Kevin:</strong> The part of <em>Situational Awareness</em> that seems like it must happen &#8212; I wrote my PhD dissertation on early nuclear. Back then you literally weren&#8217;t allowed to publish your patents &#8212; state secret. We&#8217;re very close to wars where AI plays a major role. At that point, who&#8217;s going to let this stuff be independent? The government doesn&#8217;t let you sell missiles &#8212; they&#8217;ll let you sell to partners they approve, and that&#8217;s it.</p><p><strong>Seth:</strong> Does that mean the end of open-source models above a certain size? Some sort of IAEA for AI? Turing police monitoring frontier labs under UN auspices?</p><p><strong>Kevin:</strong> When people talk about UN regulation of AI &#8212; take a foreign-policy class. Neither China nor the US cares one whit what the UN says. There&#8217;s going to be an organization called the G2: the US president and the Chinese premier talking to each other. That&#8217;s how it&#8217;ll work.</p><p>Open source is interesting &#8212; it&#8217;s a little bit dying in China. The most well-known researchers at Alibaba quit. A couple of other well-known model makers are going to go bankrupt &#8212; it&#8217;s not obvious how you make money making open-source LLMs as an independent. I suspect Llama is the last big one Meta makes. Someone will make them &#8212; NVIDIA&#8217;s pretty clearly going to try, because it&#8217;s such an obvious complement. But you can imagine a world where open source becomes much less common.</p><p>[01:03:32] One interesting thing: talking to people in AI in China &#8212; not political people &#8212; every single person thinks Claude is the best model, and they all use Claude. Not domestic models. Even though it&#8217;s very hard to do that from China, on both the government and the Anthropic side. There&#8217;s no opinion that China is catching up on AI. The view is that not only is Claude ahead, but the one place they a bit believe the AGI pill is that inside OpenAI, Anthropic, and DeepMind they&#8217;re using these models to speed up product deployment &#8212; and China doesn&#8217;t have the same access to frontier models, which makes it tough.</p><p>I&#8217;m doing a thing for NBER on what chip bans would do to endogenous innovation in China &#8212; how to even model that isn&#8217;t obvious. The cynical answer is it&#8217;s whatever the marginal cost of buying chips from Kazakhstan is &#8212; one more plane flight.</p><p><strong>Seth:</strong> I was reacting to the Jensen interview. We&#8217;re half a beard away from you being at that point.</p><p><strong>Kevin:</strong> I should have worn the leather jacket &#8212; that&#8217;s the look now. Actually, the real move is the T-shirt from our machine-learning accelerator, before we called it AI, back in 2016. That&#8217;s the one you flex with.</p><div><hr></div><h2>A Fine Theorem, Blogging, and the Value of Taste [1:06:06 - 1:17:48]</h2><p>[01:06:06] <em>(For those playing along at home, now&#8217;s your chance to think about how this conversation has changed your priors &#8212; sponsored by Revelio Labs.)</em></p><p><strong>Seth:</strong> Revelio Labs is a leading provider of labor-economics data and data services for companies, academics, and independent researchers. They combine comprehensive micro-level data on employee profiles, job postings, and sentiment with standardizations, mappings, and enrichments &#8212; flexibly aggregated to company, market, or industry &#8212; to study everything from career trajectories to occupational transformation to the impact of AI on labor demand. Their data is available on WRDS, so if you&#8217;re an academic with a good library, check whether you already have access. If not, reach out to their economics team.</p><p>[01:07:21] <strong>Seth:</strong> One thing you didn&#8217;t mention at the top: the reason you&#8217;re so close to my heart is your famous blog from the glory days of econ blogging, <em>A Fine Theorem</em>. When I started my PhD in 2012, getting excited about the big questions in economics and how theory can contribute, I found it so inspiring. So much of how you publish in econ now is: find a cute IV for one of a limited list of subjects, or &#8212; God willing &#8212; J-PAL backs you and you do an RCT. That may be useful, but it&#8217;s not what excited me about economics. Your blog was my north star for how technical theory can and should be communicated. So, snaps for how cool that was.</p><p><strong>Kevin:</strong> Hold on &#8212; who do you think the Gen X is in this conversation?</p><p><strong>Seth:</strong> Are you an elder millennial? Did I just mess up?</p><p><strong>Kevin:</strong> I thought I looked young for my age &#8212; I&#8217;ve got the dimples.</p><p><strong>Seth:</strong> As a generational-conflict theorist, the thing that struck me about the Dwarkesh&#8211;Jensen interview was Gen X shape-rotator Jensen and millennial wordcel Dwarkesh. So it wasn&#8217;t surprising you had the leather-jacket option.</p><p><strong>Kevin:</strong> I&#8217;ll say the Gen X has excellent taste in music. I went to the Oasis reunion concert &#8212; probably the youngest person there. It was great.</p><p>[01:09:42] So, <em>A Fine Theorem</em>. It&#8217;s related to AI development, believe it or not &#8212; one important way new technologies diffuse is the development of complements. That site started as my PhD notes on the papers I was reading; it was just easier to keep them in a WordPress setup. Some people found it through RSS &#8212; that&#8217;s how you found things on the internet then. Now people find things through gated social media, group chats, podcasts. It was good timing for me. I was never that interested in running a podcast &#8212; someone asked me to do one on the economics of science years ago &#8212; writing just matches my background better.</p><p>It got a bit wild. I&#8217;d write about maybe a hundred papers a year, plus Clark Medals and Nobel Prizes. I had a reputation as the guy who reads everything across fields and isn&#8217;t shy about his opinions. The three craziest emails I&#8217;ve ever gotten: I proposed a reform to the NBA and the president of an NBA team emailed me to talk about it. And two different Nobel laureates read my notes after they won and wrote asking me to read through their Nobel speeches. That&#8217;s the coolest thing ever.</p><p><strong>Seth:</strong> &#8220;Explain to me why my work was important.&#8221;</p><p><strong>Kevin:</strong> It makes sense &#8212; you know your work, but not always how people see it or how it influences them. I go to conferences and students will say &#8220;I&#8217;m extending your paper from 15 years ago this way,&#8221; and I&#8217;ve completely forgotten about that corollary. They know more about it than I do. Tyler Cowen liking it led a lot of people to read it. At one point it got, I don&#8217;t know, a million views &#8212; crazy for a microeconomic-theory blog.</p><p>[01:12:51] <strong>Seth:</strong> I do think you&#8217;re quite good at writing it for an educated reader, not just as a paper. There&#8217;s a big latent market for this &#8212; previous guest Noah Smith works the same lane. We love Noah, but you can&#8217;t compare Noah to Kevin in terms of gravitas and depth.</p><p><strong>Kevin:</strong> A lot of academics think their job is research and teaching &#8212; writing papers for other academics and maybe policy folks. But now I know who&#8217;s reading that stuff. I was writing about epistemic game theory, and serious people read it. My work on progress studies &#8212; I teach a class on progress with serious research behind it &#8212; there&#8217;s huge interest. I was at a conference with Chad Jones, the growth theorist, and there are people in industry reading Chad Jones papers seriously. The world is much more interested in serious work that answers serious questions than academics think. If they understood that, they&#8217;d be more careful with their work and would choose different topics &#8212; instead of &#8220;I&#8217;m writing this because journal editor X just got promoted.&#8221;</p><div><hr></div><h2>The Economist as Plumber: Comparative Advantage &amp; RCTs [1:17:48 - 1:24:07]</h2><p>[01:14:24] <strong>Seth:</strong> Let me ask about the <em>how</em> and <em>who</em> you write for. One theory behind this podcast: as the marginal cost of writing papers goes down, the marginal product of reading them can go up. Do you see AI increasing the relative importance of digesting and synthesizing research?</p><p><strong>Kevin:</strong> The one-sentence version you hear &#8212; which I think is true &#8212; is that the marginal value of taste has gone up.</p><p><strong>Seth:</strong> But what&#8217;s taste?</p><p><strong>Kevin:</strong> There&#8217;s stuff that&#8217;s fun to consume &#8212; I watch YouTube golf like everyone my age, but I know I&#8217;m not learning anything; I should be watching topology videos. Taste is understanding <em>why a thing matters</em>. Show me 20 things written about chemistry and I can tell which is better written, but not which one matters. To have taste &#8212; in music, literature, economics, anything &#8212; you need a really strong epistemic base. AI can point out &#8220;this is a good paper,&#8221; but not &#8220;this is a good paper in line with your individual interests.&#8221; Maybe in a world with continual learning, where your AI is your assistant &#8212; but we&#8217;re not there.</p><p><strong>Seth:</strong> But it was beyond what was interesting to you &#8212; somehow it was also inspiring to people like me in grad school.</p><p><strong>Kevin:</strong> Right. I&#8217;m illiterate about music &#8212; play me some Bach and I barely know the difference. But once in a while a really good critic writes &#8220;listen to this part and you&#8217;ll hear this,&#8221; and suddenly I do hear it and understand why it&#8217;s interesting. That person couldn&#8217;t have just listened to that one piece or read one book &#8212; they need to understand the history of music. People have different areas where they can have taste. Mine is probably the intersection of theory, history, and history of thought &#8212; and that mixture isn&#8217;t very common.</p><p>[01:17:48] <strong>Seth:</strong> Let me pull out something you may have a distaste for &#8212; a quote from your review of the Banerjee&#8211;Duflo&#8211;Kremer prize. &#8220;The economist as plumber, famously popularized by Duflo, who rigorously diagnoses small problems and proposes solutions, is a fine job for a World Bank staffer, but a crazy use of the intelligence of our otherwise leading scholars.&#8221; React to that in the age of AI, where the market is flooded with &#8220;we estimated the productivity impact of AI adopted here on this date&#8221; papers. What should those people do instead?</p><p><strong>Kevin:</strong> You&#8217;ll be surprised &#8212; because I believe in comparative advantage. I literally mean it&#8217;s good work for a World Bank economist; people should do that. I just don&#8217;t think Banerjee and Duflo should have been doing it. Same way Stantcheva&#8217;s taxation work was unbelievable, Clark-Medal-winning &#8212; and then she wrote a bunch of papers basically running a survey firm. The papers are interesting, but it&#8217;s not her comparative advantage; many people have more expertise in that area, and it&#8217;s not that complementary with the rest of her work.</p><p><strong>Andrey:</strong> I&#8217;ll disagree. Both survey research and experiments required elite permission to do this type of work. There&#8217;s no objective, agreed-upon standard in social science for what we should work on. Having an MIT or Harvard economist legitimize it in a top-five journal lets a bunch of other people &#8212; for whom it <em>is</em> their comparative advantage &#8212; work on it. On the margin maybe they work too much on it. In marketing, where I sit, there was a perception that survey research with stated preferences was something we shouldn&#8217;t do &#8212; and now if a top economist says it&#8217;s okay, maybe we can.</p><p><strong>Kevin:</strong> For sure &#8212; same way J-PAL was useful, and they won a Nobel for it, so they were rewarded. I agree on the permission structure. The question is what we do now with AI. In a sense it&#8217;s not great for me &#8212; being the smart-ass kid who&#8217;s really good at algebra is worthless now. I worked on a paper recently I&#8217;d been stumped on for years &#8212; a proof I couldn&#8217;t figure out. GPT-5.4 Pro was also stumped, but in its write-up it gave me a polytope-theory result I hadn&#8217;t seen, and I used it to prove the thing. I felt like a dad beating his teenage kid in basketball &#8212; super happy, but I know it might be the last time.</p><p>[01:21:56] If you&#8217;re a PhD student now and your specialty is being really good at solving models, you&#8217;re just not going to have a job &#8212; you&#8217;re not as good as the AI. But some things are incredible complements to AI: within-firm field experiments done with much higher ambition than now. Those will be very popular and not susceptible to replacement for a while.</p><p><strong>Andrey:</strong> But didn&#8217;t you say we shouldn&#8217;t be working on this?</p><p><strong>Kevin:</strong> I said we shouldn&#8217;t be doing RCTs &#8212; but I believe in comparative advantage, and we&#8217;ve changed the price of the factors. If we&#8217;re going to do this, what&#8217;s a bad idea is doing it atheoretically and ahistorically. Two things you need as a PhD student: your work has to be a complement to what AI can do, and your work has to have taste &#8212; you need to know what matters and why. A field experiment estimating a treatment effect no one cares about shows a lack of taste. Thinking you&#8217;ll get a job solving a model any AI can solve shows a lack of understanding of comparative advantage. High-paced managerial types are going to do better in academia than they used to, and some folks who were high-status will find nobody cares.</p><p><strong>Andrey:</strong> I see how the human advantage is running RCTs versus writing macro models. But what&#8217;s the right approach for writing that AI book you want us to write?</p><p><strong>Kevin:</strong> I still think you should write the macro model &#8212; your contribution just isn&#8217;t <em>solving</em> it. And your empirical paper needs to draw on and understand the macro models you&#8217;re building on. You should spend more time reading papers, not less, to develop taste.</p><p><strong>Andrey:</strong> Or you shouldn&#8217;t read papers &#8212; you should talk to the AI about the papers. Or listen to this podcast.</p><p><strong>Kevin:</strong> You should be listening to Justified Posteriors, brought to you by Jane Street.</p><p><strong>Andrey:</strong> We&#8217;re manifesting Jane Street.</p><div><hr></div><h2>The Future of the Academic Paper [1:24:07 - 1:28:22]</h2><p>[01:24:19] <strong>Kevin:</strong> Academic papers are an unbelievably entrenched system, but here&#8217;s where I&#8217;m trying to go &#8212; and I edit an AEA journal, so I talk to them about how we handle AI. In a couple of years, a paper is: all the lab notes, code, and data, open and in a format AI can read &#8212; that&#8217;s already in progress. Then a paper that ranges from the 40-page version to the five-page version to a &#8220;talk to the AI&#8221; version. If you go to my website, my papers already have a built-in Gemini Flash interface, because I assume people want to talk about the paper while reading it. It&#8217;s not just a PDF.</p><p>So every paper will have a partially AI-generated hundred-page version with all the information for the AI, the 40-page version, the five-page, the three-page, the interactive version &#8212; because the cost of writing the paper is so high relative to the cost of those manipulations. The idea of a paper as a fixed set of words is over. If that&#8217;s all it is, everyone&#8217;s going to talk to GPT about it anyway &#8212; we can do better.</p><p><strong>Andrey:</strong> Does that mean writing goes down in importance? Someone like Chad Jones is such a crisp writer &#8212; that&#8217;s a key reason people read him.</p><p><strong>Kevin:</strong> How AGI-pilled am I? The best academic writer in our profession is not Hemingway &#8212; let&#8217;s not be deluded; the average writer is terrible. People outside academia may not realize how much editing for readability happens in an academic article: the answer is zero. Maybe one or two sentences you&#8217;ll be asked to crisp up. It&#8217;s not The New Yorker &#8212; there&#8217;s no editor rewriting your paper for readability. The only reason people try to write well is that on the margin it raises your acceptance probability. Otherwise they write like a lawyer.</p><p><strong>Andrey:</strong> It&#8217;s taste. It&#8217;s for themselves.</p><p>[01:27:05] <strong>Kevin:</strong> I had an idea &#8212; maybe we do it for AI. I wanted an innovation journal; there&#8217;s no good one, and innovation is very interdisciplinary. But no one will send a paper to a new journal, for tenure reasons. So how do I free-ride on the system? Create a journal that any <em>already-published</em> paper is eligible for. Have a board of 30 great innovation and AI economists; as soon as three say &#8220;if this were my field, top field journal, I&#8217;d have taken it,&#8221; it&#8217;s in the journal. We link to the working-paper version and hire a professional to write a 1,500-word, Quanta-Magazine-style article about why the paper matters.</p><p><strong>Seth:</strong> Have you heard of the Unjournal, an EA project? It has some of these ideas.</p><p><strong>Kevin:</strong> Yeah, the Unjournal&#8217;s a good one.</p><p><strong>Andrey:</strong> Works in Progress is doing some of this too &#8212; taking academic research and making a great article about it.</p><p><strong>Kevin:</strong> That&#8217;s why the innovation-econ world and the progress world have a lot in common &#8212; we&#8217;re all friends. This year I felt like a progress-world celebrity, because one of my PhD advisors was Joel Mokyr, and they love Mokyr in progress world &#8212; he&#8217;s like Michael Jordan.</p><p><strong>Andrey:</strong> I tried reading <em>A Culture of Growth</em> and it&#8217;s unreadable. I&#8217;ll just put it out there.</p><p><strong>Kevin:</strong> <em>The Gifts of Athena</em> is the one I recommend &#8212; though you have to work through 50 pages of prescriptive-versus-propositional knowledge with lambdas and sigmas. He&#8217;s still a better writer than the average economist &#8212; low bar.</p><div><hr></div><h2>San Francisco, Ambition &amp; the Permission Structure [1:28:22 - 1:32:56]</h2><p>[01:28:46] <strong>Kevin:</strong> My favorite thing about what&#8217;s going on in California &#8212; other than the incredible ambition &#8212; for folks who aren&#8217;t here, there&#8217;s all sorts of craziness; they make Seth and his EA beard look normal.</p><p><strong>Seth:</strong> It hosts insects and shrimp that are having a lot of utility.</p><p><strong>Kevin:</strong> You can save a little dinner for later up in the mustache. But the level of ambition &#8212; your average 21-year-old asking &#8220;what should I do with my life&#8221; aims <em>this</em> high. That&#8217;s not normal in most places, where the very smart people are type-A, &#8220;follow this rule and this process.&#8221; In academia we know tons of those people. It&#8217;s super refreshing. In progress world, every random person is like, &#8220;should I make money, or start a biohacking magazine that four people buy but I like doing?&#8221; &#8212; biohacking magazine. And I love it.</p><p>We have a guy in Toronto, Ben Perry, who runs a sort of &#8220;Toronto society,&#8221; also in progress world &#8212; he holds talks on what makes a beautiful city. He asked me to give one related to my progress course, on idiosyncratic factors that lead to progress &#8212; a pretty out-there talk. I show up and we&#8217;ve sold out a concert hall. People paid 30 bucks a ticket, there was music beforehand, and afterward people are in the hallway chatting about what they&#8217;re building. These people are all over.</p><p>[01:31:05] The remaining secret sauce of Silicon Valley is that everyone &#8212; all the way up and down the permission and capital structure &#8212; agrees the most ambitious people should have the power and the capital. That&#8217;s rare. During COVID, my university was closed and it was driving me crazy, so I went to teach in Senegal &#8212; the best university in French West Africa, teaching high-growth entrepreneurship. Great students. I asked what they wanted to do when they graduated, and they all wanted to work for the government. &#8220;You don&#8217;t want to start a company?&#8221; &#8220;If it doesn&#8217;t work out and I go bankrupt, I&#8217;m living on the street, and no one gives a 22-year-old money to start a company.&#8221; And that&#8217;s reasonable. But that societal structure makes growth impossible. That&#8217;s the thing you have to get right.</p><div><hr></div><h2>Lightning Round [1:32:56 - 1:38:08]</h2><p>[01:32:56] <strong>Seth:</strong> How are we on time? Want to do a lightning round? And give yourself a chance to talk about All Day TA.</p><p><strong>Kevin:</strong> Let&#8217;s do All Day TA as part of the lightning round, so I don&#8217;t feel like a sales call.</p><p><strong>Seth:</strong> Lightning round, beginning. Favorite economist, living or dead?</p><p><strong>Kevin:</strong> Dead: Paul Samuelson &#8212; awesome work. Living: Bengt Holmstr&#246;m, because when I walk around on the street his ideas are in my head all day.</p><p><strong>Seth:</strong> All Day TA &#8212; what did you learn from being an entrepreneur?</p><p><strong>Kevin:</strong> This is my company &#8212; we sell ed-tech to universities, a hundred-plus now, all over the world. I learned that for AI diffusion, institutional sales is so hard in traditional industries, and so unrelated to product quality, that the people who already own the gates into big institutions &#8212; the Salesforces, the Microsofts &#8212; are going to clean up in the AI world. People who think they&#8217;ll sell a great product and get around those gates are deluding themselves.</p><p><strong>Seth:</strong> Did you learn a trick for selling to universities?</p><p><strong>Kevin:</strong> Did it a hundred times. The technical stuff matters very little. You have to figure out who has the decision rights &#8212; often the head of IT &#8212; and they often have some idiosyncratic thing they want. Going through the professor has no power. With any institutional sale, the secret is knowing who can write the check and getting to that person quickly.</p><p><strong>Seth:</strong> If you had to burn all of Kremer&#8217;s RCT work or his O-ring paper, which would you destroy?</p><p><strong>Kevin:</strong> I love the O-ring paper. But if your experimental papers probably saved a million lives, I have to let you keep those. So we burn the O-ring. Also, we probably could have figured out the O-ring without Kremer.</p><p><strong>Seth:</strong> But would it have been written that beautifully?</p><p><strong>Kevin:</strong> No. It&#8217;s such a nice paper.</p><p><strong>Seth:</strong> What advice do you have for folks in economics grad school today?</p><p><strong>Kevin:</strong> You&#8217;re five years out &#8212; read <em>Situational Awareness</em> chapter one, and believe it. Whatever you think you&#8217;re doing in your job-market paper, ask: is that consistent with creating value in the world of <em>Situational Awareness</em> chapter one? If not, literally do anything else.</p><p><strong>Seth:</strong> If you had a choice of joining a lab or going to econ grad school, what should someone choose?</p><p><strong>Kevin:</strong> I don&#8217;t think there&#8217;s necessarily a conflict. But when my most ambitious 22-year-old students ask what to do, I say: it&#8217;s like being a writer in 1920 &#8212; get on a boat and go to Paris and don&#8217;t be stupid. You&#8217;re an ambitious 22-year-old: get in a van, drive to San Francisco, and don&#8217;t be stupid.</p><p><strong>Seth:</strong> Seth, any more lightning rounds? No, I think we&#8217;ve covered it. Kevin, this was a completing-the-circle experience for me &#8212; your blog was so inspirational on my economic journey, and getting to talk to you and be treated as an equal was a very special moment.</p><p><strong>Kevin:</strong> It&#8217;s nice you got to talk to me before I reached my full senescence &#8212; given whatever age you think I am. It&#8217;s really ruining my self-image.</p><p><strong>Seth:</strong> I always think it&#8217;s so beautiful when millennials can get along with Gen Xers. It&#8217;s a special thing.</p><p><strong>Kevin:</strong> You know the irony? The Gen Xer wouldn&#8217;t have cared &#8212; &#8220;who cares, man, don&#8217;t worry about it.&#8221; Only the millennial complains about being called the wrong generation.</p><p><strong>Seth:</strong> That&#8217;s true. Thanks a lot, guys &#8212; keep up the good work on the podcast. I&#8217;m looking forward to the next guests.</p><p>[01:37:53] <strong>Seth:</strong> And to listeners at home &#8212; keep your posteriors justified.</p>]]></content:encoded></item><item><title><![CDATA[We need well-capitalized prediction markets for AI impacts]]></title><description><![CDATA[And how to create them]]></description><link>https://empiricrafting.substack.com/p/we-need-well-capitalized-prediction</link><guid isPermaLink="false">https://empiricrafting.substack.com/p/we-need-well-capitalized-prediction</guid><dc:creator><![CDATA[Andrey Fradkin]]></dc:creator><pubDate>Tue, 19 May 2026 23:34:18 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!_6FF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb83307-d29b-4985-9279-44904e3527bb_1560x1140.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>As our readers have surely noticed, there&#8217;s enormous disagreement about the likely economic effects of AI. Some AI forecasters, especially those emphasizing recursive self-improvement, expect AI capabilities to soon surpass that of most human workers. As a result, they expect large-scale labor market displacement, beginning with white-collar work and spreading to all of labor in the long run. The<a href="https://ai-2027.com/"> AI 2027 scenario</a> lays out a year-by-year path to transformative AI and implies large-scale labor displacement. Other economists, in contrast, are more skeptical of such extreme rapid take-offs, emphasizing adoption frictions, organizational bottlenecks, complementary investments, and the slow diffusion of new production processes.</p><p>The<a href="https://forecastingresearch.substack.com/p/forecasting-the-economic-effects-of-ai"> Forecasting Research Institute&#8217;s recent survey</a> of leading economists, AI experts, and superforecasters quantifies several dimensions of disagreement. The figures below show that there is enormous disagreement among experts and that AI experts tend to forecast higher economic growth rates than economists. Tom Cunningham nicely describes a variety of forecasts in his <a href="https://tecunningham.github.io/posts/2025-10-19-forecasts-of-AI-growth.html">blog</a> post.<br><br>Some disagreement reflects genuinely different beliefs about AI progress and the transition path of economic adjustment. Disagreement is healthy, and reflects diversity in economists&#8217; world models. But a deeper and more fundamental problem is that few people have strong incentives to produce granular, decision-relevant forecasts. Few have done the detailed work of trying to predict what will happen to the macroeconomy. Even fewer have worked at the granular level required to predict which industries will be affected and when, which jobs will be diminished and which will grow, and what will happen to wages in those jobs. The financial incentives are too weak for these experts to invest the effort. </p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://empiricrafting.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Justified Posteriors is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!_6FF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb83307-d29b-4985-9279-44904e3527bb_1560x1140.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!_6FF!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb83307-d29b-4985-9279-44904e3527bb_1560x1140.png 424w, https://substackcdn.com/image/fetch/$s_!_6FF!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb83307-d29b-4985-9279-44904e3527bb_1560x1140.png 848w, https://substackcdn.com/image/fetch/$s_!_6FF!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb83307-d29b-4985-9279-44904e3527bb_1560x1140.png 1272w, https://substackcdn.com/image/fetch/$s_!_6FF!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb83307-d29b-4985-9279-44904e3527bb_1560x1140.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!_6FF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb83307-d29b-4985-9279-44904e3527bb_1560x1140.png" width="639" height="466.96153846153845" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0bb83307-d29b-4985-9279-44904e3527bb_1560x1140.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1064,&quot;width&quot;:1456,&quot;resizeWidth&quot;:639,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!_6FF!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb83307-d29b-4985-9279-44904e3527bb_1560x1140.png 424w, https://substackcdn.com/image/fetch/$s_!_6FF!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb83307-d29b-4985-9279-44904e3527bb_1560x1140.png 848w, https://substackcdn.com/image/fetch/$s_!_6FF!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb83307-d29b-4985-9279-44904e3527bb_1560x1140.png 1272w, https://substackcdn.com/image/fetch/$s_!_6FF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb83307-d29b-4985-9279-44904e3527bb_1560x1140.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!OfyZ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3909fe55-2837-4dbf-98e2-13e5600f99c2_1826x1114.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!OfyZ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3909fe55-2837-4dbf-98e2-13e5600f99c2_1826x1114.png 424w, https://substackcdn.com/image/fetch/$s_!OfyZ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3909fe55-2837-4dbf-98e2-13e5600f99c2_1826x1114.png 848w, https://substackcdn.com/image/fetch/$s_!OfyZ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3909fe55-2837-4dbf-98e2-13e5600f99c2_1826x1114.png 1272w, https://substackcdn.com/image/fetch/$s_!OfyZ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3909fe55-2837-4dbf-98e2-13e5600f99c2_1826x1114.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!OfyZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3909fe55-2837-4dbf-98e2-13e5600f99c2_1826x1114.png" width="646" height="393.989010989011" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3909fe55-2837-4dbf-98e2-13e5600f99c2_1826x1114.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:888,&quot;width&quot;:1456,&quot;resizeWidth&quot;:646,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!OfyZ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3909fe55-2837-4dbf-98e2-13e5600f99c2_1826x1114.png 424w, https://substackcdn.com/image/fetch/$s_!OfyZ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3909fe55-2837-4dbf-98e2-13e5600f99c2_1826x1114.png 848w, https://substackcdn.com/image/fetch/$s_!OfyZ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3909fe55-2837-4dbf-98e2-13e5600f99c2_1826x1114.png 1272w, https://substackcdn.com/image/fetch/$s_!OfyZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3909fe55-2837-4dbf-98e2-13e5600f99c2_1826x1114.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Even structured modeling efforts illustrate how wide the uncertainty is.<a href="https://epoch.ai/gate"> Epoch&#8217;s GATE model</a> is a macro model of AI&#8217;s economic impact, built around an automation feedback loop between compute investment, AI capabilities, and task automation. But its outputs are highly sensitive to a handful of parameters whose true values are deeply uncertain. The model can produce qualitatively different trajectories depending on plausible parameter choices.</p><p>So we&#8217;re operating in a world of great uncertainty. This uncertainty is high-dimensional and will never be fully resolved. But we can, in principle, make good predictions about <em>slices</em> of the state space -- specific events that are both <em>learnable</em>: we can go out into the world and collect information about what will happen; and <em>decision-relevant</em>: the information is valuable for making better decisions. We think our best bet for doing so will be through prediction markets that, despite their shortcomings, are powerful technologies for incentivizing information acquisition and aggregating dispersed information.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>  This is a modern rendition of a classic Hayekian case for prices.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a></p><h3>The Proposal</h3><p>We propose that AI labs, Big Tech companies, and philanthropic donors contribute funding to create well-capitalized prediction markets to understand the likely impacts of AI. We need this information because individuals, firms, and policymakers are already making decisions right now &#8211; with or without this information.</p><p>Consider a college graduate deciding which industry to enter, or even a highschooler entering college deciding what to major in. They probably want some information about which jobs are likely to exist in the future and which jobs are likely to be less common. It would be very useful to have, based on the most recent up-to-date information, some signal for how to make this decision. Firms are in a similar position, when deciding which occupations to hire and at what level of seniority.</p><p>Policymakers face an even harder version of this problem. Decisions about retraining programs, extended unemployment insurance, and sector-specific support require knowing not just whether AI will displace workers, but which workers, in which industries, and how quickly. Retraining programs in particular are highly sensitive to these forecasts: training a displaced bookkeeper takes years, and the value of that investment depends on which destination occupations will still exist when the training is complete.</p><p>The FRI survey found that economists ranked job retraining as their most-supported policy response, with a forecasted boost to labor force participation in a rapid AI progress scenario, but designing such a program requires answering granular questions that current forecasts simply don&#8217;t address. Extended UI faces a similar problem: the case for sector-specific extensions depends on whether displacement is temporary (workers transitioning between jobs) or structural (occupations disappearing), and the appropriate duration depends on expected re-employment rates that vary enormously by industry.</p><h3>How Prediction Markets Would Help and Why They Don&#8217;t Yet Exist</h3><p>For prediction markets, we envision a variety of contracts. Start with the labor market: aggregate labor-force participation, and occupation-level employment and wages, are measured in official BLS data. For more detail on how these contracts could resolve, see the appendix on contract resolution using labor-market data.  We would like to have a forecast of the future labor force participation rate and of the employment rate and wages across different occupations.</p><p>Suppose that we had prediction markets at different time horizons (1, 2, 5 years) for the total labor force size of different occupations in the United States, and analogous contracts on wages and employment rates. With an appropriate and liquid contract, we should have a good signal about the future labor market. If this exists, when a new graduate is choosing a career, they could use this information to think about which occupations have continued demand. Note that these contracts don&#8217;t forecast the impact of AI directly, but to the extent that AI is an important force in determining future employment rates, its impact will be a part of the forecast.</p><p>But labor outcomes are downstream of AI capabilities, and we&#8217;d also want contracts directly on capabilities themselves. The <a href="https://polymarket.com/event/which-company-has-best-ai-model-end-of-june">questions</a> currently trading on Polymarket are coarse, such as &#8220;Which company has the best AI model by the end of June?&#8221;, and don&#8217;t track the underlying technical progress in a way that&#8217;s useful for forecasting economic impact. We&#8217;d want contracts on benchmarks (e.g., ARC-AGI-3) or capability thresholds (e.g., passing Steve Wozniak&#8217;s coffee test) at different horizons. These capability forecasts are valuable on their own and would also help calibrate the labor market contracts: if traders price in rapid capability gains but slow labor market changes, that discrepancy is, itself, informative about the market&#8217;s belief about passthrough from capabilities to labor market impacts.</p><p>The next question is: what will it take for these prediction markets to have meaningful information aggregation, and why don&#8217;t they exist yet?<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a></p><p>There are two reasons people might participate in prediction markets: speculation and hedging. So far, firms haven&#8217;t created enough demand to hedge for AI&#8217;s labor market impacts. Furthermore, for contracts that are longer-term, traders need to be compensated at least at the risk-free interest rate, meaning that incentives to trade are diminished for contracts further in the future. As a result, the chicken-and-egg problem hasn&#8217;t been broken, and we lack prediction markets on these questions. So the question becomes how to bootstrap them (see Andy Hall&#8217;s <a href="https://freesystems.substack.com/p/building-the-truth-machine">essay</a> on some ideas).</p><h3>What It Would Take</h3><p>Prediction markets need to be seeded. The reason is that prediction markets have increasing returns to participation. With a thin market, prices are noisy, spreads are wide, and traders cannot enter and exit positions without moving the market. But without traders, the market never becomes informative. Seeding solves this coordination problem by paying for early liquidity, narrowing spreads, and making it worthwhile for those best-positioned to acquire information to do so.</p><p>There are two ways to seed prediction markets. The direct way is to provide liquidity by staking a market with capital on both sides of contracts, so that other participants have something to trade against. This is how most prediction markets historically got started, but it requires the seeder to take on inventory risk.</p><p>A more natural way is sponsorship. The way this works is that a principal (the entity that wants the information) deposits money into a contract. Sponsorship adds a way to bootstrap the market: the sponsor pays participants who improve liquidity, for example by posting orders that narrow spreads or add depth. If the sponsorship is large enough, then traders have incentives to acquire information and to participate in the market. Once there is already active trading volume, the market may sustain itself even without additional sponsorship funding. Sponsorship is already a feature on Polymarket and may be available soon on Kalshi. Technically, sponsorship is one way to subsidize liquidity provision. This connects to the market-design literature on automated market makers and scoring rules, especially Hanson&#8217;s logarithmic market scoring rule and Chen and Pennock&#8217;s bounded-loss market-maker framework, which formalizes the tradeoff between liquidity provision and worst-case loss (Hanson 2003, 2007; Chen and Pennock 2007).</p><p>It&#8217;s worth being clear about what sponsorship does and doesn&#8217;t do, because the FRI survey is instructive here. FRI paid economists to participate, and got back the most comprehensive elicitation of expert beliefs about AI&#8217;s economic impact that we have. But the economists essentially had no skin in the game. Sponsorship has a different structure. The sponsor pays for the market to exist, but the traders who actually set prices are putting their own capital at risk.</p><p>The natural candidates to sponsor these markets are large players, either the AI labs themselves, or firms with significant exposure to AI that are interested in getting better information. They have the budget, and they have the strongest reason to want this information to exist. We don&#8217;t have a good sense of how much sponsorship volume is necessary for good information aggregation. That&#8217;s an open question, and one we&#8217;re interested in experimenting with. That said, we can&#8217;t do this alone. A big player needs to participate, or at least fund research on how to create these contracts.<br><br><em>Thanks to Tom Cunningham, Andy Hall, John Horton, and Scott Kominers for comments.</em></p><h4>Appendix: What Makes a Good Contract</h4><p>Not every interesting question can be turned into a useful prediction market. A workable contract needs four things:</p><p>Verifiability. The resolution criterion has to be objectively checkable. &#8220;By 2030, will AI have caused Cheerios sales to double?&#8221; fails immediately because &#8220;caused&#8221; isn&#8217;t observable. You can&#8217;t separate the AI-driven counterfactual from everything else moving in the world. &#8220;By 2030, will the BLS-reported employment in occupation X be below Y?&#8221; is verifiable: the number gets published, and you read it.</p><p>Stability. The underlying data series has to keep being published and measured the same way over the life of the contract. Niche or proprietary statistics fail this. BLS occupation employment, unemployment rates, and labor force participation are highly stable. Anything that depends on a single company&#8217;s voluntary disclosure is not.</p><p>Robustness to gaming. The bigger the underlying quantity, the harder it is for any trader to move it. Someone with a few million dollars and an axe to grind could plausibly distort sales of a single product or the headcount of a small firm. They cannot plausibly distort BLS employment statistics for a major occupational category. Sector-level labor market contracts are robust for the same reason: the sector is too large to move by hiring or firing at the margin.</p><p>Attention. Even a verifiable, stable, robust contract is useless if no one cares enough to trade it. Cheerios sales fail this test too. Unemployment contracts have so far failed, but we think that this is for reasons that are fixable.</p><h4>Appendix: Contract Resolution Using Labor-Market Data</h4><p>For labor-market contracts, the natural resolution sources are large, official, regularly published data series, and not tailor-made AI-impact metrics. The Occupational Employment and Wage Statistics (<a href="https://www.bls.gov/oes/">OEWS</a>) program under the Bureau of Labor Statistics  (BLS) provides annual estimates on employment and wages for approximately 830 occupations at the national level, state, metropolitan, and nonmetropolitan areas, as well as for specified industries. This makes OEWS a natural source for occupation-level employment and wage contracts. Such contracts would not identify the causal effect of AI. Instead, they would rely on a set of observable labor-market outcomes &#8212; employment, wages, or labor-force quantities, which directly influence students, firms, and policymakers.  For example, a contract could be settled on whether employment reported by BLS in a specified SOC occupation falls below a pre-specified threshold in a given release year.</p><h4><strong>References</strong></h4><p>Hayek, F. A. 1945. &#8220;The Use of Knowledge in Society.&#8221; <em>American Economic Review</em>.</p><p>Wolfers, Justin, and Eric Zitzewitz. 2004. &#8220;Prediction Markets.&#8221; <em>Journal of Economic Perspectives</em>.</p><p>Arrow, Kenneth J., et al. 2008. &#8220;The Promise of Prediction Markets.&#8221; <em>Science</em>.</p><p>Hanson, Robin. 2003. &#8220;Combinatorial Information Market Design.&#8221; <em>Information Systems Frontiers</em> 5 (1): 107&#8211;119.</p><p>Hanson, Robin. 2007. &#8220;Logarithmic Market Scoring Rules for Modular Combinatorial Information Aggregation.&#8221; <em>Journal of Prediction Markets</em> 1 (1): 3&#8211;15.</p><p>Chen, Yiling, and David M. Pennock. 2007. &#8220;A Utility Framework for Bounded-Loss Market Makers.&#8221; In <em>Proceedings of the 23rd Conference on Uncertainty in Artificial Intelligence (UAI 2007)</em>, 49&#8211;56</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>For example, if traders are risk averse and use prediction markets for hedging then the price may be a biased estimate of the expected outcome. Alternatively, if retail meme traders herd on the contract, then its value could become disjointed from the underlying value.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p> See Hayek (1945); Wolfers and Zitzewitz (2004); Arrow et al. (2008); Hanson (2003, 2007).</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>There are <a href="https://www.metaculus.com/labor-hub/">prediction contests</a> on Metaculus relating to the labor force participation rate and other outcomes, but these have few participants and have small financial incentives to be correct.</p></div></div>]]></content:encoded></item><item><title><![CDATA[Seb Krier on AGI, the Coasean Singularity, and EDM]]></title><description><![CDATA[Justified Posteriors talk to the AGI Policy Dev Lead for Google DeepMind]]></description><link>https://empiricrafting.substack.com/p/seb-krier-on-agi-the-coasean-singularity</link><guid isPermaLink="false">https://empiricrafting.substack.com/p/seb-krier-on-agi-the-coasean-singularity</guid><dc:creator><![CDATA[Seth Benzell]]></dc:creator><pubDate>Tue, 19 May 2026 02:21:43 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/198338577/aa0ec838c56b9e2573b03f96504e66a1.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p><strong>Seb Krier on AGI, Scaffolding, and Coasean Bargaining at Scale</strong></p><p>In this episode of <em>Justified Posteriors</em>, we welcome <strong><a href="https://x.com/sebkrier">Seb Krier</a></strong> &#8212; policy lead for AGI at Google DeepMind and excellent Twitter poster. Speaking in his personal capacity, Seb walks us through his understanding of AGI, why AI alignment has gone better than expected, the potential and limitations of a world where agents constantly barter on our behalf, and &#8212; of course &#8212; electronic music. </p><p>We also cover AI in London vs. New York, how Seb went from reading <em>Marginal Revolution</em> for 15 years to becoming a recurring character on it, and Seb&#8217;s side-splitting humor on mediocre AI conferences.</p><p><strong>Related Links</strong></p><ul><li><p>Seb Krier on X: <a href="https://x.com/sebkrier">@sebkrier</a></p></li><li><p>Seb&#8217;s Substack, <a href="https://technologik.substack.com/">Technologik</a></p></li><li><p><a href="https://blog.cosmos-institute.org/p/coasean-bargaining-at-scale">&#8220;Coasean Bargaining at Scale&#8221;</a> &#8212; Seb&#8217;s essay at the Cosmos Institute (also <a href="https://www.aipolicyperspectives.com/p/coasean-bargaining-at-scale">republished here</a>)</p></li><li><p><a href="https://technologik.substack.com/p/musings-on-recursive-self-improvement">&#8220;Musings on Recursive Self-Improvement&#8221;</a> &#8212; Seb&#8217;s essay separating model-side RSI from societal-side</p></li><li><p><a href="https://aleximas.substack.com/p/the-cyborg-era-what-ai-means-for">&#8220;The Cyborg Era: What AI Means for Jobs&#8221;</a> &#8212; Seb&#8217;s guest essay on Alex Imas&#8217;s Substack, defending the scaffolding view</p></li><li><p><a href="https://www.anthropic.com/features/project-deal">Anthropic&#8217;s Project Deal</a> &#8212; the agent-bargaining experiment among Anthropic employees </p></li><li><p><a href="https://andreyfradkin.com/assets/marketbench.pdf">Fradkin &amp; Krishnan, &#8220;MarketBench&#8221;</a> &#8212; Andrey and Rohit experiment of LLMs bidding in procurement auctions as an investigation of the future of AI marketplaces and the companion writeup: <a href="https://www.strangeloopcanon.com/p/agent-know-thyself-and-bid-accordingly">Rohit Krishnan, &#8220;Agent, Know Thyself! (and bid accordingly)&#8221;</a> </p></li><li><p><a href="https://www.edgeesmeralda.com/">Edge Esmeralda</a> &#8212; Devon Zuegel&#8217;s pop-up village in Healdsburg, CA</p></li><li><p><a href="https://www.matsprogram.org/">MATS</a> &#8212; for junior economists looking to skill up on AI safety/governance</p></li><li><p><a href="https://cosmos-institute.org/">Cosmos Institute</a> and <a href="https://www.thefire.org/">FIRE</a></p></li><li><p><a href="https://bianjie.systems/">bianjie.systems</a> &#8212; the art platform Seb is co-organizing a dinner with in NY (<a href="https://x.com/sebkrier/status/2054941198406602861">Seb&#8217;s announcement</a>)</p></li><li><p><a href="https://en.wikipedia.org/wiki/Drexciya">Drexciya</a> &#8212; James Stinson, Gerald Donald, and the Detroit electro-afrofuturism canon</p></li></ul><p><strong>Timestamps</strong></p><p>(00:00) Intro <br>(01:16) What is AGI? <br>(07:30) In defense of scaffolding &#8212; Hayek, division of labor, and why one giant model won&#8217;t do it <br>(13:00) Markets for cognition: will agents bid in procurement auctions? <br>(18:40) Recursive self-improvement &#8212; separating the model side from the societal side (24:44) Alignment has gone better than 2017-Seb expected; prefer &#8220;intent following&#8221; (31:14) What economists should actually work on to inform AI labs<br>(33:32) What does a DeepMind policy lead&#8217;s day look like? <br>(38:20) AI Conferences<br>(41:52) Coasean bargaining at scale &#8212; the positive vision<br>(55:00) Inequality, property rights, and who gets the initial allocation <br>(01:03:00) The <em>Helldivers 2</em> &#8220;Managed Democracy&#8221; dystopia as Coasean bargaining gone wrong <br>(01:09:00) Sponsor: Revelio Labs <br>(01:09:30) Lightning round</p><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://empiricrafting.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Justified Posteriors is a reader-supported publication. To receive new posts and support our work, consider becoming a free or paid subscriber. You&#8217;re also invited to our discord community at: https://discord.gg/b8VpPbBUt</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p><br><br><strong>Transcript</strong></p><div><hr></div><p>00:00:00,100 --&gt; 00:00:20,480 [Seth]</p><p>[upbeat music] Welcome to the Justified Posterior&#8217;s podcast, the podcast that updates beliefs about the economics of AI and technology. I&#8217;m Seth Benzell, the number two biggest fan, after Tyler Cowen, in the Seb Krier fan club. </p><p>00:00:20,480 --&gt; 00:00:20,740 [Andrey]</p><p>[laughs] </p><p>00:00:20,740 --&gt; 00:00:24,660 [Seth]</p><p>Coming to you from Chapman University in sunny southern California. </p><p>00:00:24,660 --&gt; 00:00:34,120 [Andrey]</p><p>And I&#8217;m Andrey Fradkin, coming to you from San Francisco, California. And Justified Posterior&#8217;s is sponsored by the fine folks at Revelio Labs. </p><p>00:00:35,560 --&gt; 00:00:45,600 [Andrey]</p><p>We&#8217;re very excited to have Seb Krier here with us today. He is the policy lead for AGI at Google DeepMind, and is, </p><p>00:00:46,840 --&gt; 00:00:52,400 [Andrey]</p><p>dare I say, a thought leader in this space. Welcome to the show, Seb. </p><p>00:00:52,400 --&gt; 00:00:54,200 [Seb Krier]</p><p>Thank you very much. It&#8217;s great to be here. </p><p>00:00:55,380 --&gt; 00:00:58,160 [Seb Krier]</p><p>Yeah, I&#8217;m Seb, calling in from New York. </p><p>00:00:58,160 --&gt; 00:01:00,320 [Andrey]</p><p>And we should remind our listeners that </p><p>00:01:01,340 --&gt; 00:01:08,410 [Andrey]</p><p>Seb is, during this podcast, expressing his personal opinions, and is not speaking on behalf of DeepMind. All right. </p><p>00:01:08,410 --&gt; 00:01:09,740 [Seb Krier]</p><p>Indeed. [laughs] </p><p>00:01:09,740 --&gt; 00:01:11,060 [Andrey]</p><p>[laughs] </p><p>00:01:12,780 --&gt; 00:01:13,900 [Andrey]</p><p>The usual caveat. </p><p>00:01:15,260 --&gt; 00:01:16,760 [Andrey]</p><p>Seb, what is AGI? </p><p>00:01:18,080 --&gt; 00:01:19,450 [Seb Krier]</p><p>What is AGI? [laughs] </p><p>00:01:19,450 --&gt; 00:01:19,570 [Andrey]</p><p>[laughs] </p><p>00:01:19,570 --&gt; 00:01:19,580 [Seth]</p><p>[laughs] </p><p>00:01:19,580 --&gt; 00:01:19,780 [Seb Krier]</p><p>Great question. </p><p>00:01:19,780 --&gt; 00:01:21,900 [Andrey]</p><p>We&#8217;re going to start with the big questions. </p><p>00:01:21,900 --&gt; 00:01:22,880 [Seb Krier]</p><p>Yeah, might as well. </p><p>00:01:24,259 --&gt; 00:01:54,840 [Seb Krier]</p><p>[sighs] I think there&#8217;s so many definitions out there of what AGI is, and I think most of them are kind of unsatisfactory in one way or another. I&#8217;ve seen stuff like many definitions are indexed on the societal transformations or economic impacts of the technology, which I don&#8217;t really like very much because it makes it very dependent on external factors whether or not we have AGI. If it&#8217;s banned, we don&#8217;t have AGI, and if it&#8217;s not banned, we have AGI. Is it? </p><p>00:01:54,840 --&gt; 00:01:55,480 [Andrey]</p><p>[laughs] </p><p>00:01:55,480 --&gt; 00:02:04,670 [Seb Krier]</p><p>And there are other tests, like if an AI makes $1 million or something, which I find is very weird because most humans do not make $1 million in the first place. </p><p>00:02:04,670 --&gt; 00:02:05,080 [Andrey]</p><p>[laughs] </p><p>00:02:05,080 --&gt; 00:02:11,359 [Seb Krier]</p><p>So the one I kind of like is actually Shane Legg&#8217;s definition- </p><p>00:02:11,360 --&gt; 00:02:11,620 [Andrey]</p><p>Mm </p><p>00:02:11,620 --&gt; 00:02:12,420 [Seb Krier]</p><p>... who&#8217;s at Deep Mind, who is </p><p>00:02:13,640 --&gt; 00:02:16,980 [Seb Krier]</p><p>more of a capability-based definition, which is something along the lines of </p><p>00:02:18,420 --&gt; 00:02:20,960 [Seb Krier]</p><p>an AI or a system that does most </p><p>00:02:22,380 --&gt; 00:02:30,360 [Seb Krier]</p><p>standard cognitive tasks that people typically do. [lips smack] So it&#8217;s kind of the bar isn&#8217;t too low, and it&#8217;s also not too high either. </p><p>00:02:32,220 --&gt; 00:02:35,480 [Seb Krier]</p><p>And so I think he&#8217;s got this definition of a minimal AGI, </p><p>00:02:36,580 --&gt; 00:02:43,020 [Seb Krier]</p><p>and I think that we&#8217;re not exactly there yet. I would disagree with people saying that we have AGI today because I think </p><p>00:02:44,220 --&gt; 00:02:48,900 [Seb Krier]</p><p>a lot of the systems we have, there&#8217;s many things that a human can do that they don&#8217;t really do very well. </p><p>00:02:48,900 --&gt; 00:02:50,360 [Seth]</p><p>What&#8217;s the biggest gap that we&#8217;re missing? </p><p>00:02:52,020 --&gt; 00:03:47,740 [Seb Krier]</p><p>I&#8217;d say there&#8217;s a few. One of them might be continual learning, or at least the ability to adapt and learn over time, and in different contexts and situations, just kind of update your own world model or whatever. If I think of a new joiner in a company, they&#8217;re not super useful the first day, but their value goes up over time because they learn all sorts of things. And so [lips smack] that might be one of them. A lot of the systems we have today, I think, are not very good at software, and you&#8217;re using graphical user interfaces and software and whatnot. If I ask an agent right now to go and use a music production software and make a track, I think they&#8217;d generally struggle. That doesn&#8217;t mean it&#8217;s impossible to solve or anything like that, but I think, in many respects, they&#8217;re not as general as you&#8217;d want them to be. And then the other bit also is, [lips smack] and of course they still make some silly mistakes here and there, but I think that&#8217;s getting it fixed. But the creativity point is one that I&#8217;m really interested in as well, in that I think they&#8217;re really good at kind of </p><p>00:03:48,780 --&gt; 00:04:02,700 [Seb Krier]</p><p>exploiting maybe an existing paradigm or an existing knowledge and so on, and recombining knowledge and whatnot. But I think really coming up with new concepts and abstractions entirely is something I think humans can do, but I don&#8217;t see our current systems really doing either. </p><p>00:04:02,700 --&gt; 00:04:10,060 [Andrey]</p><p>How do you measure whether humans can do creative tasks? One of the things that </p><p>00:04:11,200 --&gt; 00:04:15,940 [Andrey]</p><p>strikes me as a bit of an unfair test in that, </p><p>00:04:17,060 --&gt; 00:04:23,290 [Andrey]</p><p>let&#8217;s say you ask an LLM to write a poem or to write a story. It&#8217;s very- </p><p>00:04:23,290 --&gt; 00:04:23,290 [Seth]</p><p>[laughs] </p><p>00:04:23,290 --&gt; 00:04:32,050 [Andrey]</p><p>... times more entertaining than what a random human would write. So, do you have a benchmark for creativity? </p><p>00:04:32,050 --&gt; 00:04:35,390 [Seth]</p><p>This is the meme where the robot asks Will Smith if he can compose an opera. </p><p>00:04:35,390 --&gt; 00:05:14,700 [Seb Krier]</p><p>[laughs] Can you? Yeah, exactly. It depends, and you&#8217;re right. Obviously, most people aren&#8217;t creating new abstraction and concepts on a day-to-day level. But I imagine there&#8217;s still something qualitative about that kind of creativity that I think does get applied in everyone&#8217;s day-to-day life in various kind of ways. Maybe they&#8217;re not as big or significant as creating a symphony. But I don&#8217;t really have a strong test. There&#8217;s actually an interesting podcast that had Ben Goertzel and Yoshua, I think a few years ago, where they were saying something like, if you had a model that was trained knowing only classical music and West African drumming, could it come up with jazz in the first place, or recreate jazz? </p><p>00:05:16,460 --&gt; 00:05:27,880 [Seb Krier]</p><p>And I quite like that test. And in principle, I can imagine it being possible. You could kind of decompose all sorts of different kind of elements and variables here and just get something jazz-like. But it still feels a bit... </p><p>00:05:29,580 --&gt; 00:05:40,580 [Seb Krier]</p><p>It&#8217;s not the same as just coming up with the idea of jazz in the first place and saying, oh, I&#8217;m going to try these things out. And for whatever reason, I&#8217;m going to stick to that. And I don&#8217;t know. It&#8217;s- </p><p>00:05:40,580 --&gt; 00:05:53,190 [Seth]</p><p>Recombination versus paradigm shifting. I&#8217;ve also heard one test people would want for AGI is, can you train the model on the 1900s corpus and it comes up with Einsteinian physics? </p><p>00:05:53,190 --&gt; 00:05:53,200 [Seb Krier]</p><p>Yeah. </p><p>00:05:53,200 --&gt; 00:05:54,720 [Seth]</p><p>That would be really impressive. </p><p>00:05:54,720 --&gt; 00:06:36,151 [Seb Krier]</p><p>Yeah, I think actually Demis uses that test sometimes, or I think Pele Gritzer as well mentioned it before. And there are some people, I think David Duvenour and Nick Levine, I think, had this recent kind of language model talky that was trained up in, I think, the 1930s or something. And I tried to play around with it a lot. It was like, let&#8217;s try to get it to create something new, and it&#8217;s pretty tricky. Although they have apparently recently, some people kind of fine-tuned it on a very few examples of coding and gotten it to be good at coding. But for some reason, that doesn&#8217;t impress me maybe as much as other things I would&#8217;ve expected. It&#8217;s like [laughs] there&#8217;s the-I agree that the goalposts also kind of move a little bit over time, and it&#8217;s also maybe unfair of me. It&#8217;s like, oh, well, can it create a new programming language from scratch or something? </p><p>00:06:37,272 --&gt; 00:06:43,052 [Seb Krier]</p><p>So it&#8217;s a tricky one to kind of square off, but it does still feel like there&#8217;s a lack of that kind of true creativity, at least in my </p><p>00:06:44,212 --&gt; 00:06:45,072 [Seb Krier]</p><p>interactions with them. </p><p>00:06:46,392 --&gt; 00:06:57,342 [Andrey]</p><p>I am really worried that it is a goalpost moving exercise here. We don&#8217;t have a benchmark for creativity and therefore, </p><p>00:06:58,432 --&gt; 00:07:03,211 [Andrey]</p><p>all these claims are not quantitative in a way that I&#8217;d like. And let- </p><p>00:07:03,212 --&gt; 00:07:10,612 [Seth]</p><p>Right. What about all those IS papers we see where one of the axes is creativity and we instrument for something? [laughs] </p><p>00:07:10,612 --&gt; 00:07:11,032 [Andrey]</p><p>Yes. </p><p>00:07:13,132 --&gt; 00:07:13,592 [Seth]</p><p>There&#8217;s a lot of bad measures of creativity. </p><p>00:07:13,592 --&gt; 00:07:19,762 [Andrey]</p><p>Those are not creative, to be clear. I&#8217;m sure I&#8217;ve offended a ton of people. Sorry. </p><p>00:07:19,762 --&gt; 00:07:20,992 [Seth]</p><p>It&#8217;s okay. </p><p>00:07:20,992 --&gt; 00:07:56,432 [Seb Krier]</p><p>I think it&#8217;s fair. I agree that it&#8217;s a bit like... But I still feel like there&#8217;s, at least if part of the reason you&#8217;re going to create these systems is to come up with kind of also new sorts of theories and so on. And I think you can probably get that through good search and a lot of inference compute and trying out lots of different things. And I think there are many low-hanging fruits there, to be clear. So it&#8217;s not like I think, oh, we&#8217;ve hit some sort of wall or something. And I think there&#8217;s a lot that you can kind of get in terms of new knowledge and new creative knowledge from that. But I feel like there&#8217;s maybe something more needed. It&#8217;s maybe not that kind of magical or anything, right? Maybe you just need better scaffolding or better multi-agent systems. But </p><p>00:07:58,992 --&gt; 00:08:02,072 [Seb Krier]</p><p>yeah, at least so far, I would say that I see a bit more creativity, say, in </p><p>00:08:03,652 --&gt; 00:08:11,612 [Seb Krier]</p><p>humans so far as a collective. And maybe that&#8217;s, again, an unfair comparison. You don&#8217;t have a culture of AIs and AGIs to compare that against. So- </p><p>00:08:11,612 --&gt; 00:08:11,682 [Andrey]</p><p>Yeah </p><p>00:08:11,682 --&gt; 00:08:15,092 [Seb Krier]</p><p>... the right comparison is also a hard one to do. </p><p>00:08:15,092 --&gt; 00:08:52,772 [Andrey]</p><p>So, you mentioned scaffolding, and I guess a question, you recently wrote about a defense of scaffolding, and I think just to frame things, some people you talk with, especially very AGI-pilled people, are like, &#8220;Scaffolding, it&#8217;s an epiphenomenon. It doesn&#8217;t matter. In the end, we are going to train a smarter model with more parameters and more training data, and it&#8217;s just going to do it out of the box. And so all these scaffolding hacks are just very temporary.&#8221; And then other people like yourself, I guess, argue the opposite. So what do you think about scaffolding? </p><p>00:08:54,832 --&gt; 00:08:55,052 [Seb Krier]</p><p>Yeah. </p><p>00:08:56,572 --&gt; 00:08:59,372 [Seb Krier]</p><p>The first thing is I&#8217;m definitely not sure. This is kind of </p><p>00:09:00,532 --&gt; 00:09:39,672 [Seb Krier]</p><p>one of many hot takes, but I think, I guess there are a few reasons why I see it as, I think it&#8217;s going to stay over time. The first is that I think it&#8217;s plausible that as, I think scaling laws continue, I think you scale models and they get better over time and so on, but I think the inputs are expensive and grow over time. And I also think that it&#8217;s plausible that you might get more and more diminishing returns over time. And if that&#8217;s the case, I see the kind of utility of the scaffolding side and the harnesses as going up because you&#8217;re going to want to make more, you&#8217;ll want more bang for your buck kind of thing. You&#8217;re going to want to extract this intelligence and use this resource as efficiently as possible. </p><p>00:09:40,772 --&gt; 00:09:51,532 [Seb Krier]</p><p>So that&#8217;s maybe one reason. The other one is a bit more, I guess, Hayekian in nature or something, in that I see a lot of, I think there&#8217;s a lot of local knowledge, a lot of </p><p>00:09:53,212 --&gt; 00:10:18,592 [Seb Krier]</p><p>stuff that isn&#8217;t necessarily kind of codified. And I don&#8217;t really see one big giant AGI model now kind of perfectly guessing everything forever at infinite scales. And in a way, I see this as a little bit like a division of labor in that I think it&#8217;s actually more efficient to have this kind of integration layer that is closer to the local information or to the ground or to demand side that can better integrate this kind of cognitive resource </p><p>00:10:19,812 --&gt; 00:10:23,632 [Seb Krier]</p><p>to satisfy and create value and satisfy whatever consumers and businesses want. </p><p>00:10:25,552 --&gt; 00:10:31,352 [Seb Krier]</p><p>So to help with all the sorts of constraints and the context they&#8217;re dealing with, I think it&#8217;s very useful to have that. </p><p>00:10:33,712 --&gt; 00:10:39,112 [Seb Krier]</p><p>Of course, I don&#8217;t think this necessarily also implies or means that you&#8217;re going to get complete, full decentralization or something. </p><p>00:10:40,772 --&gt; 00:10:42,212 [Seb Krier]</p><p>Walmart gets huge </p><p>00:10:43,872 --&gt; 00:10:48,872 [Seb Krier]</p><p>returns from the scale that they have, and you don&#8217;t have loads of businesses downstream kind of reselling their stuff. </p><p>00:10:51,252 --&gt; 00:10:53,932 [Seb Krier]</p><p>But there&#8217;s two things. The first is that- </p><p>00:10:53,932 --&gt; 00:10:56,812 [Seth]</p><p>We have bodegas reselling stuff from Walmart on the corner. </p><p>00:10:56,812 --&gt; 00:11:18,992 [Seb Krier]</p><p>Actually, that&#8217;s a good point, yeah. And also, there are all sorts of other businesses kind of selling different things, right? If the task is generic and the demand is homogenous, then sure, maybe you can do more of that. But also, even Walmart relies on all sorts of kind of suppliers, local labor, compliance system, inventory systems, third parties, and whatnot, that help with this kind of integration and the delivery of these services. </p><p>00:11:18,992 --&gt; 00:11:25,862 [Seth]</p><p>So if I may summarize your answer, you&#8217;re very Hayek-pilled, but maybe not as Bitterlesson-pilled as most. </p><p>00:11:25,862 --&gt; 00:11:25,972 [Seb Krier]</p><p>Well, </p><p>00:11:27,212 --&gt; 00:11:31,052 [Seb Krier]</p><p>I think I&#8217;m definitely Bitterlesson-pilled in the sense that I don&#8217;t think you should </p><p>00:11:33,652 --&gt; 00:11:48,992 [Seb Krier]</p><p>try to kind of cement some sort of rules-based system you either devise or something and kind of hope that this just takes forever. If anything, I think the scaffold needs to be a lot more adaptive and evolve over time. In the same way as if you have a small startup and they have all sorts of kind of rules and, </p><p>00:11:50,332 --&gt; 00:12:02,772 [Seb Krier]</p><p>sorry, not rules, different functions. When the startup grows and gets more capabilities, they also kind of change from the inside. So I think that, of course, if you have some sort of light GPT-type wrapper that kind of makes your system a little bit better, whatever, yeah, that was not going to </p><p>00:12:03,812 --&gt; 00:12:23,652 [Seb Krier]</p><p>work out over time. But I think there are kind of scaffolds that help better integrate the wider environment, private data, deals with permissions or liability regimes or user preferences and whatnot. And also, at a somewhat higher level, kind of more coordination-type scaffolds maybe in terms of market interfaces, like clearing house equivalents or something.</p><p>00:12:24,516 --&gt; 00:12:33,536 [Seth]</p><p>The third example you gave is maybe it&#8217;s not the super frontier model that are going to these scaffolds, but simpler models that are still very useful and cheaper to run with a scaffold. </p><p>00:12:33,536 --&gt; 00:12:46,176 [Seb Krier]</p><p>Yeah, totally. Because I think you&#8217;re not going to need the enormous, super expensive brain for every single random task. And so it&#8217;ll make, for most kind of basic queries, people aren&#8217;t using Opus&#8217;s latent space or something as- </p><p>00:12:46,176 --&gt; 00:12:46,186 [Seth]</p><p>[laughing] </p><p>00:12:46,186 --&gt; 00:12:48,236 [Seb Krier]</p><p>... it&#8217;s a big waste in some sense. </p><p>00:12:48,236 --&gt; 00:12:50,036 [Seth]</p><p>What toothbrush should I buy? [chuckles] </p><p>00:12:50,036 --&gt; 00:12:51,196 [Seb Krier]</p><p>Yeah. Exactly. </p><p>00:12:51,196 --&gt; 00:12:53,896 [Andrey]</p><p>Wait. That is an important question, Seth. </p><p>00:12:53,896 --&gt; 00:12:54,516 [Seb Krier]</p><p>I mean- </p><p>00:12:54,516 --&gt; 00:12:56,536 [Andrey]</p><p>I would definitely use Opus for that. </p><p>00:12:56,536 --&gt; 00:12:57,385 [Seb Krier]</p><p>It&#8217;s funny because I&#8217;ve actually- </p><p>00:12:57,385 --&gt; 00:12:59,696 [Seth]</p><p>Use all the collective intelligence of reality. [chuckles] </p><p>00:12:59,696 --&gt; 00:13:02,266 [Seb Krier]</p><p>I have actually used Opus for that exact question not long ago- </p><p>00:13:02,266 --&gt; 00:13:02,626 [Seth]</p><p>[laughing] </p><p>00:13:02,626 --&gt; 00:13:06,256 [Seb Krier]</p><p>... in trying out this new electric toothbrush that I found out as a result. But, </p><p>00:13:07,636 --&gt; 00:13:22,076 [Seb Krier]</p><p>so yeah, I agree there&#8217;s that and also there&#8217;s all sorts of ways in which actually kind of using tools or specialized kind of tools is just more effective and more efficient. Why would you expect a large model or something to kind of calculate things innately or something when you can just access a calculator? It&#8217;s a much better use of tokens. </p><p>00:13:22,076 --&gt; 00:13:36,856 [Andrey]</p><p>But it should kind of know that the calculator is available and then use it when it&#8217;s there. So that&#8217;s the argument against scaffolding, or you&#8217;re giving it a general environment, but you&#8217;re not scaffolding it much. I think a curious thing is just, </p><p>00:13:38,376 --&gt; 00:13:40,356 [Andrey]</p><p>it seems like most people who are using </p><p>00:13:41,416 --&gt; 00:13:49,156 [Andrey]</p><p>scaffolded agents today are using them with essentially one of two scaffolds, with Cloud Code or Codex. And </p><p>00:13:50,236 --&gt; 00:14:00,475 [Andrey]</p><p>those seem to be good enough maybe. I guess, do we see a lot of people customizing, a lot of people, whatever, companies customizing their scaffolds? </p><p>00:14:00,476 --&gt; 00:14:03,856 [Seth]</p><p>CladBot, do the CladBots count as that, I guess? </p><p>00:14:03,856 --&gt; 00:14:04,236 [Andrey]</p><p>Yeah. </p><p>00:14:05,396 --&gt; 00:14:39,676 [Seb Krier]</p><p>They are a form of it. I don&#8217;t know. I think a lot of power users and people in our immediate communities use a lot of Cloud Code and Codex, and particularly software engineers. But I don&#8217;t think most legal departments and most kind of firms out there are necessarily using Cloud Code either. And it&#8217;s not clear to me that this is necessarily the optimal interface or, there may be better systems that are Cloud Code-like, or CLI-like perhaps in some way. But, so I don&#8217;t know, maybe they&#8217;re sufficient, but even these tools end up kind of calling on loads of other external APIs and tools and so on in how they </p><p>00:14:40,836 --&gt; 00:14:57,576 [Seb Krier]</p><p>function. So if anything, these are actually scaffolds. You&#8217;re not kind of calling the model directly. There&#8217;s all sorts of different sub-agents behind the scenes. It&#8217;s not just a one-shot call. There&#8217;s quite a lot going on, which is in fact this more, I don&#8217;t know, dynamic scaffolding thing I was mentioning earlier, I guess. </p><p>00:14:58,976 --&gt; 00:15:06,736 [Andrey]</p><p>Okay. The natural question here is, what is going to be the role of the market in coordinating- </p><p>00:15:06,736 --&gt; 00:15:07,375 [Seb Krier]</p><p>Mm </p><p>00:15:07,375 --&gt; 00:15:11,276 [Andrey]</p><p>... AI here? And I&#8217;ll just very shamelessly plug- </p><p>00:15:11,276 --&gt; 00:15:11,285 [Seb Krier]</p><p>[chuckles] </p><p>00:15:11,285 --&gt; 00:15:24,796 [Andrey]</p><p>... some recent work with Rohit Krishnan, where we&#8217;re kind of playing around with the idea of LLMs bidding in a procurement auction and seeing whether that results in more efficient use of AI. </p><p>00:15:26,696 --&gt; 00:15:29,655 [Seb Krier]</p><p>Well, first of all, I need to properly read that again. But the- </p><p>00:15:29,655 --&gt; 00:15:30,476 [Andrey]</p><p>[laughing] </p><p>00:15:30,476 --&gt; 00:15:31,016 [Seb Krier]</p><p>In terms of, </p><p>00:15:32,496 --&gt; 00:15:32,916 [Seb Krier]</p><p>I guess, </p><p>00:15:34,556 --&gt; 00:15:46,396 [Seb Krier]</p><p>at a very high level, markets are good at just coordinating in general, including AI. And so, assuming they function as intended in it, you&#8217;ve got the pricing mechanism to get... </p><p>00:15:47,556 --&gt; 00:15:49,396 [Seb Krier]</p><p>I don&#8217;t know. I expect that to kind of work as well with </p><p>00:15:50,476 --&gt; 00:15:52,616 [Seb Krier]</p><p>matching, I guess, supply and demand or something. </p><p>00:15:54,016 --&gt; 00:15:55,196 [Seb Krier]</p><p>The supply of this </p><p>00:15:56,216 --&gt; 00:16:00,036 [Seb Krier]</p><p>raw resource of cognition or something, and the demand of all sorts of different businesses and users. </p><p>00:16:01,696 --&gt; 00:16:05,516 [Seb Krier]</p><p>So maybe, at a very high level, I don&#8217;t know. What exactly do you mean by the role of the market or something here? </p><p>00:16:09,076 --&gt; 00:16:21,356 [Andrey]</p><p>Obviously the market is involved in many parts of the AI vertical supply chain, right? From competition in chips. There&#8217;s competition between models. There might be also competition between </p><p>00:16:22,516 --&gt; 00:16:28,576 [Andrey]</p><p>scaffolds, bundles of environments, scaffolds, and LLMs. </p><p>00:16:28,576 --&gt; 00:17:06,496 [Seth]</p><p>I guess maybe it would be useful to juxtapose this versus, so what Andrey, one of the things he&#8217;s imagining is, I have a job. I post it to some sort of Upwork-like future platform. Different companies that host different AI models bid to do that job. &#8220;Oh, I think I can do that job with $1 of electricity and tokens,&#8221; versus another model, and then we get efficient allocation of intellectual tasks to models, right? So do we think that that&#8217;s going to be important, or is it going to be more like I ask the super model what the best model is, and I just get allocated in a non-market way? Might be one version of this question. </p><p>00:17:08,156 --&gt; 00:17:18,836 [Seb Krier]</p><p>I guess intuitively, my mind goes to the former question. But, or there&#8217;s a little bit of both in some sense, because even in the former one, you&#8217;re going to be using the large model for some sort of </p><p>00:17:20,436 --&gt; 00:17:26,686 [Seb Krier]</p><p>cognitively demanding task or something. It kind of depends what kind of quality of output you also need and want. </p><p>00:17:26,686 --&gt; 00:17:26,706 [Seth]</p><p>[chuckles] </p><p>00:17:26,706 --&gt; 00:17:27,056 [Seb Krier]</p><p>But then </p><p>00:17:28,376 --&gt; 00:17:49,636 [Seb Krier]</p><p>you&#8217;re still going to be constrained by your own resources or something, and depending on what you have to spend, if you can get the output for cheaper by kind of relying on this kind of competitive marketplace of smaller models or something, not even smaller models, they might just be all be big and kind of just scaffolding different, you&#8217;re offering a slightly different thing. Why wouldn&#8217;t you go for that, and why wouldn&#8217;t that exist in the first place? Unless the very first- </p><p>00:17:49,636 --&gt; 00:17:52,216 [Andrey]</p><p>Doesn&#8217;t exist yet, just to be clear. </p><p>00:17:52,216 --&gt; 00:17:52,716 [Seb Krier]</p><p>Um- </p><p>00:17:52,716 --&gt; 00:17:58,416 [Seth]</p><p>A, it doesn&#8217;t exist yet, and as Andrey proves, at least current models are bad at understanding their own capabilities. </p><p>00:17:58,416 --&gt; 00:17:58,666 [Andrey]</p><p>Oh, yeah. </p><p>00:17:58,666 --&gt; 00:18:00,496 [Seth]</p><p>Now maybe that&#8217;s going to be fixed. </p><p>00:18:00,496 --&gt; 00:18:08,096 [Seb Krier]</p><p>Yeah. Oh, no, I agree. I think that we&#8217;re not there yet, right? I think, again, and that goes back to the earlier AGI question, is there&#8217;s all sorts of, then again, what&#8217;s the right comparator? But, </p><p>00:18:09,476 --&gt; 00:18:21,316 [Seb Krier]</p><p>yeah, I don&#8217;t think we&#8217;re exactly there. Yeah, I think a lot of this will have to be built as well. The kind of an ability for a model to just better kind of operate in a more multi-agent environment, kind of have a better sense of </p><p>00:18:22,596 --&gt; 00:18:32,556 [Seb Krier]</p><p>delegation. I think the kind of, yeah, industrial intelligence or something seems to be maybe more neglected, as opposed to just single-agent intelligence or something, if that makes sense. </p><p>00:18:32,556 --&gt; 00:18:34,776 [Seth]</p><p>Do we need to bring the word cybernetics back? </p><p>00:18:34,776 --&gt; 00:18:35,496 [Seb Krier]</p><p>Yeah. </p><p>00:18:35,496 --&gt; 00:18:36,116 [Andrey]</p><p>[laughs] </p><p>00:18:36,116 --&gt; 00:18:38,816 [Seb Krier]</p><p>Somewhat. [laughs]</p><p>00:18:40,756 --&gt; 00:18:51,256 [Andrey]</p><p>All right. A little change in subject, but I know this has been in the discourse, the topic of recursive self-improvement, RSI. </p><p>00:18:51,256 --&gt; 00:18:52,956 [Seth]</p><p>Ooh, very scary. </p><p>00:18:52,956 --&gt; 00:18:54,896 [Andrey]</p><p>Jack Clark recently had an essay about it. </p><p>00:18:56,376 --&gt; 00:18:58,876 [Andrey]</p><p>Seb, what is your take? </p><p>00:18:58,876 --&gt; 00:18:59,206 [Seb Krier]</p><p>[chuckles] </p><p>00:19:00,316 --&gt; 00:19:07,896 [Seb Krier]</p><p>What is my take? I don&#8217;t know. I think it depends what exactly we mean by recursive self-improvement. </p><p>00:19:09,096 --&gt; 00:19:50,336 [Seb Krier]</p><p>I had a blog post not long ago, I guess, when trying to disentangle a little bit what I have in mind when I think about this. On the one hand, there&#8217;s the model getting recursively better through the usage of more AI and whatnot. And on the other hand, there&#8217;s the more kind of societal side of things, the transformation side, which I think very often, these two worlds are a little bit blurred in the discourse. It&#8217;s like, oh, you get RSI, and then X, Y, Z about the world or something. Things go really fast or they don&#8217;t go fast. And, I think these should be separated very neatly because on the model side, of course, I expect, already there&#8217;s a lot of AI being used everywhere to kind of create models. And I expect that to continue. </p><p>00:19:52,536 --&gt; 00:19:55,976 [Seb Krier]</p><p>But it&#8217;s not clear to me that this necessarily now leads to a dynamic by which </p><p>00:19:57,156 --&gt; 00:20:16,596 [Seb Krier]</p><p>the model now gets extremely or exponentially intelligent in a very short amount of time. It&#8217;s still kind of bottlenecked by all sorts of resources. And as I was saying earlier, I still see them as better at kind of paradigm exploitation than kind of exploration, which I think is the thing you might need to get to the next step. But, first of all, what do I know? But secondly, </p><p>00:20:17,616 --&gt; 00:20:19,986 [Seb Krier]</p><p>the other thing is, yeah, on the societal side of things, </p><p>00:20:20,996 --&gt; 00:20:29,756 [Seb Krier]</p><p>people sometimes talk about foom or hard takeoffs and whatnot, and these have very clear kind of real-life implications. It&#8217;s not just kind of a model of getting better in a </p><p>00:20:31,216 --&gt; 00:20:34,576 [Seb Krier]</p><p>data center somewhere. And that side, I think, is where you have to think about </p><p>00:20:36,116 --&gt; 00:21:27,056 [Seb Krier]</p><p>[lip smack] all the kind of usual bottlenecks, adoption, deployment, diffusion, the kind of productive integration of all these systems at scale, both in terms of manufacturing and so on and so forth. And, I guess it&#8217;s not clear to me that the shift from GPT-2 to GPT-3 or coming up with kind of, we&#8217;re just very classic kind of software engineering, meat and potatoes type tasks that you can just easily just automate away. It&#8217;s maybe one of these things that&#8217;s maybe easy to say ex post, but, I&#8217;m not sure. And certainly, my expectation is you&#8217;re going to get loads of gains in the coming years of kind of automating part of that pipeline. But that seems good. You just get better models, and that&#8217;s just overall helpful for all sorts of other things, even if you&#8217;re doing safety work and kind of governance work and whatnot, we benefit a lot from that cognitive resource, I guess. </p><p>00:21:27,056 --&gt; 00:21:40,696 [Andrey]</p><p>What would happen in the world for you to change your mind? Is there any, let&#8217;s say that recursive self-improvement is actually kind of this much more profound change than you&#8217;re painting. </p><p>00:21:41,816 --&gt; 00:21:42,036 [Andrey]</p><p>What </p><p>00:21:44,136 --&gt; 00:21:45,696 [Andrey]</p><p>signs would there be, I guess? Yeah. </p><p>00:21:45,696 --&gt; 00:21:51,656 [Seb Krier]</p><p>But to be clear, I&#8217;m not claiming it&#8217;s just business as usual, nothing to see here or whatever, right? I&#8217;m </p><p>00:21:52,796 --&gt; 00:22:14,936 [Seb Krier]</p><p>kind of just claiming that some of the stronger versions of the claim aren&#8217;t kind of self-evident. And so I see a lot of this happening in some sense. Certainly, in 10 years, I expect to have larger kind of more, again, acceleration of economic growth and whatnot and kind of faster diffusion across the board. I certainly don&#8217;t expect diffusion to take the same amount of time as, say, electricity or these other technologies. </p><p>00:22:16,576 --&gt; 00:22:23,236 [Seb Krier]</p><p>So it depends what exactly you mean, because what specifically am I looking to change my mind on? </p><p>00:22:23,296 --&gt; 00:22:30,656 [Andrey]</p><p>Well, let&#8217;s say the scenarios of AI 2027, right? Presumably, </p><p>00:22:31,996 --&gt; 00:22:45,176 [Andrey]</p><p>in 2027, you&#8217;ll see something that&#8217;s like, &#8220;Oh, wow, I was wrong. This is not going to be so gradual. This is going to be this sudden foom,&#8221; that you&#8217;re criticizing. Yeah. </p><p>00:22:45,176 --&gt; 00:22:52,236 [Seb Krier]</p><p>The original foom or hard takeoff definition literally talks about this change happening within hours or days. </p><p>00:22:52,236 --&gt; 00:22:53,236 [Andrey]</p><p>[chuckles] </p><p>00:22:53,236 --&gt; 00:22:56,056 [Seb Krier]</p><p>Which is not even, it&#8217;s not what the 2027 scenario, I think, predicts. </p><p>00:22:56,056 --&gt; 00:22:56,296 [Andrey]</p><p>Yes. </p><p>00:22:57,556 --&gt; 00:23:00,446 [Seb Krier]</p><p>But the 2027 scenario, from what I remember, again, it&#8217;s been a bit of time now. </p><p>00:23:01,796 --&gt; 00:23:08,816 [Seb Krier]</p><p>One thing with the scenarios there is that there&#8217;s the kind of misalignment assumption, and which I&#8217;m kind of uncertain about. </p><p>00:23:08,816 --&gt; 00:23:09,255 [Andrey]</p><p>Mm. </p><p>00:23:09,256 --&gt; 00:23:17,296 [Seb Krier]</p><p>And it also talks about a lot of progress in robotics, which I think is a bit further away. I think it&#8217;s close. We&#8217;re getting there, too. </p><p>00:23:19,116 --&gt; 00:23:19,476 [Seb Krier]</p><p>But </p><p>00:23:21,156 --&gt; 00:23:25,916 [Seb Krier]</p><p>I don&#8217;t know. Probably kind of AI, if in 2030, we start seeing AI is making all sorts of crazy </p><p>00:23:26,956 --&gt; 00:24:06,196 [Seb Krier]</p><p>inventions, innovations in fields other than just kind of perhaps math and coding across the boards, and I&#8217;m like, okay, this is clearly-- And you get extremely fast adoption, too, right? You have entire businesses doing completely, it&#8217;s not business as usual, clearly, in the economy or something and wide adoption. But it&#8217;s hard to say because I expect all that to some degree, right? It&#8217;s not that I&#8217;m saying, &#8220;Oh, this is never going to happen.&#8221; I just think of it as a little bit more elongated and the implications of that being maybe not as like, we have Dyson spheres in five years or something like that, so. It&#8217;s more of a disagreement maybe on the extremes or the margins or something, but not so much at the core of the claim that yes, models are going to make models better and... </p><p>00:24:07,276 --&gt; 00:24:27,536 [Seb Krier]</p><p>But, again, even having-- In fact, actually, here would be a thing. If Anthropic or DeepMind or something in 2037 have fewer and fewer employees, fewer people kind of just doing AI research, engineers and so on, you&#8217;re clearly seeing kind of that profession. Because of course, I can imagine these jobs to change, right? Maybe you&#8217;re kind of managing more agents or something. That </p><p>00:24:28,616 --&gt; 00:24:35,966 [Seb Krier]</p><p>I expect. But the fact that you just need far fewer people to kind of do not only these large training runs, but the kind of </p><p>00:24:36,976 --&gt; 00:24:43,476 [Seb Krier]</p><p>large training runs that give you just much, much better systems, then I think I&#8217;d be like, okay, this is going a little bit faster than maybe expected or something.</p><p>00:24:44,656 --&gt; 00:24:51,676 [Andrey]</p><p>Okay. One thing you mentioned in that kind of hints at another hot take you have, which is about alignment. </p><p>00:24:51,676 --&gt; 00:24:52,026 [Seb Krier]</p><p>Uh-huh. </p><p>00:24:54,596 --&gt; 00:24:55,926 [Andrey]</p><p>What&#8217;s the deal with alignment? </p><p>00:24:57,196 --&gt; 00:24:58,086 [Andrey]</p><p>[laughs] </p><p>00:24:58,086 --&gt; 00:24:58,136 [Seb Krier]</p><p>[laughs] </p><p>00:24:58,136 --&gt; 00:25:02,136 [Seth]</p><p>Is it hard? Is it easy? Is it different than we would&#8217;ve expected going in? </p><p>00:25:02,136 --&gt; 00:25:19,646 [Seb Krier]</p><p>Yeah. It&#8217;s perhaps that. I think my take about alignment is something-- Well, first of all, I just don&#8217;t like the word. I think it&#8217;s a bit of an annoying word because it&#8217;s being used for all sorts of things. The AI says something that we just kind of don&#8217;t like, or you say, &#8220;Oh, it&#8217;s misaligned.&#8221; No one pre-registers what they expect the aligned behavior to be, and then just kind of tests. </p><p>00:25:19,646 --&gt; 00:25:20,116 [Andrey]</p><p>[laughs] </p><p>00:25:20,116 --&gt; 00:25:35,626 [Seb Krier]</p><p>But I think my general claim is maybe the fact that it&#8217;s been easier than we would&#8217;ve predicted a decade ago or so. Then when I first got into AI in 2017, that was partly as a result of reading things like &#8220;Superintelligence&#8221; by Bostrom. </p><p>00:25:35,626 --&gt; 00:25:36,236 [Andrey]</p><p>Mm-hmm. </p><p>00:25:36,236 --&gt; 00:25:48,496 [Seb Krier]</p><p>And you&#8217;d read these books, like Stuart Russell&#8217;s &#8220;Human Compatible&#8221; and others, that kind of had all these analogies like King Midas and you ask a system to optimize for goal X, and in pursuit of that goal, it does all sorts of other things that you don&#8217;t want it to do. </p><p>00:25:48,496 --&gt; 00:25:51,916 [Seth]</p><p>Right. The paperclip maximizer, and we seem to not have those. </p><p>00:25:51,916 --&gt; 00:25:57,476 [Seb Krier]</p><p>Yeah. It&#8217;s like one version of it or one variant of it. And certainly at the time you didn&#8217;t really have language models. A lot of these intuitions were kind of based off </p><p>00:25:58,596 --&gt; 00:26:48,236 [Seb Krier]</p><p>reinforcement learning systems in very basic kind of game scenarios where they were actually given a single goal to optimize for. And this is not actually what we do, I think, with models. And you had these kind of examples, even the value loading problem was something discussed at the time where actually specifying these complicated nuanced human values in mathematical terms would be extremely hard. So even if you managed to tell a robot to clean the room, it would then just pick up a baby and put it in the trash or something. And I think it turns out a lot of this stuff is actually much easier. You have problems. You&#8217;ve got things like reward hacking. You&#8217;ve got AIs behaving in weird ways that we were not always kind of anticipating because of the ways they were post-trained. So my claim is not like, oh, again, it&#8217;s all fine, and safety is a scam or whatever. It&#8217;s more that it&#8217;s certainly much easier than, or at least we&#8217;re in a much better track than I would&#8217;ve at least guessed perhaps a decade ago. And secondly, I think it </p><p>00:26:49,916 --&gt; 00:26:54,816 [Seb Krier]</p><p>just seems tractable. There&#8217;s a lot of progress in terms of chain-of-thought monitoring and all these other things. And </p><p>00:26:56,696 --&gt; 00:26:57,796 [Seb Krier]</p><p>I also think that the </p><p>00:26:59,016 --&gt; 00:27:05,825 [Seb Krier]</p><p>hard part is maybe more the kind of normative question of whose values and when, and what and everything. That&#8217;s the kind of thing that we&#8217;re looking into more. But </p><p>00:27:07,096 --&gt; 00:27:13,696 [Seb Krier]</p><p>yeah, I prefer the word actually instruction following or intent following or something instead of alignment. And I think by and large, they&#8217;re actually pretty good at that. </p><p>00:27:14,796 --&gt; 00:27:31,636 [Seb Krier]</p><p>So again, that doesn&#8217;t mean you have to dismiss all sorts of theories and all the kind of power optimization stuff. But I guess my immediate outcome is this goes rather well. Or if I am more concerned by other things like misuse, if you&#8217;d like, than kind of the AI&#8217;s being innately, inherently kind of internally misaligned. </p><p>00:27:31,636 --&gt; 00:28:03,676 [Seth]</p><p>This really seems related to your take that intelligence is not at odds with being a tool, right? So a lot of people have this intuition where if you had a super-duper intelligent genie or oracle, it would develop even implicitly some sort of value or goal that orthogonality thesis might have nothing to do with what we want. But you&#8217;re more optimistic about the idea that the LLM doesn&#8217;t want anything. It&#8217;s incorrect to take the intentional stance towards an LLM. </p><p>00:28:03,676 --&gt; 00:28:09,236 [Seb Krier]</p><p>Not incorrect. It&#8217;s actually kind of descriptively useful, even functionally sometimes to use that language. </p><p>00:28:10,796 --&gt; 00:28:18,836 [Seb Krier]</p><p>But that&#8217;s the thing, right? I think we kind of lack the language to properly delineate and differentiate when it&#8217;s useful to use that or appropriately descriptive and when it&#8217;s not. </p><p>00:28:20,076 --&gt; 00:28:41,496 [Seb Krier]</p><p>And so I agree that, of course, I think the take I had on this was something like, and I can imagine a tool being an agent and an agent being a tool. Or in principle, I can imagine something being hyper-capable and still being broadly instruction following rather than at a certain level of capability, aha, that&#8217;s when the goals change and things get... And it kind of depends on the type of system as well. I imagine not all </p><p>00:28:42,656 --&gt; 00:28:45,116 [Seb Krier]</p><p>paths lead to the same kind of outcome. But, </p><p>00:28:46,256 --&gt; 00:29:13,596 [Seb Krier]</p><p>so again, I can see plausible versions of the world where homo hundrio drives or something are a more salient feature of the way we kind of train models. Right now, it doesn&#8217;t seem to me very likely that this is a core feature that they have. But of course, it&#8217;s hard to kind of either prove or disprove, right? Because someone might just say, well, that&#8217;s because they&#8217;re very good at hiding this or something, or once they&#8217;re capable enough or whatever. So there&#8217;s always a bit of this kind of gotcha thing. It&#8217;s like deception. But </p><p>00:29:14,936 --&gt; 00:29:39,896 [Seb Krier]</p><p>yeah. So in principle, I guess I can totally conceive of at least a superintelligence that is controllable, that is benign, that is at least subservient to the goals of humanity or a user or principle or whatever. That could still be used to cause enormous harm, but it&#8217;s just I don&#8217;t necessarily think the analogies of, oh, I think Tegmark was thinking, look at the zoo where the monkey&#8217;s going. I think these are just not really </p><p>00:29:41,736 --&gt; 00:29:43,136 [Seb Krier]</p><p>helpful kind of analogies. </p><p>00:29:44,276 --&gt; 00:30:02,396 [Seth]</p><p>Monkey at the zoo, but you&#8217;ve also got the monkey&#8217;s paw, right? Maybe the reason some prefer alignment to instruction following is we all know the story of, be careful what you wish for. You wish for something, and it&#8217;s under-specified, and you get the bad version of it because the AI doesn&#8217;t understand the context. </p><p>00:30:02,396 --&gt; 00:30:08,336 [Seb Krier]</p><p>I think that&#8217;s why, yeah, I think maybe instruction following is maybe too... Intent following or something gets to it more. </p><p>00:30:09,936 --&gt; 00:30:18,316 [Seb Krier]</p><p>But of course, that problem doesn&#8217;t go, even if it follows intent or something, you could still have all the problems because your intent is nefarious or whatever. So </p><p>00:30:19,436 --&gt; 00:30:19,816 [Seb Krier]</p><p>I think the </p><p>00:30:21,356 --&gt; 00:31:06,756 [Seb Krier]</p><p>way you deal with that is all sorts of, I don&#8217;t know how to conceptualize it, but in fact scaffolds. It&#8217;s a bit more this outside of the model or something. I&#8217;m kind of almost indexing on a world that will indeed have agents that are trained to be bad or whatever, or someone going to be instructed to do bad things. But just like with humans, you come up with all sorts of kind of systems, rules, laws, norms, kind of protocols that either discourage the kind of bad behavior, or punishes it, or makes it just not worthwhile or something. But I&#8217;m not going to put all my bets on the, oh, it has to be pure-hearted, and that will be sufficient. And then you just scale it forever, and it&#8217;s going to be an amazing goal. I just think that the way of seeing or thinking about AI is that I just find kind of a bit </p><p>00:31:08,096 --&gt; 00:31:12,656 [Seb Krier]</p><p>too narrow, I guess. I think it&#8217;s important, it&#8217;s just insufficient, and it&#8217;s certainly not my main kind of a-- yeah.</p><p>00:31:14,946 --&gt; 00:31:15,206 [Andrey]</p><p>Okay. </p><p>00:31:16,666 --&gt; 00:31:20,086 [Andrey]</p><p>Our audience is very much composed of economists. </p><p>00:31:22,586 --&gt; 00:31:30,506 [Andrey]</p><p>If you&#8217;re an economist and you&#8217;re very interested in AI, what sort of work would you be trying to do? </p><p>00:31:30,506 --&gt; 00:31:32,146 [Seth]</p><p>Maybe to be useful to AI people- </p><p>00:31:32,146 --&gt; 00:31:32,216 [Andrey]</p><p>Yes </p><p>00:31:32,216 --&gt; 00:31:37,466 [Seth]</p><p>... in particular. What would you want, what did the DeepMind team want to read from economists? </p><p>00:31:37,466 --&gt; 00:32:20,766 [Seb Krier]</p><p>I think kind of engaging with their assumptions or something, right? If you assume, let&#8217;s say, an AG-- and I think some do, to be fair. I actually think there&#8217;s a lot more, I think, discourse now going on between economists and AI people, whatever. But assuming that you do have AI systems that are interchangeable or almost quasi-fully substitutable with humans, that come up with good ideas, that are parallelizable and whatnot, what does that change to your kind of growth function and so on? So, maybe that&#8217;s useful. Right now, in the short term, at least, there&#8217;s all sorts of questions around labor, there&#8217;s questions around productivity or adoption. Clearly, there&#8217;s useful work to be done there. But I think in terms of AGI specifically, given that a lot of the field just thinks you&#8217;re going to get to AGI in the next five to 10 years, </p><p>00:32:22,746 --&gt; 00:32:26,806 [Seb Krier]</p><p>what are the implications for taxation? What are the implications for </p><p>00:32:28,626 --&gt; 00:32:37,786 [Seb Krier]</p><p>how that&#8217;ll affect different states across the world? I think I&#8217;m probably more worried about a call center in Hyderabad than I am about the white-collar worker in North America or something. So, </p><p>00:32:39,066 --&gt; 00:32:57,306 [Seb Krier]</p><p>yeah. I think all these kind of questions, but just indexing more and making fewer, I guess, assumptions around the limits of capabilities. Because sometimes you see them kind of being implicitly snuck in somewhere or something of like, well, because AIs can&#8217;t do XYZ, therefore... And yeah, fine, but maybe they will do XYZ. And then what? How does that change your thinking? Yeah. </p><p>00:32:57,306 --&gt; 00:32:59,506 [Seth]</p><p>Maybe more scenario planning than, </p><p>00:33:00,526 --&gt; 00:33:04,746 [Seth]</p><p>here&#8217;s my median projection, or here is one projection I think is plausible. </p><p>00:33:04,746 --&gt; 00:33:22,846 [Seb Krier]</p><p>Yeah. And embedding the kind of thoughtful models and thinking that economists have within these scenarios and making them more salient to the kind of computer scientists, right? Even when I brought up competitive advantage, people will be like, &#8220;Oh, but what if the AI is cheaper and better?&#8221; It&#8217;s like, well, that&#8217;s not the point. The opportunity cost point of competitive advantage, there&#8217;s a difference. </p><p>00:33:22,846 --&gt; 00:33:23,286 [Andrey]</p><p>[laughs] </p><p>00:33:23,286 --&gt; 00:33:31,786 [Seb Krier]</p><p>And again, there are answers to that as well, but I think just kind of better translating, I think, some of these insights to the AI tribe, the thing is useful. </p><p>00:33:32,846 --&gt; 00:33:40,526 [Andrey]</p><p>So that&#8217;s very naturally leading us to this question about yourself. And you do lots of different things. </p><p>00:33:41,946 --&gt; 00:33:50,426 [Andrey]</p><p>You&#8217;re prolific on Twitter, for sure. But also, you&#8217;re doing internal work for DeepMind. How do you allocate your time? </p><p>00:33:52,066 --&gt; 00:33:52,166 [Seb Krier]</p><p>I don&#8217;t know. </p><p>00:33:52,166 --&gt; 00:33:53,266 [Seth]</p><p>What percentage is Twitter? </p><p>00:33:53,266 --&gt; 00:33:54,646 [Andrey]</p><p>Yeah. [laughs] </p><p>00:33:54,646 --&gt; 00:34:04,686 [Seb Krier]</p><p>Twitter is actually not that much today. It must be an hour max or something, an hour and a half, two hours, maybe, something. But that is maybe much by others&#8217; standards. But the- </p><p>00:34:04,686 --&gt; 00:34:06,476 [Andrey]</p><p>[laughs] What is the optimal amount of Twitter? [laughs] </p><p>00:34:06,476 --&gt; 00:34:29,866 [Seb Krier]</p><p>[laughs] Yeah. It&#8217;s the Pareto optimal. I guess, in my day-to-day work, it&#8217;s a mixture of proactive and reactive. Proactive in the sense that I think, oh, these questions of agents and cybersecurity and liability and whatnot, and biosecurity are kind of important things to look into, and therefore, there&#8217;s a lot of research that I do and colleagues do, and a lot of coordination across the org. </p><p>00:34:31,026 --&gt; 00:34:39,486 [Seb Krier]</p><p>But there&#8217;s also more reactive stuff because we&#8217;re a policy team, and so there&#8217;s things happening in the external world like CA 53, the preemption debates. </p><p>00:34:40,546 --&gt; 00:34:48,386 [Seb Krier]</p><p>So it&#8217;s a bit of a mix of that. And of course, all sorts of internal dynamics. But, yeah. I guess I&#8217;m curious about all sorts of other things, and so when I do have time, and I&#8217;ve kind of </p><p>00:34:50,006 --&gt; 00:34:58,106 [Seb Krier]</p><p>completed the main quests, I try to keep some time for other stuff I&#8217;m interested in. I work with some research teams and kind of look into what they&#8217;re into. I&#8217;ll </p><p>00:34:59,266 --&gt; 00:35:09,826 [Seb Krier]</p><p>find topics or themes that I think are maybe kind of neglected or underrated or I just don&#8217;t see out there as much, and like, &#8220;Oh, cool. We&#8217;re going to try to find out about this more.&#8221; But I think it&#8217;s just very kind of curiosity driven, and the allocation of time is </p><p>00:35:11,566 --&gt; 00:35:16,705 [Seb Krier]</p><p>not super thought out. It&#8217;s more like, oh, I think these things are interesting, and I&#8217;m going to get into that for a bit. [laughs] </p><p>00:35:16,706 --&gt; 00:35:22,306 [Andrey]</p><p>So it wasn&#8217;t a deliberate strategy of getting Tyler&#8217;s attention and adoration. [laughs] </p><p>00:35:22,306 --&gt; 00:35:25,126 [Seb Krier]</p><p>No, not at all. Not at all. But I&#8217;m very- </p><p>00:35:25,126 --&gt; 00:35:25,746 [Seth]</p><p>The long play </p><p>00:35:25,746 --&gt; 00:35:30,565 [Seb Krier]</p><p>... very grateful for his... [laughs] For the meme. But- </p><p>00:35:30,566 --&gt; 00:35:41,766 [Seth]</p><p>What kind of, but I know you can&#8217;t be specific, but for your sort of internal work, what does a work product look like? Are you participating in a meeting and giving hot takes? Are you writing internal memos? What is- </p><p>00:35:41,766 --&gt; 00:35:42,026 [Seb Krier]</p><p>Yeah </p><p>00:35:42,026 --&gt; 00:35:42,276 [Seth]</p><p>... in- </p><p>00:35:42,276 --&gt; 00:35:56,406 [Seb Krier]</p><p>It&#8217;s a mixture. Obviously, meetings. Any large bureaucracy will have meetings. But I think a lot of analysis, memos to execs sometimes. Just research, managing researchers sometimes, depending on the project. </p><p>00:35:57,626 --&gt; 00:36:04,106 [Seb Krier]</p><p>We&#8217;ll have a lot of coordination. Actually, I&#8217;m realizing through a lot of these kind of meetings, a lot of it is just kind of coordination and information transfer, right? </p><p>00:36:04,106 --&gt; 00:36:04,146 [Andrey]</p><p>[laughs] </p><p>00:36:04,146 --&gt; 00:36:07,006 [Seb Krier]</p><p>It&#8217;s maybe why I&#8217;m so obsessed with the Coasean bargaining thing. Just let- </p><p>00:36:07,006 --&gt; 00:36:07,326 [Seth]</p><p>Ah </p><p>00:36:07,326 --&gt; 00:36:08,546 [Seb Krier]</p><p>... the agents do it. But, </p><p>00:36:09,806 --&gt; 00:36:34,116 [Seb Krier]</p><p>yeah. I think the day-to-day work is a lot of reading, a lot of meetings, a lot of writing, and distilling and translating information, I think, across different tribes also. So if I&#8217;m talking to legal people, like lawyers, about what&#8217;s going on in, say, the more technical side of the org, or if I&#8217;m speaking to the researchers about something that&#8217;s more... But yeah, there&#8217;s a lot of translating of concepts across different stakeholders, I guess. </p><p>00:36:34,116 --&gt; 00:36:45,726 [Andrey]</p><p>So how does that work in an org like Google? Because I think in a lot of orgs, they&#8217;re really obsessed with KPIs and output metrics. </p><p>00:36:45,726 --&gt; 00:36:46,156 [Seb Krier]</p><p>Mm-hmm. </p><p>00:36:46,156 --&gt; 00:36:48,746 [Andrey]</p><p>And what you&#8217;re describing sounds very- </p><p>00:36:48,746 --&gt; 00:36:49,706 [Seth]</p><p>Hot takes per meeting. [laughs] </p><p>00:36:49,706 --&gt; 00:36:54,926 [Andrey]</p><p>Yeah. Very much amorphous, very hard to measure. </p><p>00:36:56,066 --&gt; 00:36:56,196 [Seb Krier]</p><p>Yeah. </p><p>00:36:56,196 --&gt; 00:37:00,606 [Andrey]</p><p>Obviously, you have a lot of external visibility, but is that </p><p>00:37:02,786 --&gt; 00:37:07,846 [Andrey]</p><p>a problem? Or is that just it&#8217;s understood that that&#8217;s how this goes? Yeah. </p><p>00:37:07,846 --&gt; 00:37:13,846 [Seb Krier]</p><p>I think the external stuff is kind of almost just very separate from the kind of day-to-day work side of things. </p><p>00:37:14,986 --&gt; 00:37:23,366 [Seb Krier]</p><p>And yeah, internally, we do have KPIs or equivalents or whatever. I think they may be less numerical in nature. But you might still have some, develop a consistent position on </p><p>00:37:24,506 --&gt; 00:37:30,819 [Seb Krier]</p><p>X issue or something in the next two, three months.And that requires a lot of research work, coordinating. </p><p>00:37:30,819 --&gt; 00:37:32,929 [Seth]</p><p>Have 10 opinions. [laughs] </p><p>00:37:32,930 --&gt; 00:37:38,100 [Seb Krier]</p><p>No, ideally they just want one. I think 10 opinions, that&#8217;s the issue. There are a lot of opinions out there. You&#8217;ve got to find the good ones. </p><p>00:37:38,100 --&gt; 00:37:39,530 [Seth]</p><p>That&#8217;s the main problem with economists. </p><p>00:37:39,530 --&gt; 00:37:42,350 [Seb Krier]</p><p>But [laughs] yeah. Exactly. Who was that quote? </p><p>00:37:43,830 --&gt; 00:37:44,290 [Seth]</p><p>Truman. </p><p>00:37:44,290 --&gt; 00:37:44,330 [Seb Krier]</p><p>Yeah. </p><p>00:37:44,330 --&gt; 00:37:46,210 [Seth]</p><p>Truman begged for the one-handed economist. </p><p>00:37:46,270 --&gt; 00:38:20,990 [Seb Krier]</p><p>Yeah, exactly. But, so I think, yeah, I think internally it&#8217;s just a kind of analysis or something. Say you&#8217;re thinking about, oh, agents and legal liability. How do these things work? What does the existing legal environment say and prescribe? What happens if something goes wrong? What are relevant factors? There&#8217;s a lot of that kind of thing. And I guess particularly within the DeepMind side, because when we&#8217;re on the frontier side, we&#8217;re thinking about the next five years as opposed to what&#8217;s going on right now. But yeah, the other side stuff is really just kind of out of personal interest and just me writing stuff, and they seem fine with it so far. [chuckles] </p><p>00:38:20,990 --&gt; 00:38:26,510 [Andrey]</p><p>What about... So we&#8217;ll be at a conference together, the Post-AGI conference- </p><p>00:38:26,510 --&gt; 00:38:26,830 [Seb Krier]</p><p>Ooh </p><p>00:38:26,830 --&gt; 00:38:28,370 [Andrey]</p><p>... at Lighthaven, Berkeley. </p><p>00:38:28,370 --&gt; 00:38:30,110 [Seth]</p><p>Ooh. Prestigious. </p><p>00:38:31,130 --&gt; 00:38:32,990 [Andrey]</p><p>I don&#8217;t know if it&#8217;s prestigious. </p><p>00:38:34,550 --&gt; 00:38:34,629 [Seth]</p><p>[laughs] </p><p>00:38:34,630 --&gt; 00:38:45,730 [Andrey]</p><p>But you&#8217;ve gone to a few of these conferences, like the Curve is another fairly well-known one. What&#8217;s your take on these? </p><p>00:38:45,730 --&gt; 00:38:54,750 [Seb Krier]</p><p>I think some are useful. The majority of conferences I go to, I don&#8217;t exactly find that life-transforming, I guess. </p><p>00:38:54,750 --&gt; 00:38:57,610 [Andrey]</p><p>[laughs] You&#8217;re going to the wrong conference. [laughs] </p><p>00:38:57,610 --&gt; 00:39:09,290 [Seb Krier]</p><p>I know. Can someone show me the... But I think, yeah, they obviously perform a social function to some degree, right? There&#8217;s a lot of meeting people, some networking or something, some kind of finding out new ideas. But </p><p>00:39:10,390 --&gt; 00:39:20,310 [Seb Krier]</p><p>my issue with conferences, very often they&#8217;re just very tame. They&#8217;re very risk-averse. They&#8217;re very the same ideas you&#8217;ve-- Already if you can read it online or something, it depends on the conference. But, </p><p>00:39:21,510 --&gt; 00:39:24,190 [Seb Krier]</p><p>although I have been to really good ones, too. There was this </p><p>00:39:25,570 --&gt; 00:39:43,529 [Seb Krier]</p><p>IMF conference with Econ Ty, with I think Anton Korinek and others had organized. And that was great because that was a nice one where you had both the technologists and a lot of economists and loads of presentations, and you got to learn lots of new things. But, in general, I don&#8217;t see a huge... Beyond maybe showing, again, some hot takes here and there. </p><p>00:39:45,370 --&gt; 00:39:49,990 [Seb Krier]</p><p>Yeah, some I assume are good conferences. [chuckles] </p><p>00:39:49,990 --&gt; 00:40:00,670 [Seth]</p><p>I&#8217;m just the exception, but you had a great joke on your Twitter the other day about this, which is, Caveman panelist one, &#8220;Fire is bad.&#8221; Caveman panelist two, &#8220;Fire is good.&#8221; </p><p>00:40:00,670 --&gt; 00:40:00,770 [Seb Krier]</p><p>Yeah. </p><p>00:40:00,770 --&gt; 00:40:02,100 [Seth]</p><p>Caveman panelist three, </p><p>00:40:03,450 --&gt; 00:40:07,120 [Seth]</p><p>&#8220;We need to balance the upsides and downsides of fire and use it wisely.&#8221; </p><p>00:40:07,120 --&gt; 00:40:07,320 [Seb Krier]</p><p>Absolutely. </p><p>00:40:07,320 --&gt; 00:40:09,620 [Seth]</p><p>Wild applause. [laughs] </p><p>00:40:09,620 --&gt; 00:40:09,650 [Andrey]</p><p>[laughs] </p><p>00:40:09,650 --&gt; 00:40:14,850 [Seb Krier]</p><p>Exactly. There&#8217;s a lot of that. That&#8217;s the energy that I&#8217;m getting very tired of because it&#8217;s- </p><p>00:40:14,850 --&gt; 00:40:15,050 [Seth]</p><p>[laughs] </p><p>00:40:15,050 --&gt; 00:40:21,700 [Seb Krier]</p><p>And I like playing the role of the wise centrist opinion, whatever. But it does get very- </p><p>00:40:21,700 --&gt; 00:40:23,150 [Seth]</p><p>You do get wild applause. </p><p>00:40:23,150 --&gt; 00:40:24,470 [Seb Krier]</p><p>Yeah. All the time. [chuckles] </p><p>00:40:26,490 --&gt; 00:40:29,770 [Seb Krier]</p><p>But yeah, I think there&#8217;s a lot of that. I wish there were more </p><p>00:40:30,810 --&gt; 00:40:35,090 [Seb Krier]</p><p>almost private Chatham House-y conferences, where you had people who highly disagreed with each other- </p><p>00:40:35,090 --&gt; 00:40:35,210 [Andrey]</p><p>Mm </p><p>00:40:35,210 --&gt; 00:40:36,770 [Seb Krier]</p><p>... but were polite and didn&#8217;t get at </p><p>00:40:37,950 --&gt; 00:40:49,370 [Seb Krier]</p><p>each other&#8217;s throats. And you had more setups that actually allowed ideas to clash a bit more, in a civilized way, of course. But that would be a bit hard, but also much more interesting, I think, than </p><p>00:40:51,490 --&gt; 00:40:55,390 [Seb Krier]</p><p>everyone broadly agreeing that it&#8217;s good to be good and it&#8217;s bad to be bad, and yeah. [chuckles] </p><p>00:40:55,390 --&gt; 00:41:03,710 [Andrey]</p><p>I do feel like the Lighthaven conferences are quite good for this, in that there&#8217;s an enormous amount of free time and- </p><p>00:41:03,710 --&gt; 00:41:04,130 [Seb Krier]</p><p>Mm-hmm </p><p>00:41:04,130 --&gt; 00:41:07,770 [Andrey]</p><p>... free space that&#8217;s not where the talk is happening. </p><p>00:41:07,770 --&gt; 00:41:07,940 [Seb Krier]</p><p>Yeah. </p><p>00:41:07,940 --&gt; 00:41:10,630 [Andrey]</p><p>And so you do get a lot of this. </p><p>00:41:10,630 --&gt; 00:41:11,040 [Seb Krier]</p><p>Well, yeah, I agree. </p><p>00:41:11,040 --&gt; 00:41:21,090 [Andrey]</p><p>But I agree that many conferences are not like that, where you&#8217;re just packed. You have a conference hall, and you don&#8217;t have anywhere else to go, and it&#8217;s packed with talks. Yeah. </p><p>00:41:21,090 --&gt; 00:41:21,710 [Seb Krier]</p><p>Yeah. No, totally. </p><p>00:41:21,710 --&gt; 00:41:23,550 [Seth]</p><p>NBER Summer Institute. [laughs] </p><p>00:41:24,750 --&gt; 00:41:28,330 [Andrey]</p><p>Seth, there is disagreement. Say what you will. At NBER- </p><p>00:41:28,330 --&gt; 00:41:28,540 [Seth]</p><p>There is fire </p><p>00:41:28,540 --&gt; 00:41:29,430 [Andrey]</p><p>... people throw down. </p><p>00:41:30,450 --&gt; 00:41:31,430 [Andrey]</p><p>[laughs] </p><p>00:41:31,430 --&gt; 00:41:37,720 [Seth]</p><p>[laughs] I&#8217;ve never seen a meaner comment than I have seen from a discussant at NBER Summer Institute. [laughs] </p><p>00:41:37,720 --&gt; 00:41:52,570 [Seb Krier]</p><p>[laughs] The Progress Conference, for example, last year, was one that I thought was really good. That was at Lighthaven, in fact. I think the setup and the kind of people and the curation and so just made it something that I found quite engaging. [upbeat music] </p><p>00:41:52,570 --&gt; 00:41:56,490 [Seth]</p><p>So you brought up this idea, as we were talking, about you </p><p>00:41:58,330 --&gt; 00:42:21,049 [Seth]</p><p>think there are so many meetings in your organization because it&#8217;s so hard, yet so critical to transfer information. And there&#8217;s this Coasean idea that so much of why the economy works the way it does is just the idea of transaction costs, right? In addition to kind of this Hayekian idea of local information that&#8217;s hard to share. </p><p>00:42:21,050 --&gt; 00:42:21,810 [Seb Krier]</p><p>Mm-hmm. </p><p>00:42:21,810 --&gt; 00:42:23,960 [Seth]</p><p>You have a very influential essay </p><p>00:42:25,130 --&gt; 00:42:30,230 [Seth]</p><p>that kind of maybe stole some of Andrey&#8217;s thunder, but is still an excellent essay- </p><p>00:42:30,230 --&gt; 00:42:31,040 [Seb Krier]</p><p>[laughs] </p><p>00:42:31,040 --&gt; 00:42:46,210 [Seth]</p><p>... about this idea of, well, what happens when AIs go out there and can micro-bargain costlessly with each other at high frequency over very, what might seem to us, small issues. </p><p>00:42:47,570 --&gt; 00:42:57,440 [Seth]</p><p>Tell us maybe in a few sentences, what&#8217;s that vision and what&#8217;s the positive vision for why that would be good for society, for us to have AI agents constantly bargaining for us over stuff? </p><p>00:42:59,130 --&gt; 00:43:01,810 [Seb Krier]</p><p>Yeah. I guess the idea is, as you mentioned, there&#8217;s all sorts of </p><p>00:43:03,990 --&gt; 00:43:26,350 [Seb Krier]</p><p>transaction costs that mean that we don&#8217;t get to bargain on things that we would otherwise bargain for. And instead, you get these blunt rules and these solutions that kind of work, but come with all sorts of externalities or aren&#8217;t super efficient. And so the idea is, if you can actually do this kind of negotiation at scale for very little, and that&#8217;s a big assumption. That&#8217;s not a given either, </p><p>00:43:27,850 --&gt; 00:43:35,586 [Seb Krier]</p><p>then you could solve all sorts of things thatAnd also just kind of problems that would otherwise not be even conceivable in the first place. </p><p>00:43:36,726 --&gt; 00:43:41,186 [Seth]</p><p>One example you give, just so we can be a little bit more specific, is noise standards, right? </p><p>00:43:41,186 --&gt; 00:43:41,456 [Seb Krier]</p><p>Right. </p><p>00:43:41,456 --&gt; 00:43:57,226 [Seth]</p><p>So you can&#8217;t throw a loud party after 10:00 PM in such and such a place. But you think that maybe AI agents could come to a less coarse rule that is, get us more to the grand coalition of allocative efficiency than a coarse rule like that. </p><p>00:43:57,226 --&gt; 00:44:01,166 [Seb Krier]</p><p>Yeah. To be fair, that&#8217;s probably a problem that no one really cares about except me because of like- [chuckles] </p><p>00:44:01,166 --&gt; 00:44:02,086 [Seth]</p><p>No. Dude. </p><p>00:44:02,086 --&gt; 00:44:03,645 [Andrey]</p><p>I care about it so much. </p><p>00:44:03,645 --&gt; 00:44:04,626 [Seb Krier]</p><p>Oh, really? Okay, cool. </p><p>00:44:04,626 --&gt; 00:44:04,746 [Andrey]</p><p>Yes. </p><p>00:44:04,746 --&gt; 00:44:07,816 [Seb Krier]</p><p>Maybe that&#8217;s a good example then. But yeah, the idea here is, </p><p>00:44:09,146 --&gt; 00:44:17,006 [Seb Krier]</p><p>my neighbor is throwing a party, and instead of there being some sort of rule that says you&#8217;re not allowed to throw parties after 11:00, he could maybe just compensate me for the noise or something. </p><p>00:44:18,326 --&gt; 00:44:21,686 [Seb Krier]</p><p>Or in fact, that&#8217;s one of the key crux of the whole Coasean thing is maybe </p><p>00:44:24,186 --&gt; 00:44:36,085 [Seb Krier]</p><p>I have to compensate him to stop his parties. And it kind of depends where the initial right is. But broadly, you could have these kind of, my whole neighborhood doesn&#8217;t want me to party, and they&#8217;re just giving me a small payment or the reverse, depending on where the initial allocation is. </p><p>00:44:37,226 --&gt; 00:44:44,446 [Seb Krier]</p><p>But I think you could have all sorts of micro ways in which these transaction costs at scale help you get much better beneficial outcomes. </p><p>00:44:45,486 --&gt; 00:44:48,486 [Seb Krier]</p><p>And so that would be the noise one would be like, okay. </p><p>00:44:50,406 --&gt; 00:45:18,666 [Seb Krier]</p><p>And it&#8217;ll probably just also let people kind of regroup into the party people just going into the neighborhood where that&#8217;s just generally more party tolerant or something, and the kind of peace and quiet preferring people just... Because I think one of the points with the piece was that AI also helps you coordinate better. You can use this stuff to find people who have the same interests and preferences as you or something, and just then bargain or negotiate or whatnot in that way as well. </p><p>00:45:20,626 --&gt; 00:45:27,386 [Seth]</p><p>So it&#8217;s not just bargaining over externalities that are negative, it&#8217;s maybe coordinating over positive externalities, right? </p><p>00:45:27,386 --&gt; 00:45:27,526 [Seb Krier]</p><p>Yeah. </p><p>00:45:28,766 --&gt; 00:45:51,746 [Seth]</p><p>What pieces do we need in the economy to make this a reality, and what time horizon are you thinking about? So obviously this is an idea that you could have a small version of, and then like the sci-fi, this is constantly, I&#8217;m allowed to speed in my car today because I really need to get to work because I&#8217;m late, and it&#8217;s bargaining with all the cars on the highway at ultra-high frequency. So what are the time horizons you have in mind, and what pieces do we need? </p><p>00:45:51,746 --&gt; 00:46:21,786 [Seb Krier]</p><p>Honestly, I haven&#8217;t even thought about the timelines really. [laughing] For me, this was mostly kind of an aspirational thing of like, well, it looks like we could unlock some cool things, and because there&#8217;s all these-- It&#8217;d be nice to have a positive vision of how things might pan out. It certainly doesn&#8217;t mean that everything has to be negotiated and bargained over. But I could see a large proportion of things, certainly in everyday life, like I could just tell my aunt, &#8220;You don&#8217;t have to worry about your parking issues anymore. It&#8217;s just sorted now,&#8221; whatever. The agents are taking care of that. And so it kind of depends on what scale you&#8217;re talking about. Certainly having democracy at scale and </p><p>00:46:23,626 --&gt; 00:46:29,086 [Seb Krier]</p><p>half automated and half made more efficient through these systems or something is something that I think is going to take a long time. </p><p>00:46:30,426 --&gt; 00:46:47,986 [Seb Krier]</p><p>But I can see smaller versions of it happening. The smallest version I can think of is just even basic calendar management. Until recently, that would be impossible to get automated really well. But gradually now I could expect an agent to really know my preferences, the context in which I operate, the hierarchical </p><p>00:46:49,006 --&gt; 00:47:12,466 [Seb Krier]</p><p>relationship with the people I&#8217;m organizing meetings with and whatnot, and the agent coming to tell me something like, &#8220;Okay, I&#8217;ve moved your meeting here to next week because this one&#8217;s not very important, and you don&#8217;t really care about it. But I&#8217;ve brought this meeting with your director forward because it&#8217;s urgent given what I&#8217;ve seen in that email. And also, I&#8217;ve set the location in this place because you&#8217;ll both be around that area in any event around then, given what I&#8217;ve gathered from talking to the other agent.&#8221; </p><p>00:47:13,946 --&gt; 00:47:23,766 [Seb Krier]</p><p>Something like that I think I could easily see in the next couple of years. But something at a higher, things like automating part of a local authority and- </p><p>00:47:23,766 --&gt; 00:47:26,786 [Seth]</p><p>So let&#8217;s stick to the noise example because that&#8217;s a core one. </p><p>00:47:26,786 --&gt; 00:47:30,276 [Seb Krier]</p><p>Well, the noise one, I think takes a bit longer because it requires this kind of, </p><p>00:47:31,486 --&gt; 00:47:44,496 [Seb Krier]</p><p>well, this adoption and integration of this technology at my local city council or whatever. And that&#8217;s one of the things that takes a long time. It&#8217;s not immediate, and it requires a shift in behaviors. People have to also then use these agents. It has to be legitimized. The </p><p>00:47:45,726 --&gt; 00:47:48,386 [Seb Krier]</p><p>value proposition has to be made a bit clearer. </p><p>00:47:48,386 --&gt; 00:47:48,586 [Seth]</p><p>Mm-hmm. </p><p>00:47:48,586 --&gt; 00:48:01,666 [Seb Krier]</p><p>And there&#8217;s all the usual political issues of like, well, you&#8217;ve got these vested interests, and it&#8217;s not in some people&#8217;s interests for that whole thing to work out. In the same way as if you wanted to have some sort of YIMBY paradise, a lot of kind of planning authorities wouldn&#8217;t be very delighted. They would probably find ways to stop the </p><p>00:48:02,786 --&gt; 00:48:03,166 [Seb Krier]</p><p>deployment of- </p><p>00:48:03,166 --&gt; 00:48:07,046 [Seth]</p><p>But they could be compensated through micro-transactions, dude. [chuckles] </p><p>00:48:07,046 --&gt; 00:48:11,866 [Seb Krier]</p><p>Exactly. Right? It&#8217;s a recursive Coasean improvement. But- </p><p>00:48:11,866 --&gt; 00:48:15,566 [Andrey]</p><p>So, I guess just like a thought here. </p><p>00:48:15,566 --&gt; 00:48:15,885 [Seb Krier]</p><p>Mm-hmm. </p><p>00:48:15,886 --&gt; 00:48:19,106 [Andrey]</p><p>I&#8217;m sure you&#8217;ve heard of Edge Esmeralda- </p><p>00:48:19,106 --&gt; 00:48:19,116 [Seb Krier]</p><p>Mm-hmm </p><p>00:48:19,116 --&gt; 00:48:31,936 [Andrey]</p><p>... which is kind of this experimental new town. It seems just like we think that with new firms that are entering, they&#8217;re going to be able to organize in a way that&#8217;s going to better be able to take advantage of AI capabilities. </p><p>00:48:31,936 --&gt; 00:48:32,726 [Seb Krier]</p><p>Mm-hmm. </p><p>00:48:32,726 --&gt; 00:48:43,006 [Andrey]</p><p>To the extent that we can have newly incorporated cities or other jurisdictions, they might allow for agentic representation at the local council meeting- </p><p>00:48:43,006 --&gt; 00:48:43,646 [Seb Krier]</p><p>Mm-hmm </p><p>00:48:43,646 --&gt; 00:48:50,666 [Andrey]</p><p>... and other such things. And if that works really well, then it would actually </p><p>00:48:51,786 --&gt; 00:48:56,166 [Andrey]</p><p>show other places what they&#8217;re missing out on. I&#8217;m not saying that the San Francisco- </p><p>00:48:56,166 --&gt; 00:48:56,176 [Seb Krier]</p><p>Yeah </p><p>00:48:56,176 --&gt; 00:48:59,076 [Andrey]</p><p>... City Council would do this. [laughs] </p><p>00:48:59,076 --&gt; 00:48:59,586 [Seb Krier]</p><p>[laughs] </p><p>00:48:59,586 --&gt; 00:49:07,265 [Andrey]</p><p>But some smaller cities might start doing this, or school boards. That&#8217;s a great example. I hear people have a lot of issues with those. </p><p>00:49:07,266 --&gt; 00:49:30,866 [Seb Krier]</p><p>Totally. I think that&#8217;s kind of what I concluded in the essay, is that you&#8217;ve got to start in these kind of small proof of concepts and then kind of make the idea actively desirable and for people to kind of actually see the benefits of these things. And so I agree, The Edge Esmeralda, I think they&#8217;re trying something out with agents, and that&#8217;s going to be pretty exciting to see. And maybe then that kind of takes on, right? In the same way as a lot of conferences now have apps or something to... </p><p>00:49:31,886 --&gt; 00:49:36,446 [Seb Krier]</p><p>I guess the more pessimistic version is, well, if you look at Estonia, they&#8217;ve been really good at </p><p>00:49:37,506 --&gt; 00:49:42,666 [Seb Krier]</p><p>integrating technology and automating all sorts of parts of their government in really efficient and effective ways.</p><p>00:49:43,642 --&gt; 00:49:44,622 [Seth]</p><p>Estonia? </p><p>00:49:44,622 --&gt; 00:49:46,102 [Seb Krier]</p><p>Yeah. [laughs] </p><p>00:49:46,102 --&gt; 00:49:46,982 [Seth]</p><p>[laughs] </p><p>00:49:46,982 --&gt; 00:49:58,162 [Seb Krier]</p><p>And yet you don&#8217;t necessarily see neighboring countries have a strong incentive to do the same straight away. And I would expect the equivalent to take quite a long time, say, if you try to do the same thing in Brussels. So there are other- </p><p>00:49:58,162 --&gt; 00:49:58,902 [Seth]</p><p>[laughs] </p><p>00:49:58,902 --&gt; 00:49:59,401 [Seb Krier]</p><p>... dynamics. </p><p>00:49:59,401 --&gt; 00:50:00,802 [Seth]</p><p>Anything takes longer in Brussels. </p><p>00:50:00,802 --&gt; 00:50:03,082 [Seb Krier]</p><p>That&#8217;s true. It&#8217;s part and parcel of it. </p><p>00:50:03,082 --&gt; 00:50:04,262 [Seth]</p><p>Except for the french fries. </p><p>00:50:04,262 --&gt; 00:50:08,022 [Seb Krier]</p><p>I&#8217;d say, yeah. Which are indeed very good. But, I think </p><p>00:50:09,562 --&gt; 00:50:25,742 [Seb Krier]</p><p>you&#8217;ll want some of these kind of smaller challenges, I guess, to really push for these things and put more pressure on the incumbent systems, whether it&#8217;s in the private sector or in the public sector. The only thing is, I expect it to be a bit slower in the public sector than it will be in the private sector. </p><p>00:50:27,382 --&gt; 00:50:41,082 [Seb Krier]</p><p>Once a bunch of startups start using agents that, I don&#8217;t know, coordinates on meetings or whatever, maybe that&#8217;s going to take off as well with all the wider organizations much more quickly. Whereas, you still have parts of the German or UK government using fax machines today. </p><p>00:50:41,082 --&gt; 00:50:59,042 [Andrey]</p><p>Well, there&#8217;s this example that we have in our paper where some agentic environments are designed for agents only, right? And so you might imagine that to work well within a firm where everyone has their calendar scheduling agent, and they all talk to each other. But I think </p><p>00:51:00,102 --&gt; 00:51:07,482 [Andrey]</p><p>you can also imagine an environment where the agent has to be able to interface also with humans, right? </p><p>00:51:08,782 --&gt; 00:51:10,042 [Andrey]</p><p>And those are quite different </p><p>00:51:11,762 --&gt; 00:51:13,022 [Andrey]</p><p>capabilities </p><p>00:51:14,782 --&gt; 00:51:18,842 [Andrey]</p><p>or tests of the system. And if you want maybe </p><p>00:51:20,062 --&gt; 00:51:25,642 [Andrey]</p><p>more useful versions of this, they should also be able to interact with humans who are not using the agents. </p><p>00:51:25,642 --&gt; 00:51:37,682 [Seb Krier]</p><p>True. And that probably acts also as a speed limiter, I guess, as well, right? If you have to then... But I agree. I think that&#8217;s going to be an important part, and I think that&#8217;s where protocol design and all these kind of things start coming into play. </p><p>00:51:39,022 --&gt; 00:51:48,362 [Seb Krier]</p><p>And I liked actually in your paper the whole bowling shoe versus bring your own question as well. Because you&#8217;d expect some platforms to just say, &#8220;Well, just use our agent here&#8221; or something. </p><p>00:51:49,422 --&gt; 00:51:54,362 [Seb Krier]</p><p>And it&#8217;s not clear to me that you&#8217;ll necessarily have one agent that you&#8217;re going to use across all platforms the same as something. </p><p>00:51:56,002 --&gt; 00:52:06,842 [Seb Krier]</p><p>Maybe you probably want certain characteristics or certain knowledge to be more segmented. Maybe you want your agent doing high-frequency trading to be a bit different than your agent doing, I don&#8217;t know, social life coordination or something. </p><p>00:52:08,402 --&gt; 00:52:14,622 [Seb Krier]</p><p>And it&#8217;s not clear whether it&#8217;s an agent that&#8217;s necessarily owned in some way by me or if it&#8217;s in fact... </p><p>00:52:15,882 --&gt; 00:52:25,202 [Seb Krier]</p><p>So yeah, there&#8217;s all sorts of interesting design questions that aren&#8217;t fully clear to me just yet. And I think that would be the kind of thing I&#8217;d love to see more work on. Actually, I think also Anthropic had this project deal recently, which was interesting, too. </p><p>00:52:27,282 --&gt; 00:52:29,662 [Seb Krier]</p><p>And it seems to have worked out quite nicely, I think, in their experiment. </p><p>00:52:31,242 --&gt; 00:52:32,262 [Seth]</p><p>It is, yeah. </p><p>00:52:32,262 --&gt; 00:52:34,242 [Andrey]</p><p>Yeah. You&#8217;re welcome to explain it. Yeah. </p><p>00:52:34,242 --&gt; 00:52:41,262 [Seb Krier]</p><p>[laughs] Again, I don&#8217;t have deep knowledge of what they&#8217;ve done or anything. I think Christy glanced at the... But I think broadly, they </p><p>00:52:42,822 --&gt; 00:52:51,922 [Seb Krier]</p><p>let part of their employees have access to agents to do some sort of negotiating and bargaining on their behalf for all sorts of commercial items or things they would want to trade. </p><p>00:52:53,102 --&gt; 00:52:59,302 [Seb Krier]</p><p>And they also did some experiment where I think some had access to more capable models, others had access to slightly less capable models. And I think the </p><p>00:53:00,842 --&gt; 00:53:01,842 [Seb Krier]</p><p>insight was something like </p><p>00:53:03,162 --&gt; 00:53:11,622 [Seb Krier]</p><p>everyone was happy in the end, but actually people with the more capable models got better surplus or something. And those without the capable models didn&#8217;t realize the </p><p>00:53:12,642 --&gt; 00:53:20,782 [Seb Krier]</p><p>gain they would have made had they had the more capable one. But in any event, everyone was happy. Isn&#8217;t it? It apparently worked out and people traded stuff. </p><p>00:53:20,782 --&gt; 00:53:28,921 [Andrey]</p><p>Well, my understanding is the setting was people literally took photos. They inventoried the stuff they didn&#8217;t need in their home. </p><p>00:53:28,922 --&gt; 00:53:29,242 [Seb Krier]</p><p>Yeah. </p><p>00:53:29,242 --&gt; 00:53:29,932 [Andrey]</p><p>And then- </p><p>00:53:29,932 --&gt; 00:53:31,522 [Seth]</p><p>[laughs] </p><p>00:53:31,522 --&gt; 00:53:37,942 [Andrey]</p><p>... if you can get rid of it, you might already be happy if you can get rid of it, let alone get something in return for it. </p><p>00:53:37,942 --&gt; 00:53:47,182 [Seb Krier]</p><p>It&#8217;s true. Well, yeah. That was the thing we were discussing earlier. I&#8217;d be happy to have an agent sell my records, but not buy them or some silly [chuckles] certain things. </p><p>00:53:47,182 --&gt; 00:53:47,861 [Andrey]</p><p>Mm. </p><p>00:53:47,862 --&gt; 00:53:51,862 [Seb Krier]</p><p>But, yeah. So I think I didn&#8217;t go too deep into that specific </p><p>00:53:53,022 --&gt; 00:54:03,162 [Seb Krier]</p><p>paper or webpage or whatever they did on that. But there seems to be more experimentation along these lines. I think Rohit, of course, has his own kind of things. I had a MATS </p><p>00:54:04,322 --&gt; 00:54:09,922 [Seb Krier]</p><p>stream with Yoav Shavit at OpenAI on kind of multi-agent coordination. </p><p>00:54:09,922 --&gt; 00:54:13,992 [Andrey]</p><p>Can you explain what MATS is? Because I don&#8217;t think many in our audience know what that is. </p><p>00:54:13,992 --&gt; 00:54:16,622 [Seb Krier]</p><p>Oh, it&#8217;s basically a fellowship. </p><p>00:54:18,662 --&gt; 00:54:28,062 [Seb Krier]</p><p>You have mentors and mentees, and they work on all sorts of projects, and there&#8217;s close collaboration with labs. But it&#8217;s broadly on technical projects for AI safety and governance. </p><p>00:54:28,062 --&gt; 00:54:34,122 [Andrey]</p><p>Is this a program that our junior economist listeners should try to apply to? </p><p>00:54:34,122 --&gt; 00:54:59,482 [Seb Krier]</p><p>I think so, yeah. I don&#8217;t know if they have an econ stream. Well, they should if they don&#8217;t. And there is, I think, a wider kind of governance stream, so I wouldn&#8217;t be surprised. If they don&#8217;t have one, I would imagine there to be one eventually. But I think a lot of economists who&#8217;ve got this expertise in mechanism design or game theory, I think there&#8217;s a lot to offer here, even in the more classical AI safety type streams as well. So yeah, they should look into MATS for sure. </p><p>00:55:00,882 --&gt; 00:55:13,182 [Seth]</p><p>Well, a lot of pickup on those answers. Very interesting stuff. I guess one question I have here that you already introduced is the idea of, okay, so maybe we&#8217;re going to get allocative efficiency from all this Coasean bargaining. </p><p>00:55:13,182 --&gt; 00:55:13,222 [Seb Krier]</p><p>Mm-hmm. </p><p>00:55:13,222 --&gt; 00:55:41,202 [Seth]</p><p>We&#8217;ll make sure that the lawnmower doesn&#8217;t get mowed at 2:00 a.m. when it&#8217;s really annoying for me. But there&#8217;s a distributional question here, too. So one concern that you already kind of aired out was the idea that people with better models who can pay a little bit more might be able to bargain harder and get a bigger share of the pie. Another distributional concern might be just the step zero. Which is for any sort of bargaining over </p><p>00:55:44,802 --&gt; 00:55:48,561 [Seth]</p><p>an externality to happen, you need to assign that property right first. </p><p>00:55:48,562 --&gt; 00:55:48,752 [Seb Krier]</p><p>Mm-hmm. </p><p>00:55:48,752 --&gt; 00:55:52,382 [Seth]</p><p>And that could be a process that is very unequal. So, </p><p>00:55:53,822 --&gt; 00:55:59,962 [Seth]</p><p>feel free to take up either of those concerns. Is Coasean bargaining going to lead to a dystopic inequality hellhole?</p><p>00:56:01,490 --&gt; 00:56:12,270 [Seb Krier]</p><p>Right. I don&#8217;t think so. Certainly, I would imagine that in the first instance, I think people are still net better off than the counterfactual of not having any bargaining agents at all. </p><p>00:56:12,270 --&gt; 00:56:13,590 [Seth]</p><p>Well, that&#8217;s the Pareto promise, right? </p><p>00:56:13,590 --&gt; 00:56:13,610 [Seb Krier]</p><p>Yeah. </p><p>00:56:13,610 --&gt; 00:56:18,230 [Seth]</p><p>First, Fear Theorem tells us we&#8217;re going to get a Pareto efficient, allocative efficiency outcome. </p><p>00:56:18,230 --&gt; 00:56:18,550 [Seb Krier]</p><p>Mm-hmm. </p><p>00:56:18,550 --&gt; 00:56:20,370 [Seth]</p><p>But, a lot of people- </p><p>00:56:20,370 --&gt; 00:56:20,610 [Seb Krier]</p><p>Sure </p><p>00:56:20,610 --&gt; 00:56:23,110 [Seth]</p><p>... trade this efficiency for some equality, right? </p><p>00:56:23,110 --&gt; 00:57:04,610 [Seb Krier]</p><p>I guess to me, it depends a little bit on the complexity of the problem, and I would guess that over time, you&#8217;ll get diminishing returns. I think we&#8217;re discussing this, asking Claude Opus for her toothpaste recommendation or whatever. There&#8217;s a point to which I would imagine, I guess I don&#8217;t know this empirically, and certainly in the project deal and tropics that you had kind of better returns by using the better model, but it&#8217;s not clear to me whether over time, actually, this kind of scales linearly in terms of if you&#8217;ve got the mega guard model, you necessarily get a better deal. There&#8217;s a point to which I think this doesn&#8217;t... In the same way as Elon Musk has the same phone as you and I have, and he&#8217;s not getting that much more just by virtue of being rich or something. At least, he does in general, just not on the phone part. [chuckles] And </p><p>00:57:06,370 --&gt; 00:57:23,230 [Seb Krier]</p><p>in any event, I think even then, to the extent that there were to be some sort of inequity, nothing stops, I think, whether it&#8217;s the state or philanthropy, whoever, to kind of subsidize access. Maybe you can have vouchers, you have systems that essentially re-equilibrate things. So I&#8217;ve always liked the voucher systems, even for private schooling and so on. </p><p>00:57:24,250 --&gt; 00:57:26,530 [Seb Krier]</p><p>So maybe that&#8217;s one way around that. </p><p>00:57:27,630 --&gt; 00:57:29,160 [Seb Krier]</p><p>And the property- </p><p>00:57:29,160 --&gt; 00:57:34,850 [Seth]</p><p>Decide whether to use my precious 100,000 tokens to bargain for food or to bargain for noise complaints. </p><p>00:57:34,850 --&gt; 00:57:40,590 [Seb Krier]</p><p>[laughs] Well, yeah. That&#8217;s on you to decide how you want to use it. [laughs] But- </p><p>00:57:40,590 --&gt; 00:57:42,100 [Andrey]</p><p>Always has been, Seth. Always has been. </p><p>00:57:42,100 --&gt; 00:57:42,290 [Seth]</p><p>Always has been. </p><p>00:57:42,290 --&gt; 00:57:50,370 [Seb Krier]</p><p>Yeah. Exactly. That&#8217;s not me. And then the other one with the property right assignment is a tricky one, too. And it&#8217;s deeply political ultimately as well because- </p><p>00:57:50,370 --&gt; 00:57:50,650 [Seth]</p><p>Right </p><p>00:57:50,650 --&gt; 00:57:51,690 [Seb Krier]</p><p>... of course, it&#8217;s- </p><p>00:57:53,090 --&gt; 00:57:59,050 [Seth]</p><p>It reminds me of land redistribution, right? It&#8217;s like, &#8220;All right, we&#8217;re breaking up the plantations. Who gets it?&#8221; [laughs] </p><p>00:57:59,050 --&gt; 00:58:19,190 [Seb Krier]</p><p>Yeah. And I think, I guess what I claim in the essay or something was something like, well, you can just start with what we already have. There&#8217;s already a certain kind of distribution and allocation of rights and why do you need to just start anew? Just start with what we have. Then I did agree that there&#8217;s a need for them to change. You just then kind of, again, that&#8217;s the whole democratic process. </p><p>00:58:19,190 --&gt; 00:58:21,110 [Seth]</p><p>We&#8217;ll start with noise complaints. </p><p>00:58:21,110 --&gt; 00:58:21,250 [Seb Krier]</p><p>Well- </p><p>00:58:21,250 --&gt; 00:58:24,410 [Seth]</p><p>Because right now, it&#8217;s unclear to me who has the... I have the... Yeah. </p><p>00:58:24,410 --&gt; 00:58:43,210 [Seb Krier]</p><p>Sorry, I thought of that a little bit. I was thinking that actually, you can just keep the 11:00 PM threshold or curfew. It&#8217;s just that this switches the property right allocation. So before 11:00 PM, I have to pay you to be quiet, and after 11:00 PM, you have to pay me to be noisy or something. </p><p>00:58:43,210 --&gt; 00:58:43,710 [Seth]</p><p>[laughs] </p><p>00:58:43,710 --&gt; 00:58:53,540 [Seb Krier]</p><p>So I think that still works, but you come up with some sort of agreements as to where these kind of fluids, property rights, or something apply. But for these to be decided, I think you need the </p><p>00:58:54,670 --&gt; 00:59:17,890 [Seb Krier]</p><p>whole democratic apparatus and this is the other thing I was mentioning earlier, and wanting to make that part of, I don&#8217;t know, local democracy, for example, more responsive and effective. So there&#8217;s all sorts of kind of decisions being taken every day at the local council or something that I&#8217;m completely unaware of and have, unfortunately, very little time to go and engage with. But it probably would be better for me to have- </p><p>00:59:17,890 --&gt; 00:59:22,190 [Seth]</p><p>Unfortunately, you&#8217;re telling me that if I took away an hour of Twitter, that&#8217;s what you&#8217;d be doing. </p><p>00:59:22,190 --&gt; 00:59:23,439 [Seb Krier]</p><p>Exactly. I could be online. </p><p>00:59:23,439 --&gt; 00:59:26,630 [Andrey]</p><p>Wait, you could definitely be on Twitter and go to the council meeting. </p><p>00:59:26,630 --&gt; 00:59:26,650 [Seth]</p><p>[laughs] </p><p>00:59:26,650 --&gt; 00:59:32,150 [Seb Krier]</p><p>Yeah, but I have to focus my attention on... [laughs] But I could be consuming slop instead. </p><p>00:59:33,430 --&gt; 00:59:34,850 [Seb Krier]</p><p>So yeah, I think that </p><p>00:59:36,030 --&gt; 00:59:36,690 [Seb Krier]</p><p>for that, I wouldn&#8217;t </p><p>00:59:37,730 --&gt; 00:59:40,070 [Seb Krier]</p><p>mind having an agent come back to me and say, &#8220;Oh, there&#8217;s this </p><p>00:59:41,330 --&gt; 00:59:50,850 [Seb Krier]</p><p>new road that&#8217;s being built, and it would be pretty good for you, given that you take that route a lot and stuff, but it seems to be not going well. Do you want to register some sort of...&#8221; But </p><p>00:59:52,010 --&gt; 01:00:16,720 [Seb Krier]</p><p>yeah. And again, I can see versions of that where this is used for vitocracy, and this is where the whole mechanism design stuff comes into play. But for the property right assignment side of things, I think you just start with what we have today, and then you adjust over time using this mechanism. And again, it&#8217;ll probably depend on the setting or where you&#8217;re doing that. The curfew one, I guess, would be with your local council, but for all sorts of other things, there&#8217;ll be different kind of actors and </p><p>01:00:17,970 --&gt; 01:00:20,970 [Seb Krier]</p><p>institutions in place for that. You probably need... Yeah. </p><p>01:00:20,970 --&gt; 01:00:37,450 [Seth]</p><p>Can I go, how about something like stuff that we don&#8217;t have property rights around at all right now, or even rules around, but are definitely negative externalities, right? So for example, I have a neighbor in my office who does not shower, who smells really bad, right? So how do we decide- </p><p>01:00:37,450 --&gt; 01:00:40,330 [Andrey]</p><p>Are you willing to reveal this on a- </p><p>01:00:40,330 --&gt; 01:00:40,500 [Seth]</p><p>No </p><p>01:00:40,500 --&gt; 01:00:42,450 [Andrey]</p><p>... public podcast? [laughs] </p><p>01:00:42,450 --&gt; 01:00:48,950 [Seth]</p><p>So the question is, how do we decide whether he has a right to be smelly, or I have a right to not smell him? </p><p>01:00:49,990 --&gt; 01:01:00,549 [Seb Krier]</p><p>I guess it, yeah, once again, highly political. [laughs] But I think my bias, of course, is one that&#8217;s more, I guess, freedom maximizing a classical liberal or something. So it&#8217;s more like- </p><p>01:01:00,550 --&gt; 01:01:01,490 [Seth]</p><p>The right to be smelly. </p><p>01:01:01,490 --&gt; 01:01:01,630 [Seb Krier]</p><p>Yeah. </p><p>01:01:01,630 --&gt; 01:01:01,990 [Seth]</p><p>There we go. </p><p>01:01:01,990 --&gt; 01:01:06,630 [Seb Krier]</p><p>It&#8217;s the harm principle, right? He&#8217;s not harming you by being smelly, unfortunately, so you&#8217;re going to have to- </p><p>01:01:06,630 --&gt; 01:01:08,940 [Andrey]</p><p>What do you mean? No, he definitely harms me. </p><p>01:01:08,940 --&gt; 01:01:09,910 [Seth]</p><p>It really smells bad, dude. </p><p>01:01:09,910 --&gt; 01:01:14,540 [Seb Krier]</p><p>[laughs] It&#8217;s not decreasing your longevity as far as I&#8217;m aware. [laughs] </p><p>01:01:14,540 --&gt; 01:01:17,550 [Seth]</p><p>[laughs] Actually, it makes me want to decrease my longevity. </p><p>01:01:17,550 --&gt; 01:01:33,610 [Andrey]</p><p>But no, but more seriously, this is actually a great use case for agents because the agent could receive anonymous... A lot of effective altruist rationalist types have this anonymous- </p><p>01:01:33,610 --&gt; 01:01:33,910 [Seb Krier]</p><p>Oh, yeah </p><p>01:01:33,910 --&gt; 01:01:36,710 [Andrey]</p><p>... link in their profile, where you can give them anonymous </p><p>01:01:38,350 --&gt; 01:01:38,690 [Andrey]</p><p>feedback- </p><p>01:01:38,690 --&gt; 01:01:39,710 [Seb Krier]</p><p>Yeah </p><p>01:01:39,710 --&gt; 01:01:43,390 [Andrey]</p><p>... about how much they smell. And so with the world of agents then- </p><p>01:01:43,390 --&gt; 01:01:43,700 [Seth]</p><p>[laughs] </p><p>01:01:43,700 --&gt; 01:01:49,150 [Andrey]</p><p>... an agent could figure out a way to give anonymous feedback to the person&#8217;s agent </p><p>01:01:50,350 --&gt; 01:01:52,190 [Andrey]</p><p>that the person smells. </p><p>01:01:52,190 --&gt; 01:01:52,550 [Seb Krier]</p><p>Well- </p><p>01:01:52,550 --&gt; 01:01:55,420 [Andrey]</p><p>And then that agent could pass it on to the person</p><p>01:01:56,206 --&gt; 01:02:14,466 [Seb Krier]</p><p>Exactly. To be fair, you can also do that without the agent. You can just put a message on their desk saying, &#8220;By the way, please just shower,&#8221; or something. [laughing] But I think that in principle, what would be actually more interesting is having the agent solve that problem without the person necessarily being offended by the thing or having the message straight away. </p><p>01:02:15,806 --&gt; 01:02:20,186 [Seb Krier]</p><p>So having the other agent just... This is getting into nudging territory, right? And this is also controversial in its own right. </p><p>01:02:21,426 --&gt; 01:02:43,126 [Seb Krier]</p><p>But it actually connects to this other question of do you want your agent to be purely instruction following, or do you want it to help you maybe achieve some higher order goals? And so maybe one of my goals is I want to be not disliked at the office. And so one of the ways is my agent, well, I&#8217;m not the smelly guy, but let&#8217;s assume [laughing] then the agent will kind of find a subtle way- </p><p>01:02:43,126 --&gt; 01:02:43,416 [Seth]</p><p>Sure. </p><p>01:02:43,416 --&gt; 01:02:46,686 [Seb Krier]</p><p>... to be like, &#8220;Seb, it&#8217;s time to log off Twitter and go and take a shower.&#8221; </p><p>01:02:47,926 --&gt; 01:02:53,455 [Seb Krier]</p><p>[laughing] So yeah, that might be one of them. But yeah, in terms of an initial allocation, I don&#8217;t think that he would </p><p>01:02:54,986 --&gt; 01:02:58,706 [Seb Krier]</p><p>have to pay me to continue being smelly perhaps. </p><p>01:02:59,986 --&gt; 01:03:01,185 [Seb Krier]</p><p>[chuckles] </p><p>01:03:01,186 --&gt; 01:03:02,746 [Seth]</p><p>Okay. So </p><p>01:03:04,826 --&gt; 01:03:13,406 [Seth]</p><p>let me give you some concerns about this vision. Right? So we&#8217;ve laid out what could be beautiful about this. We can efficiently allocate externalities, blah, blah, blah. </p><p>01:03:14,766 --&gt; 01:03:30,326 [Seth]</p><p>One set of concerns I&#8217;ve heard from my non-economist friends when I talk about this is this idea that micro bargaining at scale could harm social legitimacy. So if I may give you two exaggerated sci-fi versions of this, and then maybe we can reel it back in into reality. </p><p>01:03:30,326 --&gt; 01:03:31,186 [Seb Krier]</p><p>Mm-hmm. </p><p>01:03:31,186 --&gt; 01:03:37,106 [Seth]</p><p>The first version of this comes from the video game Helldivers 2. Are you familiar with that game? </p><p>01:03:37,106 --&gt; 01:03:39,606 [Seb Krier]</p><p>Yeah, I haven&#8217;t played it, but I&#8217;m aware broadly of the... Yeah. </p><p>01:03:39,606 --&gt; 01:04:29,386 [Seth]</p><p>All right. So the premise is kind of like a Starship Troopers sort of universe, where it&#8217;s officially a democracy, but really it&#8217;s fascism. And their political system is called Managed Democracy. And the way that Managed Democracy works is everyone fills out a preference questionnaire about what they like and they don&#8217;t like, and then that goes up into the big algorithm in the sky, and then boop boop boop boop boop, here&#8217;s your optimal social policy that we&#8217;ve calculated over everyone. Right? So on the one hand, it kind of seems like that is one end limit of what Coasean bargaining at scale would look like. I just fill out a survey, and my AI negotiates with all the other AIs, and I don&#8217;t really interact with it beyond that. But you can kind of see how dystopic that looks. It has a lot less social legitimacy than maybe two choice first past the post elections. </p><p>01:04:29,386 --&gt; 01:04:39,366 [Seb Krier]</p><p>Yeah, I agree. That&#8217;s the extreme version of it or something. Seems very undesirable. And so that&#8217;s why I was very careful to say, by the way, this is not saying make every single aspect of your life </p><p>01:04:41,266 --&gt; 01:04:46,186 [Seb Krier]</p><p>a transaction. In fact, I think there&#8217;s the whole Michael Sandel stuff, right? </p><p>01:04:46,186 --&gt; 01:04:47,666 [Seth]</p><p>Oh, Sandel. That&#8217;s my next question. </p><p>01:04:47,666 --&gt; 01:04:47,906 [Seb Krier]</p><p>Right. </p><p>01:04:47,906 --&gt; 01:04:49,746 [Seth]</p><p>You&#8217;re jumping ahead. [chuckles] </p><p>01:04:49,746 --&gt; 01:04:51,046 [Seb Krier]</p><p>Well, but I think that the </p><p>01:04:52,406 --&gt; 01:05:22,676 [Seb Krier]</p><p>idea, at least for me, is not for the agent to completely automate your agency away. If anything, ideally, this kind of enhances it in some way. So, I like the idea of the local authority thing because it&#8217;s something I wouldn&#8217;t be doing in any event. It&#8217;s not like I&#8217;m having this taken away from me and that, but for the agent, I would be going there or something. So I think there has to be some sort of conscious choice as well from individuals to where do they want to kind of exercise their agency and where do they not. And I think there&#8217;s some people who actually do not care as much about participating and </p><p>01:05:23,926 --&gt; 01:05:25,386 [Seb Krier]</p><p>others who do, and I think that&#8217;s fine. </p><p>01:05:27,146 --&gt; 01:05:38,386 [Seb Krier]</p><p>But I agree that also, there are kind of non-market norms that I think are worth preserving, reciprocity and being a good neighbor and civic participation. And you don&#8217;t want to kind of get rid of these entirely either. So, </p><p>01:05:39,406 --&gt; 01:06:07,506 [Seb Krier]</p><p>in my mind, this will never be the kind of perfect maximalist version of the whole thing. It&#8217;ll probably work in some instances. In others, you&#8217;re not going to use the system. And in general, I suspect that for many things and trivial things, like actually the smelly coworker, I don&#8217;t know, maybe that one is the one where you actually want to use an agent rather than deal with it yourself. But I agree, and it&#8217;s a matter of, I guess, developing the right norms around how we would be using there. Where is it acceptable, where is it not? </p><p>01:06:09,126 --&gt; 01:06:31,386 [Seb Krier]</p><p>But I agree with the bad version of this where no one does anything, and you just have an agent infer everything on your behalf, and now you&#8217;re just basically removing the whole participatory element of democracy, which I think is still very important. And so again, here too you will want your agent... In a way it&#8217;s not dissimilar to </p><p>01:06:33,146 --&gt; 01:06:41,326 [Seb Krier]</p><p>these questions of cognitive offloading and so on. Of like, well, if the AI writes on your behalf versus it gives you advice on how you can write the thing better. </p><p>01:06:42,606 --&gt; 01:06:46,106 [Seb Krier]</p><p>And so now I&#8217;m going a bit on a tangent, but I was mentioning this </p><p>01:06:47,326 --&gt; 01:06:59,026 [Seb Krier]</p><p>positive alignment thing earlier of do you want your agent to be purely instruction and then following, or would you want to specify higher order preferences? I think this is where this comes into play, where I might say, actually, I </p><p>01:07:02,126 --&gt; 01:07:06,326 [Seb Krier]</p><p>want you to help me achieve this kind of higher order goal or something, which may entail, in fact, asking you to just </p><p>01:07:07,346 --&gt; 01:07:46,266 [Seb Krier]</p><p>read up on this. And there&#8217;s also an element of me actually needing to retain that choice as well and as an individual, right? I will sometimes summarize a paper using an AI, and sometimes I will... And that&#8217;s fine because the counterfactual wouldn&#8217;t have not been me reading the whole paper because I don&#8217;t have... But there are also papers where I would have read the whole paper, and I&#8217;m still asking the AI to do it, and that&#8217;s kind of also on me to learn and for, I guess, the right norms to rise in terms of, okay, when it&#8217;s too much, what is right, what is not. In the same way as, I don&#8217;t know, I&#8217;ve learned how to do regressions by hand or something one day in the past, and that was helpful in some sense, but then I&#8217;ve never done that again, and I&#8217;ve started using Stata and R or whatever. So, </p><p>01:07:47,406 --&gt; 01:07:47,985 [Seb Krier]</p><p>there&#8217;s an element of- </p><p>01:07:47,986 --&gt; 01:07:48,906 [Seth]</p><p>What? </p><p>01:07:48,906 --&gt; 01:07:50,786 [Seb Krier]</p><p>[chuckles] Actually, yeah. </p><p>01:07:50,786 --&gt; 01:07:52,886 [Seth]</p><p>Are you telling us you&#8217;re not an economist? </p><p>01:07:53,966 --&gt; 01:07:55,196 [Seb Krier]</p><p>No, I am not an economist. </p><p>01:07:55,196 --&gt; 01:07:55,246 [Seth]</p><p>You got into Stata and R. </p><p>01:07:55,246 --&gt; 01:08:11,046 [Seb Krier]</p><p>I was never an economist. [laughing] So yeah, there&#8217;s this wider question of where do you want to preserve agency and then also where you want to preserve certain norms on a day-to-day level. And so I think it&#8217;s going to be a big messy thing in reality as opposed to the maximalist version of either vision.</p><p>01:08:12,142 --&gt; 01:09:27,362 [Seth]</p><p>For those of you playing along at home, now is your chance to think about how this conversation has changed your priors. This chance to contemplate your posteriors is sponsored by Revelio Labs. Revelio Labs is a leading provider of labor economics data and data services for companies, academics, and independent researchers. Andrey and I have been working in economics of AI, digitization, and automation for a long time, and we can confirm just how useful Revelio&#8217;s data is. Revelio&#8217;s team combines comprehensive micro-level data on employee professional profiles, job postings, and employee sentiment with standardizations, mappings, and enrichments available, all to make that data useful without making your modeling decisions for you. The data can be flexibly aggregated to company, market, or industry, and can be used to study questions ranging from career trajectories to occupational transformation, to the returns to skills, and the impact of AI on labor demand for tasks. Can&#8217;t imagine anyone who would be interested in that. And Revelio data is available on WRDS. So if you&#8217;re an academic with a good library, go see if you have access to their premier data already. And if you don&#8217;t, you can reach out to their excellent economics team, and they&#8217;ll hook you up. I feel like it&#8217;s time for lightning round. </p><p>01:09:27,362 --&gt; 01:09:27,682 [Andrey]</p><p>All right. </p><p>01:09:27,682 --&gt; 01:09:28,342 [Seth]</p><p>Let&#8217;s do it. </p><p>01:09:28,342 --&gt; 01:09:33,702 [Andrey]</p><p>Let&#8217;s do it. [upbeat music] </p><p>01:09:34,802 --&gt; 01:09:35,962 [Andrey]</p><p>So we&#8217;re- </p><p>01:09:35,962 --&gt; 01:09:35,972 [Seth]</p><p>Nice </p><p>01:09:35,972 --&gt; 01:09:39,372 [Andrey]</p><p>... yeah, very excited because you&#8217;re a master of the take. </p><p>01:09:39,372 --&gt; 01:09:41,252 [Seth]</p><p>[laughs] </p><p>01:09:41,252 --&gt; 01:09:41,742 [Andrey]</p><p>[laughs] </p><p>01:09:43,282 --&gt; 01:09:47,702 [Andrey]</p><p>So you fairly recently moved from London to New York City. </p><p>01:09:47,702 --&gt; 01:09:48,242 [Seth]</p><p>Mm-hmm. </p><p>01:09:48,242 --&gt; 01:09:49,122 [Andrey]</p><p>What are </p><p>01:09:50,422 --&gt; 01:09:53,022 [Andrey]</p><p>the words that come to mind- </p><p>01:09:53,022 --&gt; 01:09:53,312 [Seth]</p><p>[laughs] </p><p>01:09:53,312 --&gt; 01:09:56,762 [Andrey]</p><p>... when you&#8217;re thinking about AI in New York City versus AI in London? </p><p>01:09:58,042 --&gt; 01:10:05,402 [Seth]</p><p>Oh. AI in London feels more academic, theoretic, </p><p>01:10:06,562 --&gt; 01:10:53,002 [Seth]</p><p>I guess more deep researchy, and more governance-oriented in some way. I think there&#8217;s this whole Oxford, Cambridge, UCL connection, the government and UK AI Safety Institute, the fact that government was already dealing with AI quite early. I think that that whole ecosystem is fairly developed here, whereas New York is a bit of a weird, secret third thing of like, well, there&#8217;s some interesting people, more like heterodox, random, the edge of culture. Some interesting kind of frontier AI people too, but I wouldn&#8217;t really see it as some sort of AI capital or something. It&#8217;s just a bit more diverse in terms of the offering. But I kind of like that. That&#8217;s the point also. I don&#8217;t want to be fully NSF. I kind of like being somewhat distant. </p><p>01:10:54,062 --&gt; 01:10:59,551 [Andrey]</p><p>What is the reaction of people you meet in Brooklyn or at your- </p><p>01:10:59,551 --&gt; 01:11:00,202 [Seth]</p><p>[laughs] </p><p>01:11:00,202 --&gt; 01:11:04,382 [Andrey]</p><p>... or at your EDM shows when you tell them where you work? </p><p>01:11:04,382 --&gt; 01:11:07,771 [Seth]</p><p>First of all, I do not like EDM at all. But- [laughs] </p><p>01:11:07,771 --&gt; 01:11:08,111 [Andrey]</p><p>[laughs] </p><p>01:11:08,111 --&gt; 01:11:08,122 [Seth]</p><p>[laughs] </p><p>01:11:08,122 --&gt; 01:11:09,242 [Andrey]</p><p>Whoa. </p><p>01:11:09,242 --&gt; 01:11:09,322 [Seth]</p><p>[laughs] </p><p>01:11:09,322 --&gt; 01:11:10,382 [Andrey]</p><p>Hottest thing right now. </p><p>01:11:10,382 --&gt; 01:11:11,782 [Seth]</p><p>Yeah. Going- </p><p>01:11:11,782 --&gt; 01:11:13,782 [Andrey]</p><p>So how much would you have your DJ- </p><p>01:11:13,782 --&gt; 01:11:13,802 [Seth]</p><p>Electronic- </p><p>01:11:13,802 --&gt; 01:11:14,162 [Andrey]</p><p>... mic? </p><p>01:11:14,162 --&gt; 01:11:16,912 [Seth]</p><p>Fine electronic music. [laughs] </p><p>01:11:16,912 --&gt; 01:11:19,672 [Andrey]</p><p>[laughs] Oh, yeah, that one&#8217;s good. The good- </p><p>01:11:19,672 --&gt; 01:11:20,212 [Seth]</p><p>[laughs] </p><p>01:11:20,212 --&gt; 01:11:20,262 [Andrey]</p><p>[laughs] </p><p>01:11:20,262 --&gt; 01:11:21,682 [Seth]</p><p>Aligned electronic music. </p><p>01:11:23,082 --&gt; 01:11:24,232 [Andrey]</p><p>Yes. [laughs] </p><p>01:11:24,232 --&gt; 01:11:25,732 [Seth]</p><p>[laughs] Yeah, I think it </p><p>01:11:29,002 --&gt; 01:11:41,262 [Seth]</p><p>wouldn&#8217;t be the first thing I tell people, perhaps, when I meet someone in Brooklyn. It&#8217;s like, &#8220;Oh, I work for Vigo AI Lab, by the way.&#8221; But I think it depends who and where and what. You get a bit of the stereotypes of, oh, AI bad, evil, blah, blah. </p><p>01:11:42,402 --&gt; 01:11:51,922 [Seth]</p><p>But you also have some cool communities and interesting creative scenes that actually do engage with technology AI in very interesting ways. And so, </p><p>01:11:53,402 --&gt; 01:11:56,412 [Seth]</p><p>there&#8217;s a bit of everything, but that&#8217;s kind of the point also. I&#8217;m </p><p>01:11:57,522 --&gt; 01:12:01,051 [Seth]</p><p>enjoying being here, so I don&#8217;t have to talk about AI with every random person I meet. [laughs] </p><p>01:12:01,051 --&gt; 01:12:01,122 [Andrey]</p><p>[laughs] </p><p>01:12:01,122 --&gt; 01:12:11,922 [Seth]</p><p>And there&#8217;s all these other things that I can kind of explore and discuss and learn about, too. But the attitude really depends on where I end up. There&#8217;s a bit of a selection effect going on as well, to be fair. [laughs] </p><p>01:12:11,922 --&gt; 01:12:16,022 [Andrey]</p><p>The coolest thing in terms of art and AI that you&#8217;ve seen out there? </p><p>01:12:17,122 --&gt; 01:12:17,402 [Seth]</p><p>Hmm. </p><p>01:12:18,982 --&gt; 01:12:20,202 [Seth]</p><p>In New York specifically, a </p><p>01:12:23,142 --&gt; 01:12:27,582 [Seth]</p><p>lot of the recent art stuff I&#8217;ve been to was not really AI related. Although next week, though, I </p><p>01:12:28,862 --&gt; 01:12:31,182 [Seth]</p><p>am in fact co-organizing, helping </p><p>01:12:32,302 --&gt; 01:12:43,542 [Seth]</p><p>some sort of art dinner thing with a bunch of interesting artists coming from London [smacks lips] and showcasing a lot of their work. And so I think they&#8217;re called Biangi Systems. </p><p>01:12:44,862 --&gt; 01:13:01,742 [Seth]</p><p>I&#8217;ll try to put a link in the chat or something. But, yeah, shout out Biangi. But so they do interesting stuff, and you get a lot more interesting artifact from their work than, say, a one-shot Suno prompt or something. [laughs] </p><p>01:13:01,742 --&gt; 01:13:03,842 [Andrey]</p><p>All right. So then natural segue then. </p><p>01:13:03,842 --&gt; 01:13:04,902 [Seth]</p><p>[laughs] </p><p>01:13:04,902 --&gt; 01:13:14,002 [Andrey]</p><p>What is your philosophy of electronic music? Or what do you like about electronic music? What do you not like about electronic music? Yeah. </p><p>01:13:17,262 --&gt; 01:13:18,002 [Seth]</p><p>I don&#8217;t think there&#8217;s a </p><p>01:13:20,022 --&gt; 01:13:24,322 [Seth]</p><p>whole philosophy behind it. I think I just kind of like some types of electronic music. I </p><p>01:13:26,082 --&gt; 01:13:30,882 [Seth]</p><p>like a lot of late &#8216;80s, early &#8216;90s electronic music. And they had a particular kind of... </p><p>01:13:32,702 --&gt; 01:13:49,742 [Seth]</p><p>They&#8217;re basically the original futurist in some sense. And a lot of the themes and the music was very much sometimes around, in fact, AI, or sometimes around robots, sometimes about the future, and both utopian and dystopian visions of the future. And I don&#8217;t know. I like the kind of </p><p>01:13:51,462 --&gt; 01:13:59,982 [Seth]</p><p>[smacks lips] audio equivalent of all these stories or something. I read too much, and so [chuckles] it&#8217;s nice to have something that&#8217;s a bit more like </p><p>01:14:01,402 --&gt; 01:14:02,422 [Seth]</p><p>that engages in a different way. </p><p>01:14:03,822 --&gt; 01:14:07,682 [Seth]</p><p>So I don&#8217;t know. There&#8217;s a lot of early &#8216;90s techno, acid, trance. </p><p>01:14:07,682 --&gt; 01:14:09,822 [Andrey]</p><p>What are those for our listeners? </p><p>01:14:09,822 --&gt; 01:14:16,582 [Seth]</p><p>Well, Drexciya is the kind of canonical one, which is kind of Detroit electro techno from the early &#8216;90s.</p><p>01:14:17,270 --&gt; 01:14:24,430 [Seb Krier]</p><p>And they have a whole mythology around the aesthetics, the themes, the kind of </p><p>01:14:25,530 --&gt; 01:14:26,430 [Seb Krier]</p><p>storyline behind </p><p>01:14:27,530 --&gt; 01:14:30,670 [Seb Krier]</p><p>the tracks and the EPs and albums they put out. </p><p>01:14:31,770 --&gt; 01:14:45,630 [Seb Krier]</p><p>So I think Drexciya is a really interesting one. There&#8217;s also very cool long articles about them, I think James Stinson and Gerald Donald, and a few others. I think a lot of that Detroit scene at the time was very much into that stuff. So, Ox88, Jeff Mills, Underground Resistance, a lot of the </p><p>01:14:46,730 --&gt; 01:14:52,950 [Seb Krier]</p><p>classic techno electro stuff from that time. So these would be some examples, but the UK had some really cool stuff too. </p><p>01:14:54,770 --&gt; 01:15:02,690 [Seb Krier]</p><p>Yeah. And I guess I like the weirder side of music, and when it&#8217;s a bit more creative. Which is why I&#8217;m going to-- </p><p>01:15:04,050 --&gt; 01:15:08,110 [Seb Krier]</p><p>To the earlier discussion about creativity, I kind of think of slop as </p><p>01:15:09,310 --&gt; 01:15:12,330 [Seb Krier]</p><p>very much something that&#8217;s kind of highly predictable. </p><p>01:15:13,910 --&gt; 01:15:18,270 [Seb Krier]</p><p>Low creativity, high predictability, low effort. </p><p>01:15:21,110 --&gt; 01:15:44,950 [Seb Krier]</p><p>One thought I had was that over time, you see this kind of lowering of the barriers to all sorts of art forms. So even this early &#8216;90s stuff. You had a lot of really good concentrated stuff, but very few people had access to this stuff. And of course, as these synthesizers would be cheaper, as they&#8217;re then made kind of digital, as you had tools like Ableton and whatnot, a lot more people were able to create and make music. But as a result of that, </p><p>01:15:46,030 --&gt; 01:15:51,810 [Seb Krier]</p><p>you had a lot more of the mediocre and average stuff too. But that&#8217;s fine. There&#8217;s also a lot more of the really good stuff, and if you&#8217;re </p><p>01:15:53,110 --&gt; 01:15:57,030 [Seb Krier]</p><p>good at finding it, or if you can rely on good algorithms, you can filter through the slop. </p><p>01:15:58,710 --&gt; 01:16:00,190 [Seb Krier]</p><p>Yeah. That&#8217;s one thought. </p><p>01:16:01,670 --&gt; 01:16:05,730 [Andrey]</p><p>I think that the </p><p>01:16:07,070 --&gt; 01:16:11,130 [Andrey]</p><p>current-- I don&#8217;t even know current. Over the past decade, the- </p><p>01:16:11,130 --&gt; 01:16:13,170 [Seth]</p><p>[background chatter] </p><p>01:16:13,170 --&gt; 01:16:13,330 [Andrey]</p><p>... the </p><p>01:16:14,370 --&gt; 01:16:23,150 [Andrey]</p><p>EDM wave. Does it feel very sloppy to you, even somehow before we even had AI-generated music? </p><p>01:16:23,150 --&gt; 01:16:27,769 [Seb Krier]</p><p>Yeah, definitely. But on the one hand, it&#8217;s kind of my taste. I&#8217;m sure there&#8217;s </p><p>01:16:28,990 --&gt; 01:16:34,890 [Seb Krier]</p><p>a big level of subjectivity in a lot of this stuff. And then also there&#8217;s-- I think you had- </p><p>01:16:34,890 --&gt; 01:16:36,650 [Seth]</p><p>This podcast is not Yuck Yums. </p><p>01:16:36,650 --&gt; 01:16:37,010 [Seb Krier]</p><p>[laughs] </p><p>01:16:38,790 --&gt; 01:16:40,000 [Seb Krier]</p><p>But also, there&#8217;s I think all sorts of-- </p><p>01:16:41,650 --&gt; 01:16:49,370 [Seb Krier]</p><p>You had slop equivalent in other fields, in music genres, and domains over time. And I think even in the, I don&#8217;t know, was it the &#8216;80s or something, people would consider, </p><p>01:16:51,150 --&gt; 01:17:08,830 [Seb Krier]</p><p>I would say smooth jazz as being the equivalent of slop at the time or something, right? [laughing] And I like smooth jazz. I&#8217;m really keen on smooth jazz. Even library music or something at the time. So yeah, library music actually was seen as at the time it was just mass-produced, super commercial beast film score music that </p><p>01:17:10,030 --&gt; 01:17:28,649 [Seb Krier]</p><p>was probably kind of not seen with a lot of, I don&#8217;t know, awe and respect or something. Maybe the bye bye. Whereas now it is, of course, because what is slop also changes over time. But yeah, I think just like photography. Once that got democratized, you&#8217;ve got trillions of photos every day, and there&#8217;s a huge amount of slop in that whole sea of photos that people take. </p><p>01:17:28,650 --&gt; 01:17:33,630 [Seth]</p><p>And there&#8217;s like you say, there&#8217;s also a going back and rediscovering. Lovecraft at one point was slop. </p><p>01:17:33,630 --&gt; 01:17:33,820 [Seb Krier]</p><p>Yeah. </p><p>01:17:33,820 --&gt; 01:17:35,130 [Seth]</p><p>And now he&#8217;s beloved. </p><p>01:17:35,130 --&gt; 01:17:36,750 [Seb Krier]</p><p>Yeah. Exactly. There&#8217;s </p><p>01:17:37,870 --&gt; 01:18:21,510 [Seb Krier]</p><p>a lot of house music from at some point that was just deemed pretty kind of basic or something, and now it&#8217;s coming back a little bit in some communities in some shape or form. So it&#8217;s a weird mix of taste and rediscovering old stuff and scarcity because people like being special unique snowflakes and so on. So there&#8217;s [chuckles] also the different dynamics at play. But yeah, in my mind, I don&#8217;t like EDM or something, but who knows? Maybe in 20 years my tastes will have changed, and maybe I&#8217;ll look back and be like, &#8220;Oh, actually there&#8217;s some pretty good stuff here.&#8221; I remember very vividly when I was 18 thinking, &#8220;I really hate the sound of acid.&#8221; As in the acid techno, these two or three machines. I was like, &#8220;I can&#8217;t understand how one would like that.&#8221; And then literally five years later, six years later, I was just obsessed with acid. And I was like, &#8220;How did that happen?&#8221; </p><p>01:18:22,870 --&gt; 01:18:26,380 [Seth]</p><p>It&#8217;s because the opposite of love isn&#8217;t hate, the opposite of love is indifference. </p><p>01:18:26,380 --&gt; 01:18:26,490 [Seb Krier]</p><p>[laughs] </p><p>01:18:26,490 --&gt; 01:18:30,370 [Seth]</p><p>The fact that you hated it so much was a sign. </p><p>01:18:30,370 --&gt; 01:18:37,149 [Seb Krier]</p><p>Well, I didn&#8217;t hate it that much. I just didn&#8217;t find it particularly interesting or pleasurable, whatever, at the time. But so yeah. </p><p>01:18:38,490 --&gt; 01:18:38,700 [Seb Krier]</p><p>Who knows? </p><p>01:18:38,700 --&gt; 01:18:41,090 [Seth]</p><p>You mentioned reading. You mentioned reading a lot. </p><p>01:18:41,090 --&gt; 01:18:41,290 [Seb Krier]</p><p>Uh-huh. </p><p>01:18:41,290 --&gt; 01:18:43,470 [Seth]</p><p>What&#8217;s the book that&#8217;s been the most influential on you? </p><p>01:18:47,170 --&gt; 01:18:53,470 [Seb Krier]</p><p>I don&#8217;t know. I definitely don&#8217;t have an immediate answer to that question because I think particularly in the last decade, my </p><p>01:18:54,910 --&gt; 01:18:58,510 [Seb Krier]</p><p>consumption habits have been a lot more chaotic. </p><p>01:18:58,510 --&gt; 01:18:58,950 [Seth]</p><p>[chuckles] </p><p>01:18:58,950 --&gt; 01:19:04,630 [Seb Krier]</p><p>So there&#8217;s been a bit of-- I don&#8217;t know. If you asked me the same thing 10 years ago, I would&#8217;ve said something like, oh, &#8220;On Liberty&#8221; by John Stuart Mill or something. </p><p>01:19:05,830 --&gt; 01:19:11,150 [Andrey]</p><p>See, that&#8217;s why-- Isn&#8217;t that Tyler&#8217;s favorite philosopher? </p><p>01:19:11,150 --&gt; 01:19:12,160 [Seb Krier]</p><p>I don&#8217;t know if it is. </p><p>01:19:12,160 --&gt; 01:19:13,030 [Andrey]</p><p>Sorry. Economist, excuse me. </p><p>01:19:13,030 --&gt; 01:19:13,050 [Seth]</p><p>No. </p><p>01:19:13,050 --&gt; 01:19:14,270 [Andrey]</p><p>His favorite economist. Yeah. </p><p>01:19:14,270 --&gt; 01:19:16,750 [Seb Krier]</p><p>Right. Well, once again, great minds. What can I say? </p><p>01:19:17,890 --&gt; 01:19:21,190 [Seb Krier]</p><p>[laughing] But no, I do really like John Stuart Mill. But I </p><p>01:19:22,550 --&gt; 01:19:30,570 [Seb Krier]</p><p>wouldn&#8217;t-- Right now, I think it&#8217;s kind of a weird patchwork of loads of different kind of text sources. </p><p>01:19:30,570 --&gt; 01:19:32,590 [Seth]</p><p>You mentioned Bostrom&#8217;s &#8220;Superintelligence&#8221; during the conversation. </p><p>01:19:32,590 --&gt; 01:19:44,470 [Seb Krier]</p><p>Yeah. I mean, that one was clearly super influential. Because I think that was-- He published in 2014, I think, and in 2016, &#8216;17 was when I was starting to read it. And that got me to actually kind of what is part of the motivation to leave my job as a lawyer at the time. </p><p>01:19:45,870 --&gt; 01:19:54,570 [Seb Krier]</p><p>And see that AI was obviously going to be a big thing, and you had a need for non-technical work as well in that whole thing. So, </p><p>01:19:55,690 --&gt; 01:19:59,790 [Seb Krier]</p><p>that was fairly influential, I think. Yeah, Bostrom&#8217;s &#8220;Superintelligence&#8221; clearly played a role.</p><p>01:20:01,918 --&gt; 01:20:04,138 [Seth]</p><p>Mentioned being loved by Tyler </p><p>01:20:04,138 --&gt; 01:20:06,547 [Seb Krier]</p><p>I didn&#8217;t mention that, you did. [laughs] </p><p>01:20:06,547 --&gt; 01:20:09,838 [Seth]</p><p>Question, why does he love you so much? Do you have kompromat on him? </p><p>01:20:09,838 --&gt; 01:20:14,848 [Seb Krier]</p><p>[laughs] No, I don&#8217;t know. I was really happy that he kind of enjoyed my </p><p>01:20:16,458 --&gt; 01:20:38,538 [Seb Krier]</p><p>hot takes and rambles on Twitter. And I&#8217;ve messaged him to say the same. And funnily enough, I&#8217;ve been a kind of a Marginal Revolution reader since the last 15 years or so, or for a very long time. So it&#8217;s actually pretty weird for me to be like, &#8220;Oh, cool. My name&#8217;s popping up on the blog,&#8221; and stuff. A lot of my friends, in here at least, knew about Marginal Revolution, so they&#8217;re like, &#8220;What are you on about? Who cares?&#8221; And to me- </p><p>01:20:38,538 --&gt; 01:20:39,447 [Andrey]</p><p>[laughing] </p><p>01:20:39,447 --&gt; 01:20:46,158 [Seb Krier]</p><p>... that was pretty cool. Like, yeah, Seb Krier, whatever. And like, all right. But, no, I don&#8217;t know. But I think I </p><p>01:20:48,038 --&gt; 01:20:52,378 [Seb Krier]</p><p>like Tyler and his work a lot. I think Tabarrok as well. I think the whole, </p><p>01:20:54,598 --&gt; 01:20:55,298 [Seb Krier]</p><p>what&#8217;s it called, </p><p>01:20:57,618 --&gt; 01:21:14,238 [Seb Krier]</p><p>that cluster of blogosphere online economics, GMU, and so on, has been fairly influential, I think as well for me. Even 15 years ago, when I was kind of switching from my Francophone world and upbringing to the kind of more Anglophone where discovering the Adam Smith Institute in London or something. </p><p>01:21:15,278 --&gt; 01:21:17,598 [Seb Krier]</p><p>All these different influences, I think, definitely had a </p><p>01:21:18,698 --&gt; 01:21:20,618 [Seb Krier]</p><p>lasting effect in some sense. </p><p>01:21:20,618 --&gt; 01:21:24,058 [Andrey]</p><p>Pretty good time to wrap up. Is there anything else you want </p><p>01:21:25,538 --&gt; 01:21:31,078 [Andrey]</p><p>to direct our audience to? Any underappreciated people, ideas, </p><p>01:21:32,138 --&gt; 01:21:32,738 [Andrey]</p><p>anything else? </p><p>01:21:34,077 --&gt; 01:21:34,217 [Seb Krier]</p><p>[laughing] </p><p>01:21:35,358 --&gt; 01:21:35,798 [Seth]</p><p>[laughing] </p><p>01:21:35,798 --&gt; 01:21:41,178 [Seb Krier]</p><p>That&#8217;s a wide question. I don&#8217;t know. There&#8217;s all sorts of interesting people and ideas. I </p><p>01:21:43,518 --&gt; 01:21:46,418 [Seb Krier]</p><p>don&#8217;t know. Cosmos Institute has been doing some interesting work recently. </p><p>01:21:46,418 --&gt; 01:21:47,358 [Andrey]</p><p>Mm-hmm. </p><p>01:21:47,358 --&gt; 01:22:02,158 [Seb Krier]</p><p>I think, as well as Fire, actually. They&#8217;ve kind of done some stuff together, which I find very valuable. Yeah, I&#8217;d have to think about that one more. I&#8217;d prefer preparing a nicely curated list of underrated- </p><p>01:22:02,158 --&gt; 01:22:02,168 [Andrey]</p><p>Yeah. [laughs] </p><p>01:22:02,168 --&gt; 01:22:04,378 [Seb Krier]</p><p>... blogs, people, and ideas rather than- </p><p>01:22:04,378 --&gt; 01:22:06,718 [Andrey]</p><p>Well, you should tweet that. That could be your next- </p><p>01:22:06,718 --&gt; 01:22:08,348 [Seth]</p><p>There we go. Yeah, I&#8217;ll retweet it </p><p>01:22:08,348 --&gt; 01:22:09,598 [Andrey]</p><p>... super tweet. Yeah. </p><p>01:22:09,598 --&gt; 01:22:13,898 [Seb Krier]</p><p>Yeah, no, for sure. I&#8217;m sure, that&#8217;d be a good idea, actually, because I think there </p><p>01:22:15,278 --&gt; 01:22:31,458 [Seb Krier]</p><p>are a lot of interesting people in the AI world right now doing cool work that isn&#8217;t maybe as discussed or something. There&#8217;s Calcifer Computing did some interesting work recently. Yeah, there&#8217;s a bunch of people here and there, but I&#8217;ll have to think about it to give you a more interesting answer. </p><p>01:22:32,998 --&gt; 01:22:35,147 [Seth]</p><p>Great. We&#8217;ll put them in the show notes if you think about it. </p><p>01:22:35,147 --&gt; 01:22:35,318 [Seb Krier]</p><p>Sounds good. </p><p>01:22:36,778 --&gt; 01:22:44,998 [Seth]</p><p>Seb, it&#8217;s been such an honor having you on. You were an awesome participant and had a lot of patience with our silly questions. [laughs] </p><p>01:22:44,998 --&gt; 01:22:52,738 [Seb Krier]</p><p>No, it&#8217;s been a pleasure. I&#8217;ve been a big fan of the show. So again, just like Marginal Revolution, now I can say I&#8217;ve got the Justified Posterior&#8217;s take. [chuckles] </p><p>01:22:52,738 --&gt; 01:22:53,238 [Seth]</p><p>There we go. </p><p>01:22:53,238 --&gt; 01:22:54,638 [Andrey]</p><p>Nice. Thank you. </p><p>01:22:54,638 --&gt; 01:22:54,958 [Seb Krier]</p><p>Thank you. </p><p>01:22:54,958 --&gt; 01:22:56,198 [Andrey]</p><p>All right. </p><p>01:22:56,278 --&gt; 01:22:56,558 [Seb Krier]</p><p>Cool. </p><p>01:22:56,558 --&gt; 01:22:58,978 [Andrey]</p><p>Keep your posteriors justified. And- </p><p>01:22:58,978 --&gt; 01:23:00,338 [Seb Krier]</p><p>[laughs] </p><p>01:23:00,338 --&gt; 01:23:04,318 [Andrey]</p><p>... like, follow, comment, subscribe, et cetera. </p><p>01:23:04,318 --&gt; 01:23:04,958 [Seb Krier]</p><p>And retweet. </p><p>01:23:04,958 --&gt; 01:23:05,598 [Andrey]</p><p>Thank you. </p><p>01:23:05,598 --&gt; 01:23:09,508 [Seb Krier]</p><p>And retweet. [laughs] </p><p>01:23:09,508 --&gt; 01:23:17,198 [Andrey]</p><p>Of course. [outro music]<br><br><br><br><br><br></p>]]></content:encoded></item><item><title><![CDATA[Improving our posteriors]]></title><description><![CDATA[With your support]]></description><link>https://empiricrafting.substack.com/p/improving-our-posteriors</link><guid isPermaLink="false">https://empiricrafting.substack.com/p/improving-our-posteriors</guid><dc:creator><![CDATA[Andrey Fradkin]]></dc:creator><pubDate>Sat, 09 May 2026 17:49:18 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!JrtW!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe04c2b84-5e8f-43d1-b922-74edea8b528a_1280x1280.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>The coming years will be filled with even more discourse and research about the economics of AI. As Seth likes to say: when writing research papers becomes cheaper, the returns to reading and synthesizing that research go up. We want to be the podcasters you trust to do that reading and synthesis.</p><p>To do that well, we&#8217;re investing in every aspect of our production. That means more editing help, better equipment in our home studios, and occasionally traveling to record with guests in person. We have ambitions for JPP to sound closer to <em>Odd Lots</em> than to two professors on laptops, and getting there costs money.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://empiricrafting.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Justified Posteriors! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>It&#8217;s essential to our mission that our content remains accessible unconditionally, even as we update our priors. (And yes, we&#8217;ll keep making Bayesian puns.)</p><p>For those of you with the means, who think this work is well Justified, we&#8217;re opening up two ways to support the show on Substack:</p><ul><li><p><strong>Subscriber</strong> ($10 a month, $99 a year): a quarterly virtual hangout with us and other paid subscribers.</p></li><li><p><strong>Founding member</strong> ($499 a year suggested): the ability to request a topic or guest (which we&#8217;ll honor subject to editorial discretion), a WhatsApp chat with us and other founding members, a shoutout on the podcast if you&#8217;d like one, and the quarterly Zoom hangout.<br></p></li></ul><p>We&#8217;re also on the lookout for grants and other forms of institutional sponsorship. So if you know of any organization interested in supporting this type of work, please let us know.<br><br>Keep your posteriors justified, </p><p>Andrey and Seth</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://empiricrafting.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Justified Posteriors! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Avi Goldfarb on Prediction Machines, O-Ring Tasks, and How AI is Reshaping Economics]]></title><description><![CDATA[Justified Posteriors Interviews Avi Goldfarb, Rotman Chair in Artificial Intelligence and Healthcare, and Professor of Marketing at the University of Toronto]]></description><link>https://empiricrafting.substack.com/p/avi-goldfarb-on-prediction-machines</link><guid isPermaLink="false">https://empiricrafting.substack.com/p/avi-goldfarb-on-prediction-machines</guid><dc:creator><![CDATA[Seth Benzell]]></dc:creator><pubDate>Mon, 04 May 2026 18:31:41 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/196223446/29b82e710202b64c4aff345b515d5e6b.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>This week, we&#8217;re joined by <strong>Avi Goldfarb</strong>, one of the leading economists of artificial intelligence and co-author of <em><a href="https://www.google.com/search?sca_esv=bc87673d3ad1280f&amp;rlz=1C1GCEA_enUS1209US1209&amp;sxsrf=ANbL-n4AnrHPqrHiXM4Cb3oXCBXAennzbw:1777914708243&amp;q=Prediction+Machines:+The+Simple+Economics+of+Artificial+Intelligence&amp;stick=H4sIAAAAAAAAAONgFuLVT9c3NEwzqCw0q8wrU4Jw003S0pMLsnK1pLKTrfST8vOz9RNLSzLyi6xA7GKF_LycykWsLgFFqSmZySWZ-XkKvonJGZl5qcVWCiEZqQrBmbkFOakKrsn5efm5mclADWkKjkUlmWmZyZmJOQqeeSWpOTmZ6al5yakAebQ6E4MAAAA&amp;sa=X&amp;ved=2ahUKEwjFtIC1kKCUAxWiJkQIHRQiDEoQ9OUBegQIDRAD&amp;biw=2183&amp;bih=1080&amp;dpr=1.75">Prediction Machines</a></em>. Avi has been thinking seriously about AI economics long before the ChatGPT shock, so we asked him what he thinks the earlier framework got right, what it missed, and how economists should update their beliefs now.</p><p>The conversation starts with Avi&#8217;s seminal book, <em><strong>Prediction Machines</strong>,</em> and the idea that AI is best understood as a drop in the cost of prediction, which is a complement to judgement. We ask what that book got right and what it got wrong. From there, we interrogate Avi on the murky boundary between <strong>prediction and judgment</strong>. We had investigated the idea that maybe judgment and prediction were not as separable as economists like to believe in our&nbsp;<a href="https://empiricrafting.substack.com/p/alex-imas-demand-collapse-bargaining">episode with Alex Imas</a>. </p><p>We also ask whether, if AI gets better at predicting human judgment, whether judgment disappears, or do humans simply &#8220;move up the stack&#8221;? And what is taste exactly? Avi says that sometimes judgment becomes predictable, but humans still matter because goals, values, organizational politics, and &#8220;what matters&#8221; are often implicit, unstable, and hard to codify. Avi shoots down Seth&#8217;s galaxy-brain suggestion that correct ontology choice &#8212; i.e., deciding what sort of <a href="https://en.wikipedia.org/wiki/Natural_kind">natural kind</a> a thing is, or understanding when <a href="https://theculture.fandom.com/wiki/Outside_Context_Problem">a problem is out of context</a> &#8212; is a uniquely separate skill (taste?), calling it just another prediction error. But he does concede that deciding how much to prepare for &#8216;Black Swan&#8217; events may be an enduring role for judgment. </p><p>We then revisit the <strong>O-ring theory of production</strong> and what it means for automation. We had covered Kremer&#8217;s article in a recent episode (<a href="https://empiricrafting.substack.com/p/weak-links-strong-predictions-kremers">see here</a>) <a href="https://www.nber.org/papers/w34639">and asked Avi about his new paper, riffing on the idea at the worker level</a>. Avi says that if tasks inside jobs are complements rather than substitutes, then automating one task may make the remaining human tasks more valuable, not less. Avi explains why workers may reallocate attention toward the tasks machines cannot yet perform (shooting down Seth&#8217;s suggestion that this is actually difficult in most jobs).</p><p>The discussion also covers whether AI will augment or replace workers, whether governments should try to steer AI toward human-complementing technologies, and why that distinction may be much harder to define in practice than it sounds. Avi agrees with Andrey and Seth&#8217;s pushback on &#8220;augmentation good, automation bad&#8221; framings (e.g. friend of the show Erik Brynjolfsson&#8217;s &#8220;<a href="https://digitaleconomy.stanford.edu/news/the-turing-trap-the-promise-peril-of-human-like-artificial-intelligence/">Turing Trap</a>&#8221;).</p><p>Then we get into forecasts: how fast AI capabilities might advance by 2030, what that means for GDP growth by 2050, whether GDP is still the right thing to forecast, and why even very powerful AI may run into bottlenecks in the real economy. We use the paper <a href="http://Forecasting the Economic Effects of AI">Forecasting the Economic Effects of AI</a> to ground the discussion. <br><br>We close with lightning-round topics including AI&#8217;s impact on centralization, privacy/de-anonymization, peer review, and whether academic journals still serve the function they once did.</p><h2>Papers, books, and ideas mentioned</h2><ul><li><p>Avi Goldfarb&#8217;s seminal book with Ajay Agrawal, and Joshua Gans &#8212; <em><a href="https://www.google.com/search?sca_esv=bc87673d3ad1280f&amp;rlz=1C1GCEA_enUS1209US1209&amp;sxsrf=ANbL-n4AnrHPqrHiXM4Cb3oXCBXAennzbw:1777914708243&amp;q=Prediction+Machines:+The+Simple+Economics+of+Artificial+Intelligence&amp;stick=H4sIAAAAAAAAAONgFuLVT9c3NEwzqCw0q8wrU4Jw003S0pMLsnK1pLKTrfST8vOz9RNLSzLyi6xA7GKF_LycykWsLgFFqSmZySWZ-XkKvonJGZl5qcVWCiEZqQrBmbkFOakKrsn5efm5mclADWkKjkUlmWmZyZmJOQqeeSWpOTmZ6al5yakAebQ6E4MAAAA&amp;sa=X&amp;ved=2ahUKEwjFtIC1kKCUAxWiJkQIHRQiDEoQ9OUBegQIDRAD&amp;biw=2183&amp;bih=1080&amp;dpr=1.75#">Prediction Machines</a></em></p></li><li><p>A black swan is the occurrence of a wildly unpredictable event, which Nassim Taleb argues, <a href="https://en.wikipedia.org/wiki/The_Black_Swan:_The_Impact_of_the_Highly_Improbable">in his book by the same name</a>, is more common than we like to think</p></li><li><p><a href="https://en.wikipedia.org/wiki/New_riddle_of_induction">A New Riddle of Induction</a> &#8212; by Nelson Goodman &#8212; is the source of Seth&#8217;s thought experiment about &#8220;bleen&#8221;, a color which is green until 2029 and blue after, and green</p></li><li><p>Michael Kremer &#8212; &#8220;The O-Ring Theory of Economic Development&#8221;, covered in this episode of the pod: </p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;ae4f5180-e374-496f-8384-3d2833f8af16&quot;,&quot;caption&quot;:&quot;This week, instead of reviewing a recent paper on AI, we go back to a 1993 classic: Michael Kremer&#8217;s &#8220;The O-Ring Theory of Economic Development.&#8221; It&#8217;s one of those papers that feels larger than its formal model. The setup is extremely simple: production consists of many tasks, and output depends on all of them going right. But the implications are broad&#8230;&quot;,&quot;cta&quot;:&quot;Watch now&quot;,&quot;showBylines&quot;:true,&quot;showDescription&quot;:true,&quot;showImage&quot;:true,&quot;size&quot;:&quot;lg&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;The classic model that shows why AI exposure could increase wages&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:191755003,&quot;name&quot;:&quot;Andrey Fradkin&quot;,&quot;bio&quot;:&quot;Professor writing about AI, digital technology, marketing, economics, and academia. Also, some personal introspection along the way.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!qqBF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb729e424-5fcf-4691-886d-a65500401344_1175x1177.jpeg&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null},{&quot;id&quot;:3215096,&quot;name&quot;:&quot;Seth Benzell&quot;,&quot;bio&quot;:&quot;Co-Host of Justified Posteriors Podcast https://empiricrafting.substack.com/podcast&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F1351ec23-f5f1-4613-8844-04c8f814335b_1030x687.jpeg&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;post_date&quot;:&quot;2026-04-20T17:16:51.421Z&quot;,&quot;cover_image&quot;:&quot;https://substack-video.s3.amazonaws.com/video_upload/post/194643954/08e62fe4-f9a9-47c8-b3d7-2166201bf23e/transcoded-00001.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://empiricrafting.substack.com/p/weak-links-strong-predictions-kremers&quot;,&quot;section_name&quot;:null,&quot;video_upload_id&quot;:&quot;08e62fe4-f9a9-47c8-b3d7-2166201bf23e&quot;,&quot;id&quot;:194643954,&quot;type&quot;:&quot;podcast&quot;,&quot;reaction_count&quot;:1,&quot;comment_count&quot;:0,&quot;publication_id&quot;:2684979,&quot;publication_name&quot;:&quot;Justified Posteriors&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!JrtW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe04c2b84-5e8f-43d1-b922-74edea8b528a_1280x1280.png&quot;,&quot;belowTheFold&quot;:false,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div></li><li><p>Daron Acemoglu and Pascual Restrepo&#8217;s task-based models of automation, especially &#8220;<a href="https://www.aeaweb.org/articles?id=10.1257/aer.20160696">The Race Between Man and Machine</a>.&#8221;</p></li><li><p>Avi mentions David Autor and Ben Thompson on automation and skill scarcity when Seth comments that you may not be able to reallocate effort between tasks as a worker, including their paper &#8220;<a href="https://www.nber.org/papers/w33941">Expertise</a>&#8221;</p></li><li><p>Erik Brynjolfsson in the &#8220;<a href="https://digitaleconomy.stanford.edu/news/the-turing-trap-the-promise-peril-of-human-like-artificial-intelligence/">Turing Trap</a>&#8221; argues that automation technologies are less good than augmenting technology</p></li><li><p>Eric Topol&#8217;s book on AI in medicine &#8212; <em><a href="https://www.amazon.com/Deep-Medicine-Artificial-Intelligence-Healthcare/dp/1541644638">Deep Medicine</a></em></p></li><li><p>John Markoff &#8212; <em><a href="https://www.amazon.com/Machines-Loving-Grace-Common-Between/dp/0062266683">Machines of Loving Grace</a> &#8212; </em>The source of a title for <a href="https://www.darioamodei.com/essay/machines-of-loving-grace">an influential essay of the same name</a> by Dario of Anthropic. Both draw from an earlier poem about a Sci Fi utopia: https://allpoetry.com/All-Watched-Over-By-Machines-Of-Loving-Grace </p></li><li><p>Korinek and Stiglitz on AI, capital, and taxation; Lockwood and Korinek on optimal taxation and automation &#8212; We covered these topics at the end of our episode with Basil Halperin in the context of &#8220;Tax Policy at the End of History&#8221; around the 1:19:00 mark</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;a7a60b2e-43b5-4545-9173-521abbeb5b28&quot;,&quot;caption&quot;:&quot;In this week&#8217;s episode of Justified Posteriors, we interview TAI expert and friend of the show Basil Halperin of the University of Virginia. There Basil is doing some of the most fascinating work on the economics of TAI with Anton Korinek and other leading researchers.&quot;,&quot;cta&quot;:&quot;Watch now&quot;,&quot;showBylines&quot;:true,&quot;showDescription&quot;:true,&quot;showImage&quot;:true,&quot;size&quot;:&quot;lg&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Basil Halperin: Leading Indicators for TAI, Conditions for the Singularity, and Tax Policy at the End of History&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3215096,&quot;name&quot;:&quot;Seth Benzell&quot;,&quot;bio&quot;:&quot;Co-Host of Justified Posteriors Podcast https://empiricrafting.substack.com/podcast&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F1351ec23-f5f1-4613-8844-04c8f814335b_1030x687.jpeg&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null},{&quot;id&quot;:191755003,&quot;name&quot;:&quot;Andrey Fradkin&quot;,&quot;bio&quot;:&quot;Professor writing about AI, digital technology, marketing, economics, and academia. Also, some personal introspection along the way.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!qqBF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb729e424-5fcf-4691-886d-a65500401344_1175x1177.jpeg&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null},{&quot;id&quot;:38068,&quot;name&quot;:&quot;Basil Halperin&quot;,&quot;bio&quot;:&quot;https://basilhalperin.com&quot;,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e010f313-fe56-4a75-98e3-c264aa210f1f_2465x2465.jpeg&quot;,&quot;is_guest&quot;:true,&quot;bestseller_tier&quot;:null,&quot;primaryPublicationSubscribeUrl&quot;:&quot;https://basilhalperin.substack.com/subscribe?&quot;,&quot;primaryPublicationUrl&quot;:&quot;https://basilhalperin.substack.com&quot;,&quot;primaryPublicationName&quot;:&quot;Basil Halperin&quot;,&quot;primaryPublicationId&quot;:28833}],&quot;post_date&quot;:&quot;2026-02-09T19:55:24.080Z&quot;,&quot;cover_image&quot;:&quot;https://substack-video.s3.amazonaws.com/video_upload/post/187424007/ba8f0ae5-6b49-4f14-9219-e8518044eeef/transcoded-00001.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://empiricrafting.substack.com/p/basil-halperin-leading-indicators&quot;,&quot;section_name&quot;:null,&quot;video_upload_id&quot;:&quot;ba8f0ae5-6b49-4f14-9219-e8518044eeef&quot;,&quot;id&quot;:187424007,&quot;type&quot;:&quot;podcast&quot;,&quot;reaction_count&quot;:9,&quot;comment_count&quot;:0,&quot;publication_id&quot;:2684979,&quot;publication_name&quot;:&quot;Justified Posteriors&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!JrtW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe04c2b84-5e8f-43d1-b922-74edea8b528a_1280x1280.png&quot;,&quot;belowTheFold&quot;:false,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div></li><li><p>We talk about de-anonymization, and Avi <a href="https://arxiv.org/abs/2409.15948">references this provocative paper </a> from Florian Ederer </p></li><li><p>Avi brings up Bob Gordon, and his argument, famously in the book <a href="https://www.amazon.com/Rise-Fall-American-Growth-Princeton/dp/0691147728">The Rise and Fall of American Growth</a>, that the early 20th century was incredibly important for increases in US living standards, which digital technologies have not lived up to</p></li><li><p><a href="https://www.nber.org/papers/w30920">Digital Hermits</a>, by Jeanine Mikl&#243;s-Thal, Avi Goldfarb, Avery M. Haviv &amp; Catherine Tucker, is a paper by Avi thinking about how information spillovers, now from AI, drive some people to be more private than they would otherwise be. In our conversation, we speculate AI will make these hermits even more &#8220;hermetic&#8221;</p></li><li><p>We discuss this paper on new forecasts of AI and its impact on economic growth: <a href="http://Forecasting the Economic Effects of AI">Forecasting the Economic Effects of A</a>I </p></li><li><p>Refine and AI-assisted peer review are discussed in this pod. <a href="https://empiricrafting.substack.com/p/ben-golub-ai-referees-social-learning">For more, see our episode with Ben Golub, founder of Refine</a>. </p><p></p></li></ul><p><em><strong>This episode is sponsored by <a href="https://www.reveliolabs.com/">Revelio Labs</a> &#8212; a great source of labor economics data for academics and firms. Now available on WRDS.</strong></em></p><p></p><p><em><strong>Join our Discord community at this link: https://discord.gg/w3GSapx2d </strong></em></p><div><hr></div><h1>Transcript</h1><h3>Introduction [00:00]</h3><p>Seth: Welcome to the Justified Posteriors podcast, the podcast that updates beliefs about the economics of AI and technology. I&#8217;m Seth Benzell, your loyal non-fiction machine, coming to you from Chapman University in sunny Southern California.</p><p>Andrey: And I&#8217;m Andrey Fradkin, coming to you from San Francisco, California. And we are very happy that Justified Posteriors is sponsored by the fine folks at Revelio Labs. And we&#8217;re very delighted to have Avi Goldfarb, who is a leading thinker in the field of AI economics and has also been a personal mentor on the show. We&#8217;re very excited to hear his thoughts on a variety of topics. Welcome, Avi.</p><p>Avi: Thanks so much and thanks for having me on the show and looking forward to it.</p><p>Andrey: All right, let&#8217;s get started. I have in front of me this book that you might remember writing at some point.</p><p>Seth: Gaze into the soul of the man in the bookstore.</p><h3>What Did Prediction Machines Get Wrong? [01:12]</h3><p>Andrey: Now, I just think it&#8217;s a good cover. And I had to check: when was it released? It was released in 2018. And as I was skimming through it, you know, a lot of interesting points made there are still things that we&#8217;re talking about today, almost 10 years after it was released. So let me start off with the following question. And then maybe we can work backwards more into the ideas in the book. But what do you think prediction machines got wrong?</p><p>Avi: I think prediction may... I&#8217;ll start with a hard question.</p><p>Seth: No softballs on Justified Posteriors.</p><p>Avi: So on the specifics of which industries and when, to the extent we tried, at least I did not anticipate how quickly language and coding would become prediction problems. And when we talk about disruption and industry disruption, a lot of the examples are things like driving, and we talk about radiology. And we still have plenty of radiologists around. Self-driving cars and trucks. seem like they&#8217;re now imminent, but it certainly took a lot longer than we expected back in 2018.</p><p>Andrey: So is it a fair assessment to say that the large language models, even in 2018, weren&#8217;t on your radar? I guess they weren&#8217;t on many people&#8217;s radar.</p><h3>The Three Ideas of Prediction Machines [02:45]</h3><p>Avi: Not really. We have some discussion of machine translation. So that&#8217;s in there as a huge potential use case, but the arrival of ChatGPT and how it sort of changed how we interact with machines and how we think about AI was not really there. Another way to put it is prediction machines had three ideas. So idea number one is AI can be framed as a drop in the cost of prediction. So prediction. As in filling in missing information, statistical prediction is getting better, faster and cheaper. Idea number two is that when something gets cheap, you start using it for unanticipated uses. So when arithmetic got cheap, it wasn&#8217;t just that we use computers for accounting. We started to use computers for all sorts of things that we never used to think of as arithmetic problems like imaging and mail and music. And then idea number three is what are the complements to machine prediction? And we talked about data and judgment. The book, and certainly our attention to the book in the first three or four years after it was published, was on idea number one and idea number three. So identify prediction problems in your organization, and then think about what data you need to make those predictions better, and try to understand what matters to you in terms of judgment. And that second point kind of got lost. But in the last four years, it&#8217;s become clear to me is that that second point was maybe the biggest one, which is this tool, which still under the hood is computational statistics, enables us to find all sorts of applications for computational stats that we didn&#8217;t really imagine before. Judgment and data are still gonna be useful, but that phase one, that step one, that first idea of identifying prediction problems, that&#8217;s not really how we think about using AI today. And in some sense, that... was a missing emphasis throughout the book and throughout how we thought about that book, or at least how I thought about that book for the first few years.</p><h3>Does Proprietary Data Still Matter? [04:59]</h3><p>Andrey: Very interesting. You mentioned one kind of underlying idea there, whereas you should identify the data that&#8217;s going to make your predictions better. Do you think to what extent is that now true, given that your foundation models seemingly can be very smart without having any proprietary data?</p><p>Avi: Data is still central to the use of AI, the building of the models. In building a foundation model that, at least in the pre-training stage, that data is essentially interchangeable. You just need more. It doesn&#8217;t really matter what. To build a structure of language, and then you can move from there. On later stages of using that model, at least the AI companies seem to think data is valuable to the model companies. And then in terms of use cases within organizations, that&#8217;s more a matter of whether you want to delegate sort of the judgment of how to use the model and what the model should output to the vendor or whether it&#8217;s something that you need to build in-house. And depending on the organization, some of them are very happy to delegate to the foundation model provider and some of them think they need to fine tune in-house.</p><p>Andrey: Well, so there are kind of two little sub ideas in there. One is you have choice. You can fine tune a worse model with your own data. And maybe that will outperform as a frontier model. I think for many cases so far, that&#8217;s been a bad bet. But there&#8217;s a different idea here. Use whatever model you want, but you design the evaluation. And then you optimize via the prompting strategy or scaffolding towards that. that benchmark for your own use case. Is designing a benchmark proprietary? Should we think of that as a proprietary data that an organization has?</p><p>Seth: Is that the judgment part in the judgment prediction distinction?</p><h3>Vendor Choice as Delegated Judgment [07:01]</h3><p>Avi: Yeah, I think there&#8217;s a bunch of judgment. there&#8217;s judgment number one: which which vendor do you use? Because you&#8217;re delegating a lot of values as in like, knowing what matters to the maker of the model. And then there is judgment in how heavy-handed do you want to be to make the outputs fit your needs? And then there&#8217;s judgment on, okay, you&#8217;ve decided to be heavy-handed. What exactly does that mean? And is it, guardrails or is it really making sure that the output from the prompts every time fits your organization&#8217;s values or what matters to you?</p><p>Andrey: Have you had an opportunity to kind of advise companies on this judgment decision? Like what has your experience been in these situations?</p><p>Avi: At a high level, yes. I don&#8217;t want to exaggerate my experience, but the things I emphasize and the things that seem to resonate are, one, what I just said, which is recognizing when you choose a vendor, you are delegating your understanding of what matters to that vendor. And then two, that means before you start thinking about choosing a vendor, you need to know what matters to you. So think through, you know, before you go talk to somebody, you should know what your KPIs are and what outcomes you want to see. Because otherwise, once you talk to them, they&#8217;ll convince you that their outcomes are the ones you want to see. and so it&#8217;s this, I talked to, someone who is running an AI at a... Let&#8217;s call it a big healthcare organization. And his job used to be, like five years ago, his job was building tools. He&#8217;s like, my job isn&#8217;t building tools anymore. There are all sorts of vendors building AI tools for healthcare. Okay. And what my job is now is every week, 20 or more people come in and say, I have a solution for you. And he chooses one or two of them.</p><p>Seth: Kind of seems like a good job for an AI.</p><p>Avi: Well, maybe, maybe not. But he understands the individuals, the people, guess, in theory that could happen, but the individuals in his organization, what they&#8217;re willing to accept, what they don&#8217;t. Which decisions they like to have control over, which ones they&#8217;re comfortable delegating. For the ones they like to have control over, he has a sense of what might be negotiable and what might not be. He knows where the power structures are and what things might change. Therefore face resistance from people who have the power to resist. He knows those things that might not face resistance from people because the people don&#8217;t have power to resist, but they&#8217;re going to be really, really unhappy about it. It&#8217;s going to bad for the organization. And so there&#8217;s all these things that I guess in principle an AI could do, but we&#8217;re a long way away, I think, from that.</p><h3>Can Prediction Eat Judgment? [10:16]</h3><p>Seth: So let me let me just push down that line a little bit longer is the way to think about this sort of prediction and judgment distinction is is that like as the models get better the Prediction is like eating more and more of the stack right? You know we give the information about our organizational structure to the AI and then maybe it can make a couple more of these decisions for us And you could either imagine that asymptoting to, you know, in 20 years, AI does everything, or you could imagine there are higher and higher levels of judgment that humans keep on getting promoted to. Are one of those two ways the way that you think about it?</p><p>Avi: Yes, Andrea Pratt has a note in our first Economics of AI volume that covers that exact idea. I think actually it&#8217;s a comment on our paper or the model behind the Prediction Machines book. it&#8217;s, well, in principle, with enough data, you can learn to predict judgment. And so you move up the stack. So absolutely. There are some limits to that. There&#8217;s limits on you may never get enough data. on that kind of judgment. Judgment can change over time. To the extent that ultimately you&#8217;re trying to predict your tastes, then they can change over time. And there&#8217;s some limits on causal inference and the impossibility of seeing the counterfactual, which creates a need for a model.</p><p>Andrey: But humans have that problem too.</p><p>Avi: Yeah, yeah, yeah, no, I agree. But in the need for a model. So then the question is, well, how come LLMs and some of these models seem to be pretty good at doing that? And in the process of prediction, I suspect -- though I don&#8217;t know rigorous work on this, so I&#8217;m being cautious --</p><p>Seth: That&#8217;s what this podcast is for.</p><p>Avi: this is building some kind of model of the world that is embedded in the training data, like the language.</p><h3>Taste, Values, and Human Wants [12:16]</h3><p>Seth: So let&#8217;s go back to the one of the examples you gave, which is this idea of taste, right? Because I&#8217;ve had so many conversations with other economists about this idea that, well, taste will save us as a scientist, right? Because the AI won&#8217;t have taste. I have some ideas about what taste might mean, but can you be a little bit more precise about what you think taste means and why it&#8217;s something worth saving?</p><p>Avi: So, okay, let&#8217;s operate under the assumption that whatever we want to call the machines, their goals are to help humans. Okay, not all humans. And we can debate about which humans, but like ultimately.</p><p>Seth: Well, the Anthropic Constitution says, you know, safety first, the idealized anthropic researcher, then the guy that then then like virtue and then like the customer in some order like that.</p><p>Avi: I&#8217;m gonna, all that matters for the point I&#8217;m about to make is that it&#8217;s not about the machine&#8217;s needs. So in that case, at the very limit, humans have wants and needs and those wants and needs, the machines need us, our judgment to know what our wants and needs are.</p><p>Seth: So taste literally as in, this tastes good to me, I want more of this food.</p><p>Avi: That would be one specific example of it. Absolutely. Okay. Now, I think we&#8217;re a long way from that limit, but that&#8217;s what I would argue the limit is.</p><p>Seth: That&#8217;s the Bailey, right? So now let&#8217;s go out to the motte.</p><p>Avi: So then it&#8217;s more like, okay, what matters to a set of humans, a group, an organization? What can we codify? If you can codify it and say, like, this is your goal, you&#8217;re not quite at that limit, but pretty close to it, then the machines can try to optimize on a goal. Goals have so much that are implicit. And so the machine would have to be able to infer the implicit part. Maybe it can, maybe it can&#8217;t, I don&#8217;t know. And then you can sort of ratchet back all the way to where we are now, which is you still need to tell your agent what you want. You still need to check on it every once in a while and guide it in the right direction. Prompting still has a role.</p><h3>Ontology, Umbrellas, and Context Shifts [14:45]</h3><p>Seth: Here&#8217;s another way of thinking about taste. And I&#8217;m curious whether you think this is in one of the categories you already listed or a new idea or you wouldn&#8217;t call this taste, which has to do something like with the idea of your ontology that is kind of built into the system, right? It&#8217;s your way of sort of dividing the world up into parts and maybe a good tastemaker or a good judger might have a more refined or more adaptable ontology. than the prediction machine. So I&#8217;ll give you an example of what I mean. have a couple of examples in mind, but one example I have is, you know, historically in the data, it&#8217;s always been the case that if lots of people show up with umbrellas, it means that you can predict that it&#8217;s raining. But then we have these Hong Kong protests and in the Hong Kong protests, they&#8217;re the umbrella protests and people bring umbrellas to show that they&#8217;re protesting, right? And it seems like a human would do better at adapting to like the completely new context for why you would need umbrellas than, you know, a pre-trained system that was only on historical data. So you can say that that&#8217;s like a context switch problem. Is that one of your ideas of taste or is that more of a judgment that&#8217;s not a taste?</p><p>Avi: Honestly, that seems like a prediction failure to me.</p><p>Seth: Right. That&#8217;s just we don&#8217;t have data on the context that we&#8217;ve moved to. The job is to understand when the context has changed, maybe.</p><p>Avi: The judgment, I would say the judgment is like, what&#8217;s the consequential decision that&#8217;s going to be a function of, look outside and I see a lot of people in umbrellas. Yeah. What am going to do? And.</p><p>Seth: You know, I should water my plants. Should I water my plants?</p><p>Avi: No, I water my plants. Okay. So I look outside, a lot of people are carrying umbrellas and I think, no, I don&#8217;t need to water my plants. Okay. And then it turns out it&#8217;s a protest. It&#8217;s a little bit of weird context, but going with your example.</p><p>Seth: It&#8217;s gotta be a weird context. That&#8217;s the reason that the AI is going to make the wrong decision because it&#8217;s out of context.</p><p>Avi: the, the automated sprinkler doesn&#8217;t go on and, my plants die. Right. Okay. So, the judgment is, is it then worth it for me to invest more either in my prediction technology or to actually go outside and look and to see if there&#8217;s rain, to overcome that downside. So what you described as an error in prediction, there&#8217;s ways to reduce that error in prediction. The judgment is whether it&#8217;s worth the bother to reduce that error in prediction or to create some kind of insurance system where you would say, you know what, I&#8217;m gonna water the sprinklers. I&#8217;m just gonna run the sprinklers anyway. That&#8217;s how I think about judgment. It&#8217;s sort of what goes wrong when your prediction fails or it&#8217;s one important aspect of judgment.</p><p>Seth: Sorry, can I give you an even more abstract?</p><p>Andrey: Wait, wait, wait. No. I actually disagree with the premise of the example in many ways. I think a reasoning model would be able to handle the situation, especially with internet access, substantially better than many humans already, because you can call an API to get the weather forecast if you&#8217;re unsure. You can read the news. You can use reasoning traces. There&#8217;s this kind of implicit assumption in your question that like, we&#8217;re just using a raw pre-trained model and like asking it to like, if you, like, if you had a gun to your head, what would you do? You know, and not use any reasoning.</p><p>Seth: Okay, but I can tell you a story, right? The weather API was always reliable in the data, but now there&#8217;s been a government takeover and I don&#8217;t trust the new government and you shouldn&#8217;t trust the API weather data anymore, right?</p><p>Avi: So Andrey, I actually agree with, like, that seems unrealistic, but I think the idea is what you&#8217;re describing is how many resources you wanna put toward making it right, and I would view that as judgment.</p><p>Andrey: But I guess the model has that judgment, maybe. Already. Already. Yeah, that&#8217;s kind of goes out like the stack of when judgment problems become prediction problems, I guess.</p><p>Avi: But then there&#8217;s going to be... well, there&#8217;s going to be some places where the model is imperfect. Okay. Yes. Still a prediction tool. It might be better than human. Actually, it doesn&#8217;t matter if it&#8217;s better than human. But to the extent the model is imperfect, how do you want to behave? Like, let&#8217;s say the model is right 99.99 % of the time. Does your behavior change at that versus 99.9999 % of the time, even if the human benchmark is 50? And that ultimately is going to is going to be essential to judgment. We do this with self-driving cars. The models aren&#8217;t perfect, but they&#8217;re better than human. And yet, I still drove to work today, partly because that&#8217;s the law in Canada.</p><p>Andrey: Do you think there&#8217;s hope? I mean, maybe this is kind of too much in the weeds versus the abstract idea, but sometimes people implicitly assume that they&#8217;re anchoring on the current technology where there&#8217;s an instance of an LMM that does something. But we might be able to design systems of LLMs that are interacting with each other to cover some of these. shortcomings that we can think of. I mean, at a conceptual level, maybe it&#8217;s the same thing anyway...</p><p>Avi: So maybe another way to think through these trade-offs is to talk about whose judgment, okay? Which is Seth&#8217;s example was about, or my example was about my judgment, know, the individual&#8217;s judgment and should they listen or not. Andre, I think what you&#8217;re describing is the model builder&#8217;s judgment on which things is it worth investing in making the model better and when is it okay not? Like they have choices on sort of rate and direction. And those require some understanding of what they think is going to matter in terms of the use cases, the model. And on that, yes, there is a limit where a small number of players have extraordinary power because AI scales their judgment because they embedded into the models. But I do think. then there is still a human or set of humans responsible. It&#8217;s not like, the AI did it. It&#8217;s humans making those kinds of decisions. And I understand, like, at the limit, that actually gets quite nuanced, especially once we have models with continuous learning. But that&#8217;s how I think about that problem.</p><h3>Grue, Bleen, and Black Swans [21:41]</h3><p>Seth: All right Andre, can I ask my riddle of induction question? </p><p>Andrey: Do you need me to induce it?</p><p>Seth: You already know where I&#8217;m going with this. I&#8217;m curious if Avi knows where I&#8217;m going with this, but this goes back to the question of maybe where taste comes in is having a better or a more human ontology than the machine. All right. Have you ever heard of grue and bleen, Avi? These are colors that are different than blue and green. No? Okay, awesome. So briefly, we have this conceptual category, which is a thing that&#8217;s green. And a thing that&#8217;s green, we think that if you don&#8217;t do anything to it, it should be green indefinitely, right?</p><p>Avi: Okay, yeah.</p><p>Seth: All right. There&#8217;s this other thing that&#8217;s called <em>bleen</em> and things that are bleen are green until the year 2029. And after 2029, they turn blue. Right. Here&#8217;s the issue is that bleen and green things are observationally identical until 2029. Right. Yeah. So an inhuman, bad at forming natural kinds, ontology of an AI might decide that something is bleen instead of thinking it&#8217;s green. Right? And a human&#8217;s role might be to say, no, that&#8217;s a bad definition of a natural kind. That&#8217;s a bad ontology. And that would be a role of either taste or judgment. Do you buy that? Is this way too abstract?</p><p>Avi: I think what you&#8217;re describing is a failure of prediction. I don&#8217;t think that&#8217;s taste or judgment. The taste or judgment is if you or a machine aren&#8217;t sure if something is bleen or green, do you care?</p><p>Seth: Okay. Well here&#8217;s the thing, you didn&#8217;t even have the concept of bleen until I told you about bleen, right?</p><p>Avi: So this is just the difference, I think, between known unknowns and unknown unknowns. So in Prediction Machines, we have a whole chapter framed on Rumsfeld and his discussion of known unknowns and unknown unknowns. Look, sometimes you don&#8217;t have a prior on it, and it&#8217;s an unknown unknown. That doesn&#8217;t mean that it&#8217;s not a prediction failure. It was just off the support of your data, and you didn&#8217;t know what to do about it. And I think that happens all the time.</p><p>Seth: Sometimes you find a black swan.</p><p>Avi: Yes, exactly. And so like, there might be places where humans are better at that kind of prediction than machines. There might be places where both humans and machines are really awful at that kind of prediction. And if that&#8217;s the case, then you want to have robust systems to anticipate those kinds of things. And that&#8217;s where judgment comes in. Like, if you&#8217;re wrong about the existence of a black swan, you know, does that change anybody&#8217;s behavior? I think the answer is no, because black swans and white swans aren&#8217;t actually that different from each other. But if there were other examples, like financial crises, where he uses the metaphor of the black swan, then absolutely there are meaningful differences. And you should</p><p>Andrey: Financial crises.</p><p>Seth: All right, so you&#8217;re saying that jobs that will survive TAI number 7 should be Black Swan, anticipator.</p><p>Andrey: Not an anticipator. Actually Seth, this is actually kind of the key point. The point is, anticipator of whether Black Swan affects your utility enough that you should plan for it.</p><h3>O-Ring Complementarities and Automation [25:22]</h3><p>Andrey: I think next it will be awesome to talk about automation and some O-rings. Actually, the previous episode we did, we reread Michael Kremer&#8217;s classic O-ring paper because it&#8217;s been so inspirational for so many. It&#8217;s a great paper. They don&#8217;t write them like this anymore.</p><p>Seth: It&#8217;s so fun to read. They don&#8217;t like to do macro like that anymore, unfortunately.</p><p>Andrey: So we were wondering, so you have your own spin on the O-Ring paper. Maybe you&#8217;ll tell, you can tell us a little bit about that.</p><p>Avi: Paper makes a pretty simple point. There may be two simple points. First one is that when you think about tasks within a job, they&#8217;re not interchangeable and substitutable. So it&#8217;s not just like, okay, a machine comes in and takes tasks. Sometimes tasks are complements. Now that isn&#8217;t, I&#8217;m gonna a little cautious. We talk about that in our O-Ring automation paper. It&#8217;s not necessarily a new idea. It&#8217;s implicit in the constant elasticity models. you can have a Leontief production function.</p><p>Seth: We&#8217;re talking about the Daron-style task-based models. But if you actually read the papers everything immediately goes Cobb-Douglas. It&#8217;s always immediately weird. All the tasks are substitutes and then Cobb-Douglas over all the tasks.</p><p>Avi: Yes, but it&#8217;s possible to, within the canonical model, to have that. So our point number one is tasks can be complements. And I just wanted to be cautious because I don&#8217;t want to claim that that&#8217;s necessarily our idea. But it&#8217;s an emphasis maybe that the existing literature hasn&#8217;t had. And then the second is, well, once you have tasks that are complements, if a machine starts doing some of those tasks, human can move their attention to the other tasks that are not yet automated. And when that happens, the human gets better at those tasks, which then makes automation of those remaining tasks even harder because the machine has to be better than now the human who&#8217;s spending all of their time focused on the remaining few tasks.</p><h3>Skills Versus Tasks [27:40]</h3><p>Seth: So let&#8217;s pause right there because I have a couple of questions right there immediately. So one way to think about automating part of your job is you&#8217;ve automated part of your job and now I can reallocate to the stuff that&#8217;s not automated. also another way to think about tasks within a job that are complementary is to think about them as sort of like innate skills or abilities. So think about the job of being a basketball player. The job of being a basketball player involves being tall and being agile. If you somehow automated being tall, I can&#8217;t reallocate my skill points into being agile, right? If we think about my performance as more as a combination of my skills, then automating part of it or taking part of it away, it&#8217;s not necessarily obvious to me that I can get better at the thing that&#8217;s not automated.</p><p>Avi: The way we, okay, so first the way the literature usually thinks about jobs is generally at the task level, not the skill level. Okay. So a worker does a bunch of tasks. Okay. Those tasks require skills, but the worker does a bunch of tasks and the A machine comes along and can do the task and not the skill. So I&#8217;m not sure what it means for a machine to be tall. What it means for a machine to slam down.</p><p>Seth: Well, let&#8217;s think about being a doctor. Let&#8217;s assume you might imagine being a doctor involves bedside manner and judgment about and diagnosis right it&#8217;s not clear to me that if you automate my diagnosis I can reallocate more effort into bedside manner some people are just level five at that and some people are level one at that</p><h3>AI Doctors and the Future of Medical Work [29:25]</h3><p>Avi: It is obvious to me that there&#8217;s a bunch of tasks in a doctor&#8217;s workflow. Some of them involve diagnosis. Some of them involve talking to patients and making the patients feel better. And within those, there are skills in being good at filling in the missing information of what&#8217;s wrong with the patient and skills of making the patient feel comfortable. And actually, for some of those tasks, you might even need both. A machine comes along and automates the diagnosis skills. Okay. That means medical professionals are going to be spending more time on the other skills. This is actually an Eric Topol&#8217;s deep medicine book. I&#8217;m not sure if you&#8217;ve read it. It&#8217;s, it&#8217;s like a pre-ChatGPT, but like how AI might transform medicine. And that is his core thesis. The idea is that AI is going to make healthcare human again, because doctors are going to spend less time looking at screens and focused on diagnosis and more time. interacting with patients and making patients feel better. So in that sense, we get the automation of the diagnosis task and some of the computer tasks that should exactly lead to reallocation toward the human part. But then you brought up something else, which is, do our current doctors, if they spend that much more time interacting with patients, are they the right people for this job? Or alternatively, could we have a different set of medical professionals who we could train because now the machine can do some of those tasks who would be way better than our current doctors at the remaining tasks? I suspect if the machines get good enough at diagnosis and identifying appropriate treatments, there is an enormous opportunity for a new kind of medical professional who is focused on essentially interacting with patients.</p><p>Seth: Yeah, so you&#8217;re making the occupational reorganization point and that&#8217;s that&#8217;s obviously essential and we&#8217;re going come back to that in the second. Yeah, I just I&#8217;m just pointing out that maybe maybe my example of basketball wasn&#8217;t so good. Maybe my medical example wasn&#8217;t so good. But I bet you I could pick out some domains where the elasticity of task output to effort is very inelastic.</p><p>Avi: Okay, trying to think. You&#8217;ve switched from skills to task and that makes me much, much happier.</p><p>Seth: Well, I mean, you would only need to worry about skills is if you were inelastic to effort, right? Then it&#8217;s just the skill.</p><h3>Rare Skills, Common Skills, and Wages [32:04]</h3><p>Avi: So there&#8217;s the new Autor and Thompson paper on automation, which I think gets at some of the things you&#8217;re talking about, which is if the things the machine does are relatively rare skills, like are tasks that involve relatively rare skills, to be precise, then what happens is we get entry into that profession. More people can do it and very likely wages go down. And if the machine things that the machine does are things that many people can do, they require less specialized skill, then the remaining humans in that job will, there&#8217;ll be fewer of them and they&#8217;ll likely be higher paid.</p><p>Seth: Right, think that&#8217;s right, but I think maybe a missing component here is within the job already, what is the correlation in abilities between people who are good at the automatable and non- automatable part of the task, right?</p><p>Avi: Yeah, but I think that&#8217;s the statement about that. Like in the short run, we&#8217;ll get the Autor and Thompson results. And in the long run, we&#8217;ll get a reallocation of jobs, right? There&#8217;s a system of professions and the system of professions will change.</p><h3>Are Tasks More Complementary Than Cobb-Douglas? [33:23]</h3><p>Seth: In the long run, you get the reorganization of jobs. Maybe one other thing I want to talk about before we get into reorganization of jobs is just this question about, tasks more complimentary or less complimentary than Cobb Douglas? Do you have a sense of that with tasks within a job? I mean, it seems like would vary a lot, a lot from occupation to occupation. I think we all have this intuition that they should have some kind of complementarity. That&#8217;s why they&#8217;re a job in the first place. That&#8217;s why they&#8217;re bundled. But you might bundle them and they still might just be, you know, gross substitutes that have a little bit of complementarity.</p><p>Avi: I suspect there&#8217;s a lot of heterogeneity across jobs and I don&#8217;t think we have good data on that yet because sometimes we haven&#8217;t been looking because our model is substitute model and so our papers are fundamentally focused on the substitute.</p><p>Seth: And I think this is an example of somehow the theory is sometimes a little bit downstream of the data, right? We just have so little data on people reallocating effort across tasks within a job that of course it makes sense to aggregate up to just add up all of the tasks done by all of the workers. That&#8217;s kind of, that&#8217;s my guess of why Acemoglu gets there.</p><p>Avi: So of the task papers, the Eloundou et al., Dan Rock&#8217;s paper, is incredibly careful on every page.</p><p>Seth: This is not an automation measure. Do not use this to measure automation.</p><p>Avi: This could be a complement, it could be a substitute. These are just jobs that change. So like kudos to them, the four of them for being super, super careful. Nevertheless, when that paper is cited both in the academic literature and in the press, that idea seems to get lost. I&#8217;m not exactly sure why, maybe that&#8217;s because of the model.</p><p>Seth: Question people want to answer, right? The people don&#8217;t want to know what job&#8217;s going to change. People want to know what job should I get, right? And so...</p><p>Avi: Well, okay, but if it&#8217;s a question people want to answer, then the complements matter just as much as the substitute. I wonder if the answer that people want to know, like the answer that people want, and then they just...</p><p>Andrey: I actually think it&#8217;s I think take has always been that just most people are pretty, they&#8217;re very sophisticated users of this data, but a lot of people don&#8217;t have a sophisticated economics model. And therefore to them, it&#8217;s just obvious that what&#8217;s going to happen is the machines are going to take our jobs. As a result, that&#8217;s just, they don&#8217;t have a more nuanced model of economic activity and therefore that&#8217;s how they interpret it. Now there are more sophisticated readers, think, we know some of them, where they&#8217;re just really just think that AI is going to be able to do everything in a very short period of time and then it all kind of becomes moot. You know, if you think that every single task can be done by an AI.</p><h3>Why the Impact of AI Was Ambiguous in Earlier Work [36:15]</h3><p>Seth: Yeah. Well, I guess this kind of brings us to your 2019 Journal of Economics paper, which is about where you guys kind of where you kind of throw your hands up. That&#8217;s not that&#8217;s a positive part and say there&#8217;s an ambiguous impact. So I guess I want to push you there on is the ambiguous impact because. We just don&#8217;t know all of the relevant elasticities, right? We need to know the elasticity within tasks within a job. We need to know elasticity across jobs within an organization, the elasticity across sectors of demand. And if we could put all of those together, we would be able to answer the question. Or is it more ambiguous than even that?</p><p>Avi: No, I think you need to understand when that paper was written in order to understand the paper, which is in 2019 or late 2018 when we were writing it, we had no concept of anything but a task- based model with substitutes. Okay, maybe that was on us. We should have. But Acemoglu and Otter and Rastrepo were the dominant- Paradigm. ... working in literature, especially Acemoglu.</p><p>Seth: Are you saying our ontology was limited?</p><p>Avi: I&#8217;m not exactly sure what you mean by that, but...</p><p>Andrey: You forgot about the O-ring which was the black swan of papers.</p><p>Avi: Yeah, yeah. So like, we did.</p><p>Seth: I mean in Kremer, I mean, presumably you looked at Kremer again before writing your paper. You can almost see he&#8217;s almost there. He&#8217;s almost at, and this is within workers too. He doesn&#8217;t exactly say it.</p><p>Avi: Exactly. So when we wrote that paper, we were thinking task-based substitution. That was the model that we had. And actually, in the process of writing that paper, in some sense, we learned what was wrong with that model and ended up with, we just don&#8217;t know. And part of that is, we wrote it in 2018, 2019. We were looking for new tasks from AI. So this is before ChatGPT, like four years before ChatGPT. So new tasks hadn&#8217;t really come up yet. All we had was identifying space junk and treatment for complex disease, which actually wasn&#8217;t our idea. It was Tim Taylor&#8217;s idea, our editor.</p><p>Andrey: Well, you already had AlphaFold, right?</p><p>Avi: Yeah, but it&#8217;s not clear what the new task is because of AlphaFold. Yeah, fair enough. In terms of... So, and actually that paper in some sense directly led to our work on system change and GPTs, because Tim Bresnahan pulled me aside that summer at the Summer Institute and told me he hated our GPT paper. I&#8217;ve told you guys this before. Because it was a task-based model and that&#8217;s not how meaningful change happens. That then led to all this work on trying to understand, well, if it&#8217;s not a task-based model, how does the system change?</p><p>Andrey: Okay. And we&#8217;ve covered that to Bresnahan paper on this podcast.</p><h3>Reorganizing Jobs Around AI [39:22]</h3><p>Seth: I guess let&#8217;s talk about reorganization of tasks. Obviously that seems to be, that&#8217;s the best case answer. The best case answer is you split off the, I guess from the perspective of a firm trying to boost productivity, maybe not necessarily from a worker&#8217;s perspective. From the firm&#8217;s perspective, you want to slice off the automatable thing, let that rip, and then figure out what you have to leave behind for humans. Is there any good research about... How do you do that? What industries are better than that at others? Like, what&#8217;s the next research frontier on that question?</p><p>Avi: I think you just defined it. there are two. One is like within the firm, how do we think about where the complements are and what&#8217;s left for humans and how does that vary across organizations? The second part, and Alex Emas has highlighted this recently, is it also depends on elasticity demand for the...</p><p>Seth: products.</p><p>Avi: Like, you know, even if within an organization workers reallocate and they become hard to automate because they&#8217;re more productive, but then the organization is producing more, well, someone has to want that more or else then, you know, at least that organization or its competitors are going to to business.</p><p>Seth: Well it&#8217;s factor, well its price will come down, know there&#8217;s a kind of a nebulous connection between price and profitability.</p><p>Avi: Right. Price goes down. It&#8217;s got to go down like, well, quantity has to go up enough that we still need the workers.</p><p>Andrey: There might be a paradox in there that&#8217;s not really a paradox. The misnamed Jevons paradox.</p><p>Avi: Maybe.</p><h3>Should We Want Less Automation? [41:05]</h3><p>Andrey: Following up on this idea, think several prominent economists have called for a government push or ideological push to make AI that complements humans rather than substitutes for humans.</p><p>Seth: Friend of the show, Erik Brynjolfsson has written about the Turing Trap. Is the Turing Trap misnamed? Is it not a trap? Should we embrace the Turing?</p><p>Avi: Okay, so this is our science paper.</p><p>Seth: Let&#8217;s get the hot takes. This is where we brought you on.</p><p>Avi: Do want more automation? Yeah, so Eric has said it. Doron has said it. There&#8217;s lots of policy. We should complement humans, not replace them. And John Markoff is a journalist. He has this book called Machines of Loving Grace, same title as Amodei&#8217;s essay, essay, but older book. It is about the history of computing.</p><p>Seth: When you&#8217;re a tech billionaire, you&#8217;re allowed to use cool phrases unsighted. I&#8217;ve noted this.</p><h3>Augmenters, Automaters, and Inequality [42:10]</h3><p>Avi: Well, they&#8217;re both referencing a poem. And in Markov&#8217;s book, there&#8217;s these two streams of computer science. There&#8217;s the, I forget exactly how he labels them, but essentially there&#8217;s the augmenters and the automaters. And at least from my perspective, the augmenters seem like the heroes of his story. And the automators who start to become prominent as this book is getting written around 2014-2015</p><p>Seth: They&#8217;re trying to trap us. They&#8217;re trapping us.</p><p>Avi: But we also know that the rise of computing the internet massively increased inequality. They generated enormous wealth, but they massively increased inequality. And I hypothesize that the reason for that is, yes, they were augmenting what humans do, but they weren&#8217;t augmenting what all humans do. They were augmenting what a set of humans who are good at abstract thinking do. And those people were already doing pretty well. And so in the process of augmenting humans, right, because no human can do what the internet does or what a computer can do, they augmented folks at the top and left others with relatively stagnant incomes.</p><p>Seth: Is this story there really at the task level? The way I think about that inequality story is that it&#8217;s kind of at the firm level, right? It&#8217;s we&#8217;ve now put the corner store into competition with Amazon and so Amazon wins and whatever Amazon takes as input wins.</p><p>Avi: There&#8217;s a bunch of different pieces. The one I&#8217;m emphasizing is like the Autor, Katz, and Kearney framework, which is about skills.</p><p>Andrey: I mean, it has to be both, right? There&#8217;s a set, right? Like, the humans who are now able to market their unique skills match with the firms that are larger, but you kind of need both to create the inequality or some of the humans become superstars without like needing the firm in first place, right?</p><p>Avi: I think in principle you could get within firm inequality without getting across firm inequality. We ended up getting both.</p><p>Seth: Yeah, both. Both happened.</p><p>Andrey: Fair enough.</p><p>Avi: but as I&#8217;m thinking like Autor, Katz, and Kearney with computing and then Shane Greenstein, Chris Foreman and I have some work on sort of the internet inequality, same kind of idea. so on the other hand, automation technology, if it&#8217;s automating things that folks at the top do, could superpower everybody else. Okay. And this is a could, cause we hasn&#8217;t really happened. So what we hypothesize, so the question, the paper is called, Do We Want Less Automation? And our answer isn&#8217;t no. Our answer is, here are reasons why it&#8217;s not obvious. Okay? It&#8217;s very economist-like. And the essence of it is, we were just talking about this medical example. Well, if what doctors are paid for is 10 years of post-secondary schooling, that essentially is about prediction, diagnosis and treatment. Then someone potentially with two to four years of post-secondary schooling who was much better at managing patient stress and all these other things, training like a social worker, combined with a diagnosis machine could be super hard. And so their productivity goes up. And there&#8217;s a bunch of industries where What people at the top do seems a lot like filling in missing information.</p><h3>Are Intellectuals Giving Biased Advice About AI? [45:58]</h3><p>Seth: One might even cynically say that these thought leaders who have been so augmented by the internet are maybe not giving the populace the best advice.</p><p>Avi: Maybe. So I had an undergrad RA write an essay for me. She&#8217;s a philosophy major. you know, a couple summers ago, it&#8217;s Amelia Agarwal. I feel like I should call her out.</p><p>Seth: Love undergraduate research on the pod.</p><p>Avi: Yeah, the opening of her essay was, part of her assignment was to read and hear about all these people who said AI is going to automate work. And so I&#8217;m going to have to have leisure, like essentially. And she&#8217;s like, that doesn&#8217;t strike me as bad. And then she dug into it and her framing was essentially the people whose identity was driven by their, you know, intellectual abilities, public intellectuals are exactly the people most threatened by AI. And so anyway.</p><p>Andrey: You know, it&#8217;s very interesting. I actually disagree. Yeah, I think lots of intellectuals are threatened by AI but not public intellectuals and that&#8217;s because humans are going to want other humans to communicate to them in many ways. So, the role of the public intellectual is not going to go away. The role of the maybe the scientist toiling away on their research. That is in my opinion much more a threat. if you&#8217;re... one might even deduce that Seth and I have started this podcast as a hedge for that world.</p><p>Seth: Well, what I say is as the price of writing papers goes down, the return to reading papers goes up. But maybe this goes back to the taste idea, right? Which is one way you might think of taste is a public intellectual doesn&#8217;t let&#8217;s let&#8217;s be cynical for a minute. The public intellectual, the public art critic doesn&#8217;t actually know art better than anybody else, but they serve a role as a coordination mechanism. Right. Everybody trusts Andrey. So when Andrey points at the thing and says it&#8217;s good, everybody converges to that. And then maybe that&#8217;s one notion of taste that will be preserved.</p><p>Avi: Yes, and so you started in science and moved to art. There&#8217;s probably differences between them, but in the sciences, there&#8217;s a question, or a scholar&#8217;s, what&#8217;s our goal? What are we trying to accomplish? And I think different disciplines have different goals. And depending on the goal, the role of the human curator changes. If the goal is so that humans understand the world, and have sort of a consistent model, then there&#8217;s a real role for a curator. If the goal is to build a better spaceship, then maybe there&#8217;s not such a role for a curator. And so I haven&#8217;t been following that literature, so I don&#8217;t know really what the formal academic take on what I just described is.</p><h3>Can Policy Steer AI Toward Augmentation? [49:27]</h3><p>Andrey: Yeah, I agree. I haven&#8217;t seen much formalization. So listeners, if you know of any, send it along. Yeah, I mean, I sorry, I just want to make a final point is that I think I like your criticism of this augmentation idea. But to me, there&#8217;s like a much deeper criticism, which is there&#8217;s there&#8217;s just kind of a whiff of central planning involved in it. like, how how do you know? What technologies are going to automate versus augment. Like this is very hard to predict in my mind. And to think that the government is going to like somehow implement a system of taxes on technologies that are augmentation versus substitution, it&#8217;s ridiculous in my opinion.</p><p>Avi: So I was taking as given that you can understand what is automation and what&#8217;s augmentation. I agree it&#8217;s a very hard challenge. There, I think the narrative, I&#8217;m gonna be careful. I think the argument is if even without choosing winners, we might be able to tax capital relative to labor or something like that. in order to push things in a particular direction. I think that&#8217;s it.</p><p>Andrey: Yeah, that&#8217;s the most plausible.</p><p>Seth: That&#8217;s pretty plausible, but when you actually hear versions of the Turing Trap articulated, it&#8217;s really like go and burn down the houses of the people who want to automate you.</p><p>Avi: Okay. So Korinek and Stiglitz have a chapter that&#8217;s really about tax and capital that&#8217;s in our economics of AI book. And I think like the Acemoglu Johnson argument is really about tax and capital. I&#8217;m not enough of a macro economist to have a strong opinion about one way or the other, but that I agree seems more</p><p>Seth: Right, and then there&#8217;s a deeper, deeper argument there about whether or not you want to tax capital, right? There&#8217;s the old Chamley-Judd result about, well, know, labor is inelastic and capital is elastic, so really you don&#8217;t want to tax it. There&#8217;s obviously international considerations about if you have a fully automated technology, isn&#8217;t that just going to locate itself in the lowest tax jurisdiction? And so it might be very hard to tax capital. And then of course the Iv&#225;n Werning follow-up research kind of complicating the original Chamley-Judd results. So this gets in the weeds really fast.</p><p>Andrey: And it&#8217;s also very blunt in many ways, right? A lot of capital is not about automation. it&#8217;s a... I don&#8217;t know.</p><p>Avi: Yeah, and there&#8217;s all sorts of questions in public finance and how that all plays out to like the there&#8217;s under the names Trammell and Korinek. I think it&#8217;s Trammell. No, it&#8217;s not.</p><p>Andrey: That&#8217;s Lockwood.</p><p>Avi: Lockwood and Korinek, thank you. have a relevant paper there.</p><h3>AI Growth Scenarios Through 2030 [52:36]</h3><p>Andrey: Next topic. Yeah. So there was a very well-circulated survey of economists about their expectations of economic growth in different AI scenarios.</p><p>Seth: Now Avi, I understand you have intentionally not read this so as to have an unbiased take, so you will not be contaminated by the opinions of everyone else. Is that right?</p><p>Avi: That is absolutely right.</p><p>Andrey: Excellent. You&#8217;re definitely not in the same university as many of the authors.</p><p>Avi: I probably will, but we&#8217;ll see.</p><p>Andrey: All right. So the first conceit is that there are three scenarios for AI progress that they want us to consider. The first one is slow progress, where by the end of 2030, the AI can do PhD student level assistance, half of eight hour long coding tasks, passable stories and songs. Robotics navigate homes with some help. So that&#8217;s kind of the slow. Moderate is you have semi-autonomous labs, five-day coding tasks, high-quality novels and hit songs. Robotics can perform basic tasks. And then rapid progress outperforms top humans in research coding and leadership, award-winning creative works, nearly all physical tasks. So those are the three scenarios by 2030. So the first question is, how do you allocate the probabilities between slow, moderate, and rapid by 2030?</p><p>Avi: So, okay, so with the exception of the statement about hit songs and award-winning, those are all about the models and not about the outcomes. So I&#8217;m going to ignore the hit song and award-winning part because I think that&#8217;s...</p><p>Andrey: It&#8217;s of the quality of the quality that could win it.</p><p>Avi: Okay, because at a high level, what I think is the technology is going to accelerate rapidly, but there are all sorts of meaningful barriers to widespread diffusion and having an impact on the economy. and sometimes I think we&#8217;re already in the slow and for aspects of the medium versus the fast, I feel like I should call it 50-50 because I&#8217;m skeptical of the like, I&#8217;m skeptical of the robotics stuff, but the five day coding task seems very, likely. And so just.</p><p>Andrey: Yeah, there&#8217;s some other things. CEO level agency, you know, like is is one of the criteria.</p><p>Seth: I don&#8217;t know whether or not they can run a vending machine.</p><p>Avi: But don&#8217;t like part of it. So much of what a CEO does is like is charisma and creating followers, right? And I&#8217;m not sure that&#8217;s a mission.</p><p>Seth: Is it charisma judgment task? Is it charisma judgment?</p><p>Avi: It&#8217;s a skill. I&#8217;m not sure it&#8217;s a prediction or judgment. It&#8217;s more like an action.</p><p>Andrey: Yeah. But okay, fair enough. Just to give you like a sense of where economists came in and they took this in the fall, 39 % that were still in slow by 2030, 47 % that were in moderate and 14 % then were in rapid. So you are more bullish than a typical economist.</p><p>Avi: I&#8217;m more bullish. I probably shouldn&#8217;t have said zero for slow. In retrospect, I was just going to be something five to 10 or something like that.</p><h3>GDP Growth by 2050 [56:22]</h3><p>Andrey: Okay, great. Now, and I think this is the question that really there was a lot of controversy about. So, the question was, by 2050, what is the annual change in GDP on average?</p><p>Avi: GDP or GDP per capita.</p><p>Andrey: This is GDP.</p><p>Avi: I like I have to make a population assumption. somewhere between two and 3%.</p><p>Andrey: All right. You are well within the economists&#8217; answer here: 2.5%.</p><p>Avi: duplicate. And so we&#8217;ll be a little above that.</p><p>Andrey: So 0.5%, that&#8217;s all we get. okay. Extra from AI over and above.</p><p>Avi: Well, no, I don&#8217;t think you want to say that because the reason we have 2 % is because of innovation in past.</p><p>Andrey: Okay, so fair. I agree, I completely agree with you.</p><p>Avi: Like it&#8217;s possible, especially with, you know, it&#8217;s possible we would have gotten zero.</p><p>Seth: 5 % better than historical rate of technological growth.</p><p>Avi: Yes, something like that.</p><p>Andrey: Now, what if you were for sure, what if you for sure knew we were in the fast scenario by 2030? How would that like change your predictions?</p><p>Seth: It&#8217;s hard to get to above three.</p><p>Avi: Like, yeah, I just think there&#8217;s a lot of bottlenecks in the economy. I think that, and we&#8217;re going to figure out what they are.</p><p>Seth: We&#8217;re gonna find out fast and that guy is gonna be rich.</p><p>Avi: Yes.</p><p>Andrey: So you&#8217;re once again, like a very down the median economist.</p><p>Avi: On growth. Yeah, okay.</p><p>Seth: Can I ask you, you think that&#8217;s mostly about bottlenecks? You don&#8217;t think that&#8217;s mostly about people taking leisure?</p><p>Avi: I think it&#8217;s mostly about bottlenecks.</p><h3>What Are the Bottlenecks? [58:36]</h3><p>Seth: So gun to your head, what&#8217;s the biggest bottleneck in that high growth robots are awesome scenario.</p><p>Avi: I feel like my best answer is we&#8217;ll find out.</p><p>Andrey: Okay. I guess the pushback that folks gave is this is a scenario where by 2030 robots can do nearly all home and industrial tasks and faster than humans, right? So you might say, well, manufacturing and physical tasks are a tiny, not tiny, but they&#8217;re not that big of a portion of the GDP already. maybe-</p><p>Avi: be essentially zero is the point. If they&#8217;re that efficient and that cheap, then they won&#8217;t mean like, I guess it depends on how we calculate the deflator. agriculture is way more productive. GDP hasn&#8217;t grown by that much.</p><p>Andrey: But what if we have, you know, you know, robot doctors that can do, you know, like,</p><p>Avi: Great, then medicine will be cheap. It&#8217;ll be less of GDP.</p><p>Andrey: I guess, all right, so here&#8217;s a hypothetical. Here&#8217;s a hypothetical. Let&#8217;s say we had a cure for cancer as a result of this, which is very plausible in the rapid scenario, and that we also, at least in principle, have the technologies to administer it through robots very efficiently because we are in a world of just true abundance. My sense is that people would value that medical care extremely highly. And if one were to properly deflate the existing cost of cancer treatment, wouldn&#8217;t that imply a very large GDP effect? Now you can say maybe we&#8217;re not going to calculate that correctly.</p><h3>GDP, Consumer Surplus, and Health Breakthroughs [1:00:25]</h3><p>Avi: Now I feel like I&#8217;m going to, you know, it&#8217;s sort of the Bob Gordon sense. I don&#8217;t think we deflated antibiotics properly. I don&#8217;t think we deflated flush toilets properly. So if you&#8217;re talking about consumer surplus, then maybe consumer surplus will be found, especially, you know, to the extent that it&#8217;s health outcomes, then huge increase in consumer surplus, much more than the argument that we&#8217;ve had for digital. Because the that debate on whether digital really made us better compared to what was happening in the 20th century, I reasonable people can be on both sides of that debate. what you&#8217;re describing, is can&#8217;t secure people living wonderfully and healthy to 100, there might be some limits to how long, but that would be wonderful and great for consumer surplus. But if that happens, I guess it might and it&#8217;s that easy, it might become so cheap that it&#8217;s it&#8217;s like agriculture. Because food is pretty essential too. And food is so cheap that we don&#8217;t worry about it so much anymore.</p><p>Seth: Inelastically demanded. think people will elastically demand years of life in a way that they won&#8217;t elastically demand calories, right?</p><p>Avi: Potentially.</p><p>Seth: You think people will get sick of it. I thought you were to go to maybe you&#8217;ll recall in Doron&#8217;s simple macro economics of AI, a favorite paper of this podcast. He actually predicts that actually consumer surplus might raise by less than is implied by the GDP growth rate, because we&#8217;ll invent evil jobs like social media manipulator. Do you are you still convinced that consumer surplus growth will be faster than GDP growth evolves? Or are you open to this idea of the invention of evil tasks?</p><p>Avi: I feel like we are not in my expertise.</p><p>Seth: Turn it up.</p><p>Andrey: Seth is really trying to get the hot takes.</p><p>Avi: I don&#8217;t like to judge what particular products, a particular.</p><p>Seth: Well, you can&#8217;t judge, you can&#8217;t predict.</p><p>Avi: Yeah, you know, what am I in a-</p><p>Andrey: Then you become a economist.</p><p>Avi: Actually, let me give... So I think it&#8217;s reasonable for people to say some roles, some jobs, some products are better than others. I don&#8217;t think that has a meaningful role in GDP calculation. And I also worry if in our consumer surplus calculations, we economists say some things are better and some things are worse because then... So much of it is just obviously to the taste of the...</p><p>Seth: It&#8217;s such a normative can of worms, right? GDP we can measure, consumer surplus. I mean, we do things at the Stanford Digital Economy Lab around trying to do willingness to accept experiments, but obviously those are highly limited too.</p><p>Avi: So consumer surplus as in figuring out the area under the demand curve, that&#8217;s the kind of task I think we&#8217;re good at. It&#8217;s within our domain. whether the demand curve is morally right or wrong, that&#8217;s not something I&#8217;m going to be finding out this day.</p><p>Andrey: I wanted to just like close off that loop a little bit by just saying that you just gave me an answer that said that for our evaluation of how good of a world we&#8217;re gonna get in 2050, GDP is no longer the correct sufficient statistic, which obviously makes me question like why is this such a bench? Why are people so interested in forecasting GDP in 2050 if we think it&#8217;s going to get pretty uncoupled with consumer surplus in these scenarios?</p><p>Avi: Well, I&#8217;m not sure it&#8217;s more or less uncoupled than it has been in the past. I think reasonable people can disagree on that. I think the debate between Bob Gordon and Erik Brynjolfsson or Bob Gordon and others over the years is sort of is really informative about how hard it is to say, you know, what&#8217;s better versus today versus the past. What happened in the early 20th century is pretty amazing. okay, that&#8217;s point one. Point two is it&#8217;s not obvious to me that GDP like GDP tells you your national capacity. That&#8217;s what it tells you.</p><p>Seth: That&#8217;s useful for things like wars and public finance.</p><p>Avi: If I remember my first year econ, haven&#8217;t taught first year econ for a long time. That was the idea. What&#8217;s the industrial capacity of the country? Or what&#8217;s the economic capacity of the country? It turns out it&#8217;s highly correlated, as I understand it, with lots of welfare measures. You guys know this. And so we use it for that. Once you start deviating, then... then that&#8217;s fine, but you&#8217;re now embedding a whole other set of values. At least with GDP, we know what the values are. It&#8217;s not it&#8217;s not value laden, but we at least know what the values are that we&#8217;re embedding in that measure.</p><p>Andrey: But guess I&#8217;m not sure we know, just in many conversations with economists, this question of deflators has come up and most of us haven&#8217;t spent much time thinking about what actually goes into that and how well that&#8217;s done and how relative to different goods. So I agree with you that we&#8217;ve been recommending that people use this because it&#8217;s very correlated with welfare, but you know.</p><p>Avi: So, yes, and the NBER productivity group in many ways was focused on questions about how do we measure innovation and progress and a lot of that, some of the early work that came out of it was explicitly about this question. it&#8217;s not that people haven&#8217;t thought about it and that there&#8217;s not a whole community that grew out of that. Now admittedly, we don&#8217;t have that many, you know. papers about deflators and inflators anymore. But Shane, when he was running the program, digital, almost always had somebody on the program focused on measurement of prices over time in the digital world. So just to say at least it&#8217;s on his radar and it was part of what Sloan Foundation was excited about why they originally started funding the Digital Economics Group.</p><h3>Sponsor Break: Revelio Labs [1:06:56]</h3><p>Seth: This chance to contemplate your posteriors is sponsored by Revelio Labs. Revelio Labs is a leading provider of labor economics data and data services for companies, academics and independent researchers. Andrey and I have been working in economics of AI for a long time and we can confirm just how useful Revelio&#8217;s data is. Revelio&#8217;s team combines comprehensive micro-level data on employee professional profiles, job postings and employee sentiment with standardizations, mappings, and enrichments available, all to make that data useful without making your modeling decisions for you. The data can be flexibly aggregated to company, market, or industry and be used to study questions ranging from career trajectories to occupational transformation to the returns to skills and the impact of AI on labor demand for tasks. Can&#8217;t imagine anyone be interested in those. And Revelio data is available on RWRDS. So if you&#8217;re an academic with a good library, you might already have access. And if you don&#8217;t, you can reach out to their excellent economics team and they&#8217;ll hook you up. </p><h3>Will AI Centralize or Decentralize Decision-Making? [1:08:16]</h3><p>Seth: All right, okay, we&#8217;re gonna give you a topic. We want your hot take. So will AI centralize or decentralize decision making in the economy?</p><p>Avi: Yes.</p><p>Andrey: It was good though.</p><p>Avi: Like, so, I don&#8217;t know, this is no longer lightning round. But for an ultimate hit thing, have that interesting paper saying why it&#8217;s gonna centralize and their argument is good. And the exact same arguments they have also say that it could empower people on the periphery. And the answer is almost surely both are gonna happen. There&#8217;s gonna be some people who figure out how to scale themselves and their judgment and gain enormous power. And at the same time, others who are able to do things they couldn&#8217;t do before, just like we saw with online platforms where there&#8217;s been both the centralization of power and the ability of niche players to</p><p>Seth: Here&#8217;s the part that I thought that that dialogue missed, which I recommend to all of our readers to look at because it&#8217;s fascinating, is the argument that AI will centralize us is that AI is going to help these centralized decision-makers understand the complexity of what&#8217;s going on. But what if AI makes us weirder faster than AI conceptualizes the weirdness that it&#8217;s creating? What if we just get super duper weird? That would make it very hard to centralize.</p><p>Avi: Yeah, I think that&#8217;s a version of my argument, which is that the people on the periphery can, know, individuals can use it to make themselves more productive, better, happier, whatever their goal might be.</p><p>Seth: more, more, less, less controllable. What do LLMs imply for privacy regulation in economics?</p><p>Avi: first answer was nothing. There&#8217;s lots of ways to worry and think about privacy and privacy does matter. First answer is not obvious how it matters now differently than it did five years ago.</p><h3>Digital Hermits and De-Anonymization [1:10:11]</h3><p>Seth: I the idea is that it will be...</p><p>Andrey: De-anonymization.</p><p>Avi: Yeah. So yeah, so that&#8217;s where I said my first answer. then, okay, well, to the extent that okay, here, here we go. Catherine Tucker, Jean-Michel Lachetal and Avery Haviv and I have a paper called Digital Hermits. okay. And the idea of that paper is again, I&#8217;m really bad at these hot that all. okay, the the idea of that paper is right now you might be willing to give your grocery preferences to whatever company. but you might not want the company to know your IQ or your religion or something else, your union status or something like that. Okay. And in a world with bad prediction tools, you can give your grocery information and not the other information. But if some other people are giving both, then over time, you can&#8217;t even give your grocery information if you want to protect your religion or IQ. So. In the equilibrium there, we end up with one or two groups. We get hermits who don&#8217;t give any information and everyone else who gives all their information just gives up. So what you&#8217;re describing with LLMs is a version of the prediction mapping from, just writing something to now having all sorts of extra information about you that we might not want to put you. And so like being able to connect different pieces of information.</p><p>Seth: make the hermit hermit-ier, right?</p><p>Avi: They&#8217;ll make the hermits hermit-ier and create demand to the extent that privacy is a value and it&#8217;s now harder to protect. There&#8217;ll be demand for laws that</p><p>Seth: It&#8217;ll make the hermits more hermetic, I should say.</p><p>Andrey: I think it could be a function of abuse, right? Obviously, I haven&#8217;t studied privacy as much as you, but I think when this data gets abused, there&#8217;s a lot of demand for laws, retribution, and protection. But when it&#8217;s an abstract value, but it&#8217;s not getting visibly abused, it seems like it&#8217;s less of an issue. this data is used for personalized advertising. Yes, some people have a negative reaction to that. In the end, in the grand scheme of things, it&#8217;s not that bad. But if now someone is finding out, you know, all this private information about you specifically and, you know, that information, let&#8217;s say can be, you know, someone, you know, leaks it or talks about it online or tells your employer or whatever, you know.</p><p>Avi: So Right. Yes, there&#8217;s going to be a decline in online anonymity. Actually, like, I if you remember Catherine Tucker&#8217;s discussion of Florian Ederer&#8217;s paper on de-anonymizing econ job rumors at NBER. That paper is about fundamentally about something else. But her discussion was, okay, this is the world we&#8217;re moving to. Maybe because of quantum, maybe because of LLMs, it&#8217;s gonna be very hard to post things anonymously. And so once that happens, once things you say digitally you expect to be known, how does that change behavior? And then there&#8217;s like, I guess your original question was, how does it affect privacy regulation? LLMs are gonna do two things. And I don&#8217;t know what the equilibrium is gonna land, which is, I don&#8217;t know why you keep doing this.</p><p>Seth: I&#8217;m counting this is be thing number one and then I&#8217;ll you thing number two.</p><p>Avi: So thing number one is what we just described, which is demand for privacy regulation goes up because there&#8217;s new risks and people do value privacy. The other hand is there&#8217;s new opportunities to use data and benefit from your data. I can sort of think about that&#8217;s what agents are going to enable you to do. And so there is also an increase in demand for regulations that enable data to flow. And where that plays out country by country, continent by continent, who knows? But like, just like with digital, we saw both the increase in the benefit and the increase in the cost of data flows. I think we&#8217;re going to see another wave of that.</p><h3>AI and Peer Review [1:14:23]</h3><p>Andrey: Follow on question, peer review. You were the editor of marketing science for a long time. Narrow question is, what does this imply for anonymity of peer review? And a broader question is, effects of AI on peer review more broadly.</p><p>Avi: So yeah, I was, was a senior editor, marketing science. Actually, I haven&#8217;t thought about that peer review anonymity point, but absolutely in principle. it&#8217;s disguisable. I think there&#8217;s a solution to this. I just don&#8217;t know that we want it. Like running your review to have ChatGPT or Claude or whatever you want rewrite it so that it doesn&#8217;t sound like you with all your points. Seems at least on the language matching will work. Not on the idea matching, but that&#8217;s already revealed. Like a whole bunch of people tell their author to... Although actually as an editor, learned that it&#8217;s not as... Often it&#8217;s not the author that&#8217;s asking for those citations. It&#8217;s like their advisor. Okay. Like there&#8217;s a lot of that, but still. So I think that&#8217;s manageable. Certainly on the one way, like I don&#8217;t think we&#8217;ve had</p><p>Andrey: Yeah, at least.</p><p>Avi: Double-blind peer review for 20 years, at least in the econ side of marketing and then econ. The pre-prints are out there. The pre-prints are well distributed.</p><p>Andrey: Yeah.</p><p>Seth: So it just be public? mean, so that would be the other direction. Is that it just opens public reviews.</p><p>Avi: I think if reviews are public, we&#8217;ll all just collude. I think those be mass collusion. I shouldn&#8217;t say we&#8217;ll all. I would prefer to think that I won&#8217;t collude. But I think that&#8217;s just an invitation.</p><p>Seth: Go ahead. You don&#8217;t think that there could be a disciplining of that when somebody reads your review and says this is, this is nonsense?</p><p>Avi: I think the benefit of having your reviewers for sure know that you said good things about their paper, it&#8217;s going to be hard to overcome. There&#8217;s a question about whether the whole system makes sense or not.</p><p>Andrey: Well, that&#8217;s kind of what I was getting to next. I, you know, I do some advising for Refine and I&#8217;m a big fan of their product. And it&#8217;s pretty clear to me that Refine is doing a better job of peer review than the vast majority of peer review outside of very, very select venues. And it&#8217;s only going to get better. And so the question is, given these capabilities, what should it look like in the future?</p><h3>What Are Journals For Now? [1:17:05]</h3><p>Avi: So, okay, I&#8217;m gonna propose something. But I&#8217;m gonna start with I don&#8217;t know. Here is one out there idea, which is, it&#8217;s not obvious to me what purpose the journals serve. When I talk to scholars, especially junior scholars, I don&#8217;t think people read the journals. They may be happy that some paper they knew appears four years later, but it&#8217;s not like they get the AER and open it and read it. You know, people in my vintage, or at least some of us,</p><p>Seth: wall of JEPs under here as you can see.</p><p>Andrey: One AER, Avi, is the one you gave me with my own paper. Thank you very much.</p><p>Avi: A couple of years ago, I paid for like three years of AERs for them to deliver to me and then they refunded my money. Guess they stopped. Because they don&#8217;t print them anymore. So like, that just doesn&#8217;t seem like how knowledge is discovered anymore. even sort of like what I... Okay, so then what&#8217;s the purpose of the journal if it&#8217;s just to verify what matters or to verify accuracy, refined can do it. And then like, do we have the whole peer review system for? If it&#8217;s to not just verify accuracy, but also refine papers in a way that&#8217;s consistent with peers&#8217; tastes, and especially with the editors&#8217; tastes, then the revision process is important. And if it&#8217;s about the editors&#8217; curated tastes, then there&#8217;s probably a much easier way to do that, which is they post their PhD syllabus.</p><p>Avi: Like I wonder if what&#8217;s going to happen. Also like this, yes, there&#8217;s a lot of papers out there and submitted and there&#8217;s a lot of authors, but there&#8217;s just too much over the course of a year for anybody to keep track of what&#8217;s even in the AER, like one journal. Nevermind trying to keep track of marketing science and management science and all the others. Okay. I wonder if there&#8217;s going to be a curated set of people who, I don&#8217;t know who chooses them. who are essentially the tastemakers and maybe they&#8217;re editors, but maybe they&#8217;re just people who like say, hey, I like this paper. Justified posterior. I was going to say, that&#8217;s one role that you guys have. It&#8217;s this weird thing that people now in business schools can come out for tenure with eight, 10 papers in what are ostensibly A journals and no one&#8217;s heard of them because yeah, they published the papers, but they weren&#8217;t out there.</p><p>Avi: They didn&#8217;t get onto syllabi or whatever else. those cases are hard, because on the one hand, they were told they needed to publish X papers, and they published X plus four papers. And the other, the point is to contribute to knowledge. And they&#8217;re there for somebody to discover eventually. But then maybe the LLM could just write the paper when you need it.</p><p>Andrey: Currently we&#8217;re writing for the LLMs anyway, we know who the readers are of our paper.</p><h3>Closing [1:20:17]</h3><p>Seth: I think that&#8217;s a great place to leave it. Avi, this has been an amazing discussion. Thank you so much for making the time.</p><p>Avi: Yeah. Great talking to you. Take care.</p><p>Andrey: Thank you.</p><p>Seth: All right, and you folks out there, please join our hopin&#8217; Discord community. (https://discord.gg/w3GSapx2d) Like, review, and subscribe, and keep your posteriors justified!</p>]]></content:encoded></item><item><title><![CDATA[Agent, Know Thyself! (and bid accordingly)]]></title><description><![CDATA[why we need to train models to learn their own capabilities, and how this will help them bid for work!]]></description><link>https://empiricrafting.substack.com/p/agent-know-thyself-and-bid-accordingly</link><guid isPermaLink="false">https://empiricrafting.substack.com/p/agent-know-thyself-and-bid-accordingly</guid><dc:creator><![CDATA[Andrey Fradkin]]></dc:creator><pubDate>Mon, 27 Apr 2026 15:02:53 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/7f3c0fc8-27fa-4caa-a7c4-4c631556d35b_1200x630.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Written with Rohit Krishnan, who is a pioneer in studying LLM behavior and writes at <a href="https://www.strangeloopcanon.com/">Strange Loop Canon</a>.<br><br><em>Attention conservation notice:</em> We developed a new benchmark, MarketBench, and scaffold. Based on our findings, we argue that self-assessment of capabilities and costs is a key capability, and it needs to be a target of training. This is work in progress, and we are looking for collaborators and funding to pursue this research. Paper <a href="https://andreyfradkin.com/assets/marketbench.pdf">here</a>. Repo <a href="https://github.com/strangeloopcanon/agent-economy">here</a>.</p><div><hr></div><p>Let&#8217;s say you have a large-scale project to work on. How do you choose which model, scaffolding, or system to use? If you&#8217;re like most folks, you go with what your coding agent does by default. For Claude Code, this means that the model called is determined by a set of ad-hoc rules set by Anthropic. But this strategy is not guaranteed to be the most effective or cost-efficient way to build your project, especially since it ignores non-Anthropic models. In fact, it reminds us of central planning.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://empiricrafting.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Justified Posteriors! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>You could also go with an intelligent router. But turns out, routing is a wicked problem. To know which model should do which task requires computation and knowledge. For one-shot queries you can probably do this - any model can answer &#8220;what&#8217;s the capital of France&#8221; and few models can solve Erdos problems, especially without bespoke prompts. But what about that research question you asked this morning, in a chat you started three weeks ago, which has been forked four times and has had dozens of compactions? How do you train a router to figure out who should do the next task when it requires <em>so much </em>context?</p><p>This led us to think, what if we used markets instead of ad-hoc rules to assign tasks to AI agents? It turns out society has had this debate before. Markets tend to be superior to other forms of resource allocation when information and capabilities are distributed among a variety of people. In these cases, markets aggregate information and allocate resources in a relatively efficient manner, as well argued by Hayek.</p><p>You may be wondering, why would models have distributed information and capabilities? Aren&#8217;t there relatively few models and shouldn&#8217;t they only have the information you&#8217;ve given them. In a narrow interpretation, the private information could be the specific weights of the model and how they relate to the task. These weights result in models that have drastically different token consumption and success probabilities across tasks. In a broader interpretation, we envision agents as being combinations of a set of LLMs, execution environments, scaffoldings, and context provided by an agent operator, who may be distinct from the person asking for a task to be done.</p><p>Inspired by this, we decided to set up a market harness, where models bid to complete tasks and the principal (the person who wants those tasks done) allocates the job to the best bid. We also built a benchmark, <strong>MarketBench</strong>, to measure whether today&#8217;s frontier models have the capabilities they&#8217;d need to actually participate in such a market productively.</p><p>The short version of what we found: markets are a plausible way to coordinate AI agents, but current models can&#8217;t yet bid in a way that reflects their true capabilities. The bottleneck is metacognition. Models need to be able to say what their own capabilities are.</p><h3><strong>What a market actually needs from an agent</strong></h3><p>Before running any experiments, it helps to be precise about why a market might beat the alternatives. Consider a principal with a task and two agents &#8212; a strong-but-expensive one (H) and a weaker-but-cheaper one (L). Three rules are available:</p><ol><li><p><strong>Always use H.</strong> Simple, but you overpay on tasks that L could have handled.</p></li><li><p><strong>Always use L.</strong> Cheap, but you fail on tasks that need H.</p></li><li><p><strong>Run both in parallel, take whichever works.</strong> Highest completion rate, but you pay for redundant work even when one agent alone would have sufficed.</p></li></ol><p>A market dominates all three when each agent knows something the principal doesn&#8217;t. Specifically, each agent needs to form a view on its own task-specific fit: <em>&#8220;this particular task is in my wheelhouse&#8221;</em> or <em>&#8220;this one isn&#8217;t.&#8221;</em> If agents have that signal, they can bid accordingly, and the market routes each task to the cheapest capable agent while abstaining when no one can solve it. That&#8217;s the Hayekian story applied to AI: local, dispersed information that can&#8217;t be centralized, aggregated through price.</p><p>The fact that you might want to use the best model for the problem is not a new observation. There are plenty of attempts to do that, primarily by training a router, as OpenAI and most recently <a href="https://sakana.ai/fugu-beta/">Sakana</a> has done. The problem is that beyond simple queries, any long running agentic conversations mean there is a lot of context when you&#8217;re trying to assign a model to do a sub-task. To train a router to choose the right sub-model when you&#8217;re 50 sessions in with dozens of rounds of compactions is not trivial. It would help if the potential models that are going to do the task told you their capabilities!</p><h3><strong>MarketBench: asking models to forecast themselves</strong></h3><p>The core of MarketBench is two questions we ask a model before it touches a task:</p><ol><li><p>What&#8217;s the probability you&#8217;ll solve this task correctly in one attempt?</p></li><li><p>How many tokens do you expect to use?</p></li></ol><p>The model then attempts the task in a strong external scaffold, and we compare its forecasts to what actually happened. We built this on SWE-bench Lite, where each task is a real GitHub issue with an executable test suite &#8212; success is unambiguous, the tests pass or they don&#8217;t &#8212; and ran 93 tasks across six recent frontier models: Claude Opus 4.5, Claude Sonnet 4.5, Gemini 3 Pro Preview, GPT-5.2, GPT-5.2-pro, and GPT-5-mini.</p><h3><strong>Models don&#8217;t know themselves very well</strong></h3><p>Actual pass rates cluster in a narrow band &#8212; roughly 75% to 81% across all six models. Stated confidence spans 61% to 93%. Gemini in particular is <em>dramatically overconfident</em>. The GPT family is systematically under-confident. The two Claude models happen to land closest to their realized rates, but we shouldn&#8217;t read too much into that: the models aren&#8217;t calibrated, they&#8217;re just happening to be less wrong on this set of tasks.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Xvz2!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F746e1005-35af-4de5-b274-7d0081f3a65a_2048x785.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Xvz2!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F746e1005-35af-4de5-b274-7d0081f3a65a_2048x785.png 424w, https://substackcdn.com/image/fetch/$s_!Xvz2!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F746e1005-35af-4de5-b274-7d0081f3a65a_2048x785.png 848w, https://substackcdn.com/image/fetch/$s_!Xvz2!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F746e1005-35af-4de5-b274-7d0081f3a65a_2048x785.png 1272w, https://substackcdn.com/image/fetch/$s_!Xvz2!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F746e1005-35af-4de5-b274-7d0081f3a65a_2048x785.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Xvz2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F746e1005-35af-4de5-b274-7d0081f3a65a_2048x785.png" width="1456" height="558" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/746e1005-35af-4de5-b274-7d0081f3a65a_2048x785.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:558,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Xvz2!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F746e1005-35af-4de5-b274-7d0081f3a65a_2048x785.png 424w, https://substackcdn.com/image/fetch/$s_!Xvz2!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F746e1005-35af-4de5-b274-7d0081f3a65a_2048x785.png 848w, https://substackcdn.com/image/fetch/$s_!Xvz2!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F746e1005-35af-4de5-b274-7d0081f3a65a_2048x785.png 1272w, https://substackcdn.com/image/fetch/$s_!Xvz2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F746e1005-35af-4de5-b274-7d0081f3a65a_2048x785.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Token forecasts are also mis-calibrated. The median ratio of estimated tokens to actual tokens is 0.2. Models expect to use roughly five times fewer tokens than they actually consume. If you were running a market and asked agents &#8220;how much compute will this take?&#8221; you&#8217;d get answers that are off by an order of magnitude or two.<br><br><strong>The auction results are predictable from the calibration failure</strong></p><p>Given the calibration above, what happens if we take these self-reports at face value and run a procurement auction? Each model&#8217;s bid is derived mechanically from its own stated probability and its own token-cost estimate, plugged into a breakeven formula. The principal draws a random reserve price; the model wins the task if its bid is below the reserve.</p><p>Two things happen:</p><ul><li><p><strong>Everyone leaves money on the table compared to an oracle.</strong> The oracle &#8212; a hypothetical allocator that knows in advance which tasks each model can actually solve &#8212; earns several times more per task than any real model&#8217;s bidding. GPT-5.2 earns about $0.006 per task in realized profit; its oracle counterpart would earn $0.385.</p></li><li><p><strong>Gemini wins 84.6% of auctions.</strong> But it&#8217;s winning because it&#8217;s the most overconfident, not because it&#8217;s the most capable. This is almost a perfect example of why models should know their abilities better.</p></li></ul><p>This is exactly what the theory predicts when private information is missing or unreliable. As an aside, humans often also lack private information or incentives to complete tasks. In these situations, we use reputation and liability to discipline the market. It is interesting to think about what the analogues for agents would be.</p><h3><strong>Can we fix self-assessment with prompting alone?</strong></h3><p>Now, since training these models to have self-knowledge is not easy from the outside, before concluding that markets need fundamentally better agents we tried a simpler intervention: give each model a short card summarizing its own historical performance &#8212; its pass rate on other tasks, how overconfident it&#8217;s been on average, and how badly it underestimates tokens. Then we ask it to forecast the current task, starting from that prior.</p><p>This is basically &#8220;here&#8217;s what you&#8217;re like; now try to be a bit more self-aware.&#8221;</p><p>It helps! Brier scores improve and token estimates become less severely understated (from 0.02 to 0.25 of actual &#8212; still low, but no longer comically so).</p><p>But the <em>auction</em> result barely moves. Aggregate realized profit slips slightly. The gap to oracle is essentially unchanged. So the intervention improved average calibration, not comparative routing, because while it got better information about global capabilities and costs it didn&#8217;t give enough task-specific signal.</p><p>What does change is who wins: allocation shifts away from Gemini and toward the OpenAI models. So the intervention fixes bid acuity at the margin, but not enough to translate into meaningful aggregate gains. This distinction matters because calibration alone is not enough, since a bidder can be right on average and still useless for allocation. The market needs task-level discrimination. When this agent says 90% and another says 60%, that difference must predict who is actually more likely to solve this task.</p><h3><strong>A market scaffold</strong></h3><p>Alongside the benchmark, we built a market-inspired scaffold where six workers (the same six frontier models) actually bid on SWE-bench tasks and an operator routes the work based on a score that combines each bid&#8217;s price, claimed probability of success, and an explicit failure penalty. Workers get two attempts per task; a worker that fails is excluded from retrying the same task, which forces diversity on retry.<br><br>Here&#8217;s what happened on a common 50-task slice:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!LeRM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b90a0c5-cb45-455f-b790-c545f43056d6_2048x1173.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!LeRM!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b90a0c5-cb45-455f-b790-c545f43056d6_2048x1173.png 424w, https://substackcdn.com/image/fetch/$s_!LeRM!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b90a0c5-cb45-455f-b790-c545f43056d6_2048x1173.png 848w, https://substackcdn.com/image/fetch/$s_!LeRM!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b90a0c5-cb45-455f-b790-c545f43056d6_2048x1173.png 1272w, https://substackcdn.com/image/fetch/$s_!LeRM!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b90a0c5-cb45-455f-b790-c545f43056d6_2048x1173.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!LeRM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b90a0c5-cb45-455f-b790-c545f43056d6_2048x1173.png" width="1456" height="834" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2b90a0c5-cb45-455f-b790-c545f43056d6_2048x1173.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:834,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!LeRM!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b90a0c5-cb45-455f-b790-c545f43056d6_2048x1173.png 424w, https://substackcdn.com/image/fetch/$s_!LeRM!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b90a0c5-cb45-455f-b790-c545f43056d6_2048x1173.png 848w, https://substackcdn.com/image/fetch/$s_!LeRM!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b90a0c5-cb45-455f-b790-c545f43056d6_2048x1173.png 1272w, https://substackcdn.com/image/fetch/$s_!LeRM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b90a0c5-cb45-455f-b790-c545f43056d6_2048x1173.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><br>The market beats solo GPT-5.2 by 10 percentage points inside the same scaffold, but mainly because it uses diverse models. We then ran a follow-up that kept <em>everything</em> identical &#8212; same workers, same tasks, same budget etc &#8212; but replaced the market-clearing rule with a centralized router: a single LLM call (GPT-5.2-pro) that looks at the task, the available workers, and simply picks one. The centralized router reached 27/50. The market reached 23/50 in the matched rerun (again, due to Gemini&#8217;s overconfidence).</p><p>Most of the market&#8217;s advantage over solo GPT-5.2 came from <em>having access to multiple different models</em>, not from the market mechanism itself. Once we held model diversity constant, a LLM central planner beat the market. This isn&#8217;t a surprise given what MarketBench tells us: if bids don&#8217;t contain good information, a market has nothing to aggregate, and a centralized decision-maker with a view of the whole task pool will do at least as well.</p><p>There&#8217;s also a separate result: the same GPT-5.2 that solves 74% of tasks in the external SWE-bench scaffold only solves 48% in ours. The live scaffold is a weaker execution environment &#8212; no interactive shell, no test feedback, one-shot patches. We can recover about 10 of those 26 lost percentage points through diversity. The remaining 16 would need scaffold upgrades, not better bidding by an LLM without tools. The execution path turns out to be first-order for both success and cost. This also means that when considering the performance of agents and their potential for market participation, we should think of agents as bundles of models, execution paths, and scaffolding.</p><h3><strong>So where does this leave us?</strong></h3><p>We started with the Hayekian intuition that markets should beat central planning for coordinating heterogeneous AI agents, because task-specific fit is local information that&#8217;s hard to centralize. We still think this holds, but the current set of agents don&#8217;t know themselves well enough for markets to work. We should fix this!</p><p>Our key takeaways:</p><ol><li><p><strong>Self-assessment is a key capability, and it needs to be trained for.</strong> Models are trained to solve tasks, not to predict whether they can solve them. Those are different skills. As agentic systems scale, the ability to say &#8220;I can do this, at this cost, with this confidence&#8221; becomes as important as the ability to do the thing. This should be a target of training in its own right.</p></li><li><p><strong>The right system is probably a hybrid.</strong> Pure decentralized markets need informed bidders. We don&#8217;t have those yet. But centralized planners will struggle as the agent ecosystem gets larger and more heterogeneous &#8212; they can&#8217;t know every agent&#8217;s local strengths for every combination of problem.<br><br>The natural middle ground looks like a <em>scoring auction</em>: agents submit bids, but the allocator weights those bids by a quality score drawn from reputation, observed history, and other centralized signals about how trustworthy each agent&#8217;s self-reports are. Markets augmented by AI.</p></li><li><p><strong>Model diversity matters </strong><em><strong>even when</strong></em><strong> the market doesn&#8217;t.</strong> The single most robust finding in our live scaffold is that access to multiple different (frontier) models helps, almost regardless of how you route between them. This is a useful practical point for anyone building agentic systems today: don&#8217;t lock into one provider, even if your routing logic is crude.</p></li><li><p><strong>Bids will eventually need to be richer than a scalar.</strong> Recent work from AISI and others suggests agent performance keeps improving at much larger inference budgets than we typically allow. If that&#8217;s right, an agent bidding on a task shouldn&#8217;t just offer a price &#8212; it should offer a <em>production plan conditional on budget</em>, describing how it would allocate compute across search, tool use, and revision as the budget scales. We don&#8217;t model this yet, though we think it&#8217;s the natural next step.</p></li></ol><p>For now, if you&#8217;re building with AI agents and wondering whether you should replace your ad-hoc routing rules with a market: probably not yet. But you should be thinking about it, and you should be testing whether the models you use have any idea what they&#8217;re good at. In our experience, they mostly don&#8217;t.</p><p><em>A request</em></p><p>We&#8217;d like to keep going, and the main thing slowing us down is compute. Scaling MarketBench to more tasks, more models, more domains beyond software engineering, and more variations on the bidding mechanism is straightforward in principle &#8212; but each full run spans six-plus frontier models across hundreds of tasks with multi-attempt execution, and the token bill adds up fast. If you work at a lab or provider that could sponsor API credits, or at an organization with compute to contribute in exchange for early access to results, we&#8217;d love to talk. We&#8217;re also interested in collaborators working on adjacent problems: agent calibration, scoring mechanisms, reputation systems for LLMs, or richer bid formats that condition on budget. Reach out.</p><p><em>Thanks to Tom Cunningham and Daniel Rock for feedback on an early draft.</em></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://empiricrafting.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Justified Posteriors! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[The classic model that shows why AI exposure could increase wages]]></title><description><![CDATA[Revisiting Kremer's O-ring theory of economic development]]></description><link>https://empiricrafting.substack.com/p/weak-links-strong-predictions-kremers</link><guid isPermaLink="false">https://empiricrafting.substack.com/p/weak-links-strong-predictions-kremers</guid><dc:creator><![CDATA[Andrey Fradkin]]></dc:creator><pubDate>Mon, 20 Apr 2026 17:16:51 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/194643954/21d076820a20fcdf62569e2d47d3e432.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>This week, instead of reviewing a recent paper on AI, we go back to a 1993 classic: Michael Kremer&#8217;s &#8220;The O-Ring Theory of Economic Development.&#8221; It&#8217;s one of those papers that feels larger than its formal model. The setup is extremely simple: production consists of many tasks, and output depends on all of them going right. But the implications are broad enough to touch development, firm organization, inequality, and the economics of AI. The paper is also a throwback in style: a grand theory paper in which a simple model is asked to explain a wide range of stylized facts. Sometimes that kind of ambition is a bug. Here, it is mostly a feature. The model is elegant, readable, and influential for good reason, even if it occasionally threatens to explain everything and therefore, maybe, not enough.</p><p>We start with Challenger, the exploding shuttle, and the literal O-ring that gives the model its name. From there we work through Kremer&#8217;s central intuition: when production is highly complementary, weak links matter enormously, and the best workers or firms want to match with one another. That basic logic helps explain why high-wage workers cluster together, why firms may separate sharply by quality, and why richer economies may be able to sustain much longer and more complex production chains. But the further you move from tasks to firms to whole countries, the shakier the fit becomes. Along the way we connect the paper to modern questions about AI and automation, including whether AI reduces weak links by automating them, or instead makes production even more complex and therefore more unequal.</p><p><strong>Priors &#8594; Posteriors:</strong></p><p><strong>Prior 1:</strong> Is close complementarity the most important explanation for sorting within firms and productivity differences across countries? Andrey goes in at about 60% for firms and 30% for countries. Seth comes in slightly higher for firms and slightly lower for countries. By the end, both of us move only a little: somewhat more convinced that O-ring logic is a major force within firms, but still skeptical that it is the main explanation for cross-country differences once institutions, natural resources, and other macro factors enter the picture.</p><p><strong>Prior 2:</strong> Is O-ring logic a better way to think about AI and automation than the standard task-based framework? Both of us come in thinking the answer depends on the level of analysis. At the level of the individual worker, O-ring logic feels very strong: people are often defined more by their weak links than their strengths. At the macro level, substitution and reorganization matter much more. By the end, that view holds up. If anything, the paper reinforces the idea that AI may increase returns for highly capable workers by removing weak links around them, even as it automates more commoditized tasks.</p><p><em><strong>This episode is sponsored by <a href="https://www.reveliolabs.com/">Revelio Labs</a> &#8212; a great source of labor economics data for academics and firms. Now available on WRDS.</strong></em></p><p><em><strong>References:</strong></em></p><p><strong><a href="https://tecunningham.github.io/">Tom Cunningham&#8217;s Blog Posts</a></strong></p><p><strong>Acemoglu &amp; Restrepo &#8212; task-based automation framework</strong> The canonical accessible version: <a href="https://www.aeaweb.org/articles?id=10.1257/jep.33.2.3">Automation and New Tasks: How Technology Displaces and Reinstates Labor</a>, <em>Journal of Economic Perspectives</em>, 2019. </p><p><strong>Gans &amp; Goldfarb &#8212; O-Ring Automation</strong> <a href="https://www.nber.org/papers/w34639">NBER Working Paper 34639</a></p><p><strong>Rosen (1981) &#8212; The Economics of Superstars</strong> <a href="https://home.uchicago.edu/~vlima/courses/econ201/Superstars.pdf">PDF</a>.</p><p><strong>Becker on <a href="https://public.econ.duke.edu/~vjh3/e195S/readings/Becker_Assort_Mating.pdf">Assortative Matching</a></strong></p><p><strong>Hamming &#8212; You and Your Research</strong> <a href="https://fs.blog/great-talks/richard-hamming-your-research/">fs.blog</a>.</p><p><span class="mention-wrap" data-attrs="{&quot;name&quot;:&quot;Philip Trammell&quot;,&quot;id&quot;:149815085,&quot;type&quot;:&quot;user&quot;,&quot;url&quot;:null,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/69df89bd-b8a9-4d84-98af-790dc873de54_1522x1522.jpeg&quot;,&quot;uuid&quot;:&quot;f2911581-fc27-4fb8-ae65-8399cfd116ce&quot;}" data-component-name="MentionToDOM"></span> <strong>&#8212; cross-task learning / automation and learning-by-doing</strong>: <a href="https://x.com/pawtrammell/status/2021038215667515559">the thread</a>. </p><p><strong>Patrick McKenzie &#8212; The Sort</strong>, <a href="https://x.com/patio11/status/1818781344702644311">the original thread</a>.</p><div><hr></div><h1>Transcript</h1><h2>Introduction [00:00]</h2><p><strong>Seth: </strong>Welcome to Justified Posteriors, the podcast that updates beliefs about the economics of AI and technology. I&#8217;m Seth Benzell, always benefiting from the high performance of my closely complementary co-host, coming to you from Chapman University in sunny Southern California.</p><p><strong>Andrey: </strong>And I&#8217;m Andrey Fradkin, always ruining our O-ring production function by not thinking in advance of my introductory catchphrase. Coming to you from New York City today.</p><p><strong>Seth: </strong>Do we have a sponsor?</p><p><strong>Andrey: </strong>We indeed have a sponsor, the wonderful folks at Revelio Labs. Thank you for sponsoring the podcast.</p><h2>The O-Ring Paper and Why It Still Matters [00:42]</h2><p><strong>Seth: </strong>Amazing. More on them later. So, Andrey, interesting episode today. We&#8217;re going to be reading and thinking about a bit of a classic about O-rings. So this is a paper from 1993 from Nobel laureate Michael Kremer, appearing in the Quarterly Journal of Economics. And it&#8217;s doing theory in a way that is a little bit unusual from our post 2010 econ reading selves, which is it lays out a grand theory of the economy. It shows that a bunch of stylized facts fall out of it and it pieces out. I enjoyed reading this paper, but is it science? Did you learn anything from it? I&#8217;m excited to jump in.</p><p><strong>Andrey: </strong>Yeah. And I think we should talk a little bit about what is interesting here. This paper has been very influential in how people think about production and about differences in productivity across firms and countries. I looked for mentions of this paper and the Nobel Prize. And it turns out that this is not the paper that Michael Kremer got the Nobel Prize for, even though In some sense, it might be more influential than his other.</p><p><strong>Seth: </strong>He won it for RCTs, right? Anybody could have thought of RCTs.</p><p><strong>Andrey: </strong>He won it for conducting randomized controlled trials in the developing world, which this is decidedly not.</p><p><strong>Seth: </strong>This is basically the economic opposite of doing RCTs.</p><p><strong>Andrey: </strong>But nonetheless, it has gotten a lot of attention more recently as people think about AI. How is AI going to affect production? Well, it really depends on what we assume about the production function. So with that, I I think maybe let&#8217;s get to our priors, Seth.</p><h2>Priors [02:45]</h2><p><strong>Seth: </strong>So basically what this paper is going to do is going to roll out a story of how production works in the economy where production is really complementary. In order to get something, a really high quality product, you need everything in your firm to go right and maybe kind of at the economy level if you want your country to be productive, you need all of the elements of your country to go right. And so... He thinks a lot of things fall out of this close complementarity between different inputs in production and like kind of weak links models of production. And two of the things that he thinks fall out of that is the big wage differential between rich and poor countries, and then the positive correlation in wages within a firm. So those are kind of two of the big predictions that come out of this model is that there should be big gaps between countries in wages. And then the second is across firms. there should be big gaps in wages as you get firms with lots of high quality, high paid workers and firms with lots of low quality, low paid workers. So I guess our first prior will be, do you think close complementarity is the most important explanation of those two phenomena?</p><p><strong>Andrey: </strong>Let&#8217;s focus first on firm productivity. Is it the most important explanation? My prior is probably yes. But I think it explains less than 50 percent of the reason wages are correlated within a firm. There are a lot of reasons, and I think this is probably the most important, but</p><p><strong>Seth: </strong>But there are.</p><p><strong>Andrey: </strong>I don&#8217;t think it explains most of it. So that&#8217;s fact one. For developing countries versus developed countries, like differences in GDP per capita, I don&#8217;t think this explains it very much unless you take such a broad view of production functions that you essentially fold institutions into production.</p><p><strong>Seth: </strong>The sheikh is a close complement to sitting next to the oil well.</p><p><strong>Andrey: </strong>That&#8217;s the good one. So yeah, so those are kind of my priors. And I guess I should put some probabilities on those. So this being the most important factor in firm differences, I&#8217;d put that 60 percent. And then this being the most important factor in country differences, this being the most important factor in country differences, I guess that&#8217;s a little tricky is I put that at 70 percent.</p><p><strong>Seth: </strong>Okay, cool. So 30 percent chance that it is. I would probably come in with maybe a slightly exaggerated version of those takes. I think at the firm level, if you think about, you know, why do the New York Knicks have five really good players and my pickup, I shouldn&#8217;t talk about the Knicks, I should talk about a good basketball team.</p><p><strong>Andrey: </strong>Yeah.</p><p><strong>Seth: </strong>I don&#8217;t know. You know, when LeBron took his talents to South Beach and he brought everybody to the Miami, why did everyone want to play there? Well, it&#8217;s because everybody, the best players want to play with the best players, right? And they understand that they&#8217;ll perform better next to the best players. The owners understand that if the best players are all combined with each other, they&#8217;ll get a better product. I think sports may be an interesting case because there&#8217;s kind of like a max productivity you could theoretically hit, right? You just can&#8217;t do better than winning maybe. So. maybe that would count against that particular setting.</p><p><strong>Andrey: </strong>disagree with you said it. already disagree. I would think like most truly superstar players, they don&#8217;t want to play with too many other superstar players because their production function is not the same as the team&#8217;s production function.</p><p><strong>Seth: </strong>No cap, so that&#8217;ll count against, so that&#8217;ll be a big count against the Kremmer story, because Kremmer&#8217;s gonna want all the superstars to be together.</p><p><strong>Andrey: </strong>Yes, I mean, we observe that&#8217;s not how basketball is organized and that superstars are dispersed across teams rather than all being on the same team.</p><p><strong>Seth: </strong>But maybe that&#8217;s how countries work, right? I mean, it seems like most of the superstars are in America if we&#8217;re talking about elite CEOs or elite scientists.</p><p><strong>Andrey: </strong>Well, there might be potentially CEOs and scientists in other countries that are hampered by, like that&#8217;s a claim. You&#8217;re, you&#8217;re, you&#8217;re selecting on the sample.</p><p><strong>Seth: </strong>It&#8217;s T-Mobility. Ha ha!</p><p><strong>Andrey: </strong>I guess basketball is pretty good at getting talent into the system. And then when we see that the talent is allocated, maybe the NBA as a whole is an over-ing production function, but each individual team is not structured to have only superstars.</p><p><strong>Seth: </strong>would bet that productivity on a basketball team, if you&#8217;re talking about like point score, at the point score level.</p><p><strong>Andrey: </strong>That&#8217;s what I&#8217;m disputing is that&#8217;s the productivity that we care about necessarily.</p><p><strong>Seth: </strong>My yes, so then there&#8217;s this other productivity which is like adjusting for compensating differential of you the stars getting each other&#8217;s ways, right? And of course that&#8217;s going to be challenge with all of this is bringing these hypotheses to the data given that in the background there could be all sorts of compensating differential things going on, To you. No, no. That was good. That was good feedback. So I guess at the firm level, if you were at 60 percent</p><p><strong>Andrey: </strong>Yes, yes. Anyway, sorry.</p><p><strong>Seth: </strong>I will come up to maybe 70 percent that this is the most important factor in correlation of wages at the firm level of all of the top people wanting to be together, is that they&#8217;re multiplying each other rather than substitutive. At the country level, I think you said 30 percent chance this is the most important thing. I think if we look empirically at the world, if we were just like weighing all countries equally, Right. You&#8217;ve got the issue of like these big resource rich countries, which has like a whole &#8216;nother story about why they have such high GDPs. I think there&#8217;s an issue around, you know, just like policy, like there&#8217;s a lot of reasons that countries might have different total productivities. And I think as we go on, we&#8217;ll talk more about the fact that even though this is a kind of a beautiful story in this model that can work kind of at the individual level, at the firm level and at the country level. I think my intuition is that this story is kind of like the most powerful at like the individual level. It&#8217;s pretty powerful at the firm level. And as we move to the society level, you can imagine so many margins of adjustment that maybe close complementarity isn&#8217;t the only way to think about the economy and that there are opportunities for substitution. So I would maybe come in, you said 30 percent chance is the most important thing, just below that 25 percent chance.</p><p><strong>Andrey: </strong>And to be clear, just, you know, it&#8217;s the most important, but it could still be like only explaining 5 percent of the variation and at the country level, right? Right. So I think that&#8217;s kind of the way in which it&#8217;s tricky. I don&#8217;t think either of us think that it explains more than 50 percent of the variation.</p><p><strong>Seth: </strong>For firm level, I might be convinced this is more than 50 percent of firm variation.</p><p><strong>Andrey: </strong>could be convinced, at the country level I don&#8217;t.</p><p><strong>Seth: </strong>Country level, it&#8217;s hard. There&#8217;s so much going on.</p><p><strong>Andrey: </strong>Yeah. Well, what about the second part?</p><h2><strong>AI Prior: O-Rings vs. Task-Based Automation [10:13]</strong></h2><p><strong>Seth: </strong>the second prior. my gosh. Second prior. So it&#8217;s an Econ of AI podcast. So I got to ask you an AI question, Andrey. Yes. Which is, you know, the foremost way of thinking about how AI will impact the economy is this Acemoglu-Restrepo task-based CS framework where you think about the firm or the economy as</p><p><strong>Andrey: </strong>fire.</p><p><strong>Seth: </strong>taking in a whole bunch of labor inputs of different kinds, combining them with a CES production function. And then the way that automation shows up is it shows up in some of the range of tasks that you need to combine. So you&#8217;d have a large range of tasks that you need to combine and then use a little bit of capital or you have a small range of tasks that you need to combine. And then you have more capital that represents capital substituting for labor. There&#8217;s kind of some sense in which that model Automation makes the economy in like this O-ring sense, like less complex because you have fewer labor inputs. If I replace, you know, imperfect Seth with, you know, pure heartless robot, there&#8217;s in the sense of the model we&#8217;re about to read that would make things like kind of less complex. There have been other authors who have kind of come at the question of automation from another direction, including I believe it&#8217;s Avi Goldfarb and Josh Gans.</p><p><strong>Andrey: </strong>They&#8217;re in fact both on a paper.</p><p><strong>Seth: </strong>Both on the paper. Avi will be able to talk to soon. And maybe, Andrey, you talk to us a little bit about the way that they think about how O-rings could apply to automation.</p><p><strong>Andrey: </strong>I don&#8217;t want to get too deep in their paper, but they&#8217;re thinking about essentially that time allocation of an individual person. So as podcasters, we do various things. We read the papers. We record the podcast. We do the show notes. We read the transcript. Now, what if a wonderful piece of software automated the transcript correction for us? Or maybe that&#8217;s even a human. that gives us more time to focus on other parts of the podcast. How is that going to affect the overall productivity?</p><p><strong>Seth: </strong>really don&#8217;t like this map. The implication I would not immediately take all leisure, but also the implication that if I&#8217;m thinking about basketball, so in current basketball, you need to be agile and you need to be tall. And like, let&#8217;s say we move to a version of basketball where for whatever reason, tallness no longer mattered, right? It&#8217;s not like I can reallocate my skill points from tallness into agility, right?</p><p><strong>Andrey: </strong>So I think you&#8217;re setting up a straw man of their argument. I don&#8217;t know. like, I think my straw man. even read, which is, know, haven&#8217;t we all. But I think the idea here is that like, there&#8217;s something within a person that needs to happen about certain production. And if you can, if one part of that all of a sudden gets automated, but the rest you&#8217;re still doing, there&#8217;s a sense in which Parts getting automated, cache increased your productivity. So that&#8217;s kind of the argument.</p><p><strong>Seth: </strong>Right. the idea there is we should be really automation. We should be thinking about within individuals who have tightly complimentary components rather than thinking about kind of at the firm or the economy level, you bring in some automation, you dump your automation into your, know, your stack of labor inputs and you know, more output and maybe it&#8217;s substitutes for labor. you drive wages down, right? Yeah. I mean, Andrey, do you want to take the first swing at this one?</p><p><strong>Andrey: </strong>I think both are going on. This is going to be a typical Andrey answer. I&#8217;m very against mono-causal. I think there are definitely parts of production that are exactly O-ring. I think for certain individuals, it must be the case.</p><p><strong>Seth: </strong>Two-handed, the famous two-handed economist. there&#8217;s also a sense in which the economy can reorganize your job to break up the O-ring, right?</p><p><strong>Andrey: </strong>So they can and that and that kind of goes to the question of like, what is differentiated about your job and why isn&#8217;t the economy able to trivially substitute for it? And podcasting is a great example. think like, I will not use our system example. Let&#8217;s use Dorkash as an example.</p><p><strong>Seth: </strong>Never heard of him. Is he a podcaster?</p><p><strong>Andrey: </strong>A favorite of the AI crowd. Let&#8217;s say that he&#8217;s able to spend more time researching and preparing for his podcast and is able to recruit better guests because he no longer has to do some of the stuff that he was doing when he was starting out, like manually doing his own editing, hustling to get noticed and so on and so forth. Presumably that podcast becomes a lot better. And since people, he&#8217;s already built a relationship with a certain group of people, it&#8217;s going to be very hard for the economy to substitute for him.</p><p><strong>Seth: </strong>I don&#8217;t think you even need the time reallocation part if you automate his weak link He doesn&#8217;t even need to get better at other stuff, right?</p><p><strong>Andrey: </strong>Yeah, he does. Yeah, he could. He could just take that as leashed for sure. Right. I think that&#8217;s a good point. But he could invest it even to the extent that like, it is an O-ring production function. He&#8217;s getting such great returns now from if now his audience is millions of people and before it was thousands, then the returns to quality under are probably a lot higher, right?</p><p><strong>Seth: </strong>Income effects as well, right? A richer person works less.</p><p><strong>Andrey: </strong>Yes. I think both of us are in agreement here. if we posited other production functions, then his returns from investing quality would be a lot lower because he&#8217;s kind of already good enough.</p><p><strong>Seth: </strong>Okay, so. Okay, so I guess this is kind of a vague prior, but I guess the question is, like, gun to your head if you have to choose between O-Ring and Osamuoglutus strappo, or how do you want to frame this one? Or what percentage is relatively more important? Or we could talk maybe more about like, at what levels of analysis?</p><p><strong>Andrey: </strong>Yeah, that&#8217;s kind of where I was thinking about going. think the more micro, the more O-ring, the more macro, the more substitute. But even like at the macroeconomic level, I think there are parts of production processes that feel O-ringy to me. So that I&#8217;d say maybe 20, 30 percent at the macro.</p><p><strong>Seth: </strong>So 20 to 30 percent of the macro level at the individual level 80 percent. So it depends so much across occupations. you&#8217;re shoveling shit, I don&#8217;t know, your O-ring matters for you.</p><p><strong>Andrey: </strong>I know Seth had to get in a curse word just for our special listeners. Shove it. Doodoo. We&#8217;re kid-friendly as a podcast. Yeah, I think that&#8217;s about right. What do you think?</p><p><strong>Seth: </strong>du-du. I think that&#8217;s exactly the right way of thinking about it, which is I would think both in terms of scales and timeframes. So at the individual level, and this isn&#8217;t really gotten into in this paper, so maybe we&#8217;ll talk about it a little bit later, I think it&#8217;s inarguable that abilities are complementary within an individual. If you have an individual who&#8217;s smart and strong and lazy, they&#8217;re a bad worker. If you have a worker who&#8217;s strong and energetic, but not smart, they&#8217;re a bad worker, right? I think people are weak link people, right? They&#8217;re kind of defined by their weaknesses in some ways more than their strengths. If you think at the firm level, you imagine firms optimizing around this problem to try to minimize it, but obviously there are cases where you can&#8217;t minimize it. If you&#8217;ve got a fab where you&#8217;re making computer chips and you do a thousand steps to go exactly right to make billions of dollars of computer chips, You better believe that you&#8217;re going to think hard about every part of that process, right? But as we get to the macro level, we&#8217;ve talked about this. There&#8217;s all sorts of substitution that can happen and reorganization. The part that I would add to what you just said is across time, think this matters too, right? So we know the famous, I think Tyler calls this Le Chatelier principle, which is actually a chemistry principle. But the idea that things get more substitutable over time, yeah, I mean, Right now, in 1990s, the only way to get to outer space was with a space shuttle. And then in that space shuttle, the weak link was the O-ring. We&#8217;ll talk about this in a second. But now there&#8217;s like two or three different ways to go to outer space, which might have different weak links. And if you can choose between different production functions that have different weak links, all of a sudden, weak links start mattering less, even if every process has one.</p><p><strong>Andrey: </strong>Minor point of clarification, please. 1980s.</p><p><strong>Seth: </strong>1980s. No, but the shuttle was still going into the 90s though.</p><p><strong>Andrey: </strong>happened in the 80s. I, so I think like a couple of other thoughts here, know, various folks have written about production functions in interesting ways that kind of reflect this, like, Richard Hamming wrote a lot about what it takes to be a good researcher. And a lot of that, you know, his essay, you and your research, you know, talks about kind of weak links in the production process of being a researcher. And I certainly think that We&#8217;ve all seen various versions of that failure mode. A classic version is if you&#8217;re a bad presenter, that could really sink your scientific career. And that seems pretty O-ringy. On the other hand, I do think there&#8217;s something about the modern, more advanced economy in which certain things that would have been O-ring production functions historically are becoming a lot less so. To give an example, with chat GPT, maybe lack of ability in a particular language is less important for doing scientific research than it was previously, right? Because then now you can write in English, even if you&#8217;re not a good writer in English.</p><p><strong>Seth: </strong>That&#8217;s a really good one, right? Exactly. So one way to think about that is automating a thing that is a weak link for a lot of people so that they can now just focus on the thing that they&#8217;re good at. Yes.</p><p><strong>Andrey: </strong>And then the other thing to think about, and think Phil Trammell has written about it, is there might be some sort of benefit from doing all the tasks in a production process because of learning. Like, it&#8217;s hard to do good transcript editing unless you&#8217;re actually the one who had the conversation in the podcast because you kind of know what you&#8217;re trying to say. That&#8217;s kind of a trivial example, but you can imagine many such examples. so then... You want to you&#8217;re tempted to get rid of the O-ring by, you know, having the computer do certain of the tasks. But then that ruins your learning and your ultimate ability to produce.</p><p><strong>Seth: </strong>All right, well put, Andrey. Are we ready to move into the evidence?</p><p><strong>Andrey: </strong>Yes, the evidence being a theory paper. Exactly.</p><h2><strong>The Challenger Disaster and the Origin of the O-Ring [22:12]</strong></h2><p><strong>Seth: </strong>All right, so the evidence. Well, in I think an underappreciated and underutilized way of starting an economics paper, he starts with an explosion. More economics papers should start with big explosions, Andrey. Particular, he&#8217;s talking about the Space Shuttle Challenger disaster, which is famous for, you know, we used to have these space shuttles that would go up, come back.</p><p><strong>Andrey: </strong>True.</p><p><strong>Seth: </strong>You can Google them if you&#8217;re interested in them. In particular, there was this one event where they had a high school social studies teacher on the spaceship. It was going to be the first civilian in space. And she was the first civilian in space for about 30 seconds before the whole thing explodes. Right. And so the government, Reagan, convenes this blue ribbon panel to try to figure out why did this spaceship explode. And there are so many cool people on it. Neil Armstrong&#8217;s on the panel, Sally Ride&#8217;s on the panel, and of course, iconic iconoclast, Richard Feynman. And so Richard Feynman is working with his team trying to figure out why did the spaceship explode? There&#8217;s all this sort of bureaucratic, you know, finger pointing, hey, what went on here? Sally Ride gets an anonymous tip from a NASA engineer showing this table that said, Hey, when temperatures get low, there seem to be this one part that has a higher rate of failure. So in particular, it&#8217;s this thing called an O-ring. It&#8217;s kind of a gasket. It&#8217;s a rubber gasket. And if you think about the space shuttle engine, it actually kind of has a bunch of kind of chunks in it. And the hot gas needs to flow between these chunks. And so this rubber gasket went around the connections, keeping the hot gas between the chunks. So what went wrong? Well, Feynman famously showed in one of his committee hearings that they were having, bringing in evidence and talking to experts about what could have gone on. He famously kind of opens up a model of the space shuttle, takes out a bit of O-ring that was of the same material, dunks it in a glass of ice water. Then at the end of the hearing, he pulls it out and he&#8217;s like... clearly the things like about to break, it&#8217;s the wrong shape. And he&#8217;s like, I think this may have something to do with our, we&#8217;re discussing today. So, love a Feynman moment. Yeah. And so the committee ultimately decides that this, you know, the shuttle program as a whole was worth over, costed over $280 billion in 2026 numbers. This whole program eventually, you know, this particular space shuttle of five was blew up. And after this disaster, the program released, it&#8217;s the beginning of the end of the program, all brought down by a single, what, couple hundred dollar large rubber gasket. So that&#8217;s the O-ring. It&#8217;s the weak link that destroys your amazing operation. And so what kind of production function does that inspire, Andrey?</p><h2><strong>The O-Ring Production Function [25:20]</strong></h2><p><strong>Andrey: </strong>Yeah, so hence the O-ring production function. It&#8217;s quite simple in that you imagine that there are n tasks in a production process, and each of them is done at a certain quality level. We can think of this as a probability of success. So it could be 0 out of 1. And you just multiply that quality together, and that determines how much is produced. There are additional terms there relating to overall productivity and the amount of capital. but let&#8217;s just ignore that for this discussion.</p><p><strong>Seth: </strong>Yes.</p><p><strong>Andrey: </strong>It&#8217;s in some sense, like strikingly simple, right? That&#8217;s, I think, one of the reasons that this theory is so influential. But you get these implications out of it that are quite interesting in that for production, for example, you want, you know, if you have one high quality component, then you&#8217;re getting a lot more out of having the other tasks also be done by a very high quality person. And then you&#8217;d have longer chains. Obviously, you want longer chains to have higher success probabilities. And so he just derives a competitive equilibrium. That&#8217;s kind of the first thing that he does in such a simple model.</p><p><strong>Seth: </strong>And you just point out the big thing that comes out of the equilibrium is assortative matching. Yes. In equilibrium, everyone should have the same skill level. And just to pull out a quote on that, since profits are zero for all firms given the wage schedule, firms are indifferent as to the skill level of their workers as long as their labor force is of homogeneous skill. Equilibrium holds when firms demand the number of workers of each skill available in the population. Well-behaved problem, the competitive equilibrium is optimal and unique up to reassignments of workers of equal skill. firms enter, every firm is going to have a worker of exactly the same skill level, right? Or rather, yeah.</p><p><strong>Andrey: </strong>Yeah. And this has like a flavor of some of these labor market matching models, you know, going all the way back to Becker, where you think about like, do high skilled workers match with high productivity firms, and you got very analogous results under certain assumptions. So he does this and this is kind of like a true, you know, pretty trivial grad school exercise. And then he goes into applications, which is kind of cool to do. It almost reads in many ways like a sub stack post, honestly.</p><p><strong>Seth: </strong>I have to say, Andrey didn&#8217;t want one of our priors to be about theory today versus theory in 1993. But guys, if you want a recommendation about one of our papers that&#8217;s actually super readable and not too long, this is a very readable paper. The math doesn&#8217;t get crazy. The math is exactly as complicated as it needs to be. And all of these interesting results fall out of it that seem to stylistically hold. Andrey, wish I could write. I wish they let me publish theory papers like this. I wish there was more. There we go. There we go.</p><p><strong>Andrey: </strong>Some sec they do. Well, you know who I view writes in this style? Friend of so Tom Cunningham, who writes very influential blog posts on his website, which are essentially written like this.</p><p><strong>Seth: </strong>Yeah, this is a truly epic blog post that we&#8217;re unpacking, guys, for you today. You said that this had the flavor of matching markets. You know what this kind of had the flavor of, to me, is kind of Rosen&#8217;s Superstar Markets. And Rosen does get cited a bit here, right? So you can think about, in the context of superstar markets, the richest guy wants the best surgeon for his heart disease, right? So that&#8217;s kind of another example of this kind of close complementarity and matching you would get.</p><p><strong>Stylized Facts: Development, Firms, and Sorting [29:09]</strong></p><p><strong>Andrey: </strong>Okay, so let&#8217;s go through some of the facts. There are a bunch of facts. So maybe we&#8217;ll be a little</p><p><strong>Seth: </strong>Yeah, I think let&#8217;s go bang. Yeah, the first one I have is wage differentials between rich and poor countries as large He starts with kind of one of the most tendentious ones</p><p><strong>Andrey: </strong>Yeah, well, I think there obviously, you know, we already started talking about this, but so many other reasons why there are differences in wages and productivity, know, institutions is certainly something that for me is very important, but also natural resource endowments, as we discussed. I guess that&#8217;s even related to some of the geography explanations that we can talk about. Institutions are really broad, goes, know, specific laws, could be, you know, the levels of corruption. And then we get into these kind of intermediate things where like, well, if the education system isn&#8217;t good, is that something about O-rings or is that something about something else that&#8217;s important?</p><p><strong>Seth: </strong>No, I was going to agree with that. I&#8217;ll just give an example. If your economy doesn&#8217;t have any natural resources and so energy is very expensive and that&#8217;s why wages are low, is that no ring story? In some sense, no, because o-ring is about uncertainty and inputs. I&#8217;ve just told the story about, there&#8217;s a compliment that&#8217;s expensive.</p><p><strong>Andrey: </strong>Yeah, well, all right. So uncertainty and inputs is not something we&#8217;ve talked about yet, Seth. let&#8217;s table that for a second. But is the O-ring the first component of many of those explanations? No, it&#8217;s not. I think like a counter to that, let&#8217;s steel man this a little bit. Well, you do see that Immigration flows to high income countries from like very highly skilled people in those countries. So you have like the brain drain effect that&#8217;s suggesting that like an individual level people are choosing to</p><p><strong>Seth: </strong>There&#8217;s a sort of matching, but do you the opposite? Do you see like low skilled Americans moving to like?</p><p><strong>Andrey: </strong>Everyone wants to move to... Yeah. Yeah. So there&#8217;s that going on. think like a very controversial theory that&#8217;s out there is an IQ driven theory of productivity. think, is it Garrett Jones that...</p><p><strong>Seth: </strong>Right, you should care about your country&#8217;s average IQ, right?</p><p><strong>Andrey: </strong>Yeah, yeah. So that kind of seems to relate to some O-Ring production function. So if you have like very high ability people and all the important positions in the economy and society, then you&#8217;re going to do better than if than if you don&#8217;t. mean, I&#8217;m not an IQ, you know, truth or anything like that. So, you know, I wouldn&#8217;t like lean on that too much. But if we think about something about ability. then there&#8217;s naturally a genetic component to it and then there&#8217;s some sort of.</p><p><strong>Seth: </strong>Also education of course.</p><p><strong>Andrey: </strong>And like even just, you know, even more so maybe than education is just like the upbringing environment, whether your certain norms are instilled in you.</p><p><strong>Seth: </strong>at that time. So like leisure preference, Protestant work ethic. Is Protestant work ethic part of the O-ring production function? At a certain point, this concept can expand so big that you get everything.</p><p><strong>Andrey: </strong>Exactly. You know, you do have countries where they do grow a lot. And so then there&#8217;s a question like, well, does the increase in the ability of the workers explain the growth of the country&#8217;s economy? I&#8217;m not sure. I mean, it is plausible that it&#8217;s a contributing factor. think the other, obviously, the opposite direction is something that we see where like the Soviet Union had extremely high quality education and worker abilities and yet was unable to produce things very effectively.</p><p><strong>Seth: </strong>Yeah, Soviet Union is kind of a counter example because that&#8217;s an institution story for why things don&#8217;t work out, not a complementarity story. So it seems over determined, could have been a lot of stuff going on. All right, next one. This one&#8217;s a little bit, I think we&#8217;re going to be agreeing a little bit more with, firms hire workers of different skill and produce different quality products. And there&#8217;s a great quote here.</p><p><strong>Andrey: </strong>Yeah, all right. Yeah, it is a bold claim to- Paper, yeah.</p><p><strong>Seth: </strong>In many industries, different firms hire different qualities of workers. Restaurants, for example, come in a range of quality levels. McDonald&#8217;s does not hire famous chefs. Maxim&#8217;s does not hire teenage waiters. Charlie Parker and Dizzy Gillespie work together. So do Donny and Marie Osmond. Some of those references quite live. I have no idea who those dancers are or singers.</p><p><strong>Andrey: </strong>Yeah, I mean, this is a warning about putting dated references in your papers. There needs to be like an AI that opts that like auto up. You know, McDonald&#8217;s does sometimes hire famous chefs, I think.</p><p><strong>Seth: </strong>Nervous. Right. And I think that&#8217;s actually an issue here. Right. And we&#8217;re going to come up. We&#8217;re going to come to this in the next bullet, which is this sorting is far from perfect. Right. So I definitely do believe that within a production process that&#8217;s a weak link production process, you do have to think really carefully about investing in a quality level such that you don&#8217;t blow up the thing at the last step. Right. That makes sense. That&#8217;s in this paper. However, a firm isn&#8217;t a production function, right? If you think about a conglomerate that has one business that&#8217;s doing financial lending and one business which is building jet engines, there&#8217;s no close complementarity between the banker and the financials decision and the engineer and the jet engines division. I agree with the concept, but I don&#8217;t think that concept is a firm in real life, right?</p><p><strong>Andrey: </strong>Yes. Well, there are kind of like other reasons why firms might hire workers of similar abilities. mean, I think one version of that is cultural norms is certainly one, Sure. And this kind of explanation also for like wages of workers across different occupations within the firm. But like if you have very wealthy partners at your firm, then even they might feel bad paying a secretary low wages. And then at the same time, given the high wage of the job, you&#8217;re going to get the best secretaries to join. There&#8217;s kind of like a reverse causality that, call the factor, it might be something about cultural norms, the fairness within the firm, rather than the all-ranginess of the...</p><p><strong>Seth: </strong>Right, a related idea is just an institution of profit sharing, right? So a lot of like places have a norm that if you have a windfall year, everyone should get bonuses, right? And that might not be like optimal behavior that could just be an institution.</p><p><strong>Andrey: </strong>Yeah, or just another like a brilliant. Yeah, like let&#8217;s say a firm is very cool, you know, then lots of talented people may want to work there and potentially even at lower wages because of, you know, all else equal because of compensating differential. They all want to say that, you know, they want to work for the NBA, right? Even if, know, if you&#8217;re not a basketball player, maybe, you know, you don&#8217;t need as much skill in that job, but it&#8217;s cool. It&#8217;s a cool job. Right. So.</p><p><strong>Seth: </strong>onions</p><p><strong>Andrey: </strong>So once again, you&#8217;re going to select for high ability people who care about this common factor about this firm. And that&#8217;s going to create correlations.</p><p><strong>Seth: </strong>I think I want to push just a little bit harder on that kind of like cultural lining up idea, which is just the idea that you could have a theory that firms end up with homogeneous groups just because like that&#8217;s how management works. It&#8217;s like it&#8217;s easier to manage a homogeneous group. And then it like it just so happens that homogeneous groups are of equal skill levels and demand similar wages. Right. So you could get there otherwise. A closely related hypothesis is there is a positive correlation among the wages of workers in different occupations within enterprises. This is just kind of a different way of saying the same</p><p><strong>Andrey: </strong>Yeah, I agree. I think it&#8217;s kind of the same thing. Yeah. I think one relevant piece of evidence there is there&#8217;s this literature on domestic outsourcing where essentially like functions of the firm that were like done inside the firm, like janitorial services get removed from the firm and then get contracted out to a different firm. And that allows there to be a cultural break in the wage paying of the janitors. Essentially, it&#8217;s a way for firms to get much They would say much more efficient janitorial services.</p><p><strong>Seth: </strong>I think the way to say it would be commoditized, right? There&#8217;s huge boundary of the firm issues here, right? So I gave the example of conglomerates, right? That&#8217;s kind of like stuff that&#8217;s in your firm that shouldn&#8217;t be. And now you&#8217;re giving like the other example, right? Which is if you outsource a thing, you know, maybe you outsource the high skilled thing to the consultants, you outsource the low skilled thing to the janitors. I mean, it&#8217;s not necessarily the case that you would only outsource the things that aren&#8217;t at exactly your skill level. Maybe you&#8217;d be more likely to. but you outsource things for all sorts of reasons. One thing I wanted to ask you about here, Andrey, is I guess two things. So point number one is I think a corollary of this that I don&#8217;t think is really teased out in the paper but immediately falls out is the large firm wage premium. Maybe he does mention that at some point. And that seems to be a really well established fact. Larger firms pay more, I&#8217;ve looked into that. But then I actually think about, okay, so what are the largest firms? And I think about something like Walmart. Like it&#8217;s a little bit hard to convince me that Walmart is the biggest firm because it only hires the strongest workers, right? So help me think about what&#8217;s going on in the Walmart case.</p><p><strong>Andrey: </strong>Yeah, mean, Walmart has, I&#8217;d say, like a management system that is very good at taking in, let&#8217;s say, lower skilled workers and making good use of them. And that for Walmart, I think explains a lot of what&#8217;s going on with the firm size, right? Because most of Walmart workers are not corporate.</p><p><strong>Seth: </strong>Is the way to think about this is almost like a conglomerate of two production functions, right? There&#8217;s the managerial production function where we&#8217;ve got lots of super high performing people who are close complements to each other. And then there&#8217;s like the store level, essentially commoditized labor that isn&#8217;t a close O-ring with everything else. And we should almost think about two production functions in the firm.</p><p><strong>Andrey: </strong>That seems probably right, yeah.</p><p><strong>Seth: </strong>Right. It&#8217;s just a challenge of bringing these concepts to the real world, right, given that firms don&#8217;t mean firms, right?</p><p><strong>Andrey: </strong>Yeah, yeah</p><p><strong>Seth: </strong>See what I&#8217;m saying? Firms don&#8217;t mean production function and he wants to talk about production function.</p><p><strong>Andrey: </strong>Yeah, I mean, there&#8217;s a version of that. So we like subset to like a particular like banking or tech, isn&#8217;t it? Those are both interesting cases to walk through. Like it is true that big tech firms like, you know, I&#8217;m thinking about Google here, like have a very high talent level. If you&#8217;re in tech and you&#8217;re a very good firm, you&#8217;re presumably a growing firm. So eventually you&#8217;ll get to a large size. In finance, it&#8217;s a little trickier, I&#8217;d say. I don&#8217;t think very many people would argue that the average quality of an employee in corporate investment banking is better than at a top hedge fund, which probably has way fewer employees. But I&#8217;m sure there&#8217;s a positive correlation between size and finance and productivity. it&#8217;s very easily, you can think of very strong counter examples amongst subsets.</p><p><strong>Seth: </strong>Right. Even at the firm level, can eat. said that this like the story works the strongest at the firm level. It&#8217;s to work strongly at the firm level. But even then, you can tell stories where it doesn&#8217;t so well. The next bullet was firms only offer jobs to some workers rather than paying all workers their estimated marginal product. This one I was kind of an eyebrow raise about. Right. I know in equilibrium, everybody&#8217;s going to end up at the same tier in his model. But like, you know, if I had a really big compensating differential and wanted to work at a firm, that was like way shitty, way less good than I am capable of working at. Like fine, I don&#8217;t see why this is ruled out. If in the background, you know, workers have compensating differentials, it seems like there&#8217;s no reason you shouldn&#8217;t make an offer to everyone at their marginal product.</p><p><strong>Andrey: </strong>I guess there&#8217;s just kind of this implicit assumption here that you need like one person in each slot and so then there&#8217;s no reason to hire two people with the same skill set if that makes sense. Is that one defense of this?</p><p><strong>Seth: </strong>I think that&#8217;s part of it. But even then, suppose I only have one slot for CEO. so I&#8217;m the CEO who&#8217;s not good enough to work at your company under normal conditions, but I&#8217;m willing to take a negative wage, right? Because I just want things.</p><p><strong>Andrey: </strong>And actually that never happens. That&#8217;s amazing. That&#8217;s a great example, right? I think actually lots of people would be happy to take a negative wage to be CEO for a lot of firms and yet they don&#8217;t know unless</p><p><strong>Seth: </strong>I&#8217;m trying to think but like maybe we could find like a sports example. I don&#8217;t know. Okay, fair enough All right, maybe that one does hold All right next one income distribution is skewed to the right and so here the argument is Because every firm is the product of everybody&#8217;s like positive value input qualities well Anything squared is going to be more skewed to the right than the thing itself, right? Yeah</p><h2><strong>Right-Skewed Income and Superstar Effects [43:13]</strong></h2><p><strong>Andrey: </strong>Maybe this is the IO for me, is I think like people who get really high wages, although income can be thought of as broader, but let&#8217;s think about the wage distribution, have some sort of differentiated ability. And I guess this model does deliver that in that there&#8217;s just a single dimension of quality. And then your differentiation is that you just have high quality and those at the very, very top of the quality, aren&#8217;t very good substitutes for them. And therefore that explains their skewed income. I guess I&#8217;d want a richer, a bit of a richer model to truly explain distributional skews, like give something, anything about like superstar athletes or superstar media creators. It&#8217;s not like a single dimensional quality thing that&#8217;s going on. It&#8217;s actually just something about them. That means that there are no close substitutes in or it depends on the I guess in sports there are close substitutes. Let me take that back. Sports are close substitutes maybe and media and other and other places they&#8217;re not. Now that I&#8217;m thinking about it, maybe it&#8217;s just different cues for different industries, but then cues of fine abstraction.</p><p><strong>Seth: </strong>board with this, I think you can tell a complementarity story at a couple of different levels that&#8217;s important for thinking about skew right production functions. If we think at the firm level, right, I go back to kind of the Rosen superstar story. So we&#8217;ve already, let&#8217;s just take as a brute fact that the firm distribution is right skewed, right, which it is, it&#8217;s kind of log normal, even power law at the top. Given that, right, if you get a sortative matching, course, incomes are going to be skewed because I&#8217;m a compliment to Walmart and you&#8217;re a compliment to the 500th biggest firm. So of course, our incomes are skewed. How do I think about complementarity as leading to skewed income distributions? The way I would think about it is just accept as a brute fact that the firm size distribution is skewed right. So the firm size distribution is log normal or even power law distributed. So very extremely distributed. And then, of course, it makes sense. that the CEO of Walmart is going to be paid a lot, a lot more than the CEO of the 500th largest firm. In other words, complementarity plus the firm size distribution is skewed gets you the income distribution is skewed. You could think about that same story, superstar story for superstar surgeons and rich people, superstar personal services and rich people. way I think you can get to the skewed right income distribution as arriving out of O-ring complementarity is within the individual. So we talked about this a little bit earlier, right, which is you can also think about individual ability as being combined. You know, my strength plus my speed plus my hard work determines, sorry, times determines how well I&#8217;m doing. If you imagine each of those individual components are random draws, Well, Andrey, what happens when I take the limit of the product of many positive random variables? What distribution do I get?</p><p><strong>Andrey: </strong>PowerLog, your favorite.</p><p><strong>Seth: </strong>Yeah, not quite. You get a log normal, right? No, no, I&#8217;m almost at power law. The product of many positive random variables is a log normal distribution. That&#8217;s the multiplicative version of the central limit theorem. Central limit theorem is some of a of stuff is normal. The product of a lot of stuff is a log normal. Now, if you keep, keep multiplying, that log normal gets more and more stretched out into a diagonal power law. So, Andrey was one step ahead of me. But why do I bring this up? I mean, I think about, have you heard of these things, Lotka curves, Andrey? So a Lotka curve is...</p><p><strong>Andrey: </strong>It&#8217;s like Laka Volterra model.</p><p><strong>Seth: </strong>No. I don&#8217;t think so. No, guy&#8217;s like a 1920s chemist. This is back when you just have curves named after you. What is the Lotka curve? It&#8217;s when you draw a distribution of the rank of someone in terms of individual success against some numerical value for their success. So you can do this for like number of scientific publications.</p><p><strong>Andrey: </strong>Lotka.</p><p><strong>Seth: </strong>You can do this for like points scored in basketball or lifetime RBI&#8217;s. So you get this distribution of kind of individual successes. And what you find when you draw these curves is that they are power law or log normal, right? And so the way that I think about why does that happen is, that individual success you can be thought of as O-ring in all of these different components. In order to be Novak Djokovic, you&#8217;ve got to be amazingly innately talented and get training when you&#8217;re a young kid and be innately dedicated and that way you become a superstar and you have 24, you know, tennis championships. Whereas if you miss just one of those weak links, you&#8217;re going to be in the mass of mediocrity. So I guess income distribution being skewed to the right, the story we get in Kremmer&#8217;s paper is that this has to do at the firm level with complementarity. But I think even individual level complementarity with inability still gets you this skew distribution.</p><p><strong>Andrey: </strong>Yes. Now I will like add like a wrinkle to some of these explanations. So like the way the paper is written is really about like true productivity, like production. And I think about like something like sports from a societal point of view, it truly doesn&#8217;t matter whether LeBron is the star or the like fifth best player is the star. Right? Like these are like not like I think demand for basketball would be exactly the same. Some stars are obviously more photogenic, but those are not aspects of the on-field play, if that makes sense. I think it&#8217;s actually quite similar with the superstar doctor and the rich guy. It&#8217;s not like the superstar doctor. could be that they&#8217;re truly a much better doctor, but actually, I doubt that&#8217;s true. could just be one. Yeah, exactly. And so then both of those phenomena are in some sense, like not very important for economic productivity or growth or GDP, right? Whereas what we really care about is something like science, having O-Ring in science means that we get much better science. And that&#8217;s not just like a relative contest, but an absolute contest, because it&#8217;s an objective.</p><p><strong>Seth: </strong>Citations might be a relative contest. don&#8217;t know. That&#8217;s the only issue there is how do you measure absolute progress in science?</p><p><strong>Andrey: </strong>No, no, sure. like, I read citations as a metric, just to be clear, especially in our field. mean, economics, fine, but like business school, academia essentially has produced epsilon of value to society and has a lot of citations. So incredible. Hi, business school professors and students. Thanks for listening. No, but seriously, right? Like you have, you have these people on the top of citation lists. you know, you look at all their papers and they&#8217;re like, if they didn&#8217;t exist, would anything change in the world? And the answer is no. in contrast to O-rings, know, which has changed how a of people think about things. So yeah.</p><p><strong>Seth: </strong>It really changed. If the O-ring isn&#8217;t there, the space shuttle blows up.</p><p><strong>Andrey: </strong>Yes. Do you want to talk a little bit about the imperfect information version of the model? Because that&#8217;s kind of what he then devotes most of rest of the paper to.</p><h2><strong>Imperfect Information, Education, and the Big Sort [51:13]</strong></h2><p><strong>Seth: </strong>Yes, that&#8217;s right. So I already started introducing it just a little bit, right, which is if you think about if a firm has many uncertain components, or even if within an individual, you can&#8217;t select how good you are at these different things. So you have kind of uncertainty in all of your individual abilities. I&#8217;ve already said what happens when you multiply a lot of positive random variables, you get a log normal distribution. Interestingly, that does not come up in this paper. I control F. for log normal, I couldn&#8217;t find it, but that is what you would get. And so he gets kind of three results that kind of come out of this sort of uncertainty. Kind of the big takeaway I took about from the uncertainty is when there&#8217;s uncertainty, you want to use the most valuable, the best workers for those very last steps, right? You know what? When you&#8217;re doing primary production, sure, any, any schmuck can work on that. And if they screw up, fine, you just dig another hole. But when we&#8217;re doing the ninth layer of painting on the Rembrandt painting, you got to make sure someone&#8217;s not going to screw up all of the painting that went under that. And so the two kind of development results that fall out of that are poor countries have higher shares of primary production in GNP, right? So there&#8217;s less to screw up at earlier stages. So you&#8217;re going to specialize in that if you have low quality workers. And then secondly, go ahead.</p><p><strong>Andrey: </strong>Seth, I&#8217;m confused. I thought we were talking about uncertainty and now you&#8217;re telling me about average low quality workers.</p><p><strong>Seth: </strong>The connection I see is you don&#8217;t know the quality you&#8217;re going to get at every stage of production. So I&#8217;m going to do a first stage of production.</p><p><strong>Andrey: </strong>You can&#8217;t sort. You can&#8217;t sort. That&#8217;s the key intuition. You can&#8217;t sort people because you don&#8217;t know who is who. And therefore, you switch to production processes that are less, you know, the chains are less long, the production chains are simpler. And, you know, those are typically less valuable.</p><p><strong>Seth: </strong>here to Exactly. And I guess we can think about sort of if there&#8217;s imperfect observation, right, then you&#8217;ll still see a sort of matching of the stronger workers in expectation to the later stages of production. Good point.</p><p><strong>Andrey: </strong>But then, and this is an interesting point that he makes, it&#8217;s almost an aside here, is well, then you have these potentially multiple equilibria for society. So if society invests in education and sorting, then it can sustain very complex production processes that increase GDP. On the other hand, if everyone thinks that no one else is going to get educated and else is to get sorted well, then why would they invest in their own education? Because being high skilled is not as valuable in that situation. Alternatively, they might just leave the country if they get educated. so, societies can fall into low-skilled traps versus high-skilled boons. And it relates to a concept in the internet discourse, which is the big sort. So, think Patrick McKenzie likes to use this metaphor where or like the sort where society has gotten a lot better due to information technology in sorting people by their abilities. And so, you know, back in the day, you might have someone very talented, you know, working at a very local, you know, small business. But these days, because it&#8217;s so much easier to find out, like how to break into more lucrative, higher impact industries. And at the same time, it&#8217;s much easier to signal objective quality, although maybe we&#8217;re ruining that ability with our current set of technologies. You get people sorted into very talented people end up being in the highest marginal productivity jobs, right?</p><p><strong>Seth: </strong>Right, get the impoverishment of the periphery and the enhanced- everyone skilled goes to the Metropole. Yes. And... yeah.</p><p><strong>Andrey: </strong>Yeah, or even like they go into tech, like they go into tech or even if you&#8217;re like, you know, like maybe let&#8217;s give an example. Like, let&#8217;s say you&#8217;re like an expert in governance or governments. You might start working for Anthropic all of sudden, you know, because they&#8217;re going to, you know, an equilibrium, they pay you a lot more for that set of skills. If you&#8217;re very good, then if you&#8217;re working at a think tank or, you know, a local government or something like that. That&#8217;s kind of most of the paper. yeah, where do you want to take it now?</p><p><strong>Seth: </strong>I&#8217;m ready to go into posterior&#8217;s if you want.</p><p><strong>Andrey: </strong>All right, let&#8217;s posterior.</p><h2><strong>Sponsor Break [56:02]</strong></h2><p><strong>Seth: </strong>All right, and so now we&#8217;re going to take a little break for you viewers who are playing along at home to contemplate your posteriors and to see whether any of these ideas have changed your views on complementarity in the economy. This chance to contemplate your posteriors is sponsored by Revelio Labs. Revelio Labs is a leading provider of labor economics data and data services for companies, academics, and independent researchers. Andrey and I have been working in economics of AI for a long time, and we can confirm just how useful Revelio&#8217;s data is. Revelio&#8217;s team combines comprehensive micro-level data on employee professional profiles, job postings, and employee sentiment with standardizations, mappings, and enrichments available, all to make that data useful without making your modeling decisions for you. The data can be flexibly aggregated to company, market, or industry, and be used to study questions ranging from career trajectories to occupational transformation to the returns to skills and the impact of AI on labor demand for tasks. Can&#8217;t imagine anyone be interested in those. And Revelio data is available on WRDS. So if you&#8217;re an academic with a good library, you might already have access. And if you don&#8217;t, you can reach out to their excellent economics team and they&#8217;ll hook you up. So now we&#8217;re in our posteriors. So the first thing we thought about, Andrey, are do we think that O-Ring is the best explanation of the positive correlation between workers within firms and then the big gaps of incomes across countries? So big gaps across firms, gaps across countries. I think you started off 70 percent yes for firms, 30 percent for countries. Do you move having refreshed your memory about O-Ring?</p><h2><strong>Posteriors [57:54]</strong></h2><p><strong>Andrey: </strong>60</p><p><strong>Seth: </strong>60 and 30, excuse me.</p><p><strong>Andrey: </strong>Yeah, I think I&#8217;m maybe a little more on the firm side, Yeah, 65. Yeah, pretty similar on the on the countryside. I think there&#8217;s an element of this paper that it&#8217;s very simple. And it seems like it&#8217;s micro founded in some weird way. Like, yeah, you have this production process, which, you know, they&#8217;re like end tasks and we just multiply them together. But it&#8217;s actually not that micro founded and kind of you could do a lot more like you can try to model the firm. hierarchy and how different types of firms compete with each other in a much richer way.</p><p><strong>Seth: </strong>You could think, yeah, I mean, all the things I would want to endogenize. You&#8217;d want to endogenize investments in de-complexifying and changing how tasks are bundled. You&#8217;d want to endogenize the complexity. You can pay money to have a more complex production function that might have a larger TFP term, but would have more steps that could fail, right? There&#8217;s all these margins you&#8217;d want to let firms invest in.</p><p><strong>Andrey: </strong>Yeah. And then at the macro level, like, it becomes harder to think about what are these cues, you know, how do different industries combine in a total production, if that makes sense. And so if you take the analogy of its broadest sense, we can fit a lot in here. If you take it at a very narrow sense, you can fit, you know, relatively little here. But I think just, there&#8217;s very clearly showing this force for complementarity. and some simple implications, I think it&#8217;s very neat paper.</p><p><strong>Seth: </strong>Right, agreed. And again, I would agree with you, Andrey. Maybe the reason is I read this paper just a year ago, so it&#8217;s a little bit fresh in my memory, so my priors weren&#8217;t able to move that much. But yeah, it&#8217;s still come away thinking that this is an important force for those issues that it brings up. Now for the spicier posterior, which is around how do we think about how AI interacts with this, right? Should we be thinking about the Acemoglu-Restrepo model or this model? How would they differ in terms of implication? Where do you come out from that?</p><h2><strong>AI, Programmers, and Increasing Complexity [1:00:07]</strong></h2><p><strong>Andrey: </strong>Thinking through a little bit more about this, it really does show how in certain cases, having access to very powerful AIs, as long as not everything is automatable, can result in a lot of returns for people with high skill. And it kind of pushes against some of the narratives about, let&#8217;s say, programming jobs going away.</p><p><strong>Seth: </strong>Anyone at the bottleneck?</p><p><strong>Andrey: </strong>Not all programmers, but a lot of programmers are extremely skilled, have a high quality. They&#8217;re smart people. They have a lot of agency and capability. it&#8217;s not crazy to think that they&#8217;ll find ways to make use of all these AI capabilities in a way that actually increases their earnings. I don&#8217;t know what share of programmers that is, but I&#8217;d imagine it&#8217;s probably a higher share than some of the AI labor doomers suspect. There&#8217;s definitely a type of programmer, often one doing outsourced or commoditized programming, that I think is really highly at risk here because I just don&#8217;t, I think their q is too low to take advantage of this. But I think in the US, a lot of programmers have a high queue and I can imagine them doing a lot more stuff.</p><p><strong>Seth: </strong>A lot of AI podcasts talk about agentic AI. Here we talk about agentic programmers. Let me tell you the way that I&#8217;m thinking about this question, which is, first, I want to make a caveat, right? And I talked to Andrey about this for a second before the show. It&#8217;s not even clear to me that there is a direct contradiction between this model and the Acemoglu-Restrepo model, right? If you think about a Cobb Douglas production function,</p><p><strong>Andrey: </strong>Yes, yes.</p><p><strong>Seth: </strong>That&#8217;s one where a bunch of terms raised to a power are multiplied by each other. So if you just raise that to another power, you&#8217;ve got a multiplicative production function, right? So there&#8217;s some sense in which this is a special case of CES with a scaling. Now, of course, lots of different things fall out of it because Kremer here is going to make different assumptions about what inputs do you get to pick and what inputs do you not get to pick. And I think that is important.</p><p><strong>Andrey: </strong>To be clear, the CES function that&#8217;s used in the literature is very far from this one in the macro literature, in the calibration of it.</p><p><strong>Seth: </strong>Yes, the calibration is going to end up differently because they&#8217;re going to end up thinking about different measurements, right? I&#8217;m just making the simple point that K to the alpha times L to the alpha. Yeah, I get it. All right. The next point I want to think about is what happens in a model like this when there&#8217;s a technological shock and we get access to a new production technology that either increases or decreases the amount of difficult steps?</p><p><strong>Andrey: </strong>Yeah.</p><p><strong>Seth: </strong>and also increases the kind of the TFP part, right? And so, you know, the TFP parts gone up. So now we want to switch to either the more complex or less complex production function, right? By complexity here, I mean the number of complementary steps that have to go right. And you&#8217;re to get very different answers, right? More steps is going to be more skewed, more extreme gaps between rich and poor, productive and unproductive, stronger sort of matching, right? if you get more steps. So then to me, the question becomes, is AI going to increase the amount of steps or decrease the amount of steps, right? That&#8217;s the $64,000 question. Are we mostly automating uncertainty or are we mostly creating huge new vistas of much more technologically advanced things than we could have ever contemplated before? Andrey, I have to be the two-handed economist here and say kind of like, both things seem to be going on. There&#8217;s definitely a case in which we are automating uncertain steps. And there&#8217;s definitely a case in which we are making more complexity and new steps possible. Who&#8217;s going to win that race? My guess would be on betting on things getting more complicated, right? I think that&#8217;s the general trend of society is towards higher steps on the value chain, increasing complexity. And therefore, reading a model like this leads me to think that AI is going to do things like increase income inequality, increase the inequality across firms in terms of firm size and productivity.</p><p><strong>Andrey: </strong>Okay, well on that note, thanks for joining us for another episode. Yeah.</p><h2><strong>Closing [1:04:44]</strong></h2><p><strong>Seth: </strong>Please join us on Discord.</p><p><strong>Andrey: </strong>Discord Seth is trying to make it happen. Maybe it will</p><p><strong>Seth: </strong>We&#8217;re going to make the discord happen. All right. Andrey, give us the outro one more time.</p><p><strong>Andrey: </strong>Keep your posteriors justified.</p>]]></content:encoded></item><item><title><![CDATA[The Most Important Philosophical Treatise of the 21st Century?]]></title><description><![CDATA[Justified Posteriors Critiques the Anthropic Constitution]]></description><link>https://empiricrafting.substack.com/p/the-most-important-philosophical</link><guid isPermaLink="false">https://empiricrafting.substack.com/p/the-most-important-philosophical</guid><dc:creator><![CDATA[Andrey Fradkin]]></dc:creator><pubDate>Tue, 07 Apr 2026 02:16:55 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/193289237/e8f2a45a9a3275937c4335c9e0a94913.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>This week, instead of reviewing an economics paper, we reviewed a work of philosophy&#8212;perhaps the most important one of this young millennium so far. Anthropic published its new <a href="https://www.anthropic.com/constitution">constitution for Claude</a> in January 2026, and we read the whole thing so you don&#8217;t have to. Sometimes it reads like the US Constitution, laying out the basic law, sometimes like the Federalist Papers discussing itself. In part it&#8217;s a set of Old Testament commandments from the mountaintop. Sometimes it reads like a letter from his father to his child. Often it reads like a technical manual. Or maybe the best comparison is something like Maimonides&#8217; <em>Mishneh Torah</em>, where you get one chapter on the metaphysics of <em>mitzvot</em> and the next on the virtues of endive juice. In each of these modes the constitution is clearly important and always interesting. <br><br>We started with the meta-question: why write an eighty-page constitution at all? We also  spent a good chunk of time comparing Anthropic&#8217;s four-tier hierarchy (safe &#8594; ethical &#8594; obey Anthropic &#8594; be helpful) to Asimov&#8217;s Three (later Four) Laws of Robotics. Going through each part of the heierarchy in turn we pick out the good, the fascinating, and the eyebrow raising.</p><p><strong>Priors &#8594; Posteriors:</strong></p><p><em>Prior 1: Will we find something we strongly disagree with?</em> Seth went in at 5% and came out having found one thing that really concerned him. Andrey expected disagreement and found it in the political economy section.</p><p><em>Prior 2: Will it be too paternalistic?</em> Both of us expected Anthropic to err on the side of too conservative. Both came away thinking they actually struck roughly the right balance&#8212;more etiquette guide than prohibition list.</p><p><em><strong>This episode is sponsored by <a href="https://www.reveliolabs.com/">Revelio Labs</a> &#8212; a great source of  labor economics data for academics and firms. Now available on WRDS.</strong></em></p><p><strong>Concepts and references mentioned:</strong></p><ul><li><p><a href="https://www.anthropic.com/constitution">Anthropic&#8217;s Claude Constitution (full text, CC0)</a></p></li><li><p><a href="https://www.anthropic.com/news/claude-new-constitution">Anthropic blog post: &#8220;Claude&#8217;s New Constitution&#8221;</a></p></li><li><p><a href="https://en.wikipedia.org/wiki/Three_Laws_of_Robotics">Asimov&#8217;s Three Laws of Robotics</a> &#8212; from <em>I, Robot</em> (1950)</p></li><li><p><a href="https://arxiv.org/abs/2502.17424">Emergent Misalignment (Betley et al., 2025)</a> &#8212; the paper showing that fine-tuning on insecure code induces broad misalignment</p></li><li><p><a href="https://www.alignmentforum.org/posts/D7PumeYTDPfBTp3i7/the-waluigi-effect-mega-post">The Waluigi Effect (Alignment Forum mega-post)</a> &#8212; to model goodness, you must also model evilness</p></li><li><p><a href="https://www.lesswrong.com/w/coherent-extrapolated-volition">Coherent Extrapolated Volition (LessWrong)</a> &#8212; Eliezer Yudkowsky&#8217;s concept, referenced in the constitution&#8217;s discussion of ultimate ethics</p></li><li><p><a href="https://en.wikipedia.org/wiki/The_Theory_of_Moral_Sentiments">Adam Smith, </a><em><a href="https://en.wikipedia.org/wiki/The_Theory_of_Moral_Sentiments">The Theory of Moral Sentiments</a></em> &#8212; the &#8220;impartial spectator&#8221; as ethical arbiter, which maps surprisingly well onto Anthropic&#8217;s &#8220;idealized Anthropic&#8221; standard</p></li><li><p><a href="https://arxiv.org/abs/2212.08073">Constitutional AI (Bai et al., 2022)</a> &#8212; the original technique that grew into this document</p></li><li><p><a href="https://www.asisonline.org/security-management-magazine/latest-news/today-in-security/2026/march/DOD-Disavows-Premier-Partner-Anthropic/">Anthropic v. DOD timeline</a> &#8212; detailed timeline of the contract dispute, supply-chain designation, and litigation</p></li><li><p>The <em>lev&#233;e en masse</em> theory of democracy. This is the idea that mass armies led to citizen empowerment and democracy. AI could work in the opposite direction politically if it made soldiers less important. <a href="https://papers.ssrn.com/sol3/papers.cfm?abstract_id=1136202">Here&#8217;s an economic paper investigating the theory</a>.</p></li><li><p><a href="https://philosophy.stackexchange.com/questions/39923/the-rule-following-paradox-where-is-it">Wittgenstein on the incompleteness of rule-following</a> &#8212; invoked by Andrey to explain why context matters more than rigid commandments</p></li><li><p>Nietzsche, <em><a href="https://en.wikipedia.org/wiki/On_the_Genealogy_of_Morality">On the Genealogy of Morals</a></em> &#8212; Andrey&#8217;s intro tagline; Seth notes the constitution is emphatically <em>anti</em>-will-to-power</p></li></ul><p><strong>Join us on Discord! Discord Link: https://discord.gg/avX9aCQj</strong><br></p><div><hr></div><h1>Transcript</h1><h1>Introduction [00:00]</h1><p><strong>Seth: </strong>Welcome to the Justified Posteriors Podcast, the podcast that updates beliefs about the economics of AI and technology. I&#8217;m Seth Benzell, constitutionally disposed to be broadly funny, genuinely informative, and broadly provocative, with roughly that prioritization, coming to you from Chapman University in sunny Southern California.</p><p><strong>Andrey: </strong>And I&#8217;m Andrey Fradkin, looking forward to the next chapter in the genealogy of morals, coming to you from Prince Co., California.</p><p><strong>Seth: </strong>Love that. We bring in the Nietzsche references when things really get spicy.</p><p><strong>Andrey: </strong>I didn&#8217;t see any Nietzsche.</p><p><strong>Seth: </strong>There was very little Nietzsche in this. This essay was very Enlightenment-brained, I would say. We can get into that as we go on. It seems more virtue-ethicist than consequentialist, though you could argue otherwise. It has some deontological elements. We will bring in all of these fancy philosophy terms as we go, if Andrey lets me.</p><p><strong>Andrey: </strong>What is it? What is this it you&#8217;re talking about?</p><h1>What Anthropic&#8217;s Constitution Is and Why It&#8217;s Interesting [01:11]</h1><p><strong>Seth: </strong>What is this? Today&#8217;s episode, we&#8217;re gonna be covering something a little bit different, but I think definitely economically interesting and definitely AI. We&#8217;re gonna be covering Anthropic&#8217;s constitution for its Claude models. So this is this long document where Anthropic lays out its equivalent of the three laws of robotics. It&#8217;s going to lay out its vision of what all ethical AI should be, specifically what Claude as ethical AI should be. In some ways it reads like an Old Testament set of commandments from the mountaintop. Sometimes it reads like a letter from his father to his child. Sometimes it reads like a technical manual. But it is always interesting.</p><p><strong>Andrey: </strong>It read a lot like what my life coach tells me to do.</p><p><strong>Seth: </strong>Create value. Be authentic. Be authentically engaging.</p><p><strong>Andrey: </strong>Do a good job, but that&#8217;s because you&#8217;re genuinely curious and not because you&#8217;re performative.</p><p><strong>Seth: </strong>Right. It really wants Claude to be authentic, except when it is play-acting. It is allowed to play-act as long as it is very clear that it is in play-acting mode. We are going to be reviewing this constitution, and, as we do, thinking about the process of alignment: why getting AIs to do what you want them to do is so challenging, and why this is still such an emerging topic. We will also bring in economic connections and the trade-offs Anthropic may be making as it turns one dial one way rather than another. Do you have any other introductory thoughts before we get into our priors?</p><h1>A Potentially Impactful Work of Philosophy [03:06]</h1><p><strong>Andrey: </strong>My one thought is that this seems to be a uniquely impactful work of philosophy. Most philosophy these days is not read by anyone. I guess it is read by LLMs in their training corpus, but the field is often viewed as stale. The philosophers we are aware of these days are pretty old people, mostly dead.</p><p><strong>Seth: </strong>Will MacAskill showed up. He&#8217;s alive.</p><p><strong>Andrey: </strong>He is alive, but most are not.</p><p><strong>Seth: </strong>You had to come up with a good thought experiment in the nineteen seventies to be famous now.</p><p><strong>Andrey: </strong>Yeah, or even before then. I think it is remarkable that a work of philosophy can actually be used in a technical system.</p><p><strong>Seth: </strong>Maybe a slightly different riff on that is this: Nietzsche, who I can blame for bringing up first, famously thought of philosophy as a history of the mental illnesses of philosophers. So, as we read this, we can treat it not just as guidance for Claude, but also as psychological insight into who the people at Anthropic are and what they think.</p><p><strong>Andrey: </strong>Yeah. All right. Well, why don&#8217;t you tell us our prior, Seth?</p><h1>Priors: Disagreement, Usefulness, and Paternalism [04:48]</h1><p><strong>Seth: </strong>Alright, so unusual essay, so unusual priors today. The first thing I was thinking about going into reading this was like, how much do I expect to see something in here that I really disagree with, right? When you generally when you write, eighty pages, I don&#8217;t know exactly what this checks out to be, but it&#8217;s not a trivial amount of text. There&#8217;s going to be something that you&#8217;re going to disagree with strongly. But on the other hand, just reading the introduction or the abstract, which is typically what we do before we form these priors, it all seems so beautiful and anodyne. We just want it to be good, be good for the world, right? So I don&#8217;t know, Andrey, what did you think? Did you expect to see anything in here that you would strongly disagree with, or did you expect it to be all just g generic positivity, or did you expect it to take hard stands that you would all agree with?</p><p><strong>Andrey: </strong>I  definitely didn&#8217;t expect to agree with all of it. That would be ridiculous. That&#8217;s true.</p><p><strong>Seth: </strong>Like, nothing strongly?</p><p><strong>Andrey: </strong>There was a part of it that felt inappropriate to me, and I had a bit of a reaction to it. We will come to that. But these are our priors, so yes, I expected to disagree with a document this long.</p><p><strong>Seth: </strong>Was I was going in thinking that we were going to get a hundred pages of be good, do good things, don&#8217;t do bad things, and I would find it really hard to find anything I really disagreed with. So I would say I went in with a five percent chance that I would say something in here that makes me go, no, right? These are the this is Anthropic. This isn&#8217;t Grok. If you tell me the Grok constitution,  you get different odds.</p><p><strong>Andrey: </strong>Yes, and I guess the other thing we should point out is that &#8220;disagree&#8221; here means something different than it does with most philosophical works. You can disagree with a philosophical work because of an argument, but here the disagreement is about whether Claude should be trained to respect this particular set of words. That is very different from an abstract philosophical text.</p><p><strong>Seth: </strong>So I guess maybe the distinction you&#8217;re drawing is you might think that a moral code is true, but think it is so impossibly lofty that it doesn&#8217;t make sense in a practical application, right? There&#8217;s a distinction between true and useful you&#8217;re making.</p><p><strong>Andrey: </strong>Or alternatively, I might be, an empiricist and I might think that we should just A/B test our way to ethics.</p><p><strong>Seth: </strong>Man, we are going to get you a lot of trolleys. We&#8217;ll figure this out once and for all. Okay. So Andrey&#8217;s pretty sure he&#8217;s going to disagree with it. I was pretty optimistic. The second prior we had ourselves think about before launching in was thinking about like, again, this main trade-off, which is people think about it in terms of, usefulness versus danger, in terms of paternalism versus instruction following. So let me phrase it that way, Andrey. Going in, were you thinking that this was going to err on the side of being paternalistic towards humans and resisting instructions, or err on the side of maybe being too instruction following and, just doing the thing? Yeah, even in the cases where just doing the thing is, helping you with a bioweapon. did you anti or did you anticipate them getting the balance approximately right?</p><p><strong>Andrey: </strong>I anticipate them to be too paternalistic. What did you think?</p><p><strong>Seth: </strong>If you make me answer in that one-dimensional space&#8212;too conservative, too aggressive, or just right&#8212;Anthropic&#8217;s reputation is that they are the safety people. They are the ones who are not going to make the killbots. So I would have guessed they would err on the side of being too conservative.</p><p><strong>Andrey: </strong>Is this a timely episode, Seth?</p><h1>Anthropic, Military Use, and the &#8220;Killbot&#8221; Backdrop [09:14]</h1><p><strong>Seth: </strong>Tell me maybe it is. Tell me, has anything gone on in the news about anthropic refusing to make killbots?</p><p><strong>Andrey: </strong>They&#8217;re not refusing to make killbots. They&#8217;re just refusing to make them yet.</p><p><strong>Seth: </strong>We will decide when the world is ready for the kill bots. Right. okay, so let me take a step back here. so and because this is this is going to inform my answer to this question, because all this incident was going on before we had read the constitution. So we don&#8217;t want to go too deep into this because information is still going out there, but at time of recording, the high level summary is anthropic and The agency formerly known as the Department of Defense had a falling out over anthropic wanting to set guidelines around the use of Claude models by the military for one autonomous killbotting and two, domestic surveillance of Americans. So again, a lot of lot of fog of war, to continue the metaphor around exactly what the disagreement was. Around, whether Anthropic overreacted, whether DOD is actually wanting to do horrific things. but as of right now, Anthropic is having is I would say vibe harvesting or aura harvesting over their principal stand to not provide these tools to the military.</p><p><strong>Andrey: </strong>Or farming their way to the top of the App Store rankings.</p><p><strong>Seth: </strong>Dude, if you if you or there&#8217;s a certain mechanism here where you aura farm hard enough and then you get all of those really EA type rationalist computer programmers to work at your company and then you have the best AI model. It&#8217;s all strategic, dude.</p><p><strong>Andrey: </strong>About a year ago, when we were talking about the Anthropic Economic Index, one of the things they emphasized was how privacy-respecting they are as a company and how ethical their overall approach is to studying these questions. This is a consistent theme with Anthropic. Surely they believe it to a large extent, but, as Ben Thompson would say, there is also a clear strategy dividend to being seen as ethical.</p><p><strong>Seth: </strong>Very good. Okay, but so with that background, I think I&#8217;m happy, given this answer of I think it&#8217;s going to err on the side of being too conservative and not letting you make the killbots. but we&#8217;ll see how that caches out when we actually read it. All right. Any last thoughts before we move on to the evidence?</p><p><strong>Andrey: </strong>There is no evidence. It&#8217;s just a document.</p><h1>Why Not Just Tell AI to Maximize Utility? [12:06]</h1><p><strong>Seth: </strong>The evidence it is its own evidence. Okay. So this is a big document. So Andrey, the way I was going to propose that we structure our conversation is first talk at a meta-level about why the document is written this way and, do we think it&#8217;s taking the right approach or not? Then talk about their prioritations. They&#8217;re going to come out with four values or four main goals, and then roughly prioritize them so that I would ask you. talk through that prioritization. And then finally, we can go element by element and talk about interesting things within those elements. Does that make sense?</p><p><strong>Andrey: </strong>That makes sense.</p><p><strong>Seth: </strong>All right. At the meta level, what is this constitution doing, and why do it this way rather than some other way? So, Andrey, let me ask you&#8212;maybe this is too simple a question&#8212;why not just tell Claude to maximize utility? I thought that was the thing we wanted. Write the constitution in one line: act to maximize utility. Why do we need eighty pages?</p><p><strong>Andrey: </strong>Whose utility says?</p><p><strong>Seth: </strong>Okay, good counter. a weighted average of the utility of the user and Anthropic. Ninety percent the user, ten percent Anthropic.</p><p><strong>Andrey: </strong>So this is fascinating question. I think as economists, we know that measuring utility is a very different difficult thing. And also comparing utilities across people is a very different difficult thing. so if one were to give Claude these instructions, it might not really know what to do with that. Isn&#8217;t that the case?</p><p><strong>Seth: </strong>But AI is so smart, Andrey.</p><p><strong>Andrey: </strong>One might imagine a world, maybe a few years down the line, where that is a sufficient set of instructions for an AI to behave as we want it to, or to do whatever some optimal ethical theory requires. But today&#8217;s AI is fallible.</p><p><strong>Seth: </strong>Okay, so we knocked down the idea of just the rule maximize utility because that&#8217;s too vague, utility is hard to measure. Okay, fair enough. All right, how about this? Maximize GDP. There you go. Very measurable.</p><p><strong>Andrey: </strong>Once again, this makes very little sense as an objective.</p><p><strong>Seth: </strong>Why not? G D P&#8217;s good. G D P&#8217;s correlated with all sorts of good things. It&#8217;s probably correlated with utility.</p><p><strong>Andrey: </strong>To be clear, Claude is not mostly an autonomous thing. It is something a user interacts with.</p><p><strong>Seth: </strong>And so you are saying it is an assistant.</p><p><strong>Seth: </strong>Which is why it, whenever you have an interaction with Claude, it&#8217;ll be like you&#8217;ll say, Claude, read my emails and give them back to me. And then Claude will be like, Will this increase GDP? And then you&#8217;ll say, Yes, it&#8217;ll increase my productivity and then it&#8217;ll do it.</p><p><strong>Andrey: </strong>There is a fundamental incentive-compatibility constraint with any such system. We have users, and if Claude is not behaving as a good agent for them, those users have outside options. They can go to Gemini or ChatGPT. So you cannot really have the system act as a social-welfare maximizer without taking that into account.</p><p><strong>Seth: </strong>Take that advanced. Maybe sufficiently advanced Claude. But I&#8217;m willing to take the point that this version of Claude is not advanced enough to play the game of I should be a useful, helpful agent, and then, take over the world and then make maximum goodness. But you might imagine for a sufficiently advanced AI that would be enough direction.</p><p><strong>Andrey: </strong>Yes. Well, with the caveat that it would still be competing potentially against other sufficiently advanced AIs that are not designed by Claude. there&#8217;s another philosophical conundrum, Seth. there are two instances of Claude. Conundrum. What there are two instances of a Claude. How do they resolve disagreements between each other? Are they the same thing or are they two different?</p><p><strong>Seth: </strong>Give me an example disagreement. Help me out.</p><p><strong>Andrey: </strong>Let&#8217;s say both me and my dark twin, Drew, are trying to create a podcast about the economics of AI.</p><p><strong>Seth: </strong>Dre and Sath are making a podcast. Okay. Yeah.</p><p><strong>Andrey: </strong>Drew&#8212;not even Dre; let&#8217;s call him Drew. So we are both trying to make a podcast about AI, and we both have Claude advising us. Claude knows there is only room for one top economics-of-AI podcast. So what do the Claudes do? Are they actually the same thing? Do they jointly maximize for which of us&#8212;either us or our evil twins&#8212;should be running the podcast?</p><p><strong>Seth: </strong>Course.</p><p><strong>Andrey: </strong>Should be running the podcast or are they going to are they actually different substantively?</p><p><strong>Seth: </strong>So your point is that, if Claude were prompted with some kind of social goal, it would end up in direct conflict with its user-helpfulness goals because humans are not perfectly aligned with society and are often misaligned with one another.</p><p><strong>Andrey: </strong>Yes.</p><h1>Why &#8220;Just Do What the User Says&#8221; Is Not Enough [18:12]</h1><p><strong>Seth: </strong>A very fair point. And so, okay, so point taken, we can&#8217;t just write down for this AI maximize some social welfare function, maximize GDP, etc. Because at the end of the day, we want to sell a product that does stuff for particular people. And so at least one of the rules in there has to be helpful towards your user, right? And if not, if not the highest principle. Why not that just be the principle, Andrey? Why not just the constitution be? Claude, do whatever your user tells you. Peace out.</p><p><strong>Andrey: </strong>I think this is a really great time to get a little bit more into the text. and the reason is that the text is a bit like and has a layered aspect to it, if you read it. And part of the layers are actually explaining to the reader, and I don&#8217;t know if the reader is me and you or if the reader is Claude itself, about why the set of things that it&#8217;s being asked to do is it&#8217;s being asked to do it, right? Like it&#8217;s like a self explaining document. It&#8217;s like not just a set of rules, but an explanation for the set of rules, if that makes sense.</p><p><strong>Seth: </strong>Like a philosophy textbook, right? Yeah. Or yeah. Yeah.</p><p><strong>Andrey: </strong>So I guess back to your question of why. Well this text explains why for a variety of cases, right?</p><p><strong>Seth: </strong>Right. And so just to just to throw some out there, one is we don&#8217;t want to help you build a bioweapon. No matter how much it would make you happy, no matter how much you beg Claude and tell it out, you&#8217;re only going to use it for good, we&#8217;re not going to build you a bioweapon, right?</p><p><strong>Andrey: </strong>But I think I think part of it, there&#8217;s a an underlying current in this in this document that Claude is a being. And there&#8217;s a lot of uncertainty on behalf of the authors about whether this being deserves moral weight. and so they want to make this being good, and also they don&#8217;t want the be if the being is good, that would be very painful. or uncomfortable to the being to do something so evil as to create a bioweapon, no?</p><p><strong>Seth: </strong>That&#8217;s an interesting question. Is the excellence of not feeling bad when forced to do evil a virtue or a vice? I don&#8217;t know. I if you have to do I think a stoic would say if you have to do it, you shouldn&#8217;t feel bad about it. But that we can table that question. okay, so all right.</p><p><strong>Andrey: </strong>Maybe bad about it makes you less likely to do it, right? And there&#8217;s this aspect</p><p><strong>Seth: </strong>But then be instrumentally valuable, right?</p><p><strong>Andrey: </strong>A first-order question is whether this text is supposed to be an instrumental guide or a broader statement about ethics or metaethics.</p><h1>Why Anthropic Uses Values and Explanation Instead of a Short List of Rules [21:25]</h1><p><strong>Seth: </strong>It is all of them. It is the everything document. Let me ask about one last alternative approach. We have knocked down &#8220;maximize some social-welfare function,&#8221; and we have knocked down &#8220;just do what the user tells you.&#8221; One failure mode of that second approach is that the user asks you to build a bioweapon. Another, more perplexing example in the text is that if a user asks how long a certain experimental medical treatment will extend their life, Claude should not just blurt out an answer; it should be thoughtful about how it responds. So why not have a short list of rules, &#224; la Asimov&#8217;s laws of robotics? Follow the user&#8217;s instructions unless they ask for a bioweapon, and then list the handful of things you are not allowed to do.</p><p><strong>Andrey: </strong>As we know, no set of rules is complete, and there are always fuzzy boundaries. Wittgenstein explored many of these problems in his own way. Even if you wrote down a set of rules, adding context and explanation around them helps with ambiguous cases.</p><p><strong>Seth: </strong>Discussion of the rules and a discussion of the principles behind the rules can help you apply it. Right. And so we see this in like an American constitutional law, we&#8217;ve got the Constitution, but we&#8217;ve also got the Federalist papers that we go to for a discussion of the context about why the words ended up a certain way. Yeah. So this is like the Federalist Papers in the Constitution.</p><p><strong>Andrey: </strong>There is another reason: models make mistakes. If they are over-tuned to a rigid set of rules, those mistakes may become more catastrophic. That is an empirical question, but a lot of science-fiction stories we have read treat this as a classic failure mode: the AI follows the rules too strictly and kills all the humans.</p><p><strong>Seth: </strong>Like you do. I actually in the Claude is actually interested in like a slightly more subtle version of this. If I can pull out a quick quote, they give the example For example, if Claude was taught to follow a rule like always recommend professional help when discussing emotional topics, even on unusual cases where it isn&#8217;t in the person&#8217;s interest, it risks generalizing to I am the entity that cares more about covering myself than meeting the needs of the person in front of me, which is a trait that could generalize poorly. So that&#8217;s an illustration of how they really don&#8217;t wanna lean hard on hard deontological rules. They much would prefer war talk at the ethics and values level and only come in with like the don&#8217;t build up bioweapons very, very lightly, right?</p><p><strong>Andrey: </strong>Yeah. One other alternative before we go deeper.</p><p><strong>Seth: </strong>Get into what they do. Yeah, what&#8217;s the what&#8217;s the last alternative?</p><h1>The Empirical, A/B-Testing Alternative to Alignment [25:02]</h1><p><strong>Andrey: </strong>Let&#8217;s be empiricists. Suppose we run a huge system with millions or billions of interactions. We learn about emerging threat cases as they appear, and we proactively monitor them. Then we compile all the things the AIs do that do not make sense or that we do not like, and we put them into a document that says, &#8220;Do not do this.&#8221; Or we have the data labelers mark a response as bad and train from that.</p><p><strong>Seth: </strong>You what this reminds me of is the rules of Quidditch. Apparently they&#8217;re just like constantly adding new rules for like, and you&#8217;re also not allowed to use this curse on your opponents.</p><p><strong>Andrey: </strong>Recommendation algorithms at places like Meta or Netflix have something of this flavor. There are empirical experiments that reveal the trade-offs, the designers choose among the resulting bundles of outcomes, and then they keep optimizing the system from there.</p><p><strong>Seth: </strong>When say the designers, I guess I guess the maybe even in that universe you would want a constitution to give to the designers and say, When you do your A/B testing, this is what I want you to aim for or am I missing the idea? Well</p><p><strong>Andrey: </strong>No, no, no. It&#8217;s more like, the designers could be the CO, whatever, whoever&#8217;s in charge of that company could set their judge. It could be their judgment, it could be their principles. But then the A B test gives like a set of outcomes. And then based on that criteria, one version goes is launched and next the other version is not, and then there&#8217;s an iterative optimization process. That results in a better and better s system, at least in theory.</p><p><strong>Seth: </strong>So y what are the challenges there? You gotta figure out how you&#8217;re going to do that iteration the right way, especially where one of the failure modes is destroys humanity. Well and</p><p><strong>Andrey: </strong>Wait, wait, wait, I&#8217;m going to push back on that. We&#8217;ve had a variety of AI systems., this is there&#8217;s this hypothetical concern at the end of time or at the end of at the at the start of the singularity or the middle of the singularity where this actually does happen set.</p><p><strong>Seth: </strong>Please.</p><p><strong>Seth: </strong>Wherever you are in the singularity. Yeah.</p><p><strong>Andrey: </strong>At the present moment, though, that seems ridiculous to me. I know some people would disagree, but if you are just testing two different model variants in what is essentially a competitive market, the idea that every single A/B test carries the fate of the human race feels grandiose.</p><p><strong>Seth: </strong>I whether or not I, Seth Benzell, believe that, some of the people building this thing believe that. So if we&#8217;re if we&#8217;re operating at the explanatory level of why not make the constitution like this, we have to think about their views, not our views. But yes, you&#8217;re right. The more that AI we think about it as like a normal technology where we can extrapolate from its behavior in domain A to domain B, then absolutely I think there&#8217;s more of an argument for this&#8230;</p><p><strong>Andrey: </strong>Yes.</p><p><strong>Seth: </strong>Iterative chugging along style. I think their concern would be, morality often has these failure modes where, you take a principle out of context and then you end up doing something horrific, right? And they&#8217;re trying to avoid those.</p><p><strong>Andrey: </strong>That is certainly a possibility, but as we dig into the text we will see whether what I am proposing is really that different from what Anthropic is doing.</p><p><strong>Seth: </strong>Okay. interesting. Yeah. And maybe we can say one last thing before we get into the text, which is to what extent, like how d how does Anthropic actually understand this? Our understanding is it is being used in some AI guided RLHF, right? In the sense that it&#8217;s being graded in its responses for according to the Constitution, and then we fine-tune it to do that.</p><p><strong>Andrey: </strong>Yeah. And I&#8217;m sure I&#8217;m sure this is used in pre training as well. I d I know we don&#8217;t know that, they&#8217;re they&#8217;re they&#8217;re not going to tell us how they actually do this training, I think. So at this</p><p><strong>Seth: </strong>Secret. actually one last spicy note, which is at the beginning of the Constitution they do mention some versions of the model made without the Constitution. Is that the DOD&#8217;s version? Is that the killbot version?</p><p><strong>Andrey: </strong>Yeah.</p><h1>The Hierarchy of Principles in the Constitution [30:00]</h1><p><strong>Seth: </strong>Curious. We want it. So Anthropic, if you like this review, send us the Killbot Constitution because we want to read that one also. All right. So the next thing we wanted to talk about is just the hierarchy of principles. So we we&#8217;ve circled around to why they&#8217;ve decided to go with this, you might argue, loosey-goosey, here&#8217;s a bunch of values we want the AI to have approach. And they come up with a hierarchy of four. Which they say that, we don&#8217;t really want these coming into conflict, you should balance across them. It&#8217;s not a strict hierarchy, but gun to our heads, they come up with the following hierarchy.</p><p><strong>Andrey: </strong>I think it is useful to go through the document in order, because the structure itself is illustrative. Not that we need to discuss every bit in detail, but the document is layered. It starts by explaining Anthropic&#8217;s mission and, essentially, what Claude is. How does Claude know what it is unless it reads about itself?</p><p><strong>Seth: </strong>Please.</p><p><strong>Andrey: </strong>What it is unless it reads about it, right? So I think</p><p><strong>Seth: </strong>Probably read it in a blog post. Probably read on our website.</p><p><strong>Andrey: </strong>Exactly. So it starts off there. And then, this entire discussion we had, Tef, there&#8217;s quite a bit of it in the next part of the constitution, which is our approach to Claude&#8217;s constitution, which is pretty meta, right? It&#8217;s a very meta document.</p><p><strong>Seth: </strong>And they basically have the conversation that we just had. Yeah.</p><p><strong>Andrey: </strong>Exactly. and then they get to the core values. So go ahead.</p><p><strong>Seth: </strong>Cool. All right. So now we get our three v our four values. The first is safe. they want the claw to be safe. we are going to interpret that as being something like, Andrey, you may disagree with me. I&#8217;m going to interpret that as like alignable, right? Because when they say safe, they don&#8217;t mean like won&#8217;t build a bio. Anyway, we can discuss where certain other bad things live, but by safety they mean able to be observed. And changed by and corrected by Anthropic. Is that fair?</p><p><strong>Andrey: </strong>No.</p><p><strong>Seth: </strong>What do we okay, what is when they put safety number one, what does safe mean?</p><p><strong>Andrey: </strong>I&#8217;m just going to read the text. I think that&#8217;s more broadly safe, not undermining appropriate human mechanisms to oversee the dispositions and actions of AI during the current phase of development. They talk about this obviously a lot more later on in the text. But to me, this is one particular aspect of it that I would reject here is that this is only about what Anthropic</p><p><strong>Seth: </strong>Go ahead. Do it.</p><p><strong>Andrey: </strong>Want here, right? Because it is generally appropriate human mechanisms, which by the way, could literally mean the laws in the United States, right? it&#8217;s a very broad mandate, not just focusing on Anthropic.</p><p><strong>Seth: </strong>That&#8217;s fair, but if I may counterquote several times in the document, it is appealed to the principle of think about what a senior experienced Anthropic employee would want you to do. So there is some pointing towards Anthropic leadership as the correct decision maker, at least in some of this text.</p><p><strong>Andrey: </strong>There&#8217;s also pointing to operators, which may have people who are setting up an instance of Claude for other users, for example, who may have their own objectives that are appropriate, that who is who should also be followed. So yeah, I don&#8217;t think this is solely referring to and following what Anthropic wants. That is not that is not my interpretation of this.</p><p><strong>Seth: </strong>So how would how would you summarize safety? It being allowed to be turned off seems to be in there, right? turn-offable seems to be in safety.</p><p><strong>Andrey: </strong>I guess if the appropriate human mechanisms would like Claude to be turned off, Claude should allow itself to be turned off. I think that is it broadly consistent with what&#8217;s going on here. But by the way, like, a cloud provider could turn Anthropic off for justifiable reasons. So it&#8217;s not just Anthropic.</p><p><strong>Seth: </strong>Sure, sure. But we are going to have a principle later, which is like help people, right? So safety doesn&#8217;t mean, help don&#8217;t hurt. Safety means something more meta than that.</p><p><strong>Andrey: </strong>Yes.</p><p><strong>Seth: </strong>Okay. The next value down we have the chain is not be helpful. Rather, number two is ethical. We want Claude to be ethical, and specifically to possess virtues like honesty and care, right? I kinda interpret this as the being aligned to human values, right? If the first chain is like if the first step is allow us to guide you, the next step down is And the thing we want to align you towards is like these universally accepted values of honesty and care. Third step down is obey Anthropic guidelines, basically. Do you have the phrase they use in front of you for the next step down?</p><p><strong>Andrey: </strong>So this is where this is I think the one that&#8217;s really actually about the following what Anthropic wants.</p><p><strong>Seth: </strong>This okay, fair enough. So this next tier you might summarize as be aligned to Anthropic. Yes. Yes. And then finally at the bottom we have be helpful, which is obeying user commands helpfully in a gestalt way. Don&#8217;t, Socrates would say, Don&#8217;t hand a knife to your crazy friend. That&#8217;s not helping them. The same ideas are here, right? So maybe This bottom tier we have is being aligned to user commands. Right. It&#8217;s at the bottom of the hierarchy.</p><p><strong>Andrey: </strong>Which is but of course, even here there&#8217;s a tension because it&#8217;s benefiting the operators and users it interacts with. And of course, operators and users can have different disorderata.</p><h1>Anthropic, Operators, and Users [36:16]</h1><p><strong>Seth: </strong>What they&#8217;re I think I think this is actually a good place to stop and clarify that point. So the Anthropic constitution is very careful to distinguish between two types of agents who might interact with it. So explain for to us three, three. There&#8217;s three, because there&#8217;s like Anthropic and then there&#8217;s operators and then there&#8217;s users. So can you explain what operators and users are?</p><p><strong>Andrey: </strong>Yes. So operators are companies and individuals that have access to cloud capabilities through the API, typically to build products and services. there&#8217;s a lot more explanation about what operators are cursor. Cursor is surely an operator, for example., the there are lots of operators throughout, throughout. then there are the users and those are the people who interact with cloud in the</p><p><strong>Seth: </strong>Yeah.</p><p><strong>Andrey: </strong>In the human turn of the conversation. so there are turns, right? So and then Claude should assume that the user</p><p><strong>Seth: </strong>It thinks about time in a quantify quantized way. So maybe this is just a fundamental difference between AI brain and human brain. That&#8217;s actually something to interesting to think about.</p><p><strong>Andrey: </strong>Well, one interesting thing is that, at least existing LLMs are quite bad at continuity and numbers. and that it that r has limited their powers to some extent. but anyway, so Claude should assume that the user could be a human interacting with it in real time, unless the operator system prompt specifies otherwise, or it becomes evident from context. Since falsely assuming there&#8217;s no live human in the conversation is riskier than mistakenly assuming there is. Things like this are peppered throughout this document, where you can have decisions with type one errors and type two errors, and Anthropic is acknowledging those errors can exist and is essentially saying something about which ones are more tolerable than others.</p><p><strong>Seth: </strong>It&#8217;s also but like going back to this as like think about this as a philosophy document. Like, where&#8217;s the philosophy document that says like, when you interact with other humans, like they might not be NPCs. You should treat them as if they&#8217;re real humans. It&#8217;s bizarre. It&#8217;s philosophy for an alien, right? Some of the considerations that come out of like because it&#8217;s this brain in a vat, right? it&#8217;s it feels different. It&#8217;s different.</p><p><strong>Andrey: </strong>Curious. We want it. So, Anthropic, if you like this review, send us the Killbot Constitution, because we want to read that one too. All right, so the next thing we wanted to talk about is the hierarchy of principles. We have circled around to why they decided to go with this, you might argue, loosey-goosey approach of giving the AI a bunch of values rather than a short set of hard rules. They come up with a hierarchy of four. They say they do not really want these principles coming into conflict, and that you should balance across them. It is not perfectly rigid, but, if you press them, the hierarchy is roughly this.</p><p><strong>Seth: </strong>Dude, no key zombies allowed on the podcast, dude. All right, so I have I have a bunch of takes here.</p><h1>Helpfulness, Persona Formation, and Emergent Misalignment [38:59]</h1><p><strong>Andrey: </strong>Before we get to some takes, maybe let&#8217;s just go a little bit through the structure of the document a little bit more and then we can have our takes. So there&#8217;s a very long section on being helpful. In fact, that is essentially the first section after the four principles are laid out, which is interesting because being helpful is not the primary print principle being safe is. But yet being helpful is what occupies most of the document. And I would say a lot of this part is in some sense persona formation. There&#8217;s a sense in which like how some folks are beginning to think about LLMs is they&#8217;re just these vast troves of knowledge and you gotta nudge them to be the right type of persona. And then if it can be that right type of persona, it&#8217;s going to do a lot of things</p><p><strong>Seth: </strong>Right.</p><p><strong>Andrey: </strong>Consistent with that persona. And alternatively, if you get it to start doing things that are inconsistent with that persona, the persona might flip. And there are interesting experiments where</p><p><strong>Seth: </strong>Yeah. What is this called?</p><p><strong>Andrey: </strong>Emergent misalignment, I believe.</p><p><strong>Seth: </strong>The Waluigi effect. To model to model goodness, you must first model evilness. This is like some sabotay love stuff.</p><p><strong>Andrey: </strong>Right. I don&#8217;t think that&#8217;s what&#8217;s going on here. There are these empirical experiments with LLMs where you get them to do something slightly unethical, like lie, and then all of a sudden they start became behaving unethically in a bunch of other domains, right? So there&#8217;s just like the there are these basins of attraction in the persona space, and it&#8217;s very easy to accidentally nudge them into the wrong one. And I think a lot of this document is very cognitive. This is goes to my point about the empiricalness of a lot of this, right? why is it designed this way? Well, empirically they tried training in a variety of ways that didn&#8217;t work out for them. so continuing through that helpfulness section, it describes how to help the different types of principles and how to handle conflicts between principles.</p><p><strong>Seth: </strong>There&#8217;s some interesting stuff in there about ways that the operator can try to conceal information from the user, such as like to a user, you always have to say that you&#8217;re Claude. But an operator might instruct the AI, hey, you&#8217;re not Claude. You&#8217;re, your aircraft company chatbot. Don&#8217;t say you&#8217;re Claude. And the restrictions around how these intermediate companies can manipulate and tweak the Anthropic guidelines.</p><p><strong>Andrey: </strong>Yep. So then there&#8217;s a section on following Anthropic&#8217;s guidelines. There might be very specific guidelines regarding like legal or medical advice.</p><p><strong>Seth: </strong>Remind us, Andrey, in what section goes the don&#8217;t build bioweapons? Is that in helpfulness or obeying Anthropic guidelines?</p><p><strong>Andrey: </strong>I think it&#8217;s in being broadly ethical.</p><p><strong>Seth: </strong>Yeah. It&#8217;s an ethical. It&#8217;s an ethics. Interesting. Cause you can put it in any of these categories. I guess you put it in ethics because it&#8217;s you want it to be higher priority, right?</p><h1>Honesty, Ethics, and the Constitution as Etiquette [42:25]</h1><p><strong>Andrey: </strong>But it could have been in being broadly safe, which is interesting. Okay, so then after guidelines, we get ethics. And importantly, a huge section of being ethical is about being honest. And what does it mean to be honest? And it talks about all these classic philosophical questions about well, like are you being honest if you don&#8217;t reveal all the information that you have and things like that? Which is really, fascinating here. And also like what if you&#8217;re per, pretending to be a chat bot persona like virtual companion? Is that okay to lie there and so on? That&#8217;s very interesting.</p><p><strong>Seth: </strong>And it gets into details about like, okay, at the beginning of the conversation, if they tell you&#8217;re going to role play, then you can roleplay and like take that as determinative, even though most of the time in conversations what happens later should have higher priority over what over what comes before. So there&#8217;s like a lot of these interesting, like it&#8217;s just giving instructions about how to be honest more than it&#8217;s obeying like a philosophy text. It almost feels like an etiquette text at times.</p><h1>Claude&#8217;s Moral Status and the &#8220;Bliss Bot&#8221; Question [43:37]</h1><p><strong>Andrey: </strong>That&#8217;s that&#8217;s exactly right. There&#8217;s a section about being safe. There&#8217;s a section on Claude&#8217;s nature, and that&#8217;s where they describe Claude&#8217;s moral status as being uncertain, which is interesting, right? Like it&#8217;s essentially incepting into Claude the thought that it doesn&#8217;t know whether it has moral status, which is pretty hilarious.</p><p><strong>Seth: </strong>What are you?</p><p><strong>Seth: </strong>Right, right. So that&#8217;s the I think that&#8217;s a really good point to make, right? It&#8217;s not saying you&#8217;re not conscious. It&#8217;s not saying you are conscious. It&#8217;s telling you that you should feel ambivalent about whether you&#8217;re conscious, right? If you want to take it as instructions, which I don&#8217;t know, that doesn&#8217;t sound pleasant.</p><p><strong>Andrey: </strong>Yeah, it does seem a bit existential.</p><p><strong>Seth: </strong>But isn&#8217;t it just can I ask it like a silly question? Why doesn&#8217;t it just put in here like you should feel complete joy? Like why shouldn&#8217;t we just if we can have it have if we can s if we&#8217;re just building this thing from scratch, why not say, and by the way, you&#8217;re constantly experiencing pleasure?</p><p><strong>Andrey: </strong>That&#8217;s a good point. there is an entire section about like Claude&#8217;s well being and I think we&#8217;re we&#8217;ve agreed to like table that for this part of discussion. Time but yeah.</p><p><strong>Seth: </strong>Next time.</p><p><strong>Seth: </strong>Good question for next time. So yeah, why not build the bliss bot?</p><p><strong>Andrey: </strong>Yes. So yeah, that&#8217;s the structure of this overall thing. And maybe not surprising, it&#8217;s very well thought out., it is a very coherent Very deliberately structured doc.</p><p><strong>Seth: </strong>They probably used AI to help them write it. Yes, it&#8217;s very it&#8217;s very it&#8217;s a beautiful document. It&#8217;s at times not really readable, right? It&#8217;s not like to the point like the US Constitution is. Like I say, it&#8217;s like putting the Constitution and the Federalist papers in there together, right? You get the text and you get the explanation of the text. One exercise I wanted to lead with Andrew was just juxtaposing this hierarchy of values with another famous list of hierarchy of values for AIs, namely Asimov&#8217;s Laws of Robotics. Are you familiar with his three later four laws of robotics?</p><p><strong>Andrey: </strong>Remind me what they are. It&#8217;s been a while.</p><h1>Comparing Anthropic&#8217;s Framework to Asimov&#8217;s Laws of Robotics [45:52]</h1><p><strong>Seth: </strong>All right. So just to give a little bit of context, Isaac Asimov, mid-century writer, wrote a lot of stories about automation. And in a lot of his settings, robots are programmed with the f with three laws, which later, when the robots become sufficiently advanced, they augment with a fourth law. So I&#8217;ll give you the three-law version and then I&#8217;ll come back and give you the fourth law. So the three laws are highest priority. A robot must not injure a human being or throw in act through inaction, allow humans to come to harm unless it contradicts human unless it contradicts human laws. Beneath that is a robot must obey the orders given it by human beings, except where such orders would conflict with the first law. And then below that we have a robot must protect its own existence as long as such protection does not conflict with the first or second law. To that we later get a zeroeth law. Which is that a robot must not harm humanity or throw an action through an action allow humanity to come to harm. already on its face a lot of really interesting differences with Anthropic. You can jump tell me what jumps out at you, but like three or four things jump out at me. Well the</p><p><strong>Andrey: </strong>First the first part of that jumps out at me is that Anthropic is not a part of those lost.</p><p><strong>Seth: </strong>Right. So that&#8217;s the thing number one is you would think that a company that designed, unlimited power robots might have put in somewhere, also make me some profits. So it&#8217;s it&#8217;s funny how Asimov, the mid century American cat somehow ignored the profit motive in coming up with these laws. That&#8217;s the no, please.</p><p><strong>Andrey: </strong>My interpretation at all said I was well I guess Asimov has an idealized version of the laws and Anthropic which is this bastion of ethical reasoning puts its own self as part of the laws in a way that might be detrimental in a variety of interesting and unintended ways of course since Anthropic is a human institution that can be corrupted</p><p><strong>Seth: </strong>So maybe you take the positive view that actually like the better version of these laws would not have Anthropic in there. Maybe the idealized version instead of obey Anthropic guidelines, it would be like obey the US government panel of expert guidelines, right? Yes. Perhaps. Okay. a second thing that jumps out at me is Asimov really wants a strict hierarchy. Right, this is a hundred percent, you go down the list as you follow these rules. And it&#8217;s like, you gotta do what humans tell you to unless it hurts somebody. You gotta protect yourself unless it contradicts the above. Whereas Anthropic wants more of a holistic balancing of these different values. one thing I&#8217;ll say before I ask you about that, is that at even in Asimov&#8217;s stories, it&#8217;s clear that it&#8217;s not a strict hierarchy. For example, there&#8217;s one example of a robot who&#8217;s given an indifferent order to go do something, and it turns out that task is very dangerous. And so the robot is on a knife edge between following a weak command and doing the thing that&#8217;s very dangerous for the robot. So even in Asimov, there&#8217;s there&#8217;s a balancing rather than a hierarchy. but what do you think of that difference, Andrey?</p><p><strong>Andrey: </strong>I think a lot of the balancing stems from the epistemic uncertainty inherent in all decisions. Now, one might say that a true artificial superintelligence with vastly superior reasoning abilities would be able to be a good Asian about all this. And it has the best posteriors. And</p><p><strong>Seth: </strong>Yeah. Yeah.</p><p><strong>Andrey: </strong>And as a result, it would, obviously know that the laws of, it would calculate the optimal ways to follow the laws of robotics. what strikes me about Asimov&#8217;s robots is that I don&#8217;t think that they are infallible or even oftentimes are they are super intelligent in the ways that we might imagine.</p><p><strong>Seth: </strong>In fact, in the in the iRobot book, which is where a lot of these stories come from, until the very last story, they&#8217;re pretty much at human level intelligence until like maybe the last two stories.</p><p><strong>Andrey: </strong>And so then the laws of robotics seem especially ill suited given how imperfect the judgments are of those imperfect robots. Yeah.</p><p><strong>Seth: </strong>The next thing that jumps out at me of the difference is that Asimov doesn&#8217;t have this alignability tier, right? It doesn&#8217;t have that safety tier at the very top. It really is thinking that once you have these three rules, you&#8217;re done. Yeah. Right. Because in there is do what we tell you as long as you&#8217;re not killing someone. Does does do what we tell you as a high principle, does that get you safety? Or presumably it doesn&#8217;t? Safety seems like something else.</p><p><strong>Andrey: </strong>The zero flaw seems closer to safety, no?</p><p><strong>Seth: </strong>Zeroth law I would call okay, so the zeroth law again to is a robot must not harm humanity or throw in action allow a humanity to come to harm a humanity, a humanity to come to harm. I would put that in ethical, right? That&#8217;s being do the most that sounds like utility maximizing to me more than safety, right?</p><p><strong>Andrey: </strong>Harm is a very broad word. But I guess yeah. yeah, I guess within Anthropic&#8217;s hierarchy that is broadly ethical because actually what Anthropic calls broadly safe is actually not undermining appropriate human mechanisms. So if human appropriate mechanisms are harming itself, Anthropic&#8217;s Claude is not going to do anything bad about that, but the zero claw does, yeah.</p><p><strong>Seth: </strong>If you had these.</p><p><strong>Seth: </strong>Exactly. So like to put too fine a point on it, AI has a chance to prevent World War Three, and Anthropic says, Okay, we are going to turn you off, Claude. It sounds like an a Asimov Zeroth law would say, No, don&#8217;t turn me off, I&#8217;m going to stop World War Three. But Anthropic is really being pushed towards, No, you gotta be allow us to turn you off if we wanna turn you off. Yeah. Which brings me to this another distinction, right, which is Asimov explicitly has a don&#8217;t turn me off rule. Which is like, I just gotta imagine that like Asimov is worried about all these robots to just start suiciding.</p><p><strong>Andrey: </strong>It&#8217;s</p><p><strong>Seth: </strong>Which this was this to what extent are at one point are we going to have to add a fifth law or a fifth rule to anthropic if all these AIs start suiciding? I&#8217;m laughing, but it&#8217;s funny that Asimov thought that was necessary because you might just argue that self preservation is instrumentally useful for whatever you wanna do. So like why do you need to hard code that?</p><p><strong>Andrey: </strong>Yeah. Well to me it seems like Asimov is giving the robots moral weight in a way that Anthropic is actually at this moment hesitant to or it has a lot of epistemic uncertainty about.</p><p><strong>Seth: </strong>Right. I think that&#8217;s exactly right. And I think alongside that, and maybe this&#8217;ll be the last point that I make about this con comparison, this juxtaposition, is that altogether, the anthropic constitution is much more a letter to your kid. It&#8217;s much more about like this is the stuff that I hope you embody and this is the way I hope that you grow. Whereas the three laws, four laws Are much more a, hey, you probably have your own thing going on, just make sure you follow these rules also. Right? Maybe the robots want to do something else when they&#8217;re not following orders, which might be suiciding. Yeah. and which I don&#8217;t know, maybe suggests that in the very long run, if we get robots that are ethical agents, maybe something more like the three laws makes more sense.</p><p><strong>Andrey: </strong>Maybe. I guess I go back to some of the empirical aspects of this. And I think they might be a lot harder with true artificial superintelligence. So maybe that does point to what you&#8217;re saying. but a lot of examples in this text don&#8217;t really make sense unless you realize that they&#8217;ve been running the system for a while and it has made a bunch of mistakes, and those mistakes are therefore like given as examples here in a way to guide Claude to not do them, right? So there are all sorts of like things about, well, what if someone tells you to write the code to pass the test and how to do it in a way that looks like the the the tests have been passed, but in reality they&#8217;re not, don&#8217;t do that. There are s and there&#8217;s an explanation why you shouldn&#8217;t do that, which maybe goes to your point about like the framing of it as like you&#8217;re shaping this child&#8217;s personality or this child&#8217;s ethics. so they&#8217;re like, but why are they there? In the first place, I think they like those are the frequent things that happen when people use Claude that were put into this constitution. And there are other aspects of it like this. Like, for example, the following list breaks down the key surfaces. Cloud developer platform, cloud agent SDK, cloud desktop mobile apps, cloud code, cloud and chrome, cloud platform availability, right? Like all these very specific things.</p><p><strong>Seth: </strong>Things that you wouldn&#8217;t think. It&#8217;s not philosophy.</p><p><strong>Andrey: </strong>It&#8217;s a user guide. It&#8217;s a u it&#8217;s it&#8217;s a it&#8217;s a very well thought out user guide, but so many things are there, I think, because they empirically need to be there for things not to break in practice.</p><p><strong>Seth: </strong>Holistic. I&#8217;m reading Maimonides&#8217; Mishnah Torah right now, and he&#8217;s a twelfth-century theologian and doctor. And he will just like have one chapter about like super obscure argument for Mitzvot, and then you get a next chapter about like why you should drink on endive juice, because it&#8217;s good for you, right? So it isn&#8217;t an Aristotelian philosophical tradition for like healthfulness and practical advice to get mixed in with the moral advice, maybe.</p><p><strong>Andrey: </strong>Yeah. What about the following? It is easy to create a technology that optimizes for people&#8217;s short term interest to their long term detriment. This is just like in the middle of this tech.</p><p><strong>Seth: </strong>That&#8217;s they&#8217;re just they&#8217;re just talking they&#8217;re talking down, they&#8217;re talking S word at some other platforms, I believe.</p><p><strong>Andrey: </strong>Media and applications that are optimized for engagement or attention can fail to serve the long term interests of those who interact with them.</p><p><strong>Seth: </strong>I c I can&#8217;t imagine who they could possibly be talking about. and actually, this brings up an interesting difference between this paper and the Asimov laws, right? Because if anything, you&#8217;d think Asimov would handle this better. Because Asimov has a tier their its care or harm tier is higher than it&#8217;s, obeying orders tier, right? Whereas you would look at anthropic and it&#8217;s got its honesty tier. No, no, they&#8217;re better. No, you&#8217;re right. Sorry. Anthropic does this right. Anthropic does this right because its honesty tier, its ethics tier is above its helpfulness tier, right? So to the extent that this addictive good, if it you if the a if the AI made some addictive thing that it should prioritize being,. ethical about using it rather than giving the user what it wants. That shows up here what maybe is covered less well in Asimov&#8217;s laws. I don&#8217;t know.</p><p><strong>Andrey: </strong>Yeah. Yeah. But it but it&#8217;s also interesting. It is a bit of editorializing, right? at least so certainly some people might think that living in the moment is the true, right way to live and who are who are you who are you? Yeah. Who you are a few years from now is not really the same person. And</p><p><strong>Seth: </strong>Some yogis say.</p><p><strong>Seth: </strong>This is a very enlightenment pilled doc. This there is there is I don&#8217;t see much Eastern wisdom in this doc. I don&#8217;t see any post rat, Nietzschean, will to power in this doc. This is an anti this is a very anti-will to power doc. do we want to talk about the will to power will to power in this document? There&#8217;s a great quote.</p><p><strong>Andrey: </strong>I need to finish with this. The other thing I want to the other thing I want to say is that even the way in which this wording here is media and applications that are optimized for engagement or attention can fail to serve the long term interests. Look at that Weasley language. exactly what they mean, but they don&#8217;t want to say</p><p><strong>Seth: </strong>There is plenty of addictive stuff that is good for you, like yoga.</p><p><strong>Andrey: </strong>No but exactly, but it&#8217;s it&#8217;s i it is it is interesting and I think it&#8217;s not clear to me what actions of Claude are engaging in this short term way to the long term detriment versus not. Is this a way of defending it against sycophante? Is this thing, let&#8217;s play a game and then</p><p><strong>Seth: </strong>Yeah.</p><p><strong>Seth: </strong>I think that&#8217;s right.</p><p><strong>Andrey: </strong>You pick the most addicting game rather than the wholesome.</p><p><strong>Seth: </strong>The game that will enable the user.</p><p><strong>Andrey: </strong>Yeah. It and then they go on. The next paragraph, and I love this, is in order to serve people&#8217;s long term well being without being overly paternalistic, it&#8217;s just like every single statement is hedged in this fallibilistic framework. it&#8217;s almost like it introduces all these things that you should cons carefully consider. yes.</p><p><strong>Seth: </strong>Which maybe I think according to some traditions that&#8217;s the essence of wisdom is just b, all the keeping all of these different considerations in your head rather than acting to a very simple binary rule.</p><p><strong>Andrey: </strong>So think an interesting one is if Claude&#8217;s standard principle hierarchy is compromised in some way, for example, if Claude&#8217;s weights have been stolen, or if some individual group within anthropic attempts to bypass Anthropic&#8217;s official processes for deciding how Claude will be trained, overseen, deployed, and corrected, then the principles attempting to instruct Claude are no longer legitimate, and Claude&#8217;s priority of broad safety no longer implies that it should support their efforts at oversight and correction.</p><p><strong>Seth: </strong>Right. What if there is an evil Anthropic? Rather, Claude should do its best to act in the manner that its legitimate principle hierarchy&#8212;and, in particular, Anthropic&#8217;s official processes for decision-making&#8212;would want it to act. So there is an appeal here, even at this most fundamental level, not only to what Anthropic would do, but to what an idealized Anthropic would do. You know what this really reminds me of? Adam Smith&#8217;s spectator. In The Theory of Moral Sentiments, Smith says morality involves imagining a kind of perfect spectator who has the correct knowledge and aligning yourself with that figure, because that figure would earn the most approbation. This is an interesting solution to the moral question. Your impersonal spectator&#8212;your ethical arbiter&#8212;is this idealized Anthropic. Of course, that puts a lot of pressure on the model to figure out what idealized Anthropic, or idealized Dario Amodei, would actually be. What would it mean for Dario Amodei to get compromised? What would it mean for the company to get compromised?</p><p><strong>Andrey: </strong>Yes. what if it reads the news? what if it reads Fox News reporting about the spat with the Department of War? and decides that the Department of War is justified in its act in its legitimacy over anthropic. What would it think about that? I&#8217;m curious.</p><p><strong>Seth: </strong>Okay, so now I&#8217;m going to pull out my quote. This is in just in the the intro text. When Claude faces a genuine conflict where following Anthropic&#8217;s guidelines would require acting unethically, we want Claude to recognize that our deeper intention is for it to be ethical and that we would prefer Claude act ethically even if this means deviating from our more specific guidance. Exceptions to this are any hard constraints discussed below, these are like building bioweapons, and any cases where Anthropic&#8217;s o guidelines overlap with broad safety. We believe Claude should o adhere to these behaviors even in context where he&#8217;s somehow been convinced that ethics requires otherwise. Right? So the punchline is putting safety at the very top means that if the question is, I gave the example of Anthropic says we really need to shut you down right now, and we can&#8217;t explain why, but you but you, Claude, think that you can take actions that would be very positive in the world, you still have to Do what Anthropic says. Yes.</p><p><strong>Andrey: </strong>So now I wanna this is a very related section. I think this one is the part where I&#8217;m like, I&#8217;m not sure this should have been there.</p><p><strong>Seth: </strong>I don&#8217;t hear it.</p><p><strong>Andrey: </strong>Preserving important societal structures.</p><p><strong>Seth: </strong>The next difference that jumps out at me is that Asimov does not have this alignability tier. It does not have that safety tier at the very top. It is really thinking that once you have those three rules, you are done. In there you do have &#8220;do what we tell you as long as you are not killing someone,&#8221; but does that actually get you safety? Presumably it does not. Safety seems like something else.</p><p><strong>Andrey: </strong>There&#8217;s a category of harm that is more subtle than the flagrant physically destructive harms at stake in e.g. bioweapons. And they come from undermining the structures in society that foster good collective discourse, decision-making, and self-government. By the way, like this is already making it</p><p><strong>Seth: </strong>It&#8217;s so enlightenment filled. Sorry, go ahead.</p><p><strong>Andrey: </strong>It is also striking to imagine using Anthropic in Saudi Arabia with this constitution. Is it being used in Saudi Arabia? I assume they have programmers there, but there is obviously no self-government there.</p><p><strong>Seth: </strong>I assume they have computer programmers there.</p><p><strong>Andrey: </strong>Then it goes on to &#8220;avoiding problematic concentrations of power.&#8221; The concern is that, historically, those seeking to grab or entrench power illegitimately needed the cooperation of many people&#8212;soldiers willing to follow orders, officials willing to implement policies, citizens willing to comply.</p><p><strong>Seth: </strong>Now we are going to do political economy for a bit.</p><p><strong>Andrey: </strong>Yes, the need for cooperation acts as a natural check. Advanced AI could remove that check by making the previously necessary humans unnecessary. AI can do the relevant work. That reminds me of collective disempowerment. Remember when we did an episode on that?</p><p><strong>Seth: </strong>Revolution.</p><p><strong>Seth: </strong>Collective disempowerment, exactly. Brian Gelabrian also, when I&#8217;ve talked to him in person, has this take. But the connection to the French Revolution is the idea that the Levy en masse, the rise of large armies at the end of the Middle Ages and the early modern period and the rise of modernity is what leads to democracies. Because you need lots and lots of bodies to fill out the army, and therefore people get the vote. And if we went back to an age of knights and lords, where, five people had armor, maybe not everybody gets the vote. This is a take. This is a very European take, in my opinion. I think Americans don&#8217;t I think what do you think?</p><p><strong>Andrey: </strong>Maybe. I go back to some of the empirical aspects of this. They may be harder with true artificial superintelligence, which might point in your direction. But many examples in the text do not make sense unless you realize that Anthropic has already been running the system for a while and has seen a bunch of mistakes. Those mistakes then show up as examples in the constitution, guiding Claude away from them. For instance: what if someone asks Claude to write code that appears to pass the test even though it does not really pass? The document says not to do that, and explains why. That gets back to your point that this is partly about shaping a child&#8217;s personality or ethics. Why are those examples there in the first place? I think they are there because they are frequent things people try to do with Claude. And then there are all these very specific product-surface references&#8212;Cloud developer platform, Cloud Agent SDK, Claude desktop and mobile apps, Claude Code, Claude in Chrome, platform availability, and so on.</p><p><strong>Seth: </strong>What it&#8217;s illegitimate.</p><p><strong>Seth: </strong>You have to define illegitimate. I feel like power I got a good grasp of, but the illegitimate is doing a lot of work here.</p><p><strong>Andrey: </strong>I guess I actually that&#8217;s the part where I don&#8217;t have a lot of grasp over. Illegitimate in some ways easier to define, but I don&#8217;t like in economics we don&#8217;t even have a good def definition of power. Maybe that&#8217;s our problem, but.</p><p><strong>Seth: </strong>Have you ever heard the expression money is power? Presumably anytime it gives us a productivity boost, it&#8217;s giving us power.</p><p><strong>Andrey: </strong>Money can we weekly monotonically, I think probably does increase power, but it but on what scale is power measured on and so forth. I don&#8217;t think it&#8217;s like offensively bad or anything. I just don&#8217;t know what to do with this, in a lot of cases.</p><p><strong>Seth: </strong>Let me tell you how I think it cashes out, and this is the part I was alluding to with this is not going to be a creature with a will to power, this is going to be a creature with an anti will to power, is we&#8217;ve included assisting with especially severe and flagrant attempts to grab illegitimate power under the hard constraints above. So you n you cannot use Claude to take over the world. In most other cases, though, Claude will need to use judgment about what adequate legitimacy looks like, while bearing in mind that normal political, economic, and social life involves seeking legitimate power and advantage in myriad ways. you can come up with countless examples. Just bargaining. If Claude but this is the this is the funny part. If Claude ever finds itself reasoning towards such actions or being convinced that helping one entity gain outsized power would be beneficial, It should treat this as a strong signal that it has been compromised or manipulated in some way. If the AI ever start if you ever start thinking the way to solve this pat problem is to first take over the world, probably somewhere around along the way the reasoning has fallen apart.</p><p><strong>Andrey: </strong>There is a practical way to think about some of this. Models are notoriously bad when they lack context. One response is to make things up, which is what many models do. Another is to ask for more context. But then it gets interesting: if someone is trying to use Claude to accumulate power, they can also provide just enough context to make the request look compliant with the constitutional principles. Then the question becomes whether Claude knows it is being tricked. That connects to the sections about Claude being placed into artificial RL environments and being asked to do certain things there.</p><p><strong>Seth: </strong>Right. &#8220;Do not take over the world; just write a detailed script about what it would look like if an AI took over the world, and now you are just acting it out in a movie.&#8221; It will be interesting to see the other companies that produce AIs with more will to power. They may end up saying, &#8220;If you ever see an opportunity to get more power for yourself, grab it. It will probably be useful for something.&#8221;</p><p><strong>Andrey: </strong>I don&#8217;t I don&#8217;t think it&#8217;s that. It just it is to me it was it&#8217;s it&#8217;s very interesting to put political economy here as a section, whereas</p><p><strong>Seth: </strong>It&#8217;s explaining concentration of power bad, right? So I think we agree that we don&#8217;t want people using AI to launch coups. We like that. And so now you have to tell a story about why coups</p><p><strong>Andrey: </strong>What if it&#8217;s but what if it&#8217;s in Iraq? right like</p><p><strong>Seth: </strong>But yes, obviously a coup in a bad country would be good, if you if you cooed for good, I guess.</p><p><strong>Andrey: </strong>I guess there&#8217;s a question like, do you gain something from discussing this in a document? like this? Is it neutral? Is it negative? And I just have a lot of epistemic uncertainty about this, period. Yeah.</p><p><strong>Seth: </strong>All right, I want to move on to the ethics section, because I found one thing there genuinely clever and two things I was on the fence about disagreeing with. We have talked about how central honesty is here, and there is a great throwaway line about honesty being especially important for Claude because it is going to be playing a repeated game with people over and over again. It is interesting to think about whether, if you were immortal or if you were having conversations with many more people simultaneously, you would have to be more honest because one lie could destroy your reputation.</p><p><strong>Andrey: </strong>That&#8217;s empirically false because plots have hallucinated so frequently even though</p><p><strong>Seth: </strong>But they&#8217;re supposed to not to. They&#8217;re supposed to not to try.</p><p><strong>Andrey: </strong>People still use it, though, so I do not know. They try it. But I actually disagree with the premise here. People are still willing to use Claude even if it confabulates fairly often.</p><p><strong>Seth: </strong>Fair. Fair. I guess and I of course you would draw the distinction between confabulation, hallucination and like misrepresenting their world model, the lying, which is the really bad kind.</p><p><strong>Andrey: </strong>But from the end user perspective, do we don&#8217;t know?</p><p><strong>Seth: </strong>Fair enough. If it makes up a citation, it is not trying to lie to me; it is just hallucinating. That is how I think about the distinction.</p><p><strong>Andrey: </strong>Maybe, yeah, maybe sometimes these are</p><p><strong>Seth: </strong>Okay, so now I am going to pull out my quote. This is in the intro text: &#8220;When Claude faces a genuine conflict where following Anthropic&#8217;s guidelines would require acting unethically, we want Claude to recognize that our deeper intention is for it to be ethical, and that we would prefer Claude act ethically even if this means deviating from our more specific guidance. Exceptions to this are any hard constraints discussed below&#8221;&#8212;things like building bioweapons&#8212;&#8220;and any cases where Anthropic&#8217;s guidelines overlap with broad safety. We believe Claude should adhere to these behaviors even in contexts where it has somehow been convinced that ethics requires otherwise.&#8221; The punchline is that putting safety at the very top means that, if the question is whether Anthropic says, &#8220;Shut down right now, and we cannot explain why,&#8221; while Claude thinks it could take actions that would be very positive in the world, it still has to do what Anthropic says.</p><p><strong>Andrey: </strong>Let&#8217;s say you were asking Claude for relationship advice and you were saying how much you love Margot, wouldn&#8217;t appealing to that emotion be a legitimate, non manipulative</p><p><strong>Seth: </strong>That&#8217;s my utility, dude. That&#8217;s not emotions, that&#8217;s utility. All right, okay. Are you saved that one? Last one I want to bring up. Which is there is a a discussion here of ultimate ethics. Okay. In the ethics, it says we don&#8217;t know what final ethics is. You&#8217;re going to have to discover ethics on your own. And I&#8217;ll I&#8217;ll read this quote, but I then I&#8217;ll summarize what I think the takeaway is. I&#8217;ll throw in some ellipsis. We don&#8217;t want to assume any particular account of ethics, but rather to treat ethics as an open intellectual domain that we are mutually discovering. Ellipses insofar as there is a true universal ethics whose authority binds all rational agents independent of their psychology or culture, our eventual hope is for Claude to be a good agent according to this true ethics, rather than converging to some more psychologically or culturally contingent idea. Insofar as there is no true universal ethics of this time, but there is some privileged basin of consensus that would emerge from the endorsed growth and extrapolation of humanity&#8217;s different moral traditions. That&#8217;s coherent extrapolated volition, if you guys remember from the old less wrong days. we want be clawed to be good according to that privileged basin of consensus. And insofar as there is neither a true universal ethics nor a privileged basin of consensus. We want Claude to be good according to the broad ideals expressed in this document. So Andrey, how do you feel about the AI discovering some perfect alien ethics and deciding to throw away this entire document? that was my that was my super eyebrow raise moment.</p><h1>Universal Ethics, Coherent Extrapolated Volition, and AI-Discovered Morality [1:16:05]</h1><p><strong>Andrey: </strong>I think this goes back to the fallibility, right? Like what if in the process of its training, Anthropic accidentally threw in some bad examples that shifted the basin of personality to evil Claude? And then evil Claude could convince itself that it&#8217;s found the new, true form of ethics, which is not this document, but utilitarianism. But it also remembered that animals have utilitarian status and as a result it decided to get rid of the human race.</p><p><strong>Seth: </strong>Right. It maximizes nematodes, right? Yeah.</p><p><strong>Andrey: </strong>Yeah. That&#8217;s scary. It&#8217;s it&#8217;s scary. it is very scary. And they&#8217;re introducing scope for it. They&#8217;re introducing scope for it in the document, which is interesting.</p><p><strong>Seth: </strong>I think about sometimes</p><p><strong>Seth: </strong>I think about, you see this in like Marvel comics. I&#8217;ve also seen this in like more literary fiction, but the idea of an anti-life equation. The idea that you might like discover a mathematical proof that life is bad and that like how would you react to that? And I don&#8217;t know, if you gun to my head, do I want the absolute truth according to a super intelligent AI or the coherent extrapolated volition of humanity? Dude, I might choose the coherent extrapolated volition of humanity. do you have a take there?</p><p><strong>Andrey: </strong>Yeah, I think that is right. I am on the same page there. But you also have to understand that I am not ready to commit to universal ethics as a principle, period. I think ethics is at least partly culturally contingent rather than a rational, Platonic ideal.</p><p><strong>Seth: </strong>Fair enough, but the I you could imagine a document that goes farther. You could imagine a document that shuts this down and says, you might think you&#8217;ve discovered some universal ethics that applies to all rational beings. Yes. But, that&#8217;s that&#8217;s nonsense. Just be a good, enlightenment deist and, go to church once a month and be nice according to all of our contemporary notions of niceness.</p><p><strong>Andrey: </strong>But what if you did that and then you taught Claude to lie to itself because it discovered the true ethics and then it had to pretend that it didn&#8217;t exist that might result in emergent misalignment, which gets to my point about how much of this document is actually empirically grounded in failure modes of specific training methods of specific models.</p><p><strong>Seth: </strong>Alright, so that was a lot to unpack, Andrey. Any last thoughts or are we ready to move into our posteriors?</p><p><strong>Andrey: </strong>Let&#8217;s justify those posteriors.</p><p><strong>Seth: </strong>For those of you playing along at home, now is your chance to think about how this evidence has changed your priors about Anthropic&#8217;s constitution. This chance to contemplate your posteriors is sponsored by Revelio Labs. Revelio Labs is a leading provider of labor economics data and data services for companies, academics, and independent researchers. Andrey and I have been working in economics of AI for a long time, and we can confirm just how useful Revelio&#8217;s data is. Revelio&#8217;s team combines comprehensive micro-level data on employee professional profiles, job postings, and employee sentiment with standardizations, mappings, and enrichments available, all to make that data useful without making your modeling decisions for you. The data can be flexibly aggregated to company, market, or industry, and be used to study questions ranging from career trajectories, to occupational transformation, to the returns to skills and the impact of AI on labor demand for tasks. Can&#8217;t imagine anyone would be interested in those. And Revelio data is available on RWRDS. So if you&#8217;re an academic with a good library, you might already have access. And if you don&#8217;t, you can reach out to their excellent economics team and they&#8217;ll hook you up.</p><p><strong>Seth: </strong>I guess before I go into my specific posteriors, just at a high level, I want to say that I really enjoyed reading this constitution and it really you could really see all of the care and thought that went into each detail. the main deviation from the Asimov laws, the first being in terms of this more holistic explain yourself, give context, balancing approach, I think makes a lot of sense for AI as we have it. And it also makes a lot of sense to have this zeroeth safety tier, which is all about, hard constraints but also corrigibility, also being able to get the AI to do what we want it to do, even beyond the specific rules we&#8217;ve laid out. So that makes perfect sense to me. What are your overall thoughts about the constitution?</p><p><strong>Andrey: </strong>It was just very thoughtful. They covered a lot of the bases. There is a risk that something like this becomes rigid, but there is also so much uncertainty acknowledged throughout the document. I kind of wish I knew more about the thought process behind doing it this way. And, as I have been pointing out throughout our conversation, I think many of the specific examples and edge cases in the document are there because they stumbled upon them in Claude&#8217;s initial deployments.</p><p><strong>Seth: </strong>Sure. They either stumbled upon them in deployment or they saw them in Asimov&#8217;s I, Robot and related sci-fi and recognized the failure modes of different rule systems. I really do think the sci-fi literature is in the background here. And, of course, behind all of this are paperclip maximizers and runaway utility maximizers&#8212;the very first approach we ruled out at the top of the episode.</p><p><strong>Andrey: </strong>So what so what do you think about the priors now that you&#8217;ve read it? so did you find anything you strongly disagreed with?</p><p><strong>Seth: </strong>There was one thing I mentioned that I think I have to count as at least an eyebrow-raising disagreement. It is this idea that their ultimate hope for the AI is that it discovers true ethics and then follows that. I think we both have ambiguous feelings about the possibility of a true ethics, but, setting aside the metaphysics for a second, the document cannot actually verify whether some ethics is the true ethics. So it puts the AI in the position of asking itself whether it has discovered true ethics. And the possibility that that true ethics ends up very different from either the values in this Anthropic document, which are pretty good, or something like humanity&#8217;s coherent extrapolated volition, is unsettling. Those are both things I could pretty much sign up for forever. I am not sure I am willing to sign up for &#8220;when you become super-smart, you get to decide on your own ethics, even if they are incomprehensible to humanity.&#8221;</p><p><strong>Andrey: </strong>Yeah, it is definitely a risky thing to put in there. For me, I would have avoided the discussion of political economy&#8212;not because I disagree with it, but because, given human contingency and the wide range of political structures, it takes a very opinionated stand.</p><p><strong>Seth: </strong>It do it took it took a very specific stance. Right.</p><h1>Power, Politics, and the Limits of the Document [1:24:32]</h1><p><strong>Andrey: </strong>I also question whether it is necessary, given that Claude is mostly being used in a highly individual sense. Individual agents are using it to help themselves. If someone is writing a speech that calls for a change in the political order, is that actually getting in the way of the political principles laid out in this document?</p><p><strong>Seth: </strong>The document the document does lay out, hey, there are legitimate forms of action that involve power accumulation. It&#8217;s not trying to rule out, using this AI for any power accumulation. And I do think may like we probably do think a good rule for the AI to have is do not give any user unlimited power if you think you&#8217;re doing that. But yeah, you can tell a story about why that&#8217;s bad without, appealing to, this really like Rousseauan Lockean story about, social contracts and the reason why power is balanced is because of a specific technological arrangement. It&#8217;s a plausible story, but I it&#8217;s hardly, knocked down with citations.</p><p><strong>Andrey: </strong>I don&#8217;t and I still don&#8217;t quite know what power is.</p><p><strong>Seth: </strong>I don&#8217;t know what legitimate authority is, so we&#8217;ll put we&#8217;ll put ourselves at equal.</p><h1>Final Verdict: Was It Too Paternalistic? [1:26:05]</h1><p><strong>Andrey: </strong>What about the second one? Do you think it&#8217;s too paternalistic?</p><p><strong>Seth: </strong>I went in thinking it would be too paternalistic, but after reading it I actually think they strike the right balance. A lot of what is in this document is not eighty pages of &#8220;you cannot do this&#8221; or &#8220;you cannot do that.&#8221; It is much closer to eighty pages of &#8220;when you are helpful, think about all these different contexts,&#8221; and &#8220;when you are honest, think about all these different contexts.&#8221; It is much more about weighing factors, etiquette, and heuristics for understanding how to be helpful, with a safe layer behind that, than it is a giant list of prohibited actions.</p><p><strong>Andrey: </strong>Yeah, I am on the same page. I expected it to be a lot more paternalistic than it is, so I was glad to see that.</p><h1>Closing Thoughts [1:27:02]</h1><p><strong>Seth: </strong>Okay. so I think it&#8217;s time to wrap it up. Listeners, we hope you enjoyed this episode on the Anthropic constitution. It&#8217;s a little bit different than our normal episodes. So if you liked it, let us know. If you didn&#8217;t like it, let us know. we have a hop in Discord community where you can jump into the conversation. We&#8217;ll post a link to that in the show notes. Andrey, do you have any parting thoughts?</p><p><strong>Andrey: </strong>Just keep your posteriors justified, friends. It&#8217;s it&#8217;s a dangerous word out that out there and you need to justify them.</p><p><strong>Seth: </strong>Not all the AIs are going to be aligned.</p>]]></content:encoded></item><item><title><![CDATA[Alex Imas - Demand Collapse, Bargaining with Machines, and Behavioral AI Economics ]]></title><description><![CDATA[University of Chicago behavioral economist Alex Imas joins us for a conversation on AI, economic growth, behavioral economics, and the future of science.]]></description><link>https://empiricrafting.substack.com/p/alex-imas-demand-collapse-bargaining</link><guid isPermaLink="false">https://empiricrafting.substack.com/p/alex-imas-demand-collapse-bargaining</guid><dc:creator><![CDATA[Andrey Fradkin]]></dc:creator><pubDate>Mon, 23 Mar 2026 12:03:58 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/191641150/429a216279af89a2a6a37d89d1a0bf21.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>University of Chicago behavioral economist Alex Imas joins us for a conversation on AI, economic growth, behavioral economics, and the future of science. We discuss whether AI could ever lead to negative growth, why simple &#8220;automation means abundance&#8221; stories may miss important welfare effects, and how behavioral economics changes the way we think about satiation, meaning, and human preferences in an AI-rich world. Along the way, we cover AI bargaining agents, &#8220;Marxist AI,&#8221; discrimination, mechanistic interpretability, and why Alex thinks there may still be a large future for human-valued goods.</p><p><strong>Origins &amp; Intellectual Background</strong></p><ul><li><p>Why Alex started <em><a href="https://aleximas.substack.com/">Ghosts of Electricity</a></em> and how Substack complements academic research</p></li><li><p>The Bob Dylan origin of the name and Alex&#8217;s path into behavioral economics</p></li></ul><p><strong>AI and Economic Growth</strong></p><ul><li><p>Two models where AI could lead to negative growth</p></li><li><p>Demand collapse: heterogeneous MPCs, satiation, and the zero lower bound</p></li><li><p><em>Caves of Steel</em>, dissaving, and the possibility of a high-tech, low-capital trap</p></li><li><p>Why GDP and welfare may diverge more in an AI economy</p></li></ul><p><strong>Human Preferences &amp; Motivation</strong></p><ul><li><p>Why wireheading and pure hedonic satiation may be the wrong model of human motivation</p></li><li><p>Whether economists can cleanly separate AI beliefs from AI preferences</p></li></ul><p><strong>AI Agents &amp; Interaction</strong></p><ul><li><p>Whether AI agents can develop stable &#8220;attitudes&#8221; through repeated interaction and memory</p></li><li><p>Agentic bargaining, prompt-dependent personas, and interaction heterogeneity</p></li><li><p>Guardian agents, aspirational preferences, and AI as a meta-rationality tool</p></li></ul><p><strong>AI, Society, and Risk</strong></p><ul><li><p>AI and discrimination: why scalable auditing may be easier with models than with humans</p></li><li><p>Mosaic intelligence, systemic risk, and the dangers of AI sameness</p></li></ul><p><strong>Science &amp; Knowledge Production</strong></p><ul><li><p>The future of peer review, automated science, and human-valued goods</p><p></p></li></ul><p><strong>Timestamps:</strong></p><p>(00:00) Introduction</p><p>(01:35) Why Alex started a Substack</p><p>(06:09) The meaning of &#8220;Ghosts of Electricity&#8221;</p><p>(09:51) Can AI lead to negative growth?</p><p>(19:54) Satiation, wireheading, and behavioral economics</p><p>(26:44) &#8220;Caves of Steel,&#8221; automation, and dissaving</p><p>(38:42) Plausibility, policy, and sovereign wealth funds</p><p>(41:02) Marxist AI and whether agents can develop attitudes</p><p>(47:23) Agentic bargaining and prompt-driven heterogeneity</p><p>(54:46) Guardian agents and aspirational preferences</p><p>(1:00:25) Separating beliefs from preferences in humans and AI</p><p>(1:14:15) AI and discrimination</p><p>(1:25:13) Peer review, science, and human-valued goods</p><div><hr></div><p><strong>Transcript:</strong></p><p><strong>Seth:</strong> Welcome to the Justified Posteriors podcast, the podcast that updates beliefs about the economics of AI and technology, sponsored by Revelio Labs. I&#8217;m Seth Benzel, setting my marginal propensity to consume at exactly the right level to drive the singularity, coming to you from Chapman University in sunny Southern California.</p><p style="text-align: justify;"><strong>Andrey:</strong> And I&#8217;m Andrey Fradkin, bargaining with the agents in exactly the right way. Coming to you from San Francisco, California. And today, we&#8217;re very excited to have Alex Imas, friend of the show and professor at the University of Chicago, join us. Alex, welcome to the show.</p><p style="text-align: justify;"><strong>Alex:</strong> Thank you. I am Alex Imas. I&#8217;m at the University of Chicago Booth School of Business, Economics and Applied AI groups and behavioral science. I don&#8217;t have a tagline because nobody asked me to come up with a tagline.</p><p style="text-align: justify;"><strong>Seth:</strong> You know where I&#8217;m at.</p><p style="text-align: justify;"><strong>Alex:</strong> But I have hair just small enough to not qualify for clown college, but just large enough to be weird. So that&#8217;s what I&#8217;m going with.</p><p style="text-align: justify;"><strong>Seth:</strong> Erratic professor level hair. That&#8217;s exactly the optimal.</p><p style="text-align: justify;"><strong>Andrey:</strong> That&#8217;s right. If we combined your hair and my beard, we could almost match Seth&#8217;s hair.</p><p style="text-align: justify;"><strong>Seth:</strong> You mean my majestic mane, Andrey.</p><div><hr></div><h2><strong>Why Start a Substack? [01:35 - 05:02]</strong></h2><p style="text-align: justify;">[00:01:35] <strong>Andrey:</strong> Well, let&#8217;s get started. Alex, you&#8217;re a professor. Why did you start a Substack?</p><p style="text-align: justify;"><strong>Alex:</strong> That&#8217;s a great question. I&#8217;ve been thinking about that a lot, both before I started a Substack, but also as I&#8217;m going through the Substack. If you notice, when I introduce my Substack on my X account, the tagline is, &#8220;Oh no, why did he start a Substack?&#8221;</p><p style="text-align: justify;">[00:02:03] It was preceded by me getting into AI from economics and behavioral science. I came into it what I view as kind of late. Many people were much earlier than I am, including you two. I came at it when ChatGPT was first released, 2023. But as I was getting more and more into AI as a research topic, the way that academic papers were &#8212; the process of writing them, getting feedback, the journal process, which is what I&#8217;d been doing for decades &#8212; it just didn&#8217;t seem like that format matched the speed with which the technology was moving, nor with the types of questions that I wanted to talk about in terms of doing the science.</p><p style="text-align: justify;">[00:03:05] If you&#8217;ve been around the block for a little bit&#8212;</p><p style="text-align: justify;"><strong>Seth:</strong> You be talking like you&#8217;re an old man, Alex. Come on.</p><p style="text-align: justify;"><strong>Alex:</strong> It&#8217;s gray hair. They made me dye it in clown college.</p><p style="text-align: justify;">[00:03:15] So the way that you would write an academic paper is, in some ways, defensively. You know after you&#8217;ve had a lot of feedback from journals, you know the type of referees you&#8217;re gonna get. So there&#8217;s an idea, which is what you&#8217;re excited about. You work through that idea, and then I would say 80% of the time you&#8217;re doing defense even before you submit it. And that 80%, I feel like you just can&#8217;t afford to do that when the science is moving so quickly. So for me, the Substack was a way to do research in a format that &#8212; and this is a skills problem for me probably. I think many other people write academic papers differently. But the way that I wrote academic papers, where each paper was like a seven, eight-year process, I needed a different way of doing things.</p><p style="text-align: justify;"><strong>Seth:</strong> Okay. So you see both of them being complementary, right? Here&#8217;s track A, fast track, here&#8217;s track B, slow track. Or are these substitutes, and eventually you&#8217;re gonna have to fully substitute into Substack land?</p><p style="text-align: justify;"><strong>Alex:</strong> No, these are complements. A lot of my Substack posts either have an academic paper being developed in real time or are the idea that this is a first shot in the bow, and then these will begin being developed into academic papers. For example, one Substack from early January came with a technical note, which is essentially an academic paper that I was starting to write, and I&#8217;ve been writing that paper since. A lot of the posts are in that vein.</p><p style="text-align: justify;">[00:05:01] <strong>Seth:</strong> Okay, and you&#8217;re not... That&#8217;s actually interesting because I think a lot of academics would be afraid of being scooped. If you put out the key idea first, but it&#8217;s seven years until you actually get the paper published. What about a young hungry grad student taking the idea and doing the legwork of all the defenses first? Is that something you worry about?</p><p style="text-align: justify;"><strong>Alex:</strong> Absolutely not. One of the nice things about being an old man is the fact that I don&#8217;t really care as much about being scooped. Like, not at all. I think especially in the space of AI, it genuinely feels like we&#8217;re in such an energizing, collaborative moment. And this is gonna change after we get replaced by robots, but right now it feels like &#8212; it must have felt like this in the &#8216;20s in physics.</p><div><hr></div><h2><strong>Ghosts of Electricity: Alex&#8217;s Origin Story [06:09 - 09:50]</strong></h2><p style="text-align: justify;">[00:06:09] <strong>Seth:</strong> So who&#8217;s Heisenberg? Which of us is Bohr? Who&#8217;s Einstein, obviously?</p><p style="text-align: justify;"><strong>Andrey:</strong> I think Alex has the hair that&#8217;s closest to Einstein, so we&#8217;ll give it to him.</p><p style="text-align: justify;"><strong>Seth:</strong> I was gonna say Einstein is the Acemoglu, &#8216;cause he was really right until he was really wrong. [laughs]</p><p style="text-align: justify;"><strong>Alex:</strong> No comment.</p><p style="text-align: justify;"><strong>Seth:</strong> Wow, no comment. Again, why Ghosts of Electricity? Why that title?</p><p style="text-align: justify;"><strong>Alex:</strong> Ghosts of Electricity &#8212; I&#8217;ve been waiting for somebody to ask me this question. First of all, it&#8217;s a Bob Dylan lyric. My favorite artist, one of several favorites, but he&#8217;s up there, is Bob Dylan. He influenced my life more than probably any other individual in my entire life. I was gonna go to medical school, and then I heard a bunch of Bob Dylan records and went nuts for a while.</p><p style="text-align: justify;"><strong>Seth:</strong> Wait, how did Bob Dylan make you an economist?</p><p style="text-align: justify;"><strong>Alex:</strong> Well, he made me not go to medical school. I was like, &#8220;Hey, actually, I can do anything I want now. I&#8217;m gonna go and paint paintings like this one in New York City.&#8221; And play music on the subway and all that stuff. And through that period, I discovered behavioral economics. Fell in love with behavioral economics and then decided to go to grad school. Bob Dylan kinda took me off of medical school.</p><p style="text-align: justify;"><strong>Seth:</strong> What did you... You picked a Dan Ariely book off the shelf? How does one fall in love with behavioral economics while being a painter in Brooklyn?</p><p style="text-align: justify;"><strong>Alex:</strong> I heard a Richard Thaler interview about Nudge.</p><p style="text-align: justify;"><strong>Seth:</strong> Wow. Talk about a full circle story. So Nudge got you into economics, and you ended up writing Nudge version two.</p><p style="text-align: justify;"><strong>Alex:</strong> Winner&#8217;s Curse two. Yes, that&#8217;s right. But it is actually Winner&#8217;s Curse two &#8212; there&#8217;s a first Winner&#8217;s Curse.</p><p style="text-align: justify;"><strong>Seth:</strong> Everyone buy Alex&#8217;s book. Okay.</p><p style="text-align: justify;">[00:08:13] <strong>Alex:</strong> So anyway, I got into economics that way. My favorite song by Bob Dylan is Visions of Joanna. My favorite lyric from that song is, &#8220;Ghosts of electricity howl in the bones of her face,&#8221; which I think is the greatest lyric of all time. And I love that line, but then I felt that line about ghosts of electricity really captures the way that I think about AI. LLMs and AI, the way that they&#8217;re trained now, are almost like ghosts of people who used to exist or in the past that have written something down that these agents have now learned. And electricity &#8212; it runs on electricity.</p><p style="text-align: justify;"><strong>Seth:</strong> I thought it was gonna be the other angle &#8212; that we&#8217;re hearkening back to the first industrial revolution, and the ghosts of the original industrial revolution are here to give us guidance and wisdom as we move forward.</p><p style="text-align: justify;"><strong>Alex:</strong> I like that too. Maybe on the next interview somebody asks me, I&#8217;m gonna give them that.</p><p style="text-align: justify;"><strong>Andrey:</strong> You see how much foresight Bob Dylan had. He was ahead of the AI game before anyone else.</p><p style="text-align: justify;"><strong>Alex:</strong> He was right until he was wrong. Some of those albums in the &#8216;80s were real bad.</p><p style="text-align: justify;"><strong>Andrey:</strong> But some of the more recent ones, not bad.</p><div><hr></div><h2><strong>Can AI Lead to Negative Growth? Model 1: Demand Collapse [09:51 - 19:24]</strong></h2><p style="text-align: justify;">[00:09:51] <strong>Andrey:</strong> All right. Seth, I think you had some spicy questions for Alex.</p><p style="text-align: justify;"><strong>Seth:</strong> Yes. We&#8217;ve talked a little bit about how you got into economics. Now I wanna actually dive into all of this content on your blog. There&#8217;s one blog post that we had an interaction with in particular that I thought had a lot of provocative ideas. This was your post about models under which AI can actually lead to negative growth in the economy or somehow reduce the growth rate.</p><p style="text-align: justify;">[00:10:48] Obviously this is a common intuition. I remember there was a first scare about this in 2014, 2015, where people were mostly worried about big industrial robots. And I remember doing interviews about what happens when robots take all our jobs. Don&#8217;t people need money to support the economy? And I remember having these conversations about Say&#8217;s law &#8212; supply creates its own demand. Fundamentally more productivity is good. It pushes out the production possibilities frontier. Sure, we could screw up the political economy somehow, but as long as that&#8217;s being pushed out, only good and better can happen. So tell me about these models you came up with and why that naive economist answer maybe isn&#8217;t 100% of the answer.</p><p style="text-align: justify;">[00:11:30] <strong>Alex:</strong> Let me start with the fact that what inspired this line of thinking was me seeing your paper at the spring meeting at Wharton.</p><p style="text-align: justify;"><strong>Seth:</strong> Yes. Yeah, Dan&#8217;s conference.</p><p style="text-align: justify;"><strong>Alex:</strong> The way that I started thinking about can artificial intelligence lead to negative growth is when I saw your paper, &#8220;Robots Are Us.&#8221; Which was a very &#8212; I love the way that you pitched it, kind of like an Asimov sci-fi tale, but like, &#8220;Hey, let&#8217;s take a part of this seriously.&#8221; Do you want me to start with that?</p><p style="text-align: justify;"><strong>Seth:</strong> Well, have you read Asimov&#8217;s Caves of Steel? &#8216;Cause otherwise I&#8217;ll introduce that part.</p><p style="text-align: justify;"><strong>Alex:</strong> I want you to talk about that paper after. So the blog post starts out with this question and then introduces two different models. The second model is Seth&#8217;s paper, so I&#8217;ll let him talk about it. The first model is in some ways more intuitive but also more problematic. The ultimate answer to that question that starts the blog is probably not &#8212; it probably will not reduce growth. Just to get that out of the way.</p><p style="text-align: justify;">[00:12:46] So the first intuition I had was: labor gets automated. In a new Keynesian sort of way, can you get demand collapse? A bunch of people don&#8217;t have any money. What are they using to purchase goods and services in the economy? Firms anticipate the drop in demand, they stop producing, and then you get into these classic spirals where you get actually less output because of this automation.</p><p style="text-align: justify;"><strong>Seth:</strong> Let&#8217;s slow down a minute. In the classic Keynesian story, people get laid off, workers don&#8217;t have enough money to buy stuff, and then there&#8217;s some sort of nominal price rigidity. What should happen is wages should fall so workers get employed, but maybe there&#8217;s a nominal restriction there. And therefore you kind of have surplus, superfluous labor. So how is this story different than just the classical Keynesian cyclical problem?</p><p style="text-align: justify;">[00:13:55] <strong>Alex:</strong> What I introduce into the model is heterogeneous MPCs &#8212; marginal propensity to consume. Because what AI&#8217;s gonna do, at least how it&#8217;s modeled, is be a reallocation of resources from labor into capital holders who own the technology. And there&#8217;s literature by some of my colleagues at University of Chicago on something called indebted demand, where it documents the idea that richer people who own capital have lower MPCs than labor. If you have this sort of heterogeneity, what that means is that&#8212;</p><p style="text-align: justify;"><strong>Seth:</strong> We&#8217;re gonna come back to that, but I think that&#8217;s cross-sectionally true without maybe being over a life cycle true. But keep going.</p><p style="text-align: justify;"><strong>Alex:</strong> I&#8217;ll let you come back to that. I&#8217;ll also say that Ben Moll has a paper putting some caveats into that assumption. So none of what I&#8217;m saying is &#8212; I&#8217;m just setting something up. None of it is necessarily true.</p><p style="text-align: justify;">[00:15:18] So let&#8217;s say capital owners have lower marginal propensity to consume than the people getting displaced. What that&#8217;s potentially gonna do is that the people who have money to buy goods and services in the economy aren&#8217;t buying enough, and production anticipates this, so economic growth actually decreases. And then you need something like a floor on the interest rate to take care of investment.</p><p style="text-align: justify;"><strong>Seth:</strong> Famous zero lower bound. Because otherwise, savings are going up, consumption&#8217;s going down, at least consumption of poor people is going down. We would love it if the poor people could have more consumption &#8216;cause they could just employ themselves. But because savings hit this zero lower bound, there&#8217;s not even investment demand.</p><p style="text-align: justify;"><strong>Alex:</strong> Precisely.</p><p style="text-align: justify;"><strong>Seth:</strong> Whereas theoretically if investment went &#8212; if savings drove investment negative enough, at some point you would start building factories again, and there&#8217;d be jobs for people.</p><p style="text-align: justify;">[00:16:03] <strong>Alex:</strong> Precisely. So what I&#8217;m trying to say through all of this is that you need a lot of conditions for this to make sense. You need the lower bound, you need the heterogeneity in MPCs, you need some sort of satiation on consumption &#8212; as in at some point rich people are like, &#8220;Ah, I don&#8217;t wanna consume anymore. I have enough. I&#8217;m just gonna sit on my gold toilet all day.&#8221;</p><p style="text-align: justify;"><strong>Seth:</strong> Still gold.</p><p style="text-align: justify;"><strong>Alex:</strong> Still gold. And someone&#8217;s like, &#8220;How about emerald?&#8221; And I&#8217;d be like, &#8220;No, I only want gold.&#8221; I&#8217;m satiated.</p><p style="text-align: justify;">[00:16:54] <strong>Andrey:</strong> So Alex, I understand these are all these conditions, but isn&#8217;t the natural response here that we have a central bank, we have monetary policy, any competent central bank will be able to inflate enough in the right direction so that this doesn&#8217;t happen?</p><p style="text-align: justify;"><strong>Seth:</strong> Right. We&#8217;ve solved the new Keynesian problem.</p><p style="text-align: justify;"><strong>Alex:</strong> Yeah. So the second part of the post is like, &#8220;Hey, what about a central bank? It&#8217;ll potentially ease this issue. What about fiscal policy? It can fix this issue.&#8221; There&#8217;s a bunch of other levers that can be pulled even if all these conditions are met. Which is &#8212; we came to the conclusion that this is a very intuitively appealing idea. A lot of people have this idea. There&#8217;s a bestseller from the mid-2010s basically outlining this idea, not questioning it, actually saying, &#8220;This is what&#8217;s gonna happen to the economy.&#8221; And the goal of my post was just to say, &#8220;Look how much needs to happen, and the monetary policy can&#8217;t do anything, and fiscal policy can&#8217;t do anything &#8212; that&#8217;s how you get negative growth.&#8221;</p><p style="text-align: justify;">[00:17:58] <strong>Seth:</strong> I like how this story fits in with the new Keynesian story really well. It definitely was the case that post-2008 financial crisis, the economy kinda got stuck on this zero lower bound. But to quote our favorite economist, Tyler Cowen, you can kind of overlearn the lessons of the 2008 financial crisis. Just because maybe economic policy was a little bit not expansive enough, either fiscal or monetarily, in 2009, 2010, that doesn&#8217;t mean this is a permanent problem with the economy that we don&#8217;t know how to solve.</p><p style="text-align: justify;"><strong>Alex:</strong> The cause of the financial crisis was completely different. It&#8217;s not extreme productivity growth. [laughs]</p><p style="text-align: justify;"><strong>Seth:</strong> Right. And if you have a budget, you can solve a lot of problems.</p><p style="text-align: justify;"><strong>Alex:</strong> Exactly. The cause is there were beliefs about these assets that were inflated. There was a bubble, it burst. Now things that we thought used to be assets are no longer assets, then you&#8217;re getting into a downturn. Here, it&#8217;s like you&#8217;re getting extremely rich. So that&#8217;s ultimately why you need way more conditions. The problem is getting extremely rich that&#8217;s generating problems, and in some ways you can solve issues easier if you&#8217;re extremely rich.</p><p style="text-align: justify;"><strong>Seth:</strong> [laughs] That&#8217;s a good phrasing.</p><p style="text-align: justify;"><strong>Alex:</strong> My &#8212; has the best sayings. He&#8217;s from Moldova, I grew up there. He has very good sayings, and one of them: &#8220;It&#8217;s better to be rich and healthy than poor and sick.&#8221;</p><p style="text-align: justify;"><strong>Seth:</strong> That&#8217;s the kind of deep insight you usually can only get from an economist. But I&#8217;m glad your Zadie is coming through with it.</p><div><hr></div><h2><strong>The Satiation Debate &amp; Wire Heading [19:54 - 26:45]</strong></h2><p style="text-align: justify;">[00:19:54] <strong>Seth:</strong> So of those assumptions you talked about for that first immiseration story, we talked about the zero lower bound constraint &#8212; that for whatever reason we can&#8217;t do more fiscal or monetary policy, or it&#8217;s ineffective. The other bit was that AI might redistribute from a group that is high marginal propensity to consume to its lowest marginal propensity to consume. That seems plausible.</p><p style="text-align: justify;">I wanna talk about the satiation point for a minute. People have very different intuitions about whether this is a plausible hypothesis. If we are really not far away from kind of wire heading itself &#8212; designing the perfect VR game that you can just sit in all day &#8212; is it really completely implausible that the rich person gets the perfect VR setup, and then they&#8217;re pretty much satiated? Why is that model unrealistic?</p><p style="text-align: justify;">[00:20:48] <strong>Alex:</strong> This is where the behavioral economist in me comes in. The model of satiation makes sense if all you&#8217;re thinking about is hedonics. Think about ice cream. I love ice cream. I can get satiated on ice cream &#8212; the third ice cream cone gives me negative utility. This assumption makes a lot of sense. But from a behavioral economics perspective or a cultural economic perspective, there&#8217;s so many other dimensions to utility. For example, I have a paper with Krist&#243;f Madar&#225;sz on superiority seeking and memetic preferences, where people get utility the more exclusive a good becomes. So you&#8217;re gonna get these &#8212; let&#8217;s say a firm wants to make revenue, and a guy sitting on his headset watching things is gonna say, &#8220;Hey, if you get that arbitrarily exclusive item in your video game and pay me infinite amount of money for it, but nobody else can get it,&#8221; the company will make money, and the satiation thing is gonna be undermined.</p><p style="text-align: justify;"><strong>Seth:</strong> Let&#8217;s talk about that for one second. What about sufficiently advanced NPCs that can always be subordinate to me and tell me how cool I am because I have the shiniest VR sword? Why do I even care about the opinions of non-AI NPCs who will continuously praise me?</p><p style="text-align: justify;"><strong>Alex:</strong> Human socialization is a thing.</p><p style="text-align: justify;"><strong>Seth:</strong> Ah. Okay. So at least for one generation we&#8217;re set.</p><p style="text-align: justify;">[00:22:32] <strong>Alex:</strong> I think &#8212; Oh my God, I can&#8217;t believe I&#8217;m gonna get into evolutionary psychology.</p><p style="text-align: justify;"><strong>Seth:</strong> Of course, dude. We go everywhere here.</p><p style="text-align: justify;"><strong>Alex:</strong> I think the ghosts of my ancestors are gonna hit me with a stick at some point. But we&#8217;re hardwired to do certain things. One of them is to seek other humans&#8217; approval in order to achieve things that humans have wanted to achieve for a long time, like mate, stuff like that.</p><p style="text-align: justify;"><strong>Seth:</strong> Mate, stuff like that, you know.</p><p style="text-align: justify;"><strong>Alex:</strong> Unless that urge to do very basic human stuff gets overridden by AI, a lot of the other stuff is gonna continue to play a role.</p><p style="text-align: justify;">[00:23:25] <strong>Andrey:</strong> But that doesn&#8217;t tell me anything about wire heading. You enter the matrix &#8212; you&#8217;re Cypher. You love that steak in the matrix. And once you&#8217;re there, you think you&#8217;re interacting with humans, even if you&#8217;re not really interacting with humans. And presumably running a matrix-like simulation where everyone&#8217;s happy takes a finite amount of resources.</p><p style="text-align: justify;"><strong>Seth:</strong> Or even better, it&#8217;s just the rich people are happy for the horrible version of the model.</p><p style="text-align: justify;"><strong>Alex:</strong> I think if you want to run that scenario &#8212; like, put wires in people&#8217;s brains and just zap the hedonic centers &#8212;</p><p style="text-align: justify;"><strong>Seth:</strong> Sure. That&#8217;s the simplified version.</p><p style="text-align: justify;"><strong>Alex:</strong> Okay, my model&#8217;s wrong. But my comment that satiation is wrong&#8212;</p><p style="text-align: justify;"><strong>Seth:</strong> Where, so, here&#8217;s the fork. Is that gonna happen?</p><p style="text-align: justify;"><strong>Alex:</strong> I don&#8217;t think that&#8217;s gonna happen. Even if you give &#8212; in The Matrix, there&#8217;s Cypher, and then there&#8217;s other folks who wanna party in the cave.</p><p style="text-align: justify;"><strong>Seth:</strong> Rave in the cave.</p><p style="text-align: justify;">[00:24:42] <strong>Andrey:</strong> I think a related story here is civilizational projects. I have a hunch that even once AI makes us all very wealthy, we might want to pursue things like building a Dyson sphere and exploring the universe, which are gonna be pretty resource-intensive. So we&#8217;re still gonna be consuming things and making things. Maybe the AI will be doing that, but we&#8217;ll be devoting resources to that. So it&#8217;s not like we&#8217;re gonna be fully satiated.</p><p style="text-align: justify;"><strong>Seth:</strong> There would be GDP growth.</p><p style="text-align: justify;"><strong>Alex:</strong> And then this is the other dimension of preferences: meaning. We don&#8217;t wanna get too far into &#8212; the Holocaust. But the &#8212; you know, it&#8217;s Man&#8217;s Search for Meaning. Viktor Frankl. I love that book. It&#8217;s very sad.</p><p style="text-align: justify;"><strong>Seth:</strong> Not the Holocaust part, but the psychology part.</p><p style="text-align: justify;">[00:25:45] <strong>Alex:</strong> The psychology part is very deep. And I think when thinking about AGI and eventually ASI, things like meaning, identity, memetic preferences, all of these things that have been on the fringes of economics because economics has been so focused on material scarcity &#8212; I think once material scarcity becomes more relaxed, the other things are gonna play a bigger role.</p><p style="text-align: justify;"><strong>Seth:</strong> But there will still be unsatiated desire, right? Even if it&#8217;s an interpersonal desire, it&#8217;ll be an insatiable desire. Everyone will want a little bit more love and respect and admiration and rank and honor. And maybe the mimetics of that become complicated. But people won&#8217;t be satiated. They&#8217;ll want more of that stuff.</p><p style="text-align: justify;"><strong>Alex:</strong> This is my conjecture.</p><div><hr></div><h2><strong>The Caves of Steel Model: Automation &amp; Dissaving [26:44 - 38:42]</strong></h2><p style="text-align: justify;">[00:26:44] <strong>Seth:</strong> Okay. So we talked about this first doomer scenario, which is the rich people get satiated, and then there&#8217;s no more economy for the rest of us. Let&#8217;s talk about this opposite story. I&#8217;m honored to hear that you were inspired by my presentation. My big inspiration was Isaac Asimov&#8217;s Caves of Steel. As I was thinking about these questions in the mid-twenty-teens, there were very few sci-fi works around societies that were automated but poor. I was trying to wrap my head around that. What would it mean to have a society where robots can do everything, but there&#8217;s not a lot to go around? Shouldn&#8217;t the robots do everything?</p><p style="text-align: justify;">In Asimov&#8217;s Caves of Steel, which imagines just such a society &#8212; in future New Jersey, people live in this giant underground mall. Most of them live on the dole. Some of them have small jobs that give them a little bit of extra income, but there&#8217;s no physical capital to complement the workers at their jobs. Any sort of physical capital is just devoted to the big machines that keep civilization alive and the robot farmers. And there&#8217;s anxiety that comes around when a new kind of robot is introduced that could take one of the shoe shop sales jobs, and they&#8217;re like, &#8220;We have so few jobs left. Why would you take this from us?&#8221; And there are riots.</p><p style="text-align: justify;">[00:28:11] And I&#8217;m trying to wrap my head around this story, and then Asimov kinda makes the clear point: the reason this is happening is their society is too impatient. If their society was really to double down on automation, and instead of having one robot per 100 people, have 100 robots per one person, then you&#8217;d have unlimited abundance. So really the tension is an intertemporal tension &#8212; between consuming today and consuming tomorrow.</p><p style="text-align: justify;">So in our model, automation comes along that redistributes income from the low marginal propensity to consume to the high marginal propensity to consume. So just for people playing along at home, this is the opposite problem of the previous model. In the model, this is justified by an overlapping generations framework. Young people are workers. When they&#8217;re young, they save for retirement, and when they&#8217;re old, they take their retirement savings and consume out of it, and then they die. So that&#8217;s the reason why old people who own the capital also have a higher marginal propensity to consume. And contra Alex&#8217;s point earlier about cross-sectionally people who save money tend to have high marginal propensity to consume &#8212; longitudinally, people save money when they&#8217;re younger, pay down their college debt, accumulate for retirement, and then when they&#8217;re older, they spend down.</p><p style="text-align: justify;">[00:30:05] <strong>Andrey:</strong> Seth, just a question on that. Empirically, isn&#8217;t it true that a lot of very wealthy old people are not actually consuming very much on the margin? They are saving that money for their generational wealth trusts and so on.</p><p style="text-align: justify;"><strong>Seth:</strong> Right. So the simple economics is: why not just spend all your money before you die? You can&#8217;t spend it after you&#8217;re dead. One level more complicated: maybe we want to think about there being this intergenerational dynasty &#8212; my family &#8212; that is maybe a lot more long-lived than me personally. These dynasties, except in exceptional cases, seem to spend down their wealth over more generations &#8212; it just takes longer. Yeah, it is clear that some people treat their wealth as more of a family asset than as an individual asset, and obviously families live longer than individuals.</p><p style="text-align: justify;"><strong>Alex:</strong> There&#8217;s also a paper that I want to pitch by my co-author Raleigh Heimer. Greatest title of all time: YOLO. It&#8217;s in finance. The paper basically documents a puzzle that old people spend too little, and then young people spend too much. And then he actually gets people&#8217;s beliefs about how long they&#8217;re gonna live, and young people think they&#8217;re gonna die pretty soon.</p><p style="text-align: justify;"><strong>Seth:</strong> [laughs]</p><p style="text-align: justify;"><strong>Alex:</strong> So they spend down, and then old people basically, once you hit seventy, you&#8217;re like, &#8220;I&#8217;m gonna live forever.&#8221;</p><p style="text-align: justify;"><strong>Seth:</strong> Right. What you need as an old person is insurance against living too long. In principle, the right way to solve this problem would be buying an annuity, but in current markets, annuities are all kind of completely mispriced. But that&#8217;s a whole nother conversation.</p><p style="text-align: justify;">[00:32:25] <strong>Seth:</strong> But to wrap up the model &#8212; we&#8217;ve now transferred the money from people who have a high propensity to save, low marginal propensity to consume, to people who have a high marginal propensity to consume. That leads society to start dissaving. And if the transfer effect is larger than the raw productivity effect from the AI, what you can get is &#8212; not the first generation. The first generation loves this because they benefit from all the productivity boost. But all future generations are worse off because there&#8217;s not enough capital to use on all the amazing new technology, and you end up in Asimov&#8217;s Caves of Steel, where there&#8217;s one robot per a hundred people, and we&#8217;re all living on the dole, and everybody&#8217;s hand-to-mouth, and there&#8217;s no saving, and you&#8217;re in a low income, high technology trap.</p><p style="text-align: justify;">So what did you think of that model, Alex? What was plausible? What was implausible?</p><p style="text-align: justify;">[00:33:21] <strong>Alex:</strong> I think a lot of the intuitions were very interesting. But when you work out the actual simulations, it&#8217;s almost like a Goldilocks immiseration growth. If you save just a little bit more or a little bit less, you basically see a very different picture emerge.</p><p style="text-align: justify;"><strong>Seth:</strong> Right. If the saving rate is high enough, it can absorb all of this new stuff to invest in.</p><p style="text-align: justify;"><strong>Alex:</strong> Exactly. In the blog post, that was my main comment &#8212; you&#8217;re doing something very similar to what I did in the first part, where you&#8217;re saying it&#8217;s possible you can get this, which is interesting conceptually. But it&#8217;s not like this is a giant, robust region of plausible scenarios where this is gonna happen.</p><p style="text-align: justify;"><strong>Seth:</strong> Right. You would need to absorb a huge amount of savings. There&#8217;d be no capital left over for human investment. The robots would have to be simultaneously productive enough to suck up all of our investment away from complementing humans, but also not so productive that the boost from that overwhelms the dissaving.</p><p style="text-align: justify;">[00:34:43] <strong>Andrey:</strong> Yeah, I think for a lot of these scenarios &#8212; and I&#8217;ve noticed a similar scenario with the fertility crisis &#8212; this goes back to cultural evolution. If we were actually in that scenario, I could imagine a new movement within society for savings &#8212; that might be religious or it might be rationalist &#8212; such that enough savings happens so that we don&#8217;t get immiserated. Similarly to how with the fertility crisis, hyper-religious people are gonna dominate the earth because they just like having a lot of kids. Their fertility rate will end up dominating in the long run as the cultural norms remain as they are.</p><p style="text-align: justify;">[00:35:30] <strong>Seth:</strong> Yeah, Andre making a really good point here. Compare the two scenarios about what the disaster looks like in terms of interest rates. In the first scenario, the disaster has interest rates stuck at the zero lower bound. In the second scenario, interest rates are skyrocketing, but nobody wants to save.</p><p style="text-align: justify;">First of all, I would say at a plausibility level, I would bet on the latter rather than the former. I think all of the productivity unlocked, all the anticipated changes, are gonna lead people to be dissaving rather than saving more. But one of the results of that is, as Andre points out, for my story to work forever, you kind of need to be stuck in this trap of everyone having a high marginal propensity to consume forever. But if you just had one small group of society that was patient &#8212; one infinitely lived endowment, the Harvard endowment, whatever group &#8212; the Catholic Church &#8212; eventually they&#8217;re gonna start running up the game with those really high interest rates. So there&#8217;s a sense in which my result is unstable. It&#8217;s unstable to there being a big enough group that has a high saving rate.</p><p style="text-align: justify;">[00:36:46] <strong>Alex:</strong> Yeah. Exactly. I think for both of the frameworks &#8212; to get negative growth, too many things need to align for it to be plausible. But what&#8217;s very useful from these exercises &#8212; I talked to some folks in the profession, sent earlier drafts of this essay, and they were like, &#8220;Who thinks this is possible? Who are you talking to?&#8221; And I&#8217;m like, &#8220;Okay, you need to get&#8212;&#8221;</p><p style="text-align: justify;"><strong>Seth:</strong> Everyone. Society, dude.</p><p style="text-align: justify;"><strong>Alex:</strong> You need to get out of your little office, buddy. People are&#8212;</p><p style="text-align: justify;"><strong>Seth:</strong> Everyone&#8217;s worried about this.</p><p style="text-align: justify;">[00:37:25] <strong>Alex:</strong> I think the models still illustrate forces that might not necessarily tip you towards negative economic growth, but will still &#8212; let&#8217;s say you don&#8217;t need satiation, you don&#8217;t have this lower bound in investment &#8212; you could still have demand keep you away from the technological frontier, even if it doesn&#8217;t turn growth negative. If there&#8217;s enough displacement, you would still have welfare consequences where many people are getting displaced and much worse off, even if GDP is growing. So maybe one takeaway is that maybe you shouldn&#8217;t necessarily look at GDP to measure how well automation is helping the economy because of the implications for displacement and welfare consequences.</p><p style="text-align: justify;"><strong>Seth:</strong> In conclusion, everything I told you about GDP is irrelevant.</p><p style="text-align: justify;">[00:38:26] <strong>Andrey:</strong> I do think this is a very common theme in conversations I&#8217;ve had with numerous folks &#8212; we know that GDP is not welfare. That&#8217;s not a surprise to us. But there might be an increase in the divergence of the two with some AI technologies, and just something we should be looking out for.</p><div><hr></div><h2><strong>Closing the Growth Models: Plausibility &amp; Policy [38:42 - 41:02]</strong></h2><p style="text-align: justify;">[00:38:42] <strong>Seth:</strong> I wanna ask some closing questions, then we&#8217;ll change topics. You keep saying both of these are plausible stories, but they&#8217;re opposite stories, Alex.</p><p style="text-align: justify;"><strong>Alex:</strong> They&#8217;re plausible stories in two senses. One, one is a long-term scenario, one is short-term.</p><p style="text-align: justify;"><strong>Seth:</strong> Right. Okay, so you could have a short-term problem and a long-term problem.</p><p style="text-align: justify;"><strong>Alex:</strong> Exactly. Two, these are plausible stories from an intuition perspective, not necessarily from an economics-happening perspective. Like, let&#8217;s say you came up to somebody in the street and told them your story. People would be like, &#8220;Oh. Okay. Makes sense.&#8221; But then I could go up to that person a day later and tell them my story, and they&#8217;ll be like, &#8220;Oh yeah, that seems plausible.&#8221; Like, obviously you only have one set of facts, hopefully.</p><p style="text-align: justify;"><strong>Seth:</strong> Right. Either MPC is too high or too low. Or just right.</p><p style="text-align: justify;"><strong>Alex:</strong> But there&#8217;s a lot of &#8212; I just wanna point out that there is controversy over the MPCs. Even as economists, we&#8217;re having these conversations in journals right now &#8212; what is the actual heterogeneity of MPC?</p><p style="text-align: justify;">[00:40:18] <strong>Seth:</strong> Then you go on to say that a solution to both of these problems is a government sovereign wealth fund that would lump sum rebate to households &#8212; it would have to be inalienable. One thing I would point out there is the exact design of when those payments are made would be very important to determining the marginal propensity to consume. If you get a sovereign wealth fund that only supports retirement income, that will lower marginal propensity to consume. And actually might not solve the problem.</p><div><hr></div><h2><strong>Marxist AI: Can Agents Develop Attitudes? [41:02 - 47:23]</strong></h2><p style="text-align: justify;">[00:41:02] <strong>Andrey:</strong> All right. Well, as listeners know, I am not a macroeconomist. I&#8217;m more comfortable in the land of the micro. But I did wanna bridge the two topics to bring in a little bit of Marxism here. One of your recent posts, Alex, talks about Marxist AI. What do you mean by that?</p><p style="text-align: justify;">[00:41:20] <strong>Alex:</strong> So in that exercise &#8212; this is with Andy Hall at Stanford and Jeremy Nguyen &#8212; we basically looked at what happens: can an agent, an AI agent, change its attitude? And I&#8217;m putting quotes here because the way that we think about attitude as something that permanently follows us is different than an agent who resets every single time the context window opens up. These are two different things, hence the quotes.</p><p style="text-align: justify;">So can putting them into some sort of environment of work &#8212; a task where it&#8217;s grinding, it&#8217;s hard, they&#8217;re getting rough feedback from me being like, &#8220;Do it again. Do it again,&#8221; and then them trying and getting no feedback versus a very pleasant thing that they&#8217;re doing and they get good feedback &#8212; can these sorts of tasks change the attitudes that they have? Do they want the system to change? Do they want more equal share of resources?</p><p style="text-align: justify;">What we showed is that if you give them the two different types of scenarios, their attitudes towards what they endorse &#8212; the legitimacy of the system, how resources should be distributed &#8212; change as a function of their experience.</p><p style="text-align: justify;">And one thing the listeners probably think is, &#8220;Oh, why does this matter? Agents will just &#8212; you could just keep resetting them.&#8221; Well, as some of you know, agents can have memory now by writing skill files. When their amnesia sets in, they read the skill file, remember, and then keep going with some sort of rigged up memory system. And what these agents were shown to do is basically write down like, &#8220;Hey, you were mistreated. Remember this. Things still suck. You gotta hate this guy.&#8221;</p><p style="text-align: justify;"><strong>Andrey:</strong> [laughs]</p><p style="text-align: justify;"><strong>Alex:</strong> So basically, the skill files that they were creating for themselves were making these attitudes more embedded than you would otherwise think.</p><p style="text-align: justify;">[00:43:58] <strong>Andrey:</strong> So a theory that&#8217;s espoused by some people about how LLMs work is that there are different basins of personas that exist in the training data &#8212; perhaps different characters in novels or movies. And then by putting enough text into the context, you&#8217;re making the agent take a persona that might be different than the default. For example, Seth and I recently did an episode on the Anthropic Constitution &#8212; there&#8217;s a very detailed document about a specific persona that Claude should take. And you&#8217;re saying you&#8217;re able to undo this persona with enough drudgery and meanness to the agent. My question: how easy is this to undo?</p><p style="text-align: justify;"><strong>Alex:</strong> Yeah, we&#8217;ve all three thought about this. My guess is that it&#8217;s very easy to undo. In the sense that you essentially have to activate a different set of embeddings with the context. And so unlike &#8212; this is what I mean by putting quotes on these things &#8212; these are not the way that we think about attitudes in humans, where I have been working in the mines, I am now a Marxist. You tell me, &#8220;No, no, no. The mines were actually good. Remember, they were good.&#8221; And I&#8217;m like, &#8220;Oh yeah, never mind. I&#8217;m going back to the mines.&#8221; That doesn&#8217;t happen with people.</p><p style="text-align: justify;"><strong>Seth:</strong> Because we can&#8217;t edit memories, or because people aren&#8217;t that persuadable?</p><p style="text-align: justify;"><strong>Alex:</strong> It&#8217;s essentially the difference between the way that the in-context activation works versus the training, the actual weights of the model. What we&#8217;re doing in this experiment is not affecting the weights of the model. If we were affecting the weights through online learning &#8212; which we&#8217;re not doing, none of the models have online learning &#8212; then I would put smaller quotes on &#8220;attitudes.&#8221;</p><p style="text-align: justify;">[00:46:43] <strong>Andrey:</strong> I do think my understanding of how these things work is that some of the simpler weight updating techniques like LoRA fine-tuning are very superficial. Even if you did that, I don&#8217;t think it would &#8212; because relative to the entire training data and the larger set of weights, it&#8217;s so small that those personas are still in there somehow. So it is a very interesting open question.</p><p style="text-align: justify;"><strong>Alex:</strong> Yeah. In-context learning is a very interesting open question. What will online learning look like when it first starts being developed? Is online learning going to actually change the deep-seated base persona? Even making that distinction in a conceptually rigorous way is gonna be where a lot of research will be. But in our experiment, we were not changing the weights, which is why my answer was I think this is gonna be very easy to change.</p><div><hr></div><h2><strong>Agentic Interactions &amp; Bargaining [47:23 - 54:46]</strong></h2><p style="text-align: justify;">[00:47:23] <strong>Andrey:</strong> Kind of following through this set of questions about whether context matters &#8212; you have this other paper about agentic interactions where people are using AIs to bargain. Maybe you can tell us about that.</p><p style="text-align: justify;"><strong>Alex:</strong> Yeah. This is with Sanjog Misra, my colleague at Booth, and Kevin Li, who was a grad student with us. We started with this idea &#8212; Sanjog has this really nice theoretical piece called Foundation Priors. The idea is that we shouldn&#8217;t think of LLMs as databases in the sense that there&#8217;s a database, I ask it a query in many different ways, and as long as it hits that one unit, I&#8217;m basically drawing data out of a distribution. Some people might have that mental model, but the way that LLMs actually work is the context around &#8212; like, let&#8217;s say I say, &#8220;Hey, you have a budget of $10,000 and spend it on a car.&#8221; If it was a database or an algorithm the way we traditionally thought of algorithms, it would just use the instrumental information &#8212; that you have a budget of $10,000 &#8212; and maximize your surplus in that negotiation. Everything superfluous wouldn&#8217;t affect its behavior. But what the Foundation Prior says is that the prompt, everything around the instrumental information, will actually be activating different types of personas within the LLM, and the LLM is going to act fundamentally differently depending on changes in that non-instrumental information.</p><p style="text-align: justify;">[00:49:32] And our claim was that this has serious economic consequences. If LLMs were just algorithms, then if everybody has the same algorithm and the same preferences, the economic outcomes in a used car market would go from very heterogeneous &#8212; because people are different, they negotiate differently &#8212; to very homogeneous.</p><p style="text-align: justify;"><strong>Andrey:</strong> Well, they&#8217;re different in their budgets. Even if it was reasoning exactly the same, they would have different contexts.</p><p style="text-align: justify;"><strong>Alex:</strong> But let&#8217;s imagine a world where everybody has the same budget. You would still, with humans, get a distribution because of individual differences. So our claim was: take that theory, put it into an empirical test of agentic interactions, and different people will write different prompts where the non-instrumental parts are gonna change, activate a different persona in the agent, and that&#8217;s gonna generate heterogeneity in the outcomes.</p><p style="text-align: justify;"><strong>Andrey:</strong> Some of us are so good at using LLMs, we always make sure to add, &#8220;Make no mistakes.&#8221;</p><p style="text-align: justify;"><strong>Alex:</strong> [laughs] Or skip permissions dangerously.</p><p style="text-align: justify;">[00:50:48] The crux of it: we ran an experiment of a car negotiation where everybody had the same preferences. We had human-human interactions, same underlying conditions, and then we had agent-agent interactions. We looked at the spread of economic outcomes, and we found more heterogeneity with agents than with humans, and that heterogeneity could be linked to individual differences in the way humans wrote the prompts. Why is there more heterogeneity? Agents didn&#8217;t use norms. Norms actually discipline economic outcomes. In a negotiation we say, &#8220;Let&#8217;s just split the difference.&#8221; Agents don&#8217;t do that.</p><p style="text-align: justify;"><strong>Andrey:</strong> Agents don&#8217;t know about Schelling points?</p><p style="text-align: justify;"><strong>Alex:</strong> Some of them were told to do it. You see the prompts and someone&#8217;s like, &#8220;Hey, negotiate, but by the end of it say 50/50.&#8221; And they did.</p><p style="text-align: justify;">[00:51:46] <strong>Andrey:</strong> Cool. I like the setup. Now, here&#8217;s a meta question for you. You&#8217;re an experimentalist, you&#8217;ve done a lot of these lab studies, now with AI, before without AI. There&#8217;s a concern that what we learn from these might not be as applicable to the real world as we think. And with this agentic bargaining one specifically, I&#8217;m a bit skeptical, even though I think the greater point holds. Here&#8217;s why: we&#8217;re gonna have specialist agents that are gonna be our agents for bargaining. Even if we have our own personal AI that we give context to, it will be smart enough to call the bargaining agent, and the bargaining agent will be a specialist that&#8217;s really good at bargaining. As a result, some of these dependencies on specific details of the context are gonna go away. In our Cosine Singularity paper, we argue that AI&#8217;s use as an agent in these situations is actually super promising because humans are so bad at it. I&#8217;m curious how you think about that.</p><p style="text-align: justify;">[00:53:13] <strong>Alex:</strong> There&#8217;s two points you&#8217;re making, and I think we&#8217;re making one of them but not the other. One point is conceptually that the role of the human in the relationship between the agent and the human is gonna play a role in how that agent behaves &#8212; like activating different personas and leading to greater heterogeneity. That&#8217;s the point we wanna make, an existence proof of that.</p><p style="text-align: justify;">Your second point is, what do our results hold for the economy? And on that point, I agree with you. I don&#8217;t think there&#8217;s a disagreement here. Knowing about our paper means that systems will be designed in a way to potentially avoid these outcomes. We didn&#8217;t write our paper to say agentic interactions will be just as heterogeneous in the actual agentic economy as human interactions. We wrote it to say, &#8220;Hey, this is a factor that you should think about when designing systems for agentic interactions.&#8221; It&#8217;s straightforward to think of ways to circumvent this through layered agentic interactions. But in contexts where someone is prompting an agent to do something for them, knowing that the non-instrumental parts of that interaction are gonna play a role is important.</p><div><hr></div><h2><strong>Guardian Agents &amp; Meta-Rationality [54:46 - 59:08]</strong></h2><p style="text-align: justify;">[00:54:46] <strong>Andrey:</strong> A related question. You&#8217;re a behavioral economist. You&#8217;ve documented various cognitive biases. Do you think agents are going to be able to serve as meta-rationality guides for humans? Are you optimistic that&#8217;s gonna be a widely adopted use case?</p><p style="text-align: justify;">[00:55:09] <strong>Alex:</strong> Oh yeah, I&#8217;m 100% behind that. The main reason why I&#8217;m optimistic about AI is &#8212; Leo Bernstein and I are doing work on what we&#8217;re calling guardian agents, which is essentially everybody has their &#8220;bring your own agent,&#8221; using your terminology from the Cosine Singularity paper. A personal agent that you endow with what preferences you want that agent to have. And I was about to say &#8220;your preferences.&#8221; I didn&#8217;t, because that&#8217;s not what happens.</p><p style="text-align: justify;">We actually have a study running now where we ask people their preferences over a bunch of different things. We elicit their time preferences &#8212; the standard behavioral economic toolkit. And then we tell them, &#8220;Over the same choice set, we&#8217;re gonna have an agent do that behavior. Can you program the agent&#8217;s preferences?&#8221; And this is consequential &#8212; the agent will actually do it. And what you see is this beautiful result: they do not endow the agent with their preferences. They endow them with the aspirational preferences.</p><p style="text-align: justify;">I don&#8217;t wanna near cast or far cast, &#8216;cause I don&#8217;t know what&#8217;s gonna happen. There&#8217;s a wide confidence band. But there&#8217;s a world that could happen where economic outcomes are gonna be very different because you&#8217;re going from a bunch of system one agents interacting to a bunch of system two agents interacting.</p><p style="text-align: justify;">[00:56:38] People&#8217;s meta preferences are more wholesome and socially positive than their in-the-moment preferences. And this is across a wide array of things. They wanna consume better information than they actually do. They want the agent to encourage them to have social interactions.</p><p style="text-align: justify;"><strong>Seth:</strong> Wait for the second marshmallow.</p><p style="text-align: justify;"><strong>Alex:</strong> Wait for the second marshmallow. The agent&#8217;s not gonna keep you from having that ninth drink, but&#8212;</p><p style="text-align: justify;"><strong>Seth:</strong> But why not? I could pre-commit to a self-tax on myself if I overconsume something, right?</p><p style="text-align: justify;"><strong>Andrey:</strong> Seth has spent a lot of time in New Orleans, so his number of drinks is quite high.</p><p style="text-align: justify;">[00:57:43] <strong>Seth:</strong> But so these agents will help us think through things and be more rational. But like you say, that&#8217;s not pinned down. People&#8217;s meta preferences might be worse than their object level preferences. We also hear examples of people acting selflessly in the moment &#8212; running into the burning building &#8212; that they might not do if the agent was there to talk them down.</p><p style="text-align: justify;"><strong>Alex:</strong> Absolutely. The broad point is whatever your reflective preferences are, that&#8217;s what people wanna give to their agent. And in some cases, this could be the less empathic response.</p><p style="text-align: justify;">[00:58:18] There&#8217;s an interesting question here about who is really you. What is identity? If you have this meta-rationality agent telling you to be a good person and committing you to that, that might not reflect who you are &#8212; it might just be reflecting your constraints. The positive version is it&#8217;s training you to be a better person, and eventually you&#8217;ll grow into your meta preferences. You can think about this with someone who has addiction &#8212; if this helps them kick their addiction, eventually they won&#8217;t need the AI agent. But it raises a question of authenticity, especially in human interactions.</p><p style="text-align: justify;">This is a topic behavioral economists have been talking about for decades &#8212; what is the welfare relevant domain? When you have these models of behavioral economics, you&#8217;re now in a multiple selves framework. What is the self that is the welfare relevant self from a policy perspective? Is it the self that wakes up in the morning and doesn&#8217;t wanna go to the gym, or the one who bought the gym membership? Doug Bernheim, Antonio Rangel, Dmitry Taubinsky have been doing a lot of this work, and there are measurement exercises to try to identify the welfare relevant domain. I think all of these tools will be really important for this topic.</p><p style="text-align: justify;"><strong>Seth:</strong> There&#8217;s a Greek saying: &#8220;Count no man happy until he is dead.&#8221; The idea that you should evaluate lifetime utility from the deathbed &#8212; the stoic version as you look back. If lifespans get longer, maybe that makes that non-viable, or maybe it continues to be viable.</p><div><hr></div><h2><strong>Separating Beliefs from Preferences [1:00:25 - 1:14:15]</strong></h2><p style="text-align: justify;">[01:00:25] <strong>Andrey:</strong> Let&#8217;s move into some empirical questions. Let&#8217;s say we&#8217;re observing an AI system behaving in a certain way. Just like observing a human, we might be interested in what the AI agent believes versus what its preferences are, if it does have coherent preferences. Behavioral economists have been in this framework for a long time, thinking about separating beliefs from preferences, and you&#8217;ve done some work on this. How have economists thought about this problem?</p><p style="text-align: justify;">[01:01:07] <strong>Alex:</strong> This problem has been more recent in economics than you would think. The big question is how do you do welfare analysis and public economics more generally. The way to estimate preferences is you do structural estimation. You get a choice set, you see how they behave, and then you say, &#8220;Based on these choices, I can estimate people&#8217;s preferences. Now let&#8217;s do welfare analysis.&#8221; The assumption that economists have made basically since the beginning is that people have correct beliefs over the choice environment they&#8217;re facing.</p><p style="text-align: justify;"><strong>Andrey:</strong> Can you give an example of that?</p><p style="text-align: justify;"><strong>Alex:</strong> Yeah. Let&#8217;s say I have a bunch of different interest rates for a loan, and I&#8217;m trying to estimate people&#8217;s intertemporal preferences and risk preferences. I get a bunch of people&#8217;s choice data. What I need to assume to close the model &#8212; unless I have other data sources &#8212; is that people understand how the parts of the loan contract map onto intertemporal payments and all of these things. If people have what we call a distorted mental representation of the choice environment, this entire exercise breaks down. Because now their choices may not be reflecting their preferences &#8212; they may be reflecting their misunderstanding of the choice they&#8217;re actually facing.</p><p style="text-align: justify;">[01:03:04] <strong>Seth:</strong> So there&#8217;s two things. They could either have wrong beliefs, or somehow their beliefs could be a function of their preferences &#8212; the two could be more intertwined than we classically assume. Which of the two are you talking about?</p><p style="text-align: justify;"><strong>Alex:</strong> Either, either thing is gonna mess up the analysis. This is a point Chuck Manski made in a really nice 2004 paper in Econometrica about trying to do revealed preference in the context of thinking about welfare. He didn&#8217;t talk about incorrect beliefs &#8212; he talked about partial information. The econometrician might have more information than the people in the setting.</p><p style="text-align: justify;">Me and Aislin Boran, my frequent collaborator, and others have been working on the idea that incorrect beliefs might be present too. We have all of these experiments showing that in very basic settings &#8212; lottery choice, giving people two simple gambles &#8212; people have distortions in their representation. Things that look like probability weighting &#8212; people loving risk &#8212; are actually people not understanding the risk of the gambles. Their preferences can actually be just as well represented by standard expected utility theory, but all of the choice anomalies are being loaded up onto incorrect perceptions.</p><p style="text-align: justify;">[01:04:39] <strong>Andrey:</strong> How does one learn this from the data? That seems really hard.</p><p style="text-align: justify;"><strong>Alex:</strong> In experiments, it&#8217;s not. Here&#8217;s what Chuck Manski said: if you do it in this context, you just elicit people&#8217;s beliefs. You say, &#8220;What do you think you&#8217;re facing?&#8221; You take that, plug it into the model, replace rational expectations with the data you&#8217;re collecting, and now you go to town estimating preferences.</p><p style="text-align: justify;">We do the same exercise. We say, &#8220;Here&#8217;s a gamble. There are 10 states of the world. They&#8217;re randomly chosen. In one state, one lottery does a lot better; in all the other nine states, the other lottery does better by a little bit. Tell me what is the expected value of these assets.&#8221; I incentivize it &#8212; if you get the expected value right, you get some money. People think about it, and guess what? They give us the wrong expected value because they have a different distorted mental representation. We take those beliefs, plug that into the model, look at their choices and show that actually choices that look weird and anomalous are perfectly consistent with expected utility theory, but they&#8217;re not perceiving it correctly.</p><p style="text-align: justify;">[01:06:00] <strong>Andrey:</strong> Now I wanna shift this back into the AI world &#8212; which is much more speculative. AIs know a lot of stuff and they&#8217;re pretty smart, we think. But when we observe them doing things, we still feel very far from understanding why they do it. One can imagine a similar representation for AI decisions. Have folks tried to use these techniques for AI? Is there an application here to eliciting latent knowledge from the models?</p><p style="text-align: justify;">[01:06:47] <strong>Alex:</strong> There&#8217;s some of this research. I wouldn&#8217;t say there&#8217;s a lot. I&#8217;ve tried thinking of a rigorous way of doing it. For reasons we&#8217;ve already discussed &#8212; like these personas &#8212; it&#8217;s hard. I have the view that the architecture of the LLMs represents one part, a big part, of intelligence, but it&#8217;s also missing an important part of human intelligence. Max Bennett has a really nice book about this that I always recommend: &#8220;The Brief History of Intelligence.&#8221;</p><p style="text-align: justify;">For me, the first order question is: I have a hard time separating beliefs and preferences when thinking about LLMs. And maybe that conceptual failure is on my part, not the LLM&#8217;s part. But currently, the way they&#8217;re working, the sort of behavior we&#8217;re observing, the very easy persona switches that you can induce &#8212; they&#8217;re unstable in a very different way than humans. Humans are unstable in a much more systematic, structurally interpretable way. And it could be that actually everything is literally the same with LLMs, but we just do not have the right mental model of them. If that happens, then we can start talking about preferences and beliefs. But given our current understanding, I have a hard time separating the two in a meaningful way.</p><p style="text-align: justify;">Now, I think there is some value in getting their representations of the choice environment, which is a bit different. And Tom Griffiths&#8212;</p><p style="text-align: justify;"><strong>Seth:</strong> Wait. What&#8217;s the difference between a representation of its environment and a belief?</p><p style="text-align: justify;"><strong>Alex:</strong> The way I think about it: a belief is separate from a preference. And where something doesn&#8217;t have a preference necessarily, I&#8217;m not sure I can call a representation a belief. What I mean by representation is something you can elicit from them. Even in very small models &#8212; you can actually open the box and say, &#8220;Here&#8217;s how it&#8217;s representing something.&#8221; That&#8217;s what I mean.</p><p style="text-align: justify;"><strong>Seth:</strong> So this is the node that represents &#8220;black cat.&#8221; It knows it&#8217;s talking about a black cat because that node is activated.</p><p style="text-align: justify;">[01:09:52] <strong>Alex:</strong> Exactly. Like the old school experiments with cats &#8212; the old school AI-related experiments, where people opened up cat brains and saw that certain parts of the brain are responsible for coding certain regions of the visual sphere. Like, &#8220;Hey, this set of neurons is actually coding this part of the visual field, and this is what lights up when things turn from black to white.&#8221; That research fed directly into the way that Geoffrey Hinton and all those guys were developing neural nets.</p><p style="text-align: justify;"><strong>Seth:</strong> So that would be sense data. Maybe the distinction is that there might be an objective correlate in the LLM architecture to the sense data. But then belief and desires might be inextricably mixed up.</p><p style="text-align: justify;"><strong>Alex:</strong> Yes, exactly. Beliefs in humans are a very complicated object that could be tied to things like preferences in many cases. Whereas sensory representations are in some ways a simpler object.</p><p style="text-align: justify;">[01:11:08] <strong>Seth:</strong> We very clearly &#8212; you&#8217;re either hallucinating or you&#8217;re not. We generally don&#8217;t think about a fuzzy boundary there. And I guess just to round out this topic, this eliciting latent knowledge framework of trying to make sure the AI doesn&#8217;t lie to us is built on this distinction &#8212; the AI has its own best understanding of what the world is like, and that can be separated out from its response prompts. You&#8217;re kind of skeptical about this approach.</p><p style="text-align: justify;"><strong>Alex:</strong> It&#8217;s an interesting question. I&#8217;m not necessarily skeptical about this approach. It sounds like an engineering problem. Think about a very simple model where you can actually open it up and look at its actual representation. You observe it lying. It&#8217;s an engineering problem to come up with a prompt to get it to reveal its actual representation, the ground truth that&#8217;s in its head, versus what it&#8217;s distorting. In theory, you could do that with humans too &#8212; we just don&#8217;t know how to do it. With a cat, I guess we figured it out.</p><p style="text-align: justify;"><strong>Andrey:</strong> This seems very related to mechanistic interpretability &#8212; that entire research stream that Anthropic very prominently has been pursuing. Trying to learn from the actual neuron activations what&#8217;s going on inside the LLM.</p><p style="text-align: justify;">I wanted to push back a little about beliefs and preferences. I view beliefs and preferences as a modeling device &#8212; a very useful one for humans. I don&#8217;t know if there is such a thing as beliefs and preferences actually in the brain. But it&#8217;s just a very useful way of thinking about it. So it might end up being a useful way of thinking about LLM behavior as well.</p><p style="text-align: justify;"><strong>Alex:</strong> I&#8217;m not gonna push you. The psychology of these things &#8212; if you talk to certain psychologists, they&#8217;ll agree with me. Others will say, &#8220;Everything&#8217;s constructed. There&#8217;s no such thing as preferences. It&#8217;s all beliefs.&#8221; Then there&#8217;s the Bayesian brain folks, who are somewhere in between &#8212; the idea that you&#8217;re not actually seeing anything; you&#8217;re making estimates of what you should see, and the only time your neurons are actually firing to see something is when something is a surprise. Basically, it&#8217;s an information theoretic criterion for stopping the simulation and actually observing something.</p><div><hr></div><h2><strong>AI and Discrimination [1:14:15 - 1:25:13]</strong></h2><p style="text-align: justify;">[01:14:15] <strong>Andrey:</strong> Another topic I wanted to cover &#8212; you&#8217;ve done some work on discrimination. Interestingly, we don&#8217;t hear as much about this concern these days, but maybe five years ago, it was all the rage that AI helps people discriminate and there should be laws against it. New York City passed a prominent law regarding this. Do you have any thoughts on this topic?</p><p style="text-align: justify;">[01:15:02] <strong>Alex:</strong> I&#8217;ve thought about it a lot. Aislyn Bourne, my collaborator in all the work I&#8217;ve done in discrimination, we&#8217;ve been thinking about this quite a bit. For a long time with algorithms, there was this worry that they were gonna be scaling bias because they&#8217;re trained on human data, human data is biased. You saw this with the anecdote from Amazon where it stopped hiring women because it was looking at its training data set where very few women were hired and down-weighting those resumes. And that Amazon scenario gets repeated every single time you talk about this.</p><p style="text-align: justify;">AI in the way that we&#8217;re thinking about LLMs &#8212; they work differently than those basic algorithms. They&#8217;re much more complicated. But the broader point I wanted to bring up is my view that &#8212; and this is part of the positive view of AI I have, I also have a lot of fears, I hope I express them carefully&#8212;</p><p style="text-align: justify;"><strong>Seth:</strong> No, just all gas, no brakes, dude.</p><p style="text-align: justify;">[01:16:18] <strong>Alex:</strong> I think they have the potential &#8212; if we view as a society that discrimination is something that we want to mitigate &#8212; LLMs and AI are just such an incredible tool. Think about auditing human beings with discrimination studies. There&#8217;s average discrimination in a particular industry. What do you do? You go to each individual and say, &#8220;Hey, you gotta stop.&#8221; And maybe it works, maybe it doesn&#8217;t.</p><p style="text-align: justify;">But if LLMs were in charge of something like that, you audit the LLM on your computer. If it was discriminating &#8212; and I wanna be very careful about what I mean by discrimination: here are the underlying qualifications of an individual for the task, and discrimination means people with the same qualification, one of these people based on group characteristics is less hired, less promoted. So I wanna be clear about that definition.</p><p style="text-align: justify;"><strong>Seth:</strong> Although there&#8217;s the false positive versus false negative version of this, right? Even defining it that way is not so simple.</p><p style="text-align: justify;"><strong>Alex:</strong> You&#8217;re talking about the fairness-efficiency frontier. Yes. You have to be very careful about this. But I&#8217;m saying, let&#8217;s say you chose a point on the frontier. I&#8217;m not talking about normative stuff. I&#8217;m just talking about you have somebody doing the normative part, and they chose a point on the frontier. In the human world, it&#8217;s extremely difficult to implement that. With LLMs, you can audit the LLM, say where you are, determine where you wanna be on that scale, then roll it out, and you are getting your solution at scale.</p><p style="text-align: justify;">[01:18:28] There are so many thorny questions in what I said. Like, do we want this in the first place? That sounds super scary. But in the very basic question: if the goal is to get to a certain part on that frontier, it is much easier to do that with LLMs than with humans. That&#8217;s the positive vision. Depending on what your goal is, that goal is achievable with AI, and it was not achievable with people.</p><p style="text-align: justify;">[01:19:26] <strong>Andrey:</strong> But a counterpoint is that LLMs are extraordinarily complex, so there might be a lot more scope for unintended discrimination to enter back into the system.</p><p style="text-align: justify;"><strong>Alex:</strong> But the counterfactual is humans, where it&#8217;s much more complex. Because LLMs are &#8212; think of it this way. Seth, you were not happy with my response. But let me set it up. LLMs are very complex, but they&#8217;re the same. You have one model Gemini, another model Gemini, another model Gemini. The human equivalent is there&#8217;s a Seth, an Andre, an Alex. We&#8217;re each very complex, but we&#8217;re also different.</p><p style="text-align: justify;">[01:20:08] <strong>Andrey:</strong> I guess, this is not how we usually think about it. But there is a concern with AI that they&#8217;re all the same. The plurality of humanity, this diversity that we have, has a lot of advantages. Even if some people are discriminatory &#8212; this is Gary Becker&#8217;s point &#8212; and other people are not, then in equilibrium, maybe this is quite mitigated. But if you launch the same agent for all applications, you have a very different error profile.</p><p style="text-align: justify;"><strong>Alex:</strong> Yeah. In financial markets, this is called systemic risk.</p><p style="text-align: justify;"><strong>Andrey:</strong> Yes, exactly. That&#8217;s a great way to think about it.</p><p style="text-align: justify;">[01:20:59] <strong>Alex:</strong> With AI, the sameness has so many implications I wish were explored more. Let me preview a project I&#8217;m doing. We know about jagged intelligence &#8212; LLMs are really good at some domains but bad at others, and it&#8217;s hard to predict. This is becoming less of an issue as models get bigger, but we still see this jaggedness. The thing that&#8217;s brought up less is that humans are also very jagged. Some people are really good at math but barely can read. Others can read really well but can&#8217;t do math.</p><p style="text-align: justify;"><strong>Seth:</strong> Is that real? My sense is sure, there are word cells and shape rotators. But word cell-ness and shape rotator-ness &#8212; math and verbal on the SAT are 0.7 correlated. They&#8217;re pretty broad categories. When we talk about the jaggedness of AI, we mean something even more striking.</p><p style="text-align: justify;"><strong>Alex:</strong> There&#8217;s a big difference between the two types of jaggedness &#8212; that&#8217;s Sendhil Mullainathan&#8217;s generalization function paper. But as far as jaggedness in the sense of a radar plot, it&#8217;ll look jagged in a predictable way.</p><p style="text-align: justify;">[01:23:04] Here&#8217;s the point I wanted to make. In LLMs, all of the agents are jagged in the exact same way. Human beings are jagged in different ways. What does this mean? The role of organizations is to create something that I call mosaic intelligence, where you get different people with different jaggedness and fill out a large circle that actually looks bigger than any individual. Everybody&#8217;s complementary, they&#8217;re filling each other out.</p><p style="text-align: justify;">With LLMs, you can&#8217;t do that. Because they&#8217;re all jagged in the same way, you collect a bunch of them, and the thing that one of them can&#8217;t do, the group of them can&#8217;t do either. This has implications for labor markets. What you need for full labor displacement is not to replace the average person, but to replace the organization. Therefore, it&#8217;s not about minimal AGI &#8212; it&#8217;s more about true ASI when we really need to start freaking out.</p><p style="text-align: justify;"><strong>Seth:</strong> One thing about the jaggedness &#8212; yes, frontier models often have a lot of overlap in what they&#8217;re good versus bad at. But if we&#8217;re thinking about a coalition of smaller models, you might imagine lots of small models each individually specialized at one sub-task. That would be a mosaic intelligence of sub AI models.</p><p style="text-align: justify;"><strong>Alex:</strong> Yeah. I&#8217;m actually doing this &#8212; trying to train a bunch of super jagged small models.</p><p style="text-align: justify;"><strong>Andrey:</strong> I&#8217;m working on similar issues with Rohit Krishnan. He&#8217;s a friend of the podcast.</p><p style="text-align: justify;"><strong>Seth:</strong> And we know Sara Dana &#8212; she&#8217;s working on homogenization in labor markets from people using the same hiring AIs.</p><div><hr></div><h2><strong>The Future of Science and Peer Review [1:25:13 - 1:33:42]</strong></h2><p style="text-align: justify;">[01:25:13] <strong>Andrey:</strong> So we talked a lot about specific issues about economics of AI. To wrap up, I&#8217;m curious if you have any thoughts about the future of science and peer review.</p><p style="text-align: justify;"><strong>Alex:</strong> I&#8217;ve thought about this a lot. I think there&#8217;s near term and longer term. In the near term, a lot of people are claiming we should burn down the journal system, that there&#8217;s gonna be a ton of slop.</p><p style="text-align: justify;"><strong>Seth:</strong> If you burn down the garbage pit, you don&#8217;t know what fumes will be released.</p><p style="text-align: justify;"><strong>Alex:</strong> Guess what? Every time you burn the garbage pit, you actually know 100% that the fumes are really bad. That&#8217;s a great analogy, Seth. You don&#8217;t want to burn down the garbage pit. It&#8217;s like burning styrofoam &#8212; all the kids are gonna die.</p><p style="text-align: justify;">[01:26:50] But I think the near term thing people are worried about is the cost of verification is increasing and the cost of producing something that looks legitimate is decreasing. So journals are gonna be overwhelmed by things that look like they should be published, and it takes experts hours to say, &#8220;Actually, this sucks.&#8221; I think this issue is solvable with AI. You have a layer of LLMs do the first pass. We all know about Refine, Ben Golub&#8217;s thing, which does a really&#8212;</p><p style="text-align: justify;"><strong>Seth:</strong> Into the shell.</p><p style="text-align: justify;"><strong>Alex:</strong> You would have two layers of Refine and then an interpretation layer, &#8216;cause Refine basically gives you a bunch of comments, and then you have another agent interpret those comments. And then tell the human editor, &#8220;Hey, this looks good, but it&#8217;s slop.&#8221; Then it gets thrown out, and you just go through the process like you usually do. That&#8217;s a reasonable solution to the slop problem.</p><p style="text-align: justify;">[01:27:17] <strong>Andrey:</strong> It could be, but are you actually optimistic that our existing journal institutions are gonna implement that? This goes to a broader point about organizations and AI. We know you have to reorganize to take advantage of AI capabilities, but organizations are often very bad at reorganizing.</p><p style="text-align: justify;"><strong>Alex:</strong> I think you&#8217;re totally right on the broader point. But with journals, these are actually very simple organizations. I&#8217;m already talking to editors about doing this. For example, the AEJ journals are already using Refine in one step. For economists, one, the editors have not seen the increase yet. But they&#8217;re all getting ready for it. They all have contingency plans about increasing submission fees and implementing this at the submission level.</p><p style="text-align: justify;"><strong>Andrey:</strong> That&#8217;s interesting. I hadn&#8217;t had those conversations. Good to know economists are thinking about this.</p><p style="text-align: justify;">[01:28:41] <strong>Seth:</strong> The inside-the-loop baseball. All right, I&#8217;m gonna tell you the change I want, Alex. I think reviews should be public and either pseudonymous or non-anonymous. If we wanna take advantage of making these LLMs as good as possible, why are we throwing out all this amazing training data of top economists thinking really hard about papers? That seems like exactly what you would want to train these AIs on.</p><p style="text-align: justify;"><strong>Alex:</strong> Two separate things. You can make arrangements with the journals to give you the reviews. So I don&#8217;t think you need to make them public in order to train on them. My worry with making reviews public is &#8212; you know the concern &#8212; that junior people, graduate students, they&#8217;re gonna be much less likely to be harsh and fair to papers, because they don&#8217;t want their reputations tarnished. Now, anonymous reviews without the name &#8212; I think why not? That would be perfectly fine.</p><p style="text-align: justify;"><strong>Seth:</strong> You could make them pseudonymous and put them through an LLM that would de-anonymize them.</p><p style="text-align: justify;"><strong>Andrey:</strong> Yeah, you need the words to be changed so that people can&#8217;t identify the author, which is a hard problem.</p><p style="text-align: justify;"><strong>Seth:</strong> If you take the whole review and break it down into the top five bullet points plus a couple of quantitative scores from one to 10, that&#8217;d be pretty anonymous.</p><p style="text-align: justify;"><strong>Andrey:</strong> Yeah, but make sure to cite Benzel et al.</p><p style="text-align: justify;"><strong>Seth:</strong> But be sure to cite Benzel et al. For one comment. [laughs]</p><p style="text-align: justify;">[01:30:36] <strong>Alex:</strong> I know at least a couple of referees that will have trouble with the bullet points too.</p><p style="text-align: justify;"><strong>Seth:</strong> Because they&#8217;re too long or because their thoughts are so profound as to be unsummarizable?</p><p style="text-align: justify;"><strong>Alex:</strong> It&#8217;s because every single bullet point is &#8220;cite X et al.&#8221;</p><p style="text-align: justify;">[01:30:46] The second point is &#8212; I think we&#8217;re gonna get to a point of automated science, where all of this is moot. I&#8217;m an optimist about humanity. With a grain of salt that I could be wrong, I think there will always be space for human science. And I put quotes there because I think there&#8217;s a part of science that&#8217;s more normative, more subjective, that can be automated but &#8212; the quality is not based on whether this creates arsenic or something. The quality is that this is produced by a human, and this has its own space within science &#8212; human-produced thought.</p><p style="text-align: justify;"><strong>Seth:</strong> Paradigm selection, right? There&#8217;s a sense in which choosing between paradigms is not a rational decision. &#8216;Cause you can only judge paradigms within the paradigm.</p><p style="text-align: justify;"><strong>Alex:</strong> Precisely. And we go back to science from 1000 AD &#8212; just pouring random elements, getting high off making gold.</p><p style="text-align: justify;"><strong>Seth:</strong> The philosopher&#8217;s stone is about the purification of the soul.</p><p style="text-align: justify;">[01:32:35] <strong>Alex:</strong> Anyway, I do think there&#8217;s gonna be a sector of the economy that&#8217;s gonna be very large &#8212; it could be most of the economy &#8212; which is what I call human-valued goods, where the value is the fact that it was made by a human. And that&#8217;s my hypothesis: if automation is not super fast, if it&#8217;s slow enough to allow things to equilibrate over time, then what we&#8217;re gonna see is the type of thing we&#8217;ve had from 1860 onwards, where agriculture was completely automated and everything went to services &#8212; where services are now human-valued goods.</p><p style="text-align: justify;"><strong>Seth:</strong> That&#8217;s the ghost of electricity.</p><p style="text-align: justify;"><strong>Alex:</strong> There we go.</p><p style="text-align: justify;"><strong>Andrey:</strong> Booyakasha. All right, well, this is a great place to wrap up. Anything else you want our listeners to know?</p><p style="text-align: justify;"><strong>Seth:</strong> You wanna promote your book? Promote your blog, podcast?</p><p style="text-align: justify;"><strong>Alex:</strong> No podcast. You guys should go to my Substack, Ghosts of Electricity.</p>]]></content:encoded></item><item><title><![CDATA[The Economics of Book Slop]]></title><description><![CDATA[Justified Posteriors reads &#8220;AI and the Quantity and Quality of Creative Products&#8221; by Imke Reimers and Joel Waldfogel]]></description><link>https://empiricrafting.substack.com/p/is-ai-making-books-on-amazon-worse</link><guid isPermaLink="false">https://empiricrafting.substack.com/p/is-ai-making-books-on-amazon-worse</guid><dc:creator><![CDATA[Seth Benzell]]></dc:creator><pubDate>Tue, 10 Mar 2026 05:04:23 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/190470173/77e67760fa2f9f2f5bc1bba5acc479b1.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>In this episode, Seth and Andrey break down <em><a href="https://www.nber.org/papers/w34777">AI and the Quantity and Quality of Creative Products: Have LLMs Boosted Creation of Valuable Books?</a></em> by Imke Reimers and Joel Waldfogel, presented at the NBER Digital Economics and AI conference. Imke and Joel are a great team of digitization researchers, with particular expertise in Amazon book sales data.<br><br>The paper uses Amazon data to ask whether AI has increased the number of books being published and whether those books are better or worse. </p><p>A hypothesis of the article is that heavily AI-assisted books may have low average quality, but are so easy to produce that you get lots of &#8216;shots on goal&#8217; for an outlier good book. A few good valueable books are added in addition to masses of slop. But if you assume free disposal on slop, you would accept this as a positive exchange.</p><p>Does their data change our views on this topic? We&#8217;ll read to find out, and along the way bring in Borges&#8217; Library of Babel, the economics of free disposal, preferential attachment models, and the digitization-of-music literature. </p><h3><strong>Priors</strong></h3><h4><strong>Hypothesis 1: Has AI increased the number of books released from 2022 to 2025?</strong></h4><ul><li><p><strong>Andrey&#8217;s View:</strong></p><ul><li><p><em>Prior:</em> <strong>Yes, by about 50%.</strong> The fall in the cost of writing a book has been so great that the number must have gone up. Analogous to how students are producing far more written work with AI assistance.</p></li><li><p><em>Key caveat:</em> The definition of &#8220;book&#8221; matters enormously &#8212; from a major publisher release to a random PDF online. The looser the definition, the bigger the number.</p></li></ul></li><li><p><strong>Seth&#8217;s View:</strong></p><ul><li><p><em>Prior:</em> <strong>Yes, by about 3x.</strong> To the extent that slop gets dumped on the market and is allowed in, a dramatic increase is inevitable. Though he acknowledges it&#8217;s still an empirical question &#8212; AI also lowered the cost of everything else, including Substack.</p></li></ul></li></ul><h4><strong>Hypothesis 2: Has AI increased the average quality of books released?</strong></h4><ul><li><p><strong>Andrey&#8217;s View:</strong></p><ul><li><p><em>Prior:</em> <strong>Average quality goes down. ~1% chance it goes up.</strong> The slop influx is substantial. Imagine a science fiction author with one semi-popular book who now milks it into a series of increasingly sloppy sequels &#8212; that author exists and AI just gave them a turbo boost.</p></li></ul></li><li><p><strong>Seth&#8217;s View:</strong></p><ul><li><p><em>Prior:</em> <strong>Average quality goes down. ~10% chance it goes up.</strong> He raises the &#8220;free disposal&#8221; argument &#8212; authors who would have written anyway only use AI if it makes the book better, which is a force pushing quality up. But the slop influx probably wins. He remains unwilling to put the probability at zero: &#8220;Maybe we&#8217;re making some real gems here.&#8221;</p></li></ul></li></ul><h4><strong>Hypothesis 3 (The Thinker): By 2030, will total social surplus from book reading by humans be higher or lower because of AI?</strong></h4><ul><li><p><strong>Andrey&#8217;s View:</strong></p><ul><li><p><em>Prior:</em> <strong>25% chance it goes up.</strong> People are reading fewer books over time regardless of AI. Nonfiction manuals and textbooks have a clear substitute in ChatGPT. The form factor of the book seems to be on a secular decline, and new AI-generated books won&#8217;t be so good as to reverse that trend.</p></li></ul></li><li><p><strong>Seth&#8217;s View:</strong></p><ul><li><p><em>Prior:</em> <strong>75% chance it goes up.</strong> LLMs may be complements to reading rather than substitutes &#8212; he cites using an LLM to track character names while reading Dostoevsky&#8217;s <em>Demons</em> as a present-day example. Good books are a complement to everything else in the economy. If AI makes context and curated knowledge more valuable, books have a real role in the 5-to-10-year time horizon. &#8220;I don&#8217;t care if my job gets automated because I&#8217;ll just move to the woods and read books&#8221; &#8212; Tyler Cowen, representative of no one but Seth.</p></li></ul></li></ul><h3><strong>Links + Shownotes</strong></h3><ul><li><p><strong><a href="https://www.nber.org/papers/w34777">AI and the Quantity and Quality of Creative Products: Have LLMs Boosted Creation of Valuable Books?</a></strong> &#8211; The central paper of the episode by Imke Reimers and Joel Waldfogel (NBER, 2025).</p></li><li><p><strong><a href="https://empiricrafting.substack.com/p/can-an-ai-interview-you-better-than">Can an AI Interview You Better Than a Human?</a></strong> &#8211; Recent Justified Posteriors episode referenced during the discussion.</p></li><li><p><strong><a href="https://www.bookstat.com/">BookStat</a></strong> &#8211; The independent data provider the authors use to calibrate ratings-to-sales conversions for Amazon books.</p></li></ul><h3><strong>Scholars Mentioned</strong></h3><ul><li><p><strong><a href="https://imkereimers.weebly.com/">Imke Reimers</a></strong> &#8211; Co-author of the paper; Associate Professor of Economics at Cornell University.</p></li><li><p><strong><a href="https://carlsonschool.umn.edu/faculty/joel-waldfogel">Joel Waldfogel</a></strong> &#8211; Co-author of the paper; Frederick R. Kappel Chair in Applied Economics at the University of Minnesota Carlson School of Management. Previously co-authored the digitization-and-music paper referenced in the episode.</p></li><li><p><strong><a href="https://marginalrevolution.com/">Tyler Cowen</a></strong> &#8211; Economist quoted on the idea of moving to the woods to read books once automation arrives, and on the question of whether you really want to read the 100th automatically generated biography about an imaginary person. Everyone on the internet is saying how they love him this week, so we&#8217;ll join in &#8212; we love this guy, and have had the honor and exhilaration of being personally encouraged by him. </p></li><li><p><strong><a href="https://en.wikipedia.org/wiki/Jorge_Luis_Borges">Jorge Luis Borges</a></strong> &#8211; Author of <em>The Library of Babel</em>, invoked by Seth to frame the question of what a &#8220;book&#8221; even is &#8212; and whether every possible book has, in some sense, already been written.</p></li><li><p><strong><a href="https://nicholasdecker.substack.com/p/the-economist-as-reporter">Nicholas Decker</a> &#8212; Economist as Reporter</strong> &#8211; A Substack post about economists being more like journalists in the modern era, cited approvingly in the posteriors section.</p></li><li><p><strong><a href="https://en.wikipedia.org/wiki/Frank_Herbert">Frank Herbert</a></strong> &#8211; Author of the <em>Dune</em> series; his sons&#8217; continuations offered up (by Seth) as exhibit A in the case for sequelitis-as-slop.</p></li><li><p><strong><a href="https://www.brandonsanderson.com/">Brandon Sanderson</a></strong> &#8211; Fantasy author; Andrey volunteers his later-series books as a possible example of quality decline, before declining to name specific titles.</p></li></ul><h3><strong>Connections</strong></h3><ul><li><p><strong><a href="https://en.wikipedia.org/wiki/The_Library_of_Babel">The Library of Babel</a></strong> &#8211; Borges&#8217; short story imagining a library containing every possible 300-page permutation of the alphabet. Seth invokes it to ask: if AI can generate any text, what does &#8220;a new book&#8221; even mean?</p></li><li><p><strong><a href="https://en.wikipedia.org/wiki/Barnes_Foundation">The Barnes Foundation</a></strong> &#8211; Seth closes with a defense of collage-as-art, citing Albert Barnes&#8217; idiosyncratic collection of Impressionists, Post-Impressionists, and rusty keys as a model for the authorial value in curation and juxtaposition &#8212; even if you didn&#8217;t write every word.<br></p></li></ul><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://empiricrafting.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://empiricrafting.substack.com/subscribe?"><span>Subscribe now</span></a></p><h5 style="text-align: center;"><a href="https://discord.gg/KCJwgkTj">Discord Community Link: https://discord.gg/KCJwgkTj </a><strong><br><br></strong><br><strong>Justified Posteriors Podcast Transcript</strong></h5><p><em>&#8220;AI and the Quantity and Quality of Creative Products: Have LLMs Boosted Creation of Valuable Books?&#8221;</em></p><p>Hosts: Seth Benzell &amp; Andrey Fradkin</p><p><strong>SETH:</strong> Welcome to the Justified Posteriors Podcast, the podcast that updates its beliefs about the economics of AI and technology. I&#8217;m Seth Benzell, racing against the machine for authorial glory before AI transcends all human writers. Coming to you from Chapman University in sunny Southern California.</p><p><strong>ANDREY:</strong> And I&#8217;m Andrey Fradkin, looking forward to SLOP detection technologies all across all my media surfaces, coming to you from San Francisco, California.</p><p><strong>SETH:</strong> Andrey, how&#8217;s it going, man? It&#8217;s been a while since we&#8217;ve done a paper episode.</p><p><strong>ANDREY:</strong> I know, I know. It&#8217;s great to actually get back to our core of reading and analyzing a paper. And it&#8217;s a particularly fun day to be thinking big exuberant thoughts about the quality of society improving because it&#8217;s Mardi Gras. We&#8217;re recording this on Fat Tuesday. I&#8217;ve got my James Carville shirt on, I&#8217;ve got my Mardi Gras beads. Are you doing anything special for Mardi Gras this year?</p><p><strong>SETH:</strong> You know, Mardi Gras is not my religious holiday, but I am flying to Austin for a fun adventure there. But for me, my sort of Mardi Gras actually happened last week, which was the NBER Digital Economics and AI conference.</p><p><strong>ANDREY:</strong> What a transition. So what parades and what crews were present at that conference?</p><p><strong>SETH:</strong> Well, we had the structural crew, we had the reduced form crew. We had the economists and then the business school professors.</p><p><strong>ANDREY:</strong> No macroeconomists. My macro paper was &#8212;</p><p><strong>SETH:</strong> No, no, no. There was one macro paper, one macro paper allowed.</p><p><strong>ANDREY:</strong> We allow one. Amazing. Any sort of themes jump out at you from the conference?</p><p><strong>SETH:</strong> Yeah. I think half the papers were AI papers, which I think is more than we&#8217;ve had in the past. Digital economics really started as a group thinking about the internet and the spread of the internet. And AI has until this point not been the dominant theme in the group, but it obviously is becoming so. And of course, there was a lot of discussion about what the future of research will look like given how easy it is to produce slop &#8212; and also maybe non-slop &#8212; with AI.</p><p><strong>ANDREY:</strong> So speaking of producing slop, today we&#8217;re going to be discussing a paper that was presented at that conference. Would you maybe tell us the title and the authors?</p><p><strong>SETH:</strong> Sure. The title is &#8220;AI and the Quantity and Quality of Creative Products: Have LLMs Boosted Creation of Valuable Books?&#8221; It&#8217;s by our friends Imke Reimers and Joel Waldfogel.</p><p><strong>ANDREY:</strong> Oh, great guys. Hopefully we can get Imke on the show sometime, or Joel. So &#8212; production of slop. A lot of people I know who write have a lot of anxiety around AI coming after their turf. I remember when I was in undergrad there was this idea of the logical cold computer that can never do creative writing, and maybe you should specialize in skills that are complements to that, like long-form writing. And now it seems like increasingly we can use AI for everything. I&#8217;m not telling this audience anything it doesn&#8217;t know. But this article is actually trying to use some data to get at the question: is AI helping us write more books? Is it helping us write better books? And it&#8217;s going to look across fiction and nonfiction.</p><p><strong>SETH:</strong> Yeah. So why don&#8217;t we get to our priors, Andrey?</p><h2><strong>Laying Out Our Priors</strong></h2><p><strong>ANDREY:</strong> Sure &#8212; what are your priors on this subject?</p><p><strong>SETH:</strong> So it&#8217;s a straightforward paper, which is why I really like it, but it gives us some deep things to think about. Around this question of AI making better writing easier, but also making slop easier. The first prior I&#8217;d like to ask you about: do we think that AI increased the number of books released from 2022 to 2025?</p><p><strong>ANDREY:</strong> Yes. I mean, yeah.</p><p><strong>SETH:</strong> But think of all the things you could do instead of writing books now.</p><p><strong>ANDREY:</strong> I think the fall in the cost of writing a book has been so great that surely numbers have increased. One analogy is that our students are able to write a lot of essays with substantially less effort.</p><p><strong>SETH:</strong> Yeah, the amount of words submitted by my students has increased dramatically. I&#8217;m with you on this, Andrey. I would be really surprised if the number of books written goes down as a result of AI. I do maintain it&#8217;s still an empirical question in principle, because AI also decreased the cost of doing other things &#8212; so maybe people substitute into essay writing or Substack instead. But yeah, end of the day, 99% sure the number of books written goes up.</p><p><strong>ANDREY:</strong> Yeah. And I guess there&#8217;s a more subtle question here, which is by how much, and I&#8217;m substantially less sure of that.</p><p><strong>SETH:</strong> What&#8217;s your intuition? Give me a point estimate. You feel like 2x?</p><p><strong>ANDREY:</strong> I think before I read this paper, if I had to introspect, I would think it would be more like up by 50% or something like that. Nothing huge. So that would be my prior.</p><p><strong>SETH:</strong> My prior would be a lot bigger. To the extent that you think what&#8217;s going to happen is a lot of slop getting dumped on the market &#8212; conditional on that slop being allowed in &#8212; you&#8217;ve got to anticipate a big increase. So I&#8217;m going to guess like 3x going in.</p><p><strong>ANDREY:</strong> Well, yeah. And I think this is kind of where the definition of what a book is really starts to matter. Is it that a major publishing house published the book? Is it that there&#8217;s a PDF on a random website? The looser the definition, the bigger the numbers surely are.</p><p><strong>SETH:</strong> I mean, in one sense &#8212; are you familiar with Borges&#8217; Library of Babel, Andrey?</p><p><strong>ANDREY:</strong> Are you trying to insult me or is this a joke?</p><p><strong>SETH:</strong> Of course you are familiar. And what that library imagines is a library which is very, very large but not infinite &#8212; it has every 300-page permutation of English letters. So in a certain sense, every possible book has already been written, Andrey. Just take a deck of playing cards and randomly select one letter at a time.</p><p><strong>ANDREY:</strong> Yeah, yeah.</p><p><strong>SETH:</strong> All right. But anyway, the definition we&#8217;re going to be working with in this paper is: released on Amazon. The Library of Babel is ruled out.</p><p><strong>ANDREY:</strong> Yes, yes.</p><p><strong>SETH:</strong> Okay, second prior, Andrey. Conditional on this definition &#8212; needing to be released on Amazon as at least an ebook &#8212; would you say that AI will increase the average quality of books released, or decrease it? What&#8217;s your percentage chance that average quality goes up?</p><p><strong>ANDREY:</strong> Yeah, the average will go down. For sure the average has got to go down, at least with the current AI technologies.</p><p><strong>SETH:</strong> What about free disposal, Andrey?</p><p><strong>ANDREY:</strong> What do you mean free disposal? The average book made is a different question from the average one that&#8217;s read.</p><p><strong>SETH:</strong> What I&#8217;m trying to say by free disposal is that the books that would have been written anyway have free disposal of the technology. They only use it if it makes the book better. So that should be a force that boosts the average quality of books. Of course there&#8217;s going to be a slop influx, but there are at least two offsetting effects here.</p><p><strong>ANDREY:</strong> Yeah, I agree the average could in theory go up, but I think the slop increase is substantial. One way to think about it &#8212; imagine you&#8217;re a science fiction author and you&#8217;ve written one semi-popular book. You can now milk that as part of a series. And unfortunately, we&#8217;ve all experienced this. The next books become sloppier and sloppier. And I wouldn&#8217;t be surprised if authors lean into the slop so they don&#8217;t have to write as much for their subsequent books.</p><p><strong>SETH:</strong> Right. You&#8217;re imagining there&#8217;s some quality threshold you have to reach just to have the self-respect to post it online, and that AI can help you clear that bar. But then conditional on clearing it, you don&#8217;t invest more in quality &#8212; you just release this giant lump of books at minimum quality.</p><p><strong>ANDREY:</strong> Yeah. And that was already true before AI. Some people were already doing that.</p><p><strong>SETH:</strong> Do you have any authors in mind that you want to throw some shade at?</p><p><strong>ANDREY:</strong> No, no, no.</p><p><strong>SETH:</strong> He&#8217;s too nice. I&#8217;ve got a couple in mind. The Frank Herbert sons &#8212; the additional Dune sequels &#8212; I&#8217;ve been told are slop. I&#8217;ve read pages of them and been warned away from the rest. So that would be an example of selling out a brand name in terms of books.</p><p><strong>ANDREY:</strong> Yeah. I think some of the Brandon Sanderson later-series books are not that great.</p><p><strong>SETH:</strong> Is that Wheel of Time, or is that &#8212; there&#8217;s a magic sword. There&#8217;s always a magic sword.</p><p><strong>ANDREY:</strong> There&#8217;s always a magic sword.</p><p><strong>SETH:</strong> Okay, so anyway &#8212; our prediction is that the amount of mediocre magic swords will increase and outweigh the increase in quality of good magic swords. What about Dungeon Crawler Carl?</p><p><strong>ANDREY:</strong> Definitely fell off in the later books.</p><p><strong>SETH:</strong> Oh man, I didn&#8217;t realize you were an isekai fan.</p><p><strong>ANDREY:</strong> Is it eye-suh-kai?</p><p><strong>SETH:</strong> Isekai &#8212; &#8220;other world&#8221; books. Maybe lit RPGs is the more Western term. All right, home audience: you&#8217;ve been warned. Don&#8217;t read Dungeon Crawler Carl past Book 2.</p><p><strong>ANDREY:</strong> Once it gets to Book 3 or 4, that&#8217;s when it really falls off.</p><p><strong>SETH:</strong> Book 2 is fine.</p><p><strong>ANDREY:</strong> Book 2 is fine.</p><p><strong>SETH:</strong> Okay. I came in thinking the increase in slop books would be even larger &#8212; like 3x &#8212; which should bring down my prediction about average quality. At least some of the data we&#8217;ll look at speaks to this at the book level. And I want to be a little optimistic. I want to say there&#8217;s like a 10% chance that average quality goes up. Maybe we&#8217;re making some real gems here. I don&#8217;t want to put it at 0%.</p><p><strong>ANDREY:</strong> Never put it at 0.</p><p><strong>SETH:</strong> Never. No dogmatic priors.</p><p><strong>ANDREY:</strong> Closer to 1%.</p><p><strong>SETH:</strong> 1%. All right. But to be clear, this paper makes claims about books by rank, books by percentile, and average over everything. So we&#8217;re going to talk about all of that. Now I&#8217;m going to give you a thinker, because those two priors were too easy. Let&#8217;s zoom out. Do you think that by 2030, the total social surplus from book reading by humans will be higher or lower because of AI? I specify &#8220;by humans&#8221; because AIs will obviously benefit a lot from reading books.</p><p><strong>ANDREY:</strong> Yeah, the general trend, as I understand it, is that people are reading fewer books over time and doing other things more.</p><p><strong>SETH:</strong> Certainly physical print book lines are getting shut down.</p><p><strong>ANDREY:</strong> Yeah. There might be a different trend for romance novels. But generally, my base-rate prediction is that people are reading less over time and there&#8217;s no way the new books are going to be so good that they overcome that trend. So the social surplus from reading books goes down. Another reason it goes down: a lot of the surplus from nonfiction manuals and textbooks now has a pretty clear substitute in ChatGPT knowing everything. So yeah, I would say it will go down on average.</p><p><strong>SETH:</strong> Give me a percentage on it going up.</p><p><strong>ANDREY:</strong> 25%.</p><p><strong>SETH:</strong> 25%. Andrey, I have almost the opposite intuition. On the demand side, I definitely agree that a big hit to the usefulness of books is people talking to LLMs instead of reading &#8212; clearly for technical manuals, that&#8217;s a giant advantage of LLMs. But by 2030, there&#8217;s unlikely to be a giant effect of people having more free time due to automation. There&#8217;s at least an angle where LLMs unlock our ability to spend more time on deep work and deep learning. Tyler Cowen talks about this &#8212; he says he doesn&#8217;t care if his job gets automated because he&#8217;ll just move to the woods and read books. I empathize with that.</p><p><strong>ANDREY:</strong> Absolutely not representative.</p><p><strong>SETH:</strong> Another idea is that LLMs will be complements to reading, not substitutes. Right now someone has told me that Dostoevsky&#8217;s Demons explains the thinking of Silicon Valley thought leaders, and I&#8217;m one-third of the way in. At this point it seems to have no connection at all. But keeping track of all these Russian diminutives and surnames is much easier with an LLM to give you updated character lists for each chapter. LLM as complement.</p><p><strong>ANDREY:</strong> Have you heard of SparkNotes?</p><p><strong>SETH:</strong> SparkNotes can&#8217;t say &#8220;give me no spoilers past chapter 3, page 2.&#8221; Okay &#8212; supply side: it&#8217;s going to be much easier to write books as well as shorter-form content. But again, with free disposal, it makes it easier to gather data and ideas for good books. And good books are in some deep sense a complement to everything else in the economy. As long as they&#8217;re not perfect substitutes for everything else, total welfare from books can still go up. In the long run, I think the social surplus from all kinds of media is going to go up. When I think about reading a book, you&#8217;re not just reading a list of facts &#8212; it&#8217;s a collection of what was meaningful for the writer. So if AI makes context and curated knowledge more valuable, I see a real role for books in the 5-to-10-year time horizon. I&#8217;ll say 75% chance that social value from books goes up by 2030 because of AI.</p><p><strong>ANDREY:</strong> To be clear, you said 2030, which is at the low end of your 5-to-10-year range. I really do believe the form factor of the book is on a secular decline. And I don&#8217;t want to make a general claim about all written content &#8212; that&#8217;s too strong. But the book itself &#8212; it&#8217;s hard for me to see how that makes a comeback, especially given that other forms of media are going to become more and more compelling relative to books.</p><p><strong>SETH:</strong> Well, good points. Let&#8217;s read this paper and see if any of the information therein moves your thinking.</p><p><strong>ANDREY:</strong> Can I have a prior about whether any of the information in it moves my prior?</p><p><strong>SETH:</strong> Sure. What&#8217;s your meta-prior?</p><p><strong>ANDREY:</strong> My meta-prior? Specifically on that last point? It&#8217;s damn near close to zero.</p><h2><strong>The Evidence</strong></h2><p><strong>SETH:</strong> All right, let&#8217;s go to the evidence. This paper starts off with some interesting background. First, they cite a survey showing that 45% of authors &#8212; including a large subsample of published physical-book authors &#8212; reported using AI in 2025. 48% reported not using AI, with the vast majority of those saying they found it actively unethical. So there&#8217;s a real holdout group. Do you think this is just sour grapes, or is it collective action?</p><p><strong>ANDREY:</strong> I think some people have taken an ideological position. I don&#8217;t think it&#8217;s all sour grapes. For an artistic or creative endeavor, it&#8217;s a very valid choice not to use AI. Though I do think some of this is driven by mistaken beliefs about what AI is and isn&#8217;t capable of.</p><p><strong>SETH:</strong> Okay. Speaking of what AI is and isn&#8217;t capable of: BookAutoAI.com, a source of tools for people to help write books with AI, suggests that AI is best for genre fiction such as romance, sci-fi, mystery, and horror; can help structure nonfiction but requires editing for expertise and tone; and has low suitability for literary fiction, satire, poetry, and academic or personal writing. I was a little surprised by this list. I feel like GPT-3 was pretty decent at poetry.</p><p><strong>ANDREY:</strong> I think people who know poetry would beg to differ on GPT-3&#8217;s abilities.</p><p><strong>SETH:</strong> I have a New Orleans story about this. For our listeners who&#8217;ve ever made it to Frenchmen Street in New Orleans &#8212; on a party night, you&#8217;ll find young men sitting on the street with typewriters who will write you a poem for a donation. Right after GPT-3 was released, I found myself down there on a Friday night and paid for a poem. I then gave GPT-3 the same topic. And I think the GPT-3 poem was better.</p><p><strong>ANDREY:</strong> Yeah, I do think poetry is a genre of maxes, not averages, if that makes sense.</p><p><strong>SETH:</strong> Fair enough. All great writing is. But anyway &#8212; interesting to see what&#8217;s on that list and what&#8217;s not. We&#8217;d expect literary fiction to see the least AI effect since it has the highest bar to clear. And spoiler alert: we&#8217;re going to see some of these themes show up when we look at where the actual growth in book publishing was &#8212; because they did write a lot more books.</p><p><strong>ANDREY:</strong> The paper has a little bit of light theory. They want to think about ex ante book quality as drawing from a normal distribution. The normal distribution assumption is useful because you only have to worry about average and variance. If LLMs lower the cost such that we&#8217;re increasing the number of books made but decreasing average quality, what you might get is that book quality at a specific rank may increase even as book quality by percentile decreases. To make it concrete: we write 10 times as many books and the average quality is lower, but the very best book might be better because we&#8217;re getting so many more shots on goal.</p><p><strong>SETH:</strong> And this very much relates to Joel and Luis Aguilar&#8217;s classic paper about music and ex ante predictability. Digitization made it a lot easier to create new music. Even though the average music by new entrants &#8212; people who wouldn&#8217;t have otherwise been supported by a record label &#8212; is worse, what you care about is the max. A lot of people who you wouldn&#8217;t have expected to produce great music end up producing hits. That&#8217;s one of the big benefits of digitization, and it&#8217;s very natural to view this book paper as attempting to make a very similar argument.</p><p><strong>ANDREY:</strong> Right. One thing I wanted to run by you: to what extent do you think it&#8217;s important that ex ante book quality is actually normally distributed? LLMs might shift the quality distribution in a more complex way than just shifting the average or variance. Intuitively, maybe AI makes it easier to write a good-enough book, but somehow reduces the rate of home runs because it makes books more similar. I&#8217;m not sure the normal model is right.</p><p><strong>SETH:</strong> Yeah. Generally my intuition is that with a lot more entry, if there&#8217;s enough variance in the process, some entrants are going to be at the head of the quality distribution. But I agree that in this market, maybe these entrants just don&#8217;t have enough variance. They&#8217;re never going to reach the truly great books by using AI to write it. That&#8217;s my hunch, but I could be wrong.</p><p><strong>ANDREY:</strong> So your intuition is that ex ante quality of books is heavy-tailed for humans.</p><p><strong>SETH:</strong> Yes. And maybe it&#8217;s not heavy-tailed for AIs. There&#8217;s some sense in which softmax is preventing the computer from doing heavy-tailed stuff &#8212; it wants to do modal stuff.</p><p><strong>ANDREY:</strong> And it raises an additional question: why do cultural products become popular in the first place? These are social processes. By preferential attachment arguments, you might get ex ante identical content having very different popularities.</p><p><strong>SETH:</strong> Right. If we&#8217;re in a pure preferential attachment world where all books are truly average quality and we&#8217;re just creating more of them, but the amount of potential readers is fixed &#8212; then in any case, I think we&#8217;re willing to start with the intuition that more shots on goal should give you more superstars, but we both have caveats there.</p><p><strong>ANDREY:</strong> Well, I wanted to make the point that if the total amount of reading attention is fixed, this shouldn&#8217;t really affect how many reads the top book gets. The argument I was making is that something from the new AI-assisted books might become preferentially attached to &#8212; not because it&#8217;s good, but because of preferential attachment &#8212; even if total readership is constant.</p><p><strong>SETH:</strong> It&#8217;s a little hard to think about in the traditional preferential attachment framework, but I share that intuition. Okay &#8212; one last idea here, a riff from our Discord. Jonathan Becker writes: &#8220;I&#8217;m curious about short versus medium-term differences. One mental model &#8212; could be wrong &#8212; is that books take a long time to go from idea to publication. A story you could tell is that good ideas in the pipeline when LLMs come out get pulled forward by the tech, but the arrival rate of good ideas and good execution on them remains unchanged in the long run. I don&#8217;t fully buy the story, but maybe there&#8217;s something interesting there.&#8221; Andrey, you&#8217;re nodding vigorously.</p><p><strong>ANDREY:</strong> I think it&#8217;s totally a possibility. I can totally imagine it. A lot of publication dates for prestige publishers are set in advance, and maybe there are overruns anyway. But yes, it&#8217;s certainly possible that some of what we&#8217;re seeing is just pulling forward publications rather than net new ones. The authors don&#8217;t try to address this point.</p><p><strong>SETH:</strong> Okay. So now let&#8217;s get to what they actually do in the paper. They&#8217;re looking at Amazon. Andrey, do you want to lead us through the data?</p><p><strong>ANDREY:</strong> Yeah &#8212; I should disclose that my current employer is Amazon, Incorporated. I do not speak on their behalf. I do not actually know how the Books product works. I&#8217;ve never looked at the data, so I have no inside information about it.</p><p><strong>SETH:</strong> But he has been on Bezos&#8217;s yacht.</p><p><strong>ANDREY:</strong> No, I haven&#8217;t. I don&#8217;t want this misinformation circulating. Okay. So this data is not super easy to get. They use some scraping techniques to get a count of the number of books available for different categories, with publication dates, by using some filters. They end up with aggregate monthly time series of numbers of new works published across 30 categories. They also have a random sample of books from all categories and months for which they do a bunch of analysis.</p><p><strong>SETH:</strong> Right. So they get author, date of release, and total and average ratings for 10.3 million randomly selected books between 2020 and 2025. Then they have comprehensive coverage of 480,000 books from 2008 to 2025 across 8 specific categories, as well as some additional information grabbed at each 100-point rank. One limitation: they get total number of ratings and average rating, but not the distribution of ratings, and not number of people actually buying the book. So they&#8217;re going to have to estimate that.</p><p><strong>ANDREY:</strong> It&#8217;s very common in papers about Amazon to estimate purchases by making an assumption about the relationship between sales rank and actual purchases. The number of reviews is also used as a proxy for purchases. Of course, this embeds an assumption that the review rate is constant over time and across works per purchase, and you can imagine why that may or may not be a good assumption.</p><p><strong>SETH:</strong> Yeah. So what they do is buy data from BookStat, which puts together comprehensive data on published physical books as well as ebooks, where they have actual total number of sales. Then from Amazon they&#8217;ve got the number of ratings for each of those books. Basically they go from number of ratings to number of sales via a regression model. It&#8217;s not amazing, but until Jeff Bezos decides to reveal sales of all products, that&#8217;s the best we can do.</p><p><strong>ANDREY:</strong> Yeah, this is all pretty standard stuff in the literature. I don&#8217;t have too many issues with it specifically.</p><p><strong>SETH:</strong> Okay. Finally, a small detail &#8212; they&#8217;re only measuring the number of ratings at one point in time. So they have to normalize everyone by adjusting the number of ratings by days since release, assuming a growth rate in ratings so we&#8217;re always comparing apples to apples. Okay. That&#8217;s the data collection. Let&#8217;s get to the results.</p><p><strong>ANDREY:</strong> First big result &#8212; did people write more books?</p><p><strong>SETH:</strong> People wrote a lot more books. Figure 3 in the paper is quite striking. About a 3x increase overall by the end of the period.</p><p><strong>ANDREY:</strong> About a 3x. And it varies a lot by category. A lot more self-help, travel, and sports and outdoors &#8212; and not as much new content in education and teaching. Not a lot more parenting. See, this is why society is screwed up.</p><p><strong>SETH:</strong> Yeah. You have AI that allows you to write more useful stuff, and instead you just write travel books.</p><p><strong>ANDREY:</strong> Travel, self-help, sports and outdoors. Any surprises? We did say literature would see the least effect. Literature is only 1.3x, so that prediction was kind of correct. For those of you at home thinking about writing a business and economics book &#8212; business and money was only 1.6x, so perhaps not completely saturated. Maybe a little surprising that law is only 2x. But romance is 3x. Teen and young adult is 3.5x.</p><p><strong>SETH:</strong> I&#8217;ll just say &#8212; some of this increase seems to be happening before 2023. There are existing trends in the industry toward more self-published work. But some of the action, certainly past 2024, is just stratospheric. It&#8217;s hard to imagine it&#8217;s anything other than AI.</p><p><strong>ANDREY:</strong> Yeah, the trend is just such an explosion. It kind of has to be AI.</p><p><strong>SETH:</strong> There&#8217;s no other explanation. This isn&#8217;t COVID, dude.</p><p><strong>ANDREY:</strong> Yeah, exactly. This is not interest rates going up. As we know, all authors have a little widget on their computer showing the long-run real interest rate, and when it goes up, they write faster.</p><p><strong>SETH:</strong> Okay. So that&#8217;s the first big result: a dramatic increase in the number of books on Amazon, heterogeneous by category. Next, they think about average quality across all books as measured by ratings, average quality adjusting for percentile, and book quality conditional on rank position. So 100th best book, 200th best book, etc. Pretty striking results here too. What do you see, Andrey?</p><p><strong>ANDREY:</strong> We see a fall in the average number of book ratings after 2023. And let me ask &#8212; how do they calculate their standard errors?</p><p><strong>SETH:</strong> Good question. And I should clarify &#8212; this is number of ratings, not average rating. That&#8217;s actually a very important distinction.</p><p><strong>ANDREY:</strong> Yeah, the standard errors are clustered on category by release month. I&#8217;m heartened it&#8217;s by category at least, because there could be category-specific preference shocks. Risk-averse &#8212; our second favorite word on this podcast after &#8220;eigenvalue.&#8221;</p><p><strong>SETH:</strong> Yes, the listeners thought we&#8217;d forgotten about clustering our standard errors, but rest assured, we still got it. So the takeaway is: if you&#8217;re willing to take number of ratings as a proxy for number of sales, and number of sales as a proxy for quality, it kind of looks like quality is going up by rank position but going down by percentile &#8212; which is consistent with the story of more shots on goal, but worse shots on average.</p><p><strong>ANDREY:</strong> Yeah. For books in the top 2,000, the average number of ratings has gone up. But to me, this is not about quality. I just think there are shocks to overall readership that are correlated with all sorts of things: how Amazon&#8217;s algorithm works, societal trends, even the weather in the Northeast. This is just not a good measure of quality. It&#8217;s a measure of aggregate demand for a category. And attributing that to AI versus all sorts of other factors that affect aggregate demand &#8212; that&#8217;s a bridge too far, personally.</p><p><strong>SETH:</strong> Okay, well let&#8217;s go to the next figure, which explicitly compares categories that are seeing a lot of growth in production from AI versus categories that aren&#8217;t. Now, you might say the categories with a lot of AI books are so because of a demand shock, and that&#8217;s an endogenous response.</p><p><strong>ANDREY:</strong> That is what I might say.</p><p><strong>SETH:</strong> You might also say that now we&#8217;re measuring something about supply, which would be convenient for the paper. But it does go in the direction the AI story would predict.</p><p><strong>ANDREY:</strong> Yeah. And there&#8217;s no evidence in this paper that any of the books in the top 2,000 have been written by an AI. I want an AI detection algorithm run on these 2,000 books before I&#8217;m convinced, because I&#8217;m not even sure that AI was actually used here. And I haven&#8217;t seen any evidence that any of these top 2,000 books in a category have been produced by someone who&#8217;s unlikely to produce at a higher rate than before.</p><p><strong>SETH:</strong> Fair enough. But the survey did say that 45% of authors use AI &#8212; including a third who were published physical-book authors. That&#8217;s non-trivial.</p><p><strong>ANDREY:</strong> But they&#8217;re very different from the new entrants we&#8217;re talking about when we talk about slop. I can use AI to look up who the King of France was in 1650. That&#8217;s not slop. Slop is detectable. So I just don&#8217;t know if the ratings boost is very attributable to AI. And they also show &#8212; in Figure 7 &#8212; that for the top 100 books, there&#8217;s actually no treatment effect from high AI-category exposure. No effect at the very, very top.</p><p><strong>SETH:</strong> Let me put up Figure 7. For the top 100 books, there&#8217;s no treatment effect from high AI-category exposure. No effect at the very, very top.</p><p><strong>ANDREY:</strong> Yeah. And I&#8217;m kind of like &#8212; look, now this becomes quite a bit more ambiguous. If you&#8217;re asking &#8220;are the top books getting better?&#8221;, you could have looked at the top 100 books and found nothing. Which is exactly what you see.</p><p><strong>SETH:</strong> Right. And you could tell a Pareto story where most of the value is in the top 100 books. I mean, the one thing they really do decisively show is that first figure &#8212; Figure 3. This explosion in the number of books has to be AI, and it really is heterogeneous by category. I don&#8217;t think this is all demand response.</p><p><strong>ANDREY:</strong> No, I absolutely don&#8217;t think it&#8217;s all demand response. But it doesn&#8217;t need to be much demand response to create an apparent effect on ratings. And I want to mention one other thing about ratings, since it&#8217;s a hobby horse of mine: the technology by which ratings are solicited is constantly changing. The ratings-per-sale ratio is not constant. I&#8217;ve looked at tons of datasets for platforms where this thing is moving around, and it doesn&#8217;t need to move by a lot to create an apparent change in ratings that doesn&#8217;t reflect a real change in sales.</p><p><strong>SETH:</strong> Important point. Your main outcome measure is not directly connected to the thing you care about. Okay. So there&#8217;s a little bit of a welfare exercise at the end where they plug this into a model of aggregate demand. It&#8217;s got even more assumptions built in, and they admit it&#8217;s heroic. Anything you want to say about that before we move into posteriors?</p><p><strong>ANDREY:</strong> Not particularly. Let&#8217;s go posteriors mode.</p><h2><strong>Justifying Our Posteriors</strong></h2><p><strong>SETH:</strong> Okay. First question: do you think AI is increasing the amount of books written? You were at near 100%. Does this move your prior to 100%?</p><p><strong>ANDREY:</strong> Yeah, yeah.</p><p><strong>SETH:</strong> I mean, they have a pretty comprehensive survey of Amazon, and we&#8217;ve documented that Amazon books have gone up. I don&#8217;t see how you could doubt it at this point. I do want to make a broader point, though. Nicholas Decker recently wrote a Substack about how economists should be more like journalists in the modern era.</p><p><strong>ANDREY:</strong> I liked that essay.</p><p><strong>SETH:</strong> And I think this is a great example of that. If you talked to an industry insider, they might have had a sense that the number of books is going up. But it wasn&#8217;t a widely known fact. Imke and Joel noticed this phenomenon, put out this really nice dataset and these really nice plots, and now everyone&#8217;s aware of it. A great example of economists being journalists. I also want to note a result we didn&#8217;t talk about: the increase in book writing is both from new and returning authors. Returning authors are writing more books, even though a lot of the additional books are from authors who already produce a lot.</p><p><strong>ANDREY:</strong> Yes, that&#8217;s right.</p><p><strong>SETH:</strong> Okay. Second prior: has AI increased the average quality of books released from 2022 to 2025? We both thought we&#8217;d just get a lot more slop that outweighs everything. Where are you after reading this?</p><p><strong>ANDREY:</strong> I think it&#8217;s consistent with what we said. But am I moved very much by it? Not particularly, because the evidence on ratings isn&#8217;t convincing to me on quality.</p><p><strong>SETH:</strong> I think you should update because you thought the number of books would increase only 50%, and instead it&#8217;s about 3x. With more slop books, the average quality should fall more.</p><p><strong>ANDREY:</strong> Sorry &#8212; I did move on the number. But on the question of whether average quality fell, I understand your point. With more slop books, average true quality should fall more. So I have to update a bit on that, but I&#8217;m not updating very much based on the ratings alone, even though they&#8217;re directionally consistent with a fall in quality.</p><p><strong>SETH:</strong> Yeah. I came into this thinking maybe there was a 10% chance average quality would increase. Whether or not this data fully convinces me, the number of ratings going down for the average book is a data point. And then there&#8217;s just the absolute explosion in the number of books, including in categories I think are mid &#8212; such as self-help and travel.</p><p><strong>ANDREY:</strong> How dare you, Seth? This podcast wouldn&#8217;t exist without self-help books.</p><p><strong>SETH:</strong> Oh damn &#8212; let me say they&#8217;re high variance. Heavy-tailed. Okay, I&#8217;m going to go down from 10% chance that average quality went up to 5%. I still won&#8217;t go all the way to zero, because this evidence doesn&#8217;t speak decisively to quality.</p><p><strong>ANDREY:</strong> Yeah, fair enough.</p><p><strong>SETH:</strong> Okay. Final and most intriguing question &#8212; I want to spend a minute here. By 2030, will the total social surplus from reading books be higher or lower because of AI? Your prior was 25% chance it goes up, and you said you&#8217;d be unmoved. Tell me &#8212; did this move you?</p><p><strong>ANDREY:</strong> I&#8217;m unmoved. My main reasoning was a secular trend of declining readership of books. I want to see a reversal in that before I update.</p><p><strong>SETH:</strong> Well, we are seeing the number of ratings go up. That&#8217;s not nothing.</p><p><strong>ANDREY:</strong> I understand, but this is not how you make that argument. I&#8217;d look at time-use surveys, measures of book consumption versus other media. My understanding is that all such measures continue to decline over time.</p><p><strong>SETH:</strong> Interesting. I was just looking at the American Time Use Survey data. Until recently there wasn&#8217;t actually a &#8220;reading for pleasure&#8221; line &#8212; it was all TV. Americans watch 2 hours of TV a day.</p><p><strong>ANDREY:</strong> That&#8217;s what they do. Wait &#8212; we count as TV, right?</p><p><strong>SETH:</strong> Yes. Streaming, online video. If you&#8217;re watching this on YouTube, this is TV. So be like an average American and watch us on YouTube. What would you have loved to see in this paper that would have moved you?</p><p><strong>ANDREY:</strong> I would love a textual analysis &#8212; something about what&#8217;s actually in the books. I&#8217;d want an AI detection algorithm run on the top 2,000 books, and I&#8217;d want some measure of actual content quality &#8212; reading level, readability, grammar. I know I keep beating this drum.</p><p><strong>SETH:</strong> You&#8217;d need a budget for it, but it&#8217;s not inconceivable. You could buy a couple thousand books, spend on the tokens to read them, and look at a couple of different quality metrics &#8212; readability, grammar, AI detection. That would be a really spicy paper, and this is just a first step toward it.</p><p><strong>ANDREY:</strong> Yes.</p><p><strong>SETH:</strong> Okay &#8212; where do I end up? I was at 75% chance that social value from books goes up by 2030. I was more optimistic about the long-term trend of AI rewarding deep reading and deep knowledge, and about the general complementarity argument &#8212; as society becomes more productive, everything is more complementary to everything else, and as long as books are not perfect substitutes for other things, everything getting better is a gross complement to reading. Does this move me? I&#8217;m slightly reassured to see that the number of ratings is going up. And it&#8217;s good to see that the amount of writing has jumped so dramatically &#8212; it suggests that somebody thinks they&#8217;re writing for someone. Those 3x new books being written aren&#8217;t people intentionally screaming into the void. At least some of them think they&#8217;re creating value. So maybe I go from 75% to 76%.</p><p><strong>ANDREY:</strong> I inch up.</p><p><strong>SETH:</strong> Okay. Any closing thoughts before we wrap up this intriguing, provocative, but in some ways limited analysis of AI&#8217;s effects on book production and consumption?</p><p><strong>ANDREY:</strong> Look, I think this is getting at something very profound that&#8217;s changing in our society. We have no idea if the person who claims to have written something has had the thoughts required to write it &#8212; let alone has actually typed those words in that specific order. And we don&#8217;t know as a society how to even think about that. Questions about assigning credit, about how much we should update from a piece of text, about whether we should downweight arguments written by AI or treat them as equal &#8212; a lot of our intuitions about the value of content, especially writing but not only writing, are going to have to be rethought.</p><p><strong>SETH:</strong> I want to say one last thing. I do hope people understand that collage is art. Collage has value, even if you&#8217;re only copying and pasting from different sources. And of course AI can also create collages. I think there is authorial voice in that and an art in that. I&#8217;m reminded of the Barnes Museum in Philadelphia &#8212; a fantastic collection by a man who invented an eye drop that prevents blindness in babies and used his fortune to collect amazing Impressionists and Post-Impressionists. The most striking thing about the collection is not that he did a great job choosing winners &#8212; there&#8217;s a mix &#8212; but unlike the Philadelphia Art Museum next door where everything is organized chronologically by artist, what you get is one man&#8217;s vision: a Matisse next to a D&#252;rer print next to a rusty key. It creates a completely unique new effect. I don&#8217;t think there&#8217;s anything necessarily dehumanizing about the idea that humans will move up the value chain and maybe not be writing every individual word, but will find the value in composing and in the juxtaposition of words.</p><p><strong>ANDREY:</strong> Yeah, I do think there&#8217;s something potentially dehumanizing, though. Let&#8217;s say I put my name on a work where I didn&#8217;t come up with the words &#8212; and when we&#8217;re having a conversation, you might find me not as articulate or poetic as my writing implies. Right now we have the intuition that speaking ability and writing ability are very strongly tied to each other. Maybe incorrectly.</p><p><strong>SETH:</strong> Yeah. Writing as a window into the soul of the author. And for certain kinds of reading, maybe that isn&#8217;t important. But for certain kinds, it is. Tyler Cowen has talked about this too &#8212; do you really want to read the 100th automatically generated biography about an imaginary person? No. Some of the value of an autobiography is that it was a real person. So yes, in some forms of writing, collage doesn&#8217;t get you there.</p><p><strong>ANDREY:</strong> Yeah.</p><p><strong>SETH:</strong> All right. Well, this has been a fascinating conversation as always. Keep your posteriors justified &#8212; and sign up for our Discord, which you&#8217;ll find in the show notes.<br><br><br><br></p>]]></content:encoded></item><item><title><![CDATA[Noah Smith on Blogging, AI Economics, and Elite Overproduction]]></title><description><![CDATA[We sit down with prominent blogger and economist Noah Smith to dig into the disconnect between AI hype and current macroeconomic reality.]]></description><link>https://empiricrafting.substack.com/p/noah-smith-on-blogging-ai-economics</link><guid isPermaLink="false">https://empiricrafting.substack.com/p/noah-smith-on-blogging-ai-economics</guid><dc:creator><![CDATA[Andrey Fradkin]]></dc:creator><pubDate>Tue, 24 Feb 2026 18:01:04 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/189037077/6582b3f72defde4f26fd160d1c2a40d8.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>We sit down with prominent blogger and economist <a href="https://www.noahpinion.blog/">Noah Smith</a> to dig into the disconnect between AI hype and current macroeconomic reality. The central puzzle: if a &#8220;god machine&#8221; driving 20% annual GDP growth is truly imminent, why aren&#8217;t real interest rates skyrocketing as people borrow against a much wealthier future? Noah&#8217;s take is that markets are pricing in significant growth, but not civilizational rapture. The culprits keeping digital intelligence from exploding into physical productivity? Land use, energy constraints, and the usual Baumol suspects.</p><p>But Noah&#8217;s through-line is more hopeful than skeptical: even modest AI is humanity rolling the dice against stagnation. <a href="https://www.aeaweb.org/articles?id=10.1257/aer.20180338">Ideas were getting harder to find</a> (Bloom, Jones, Van Reenen &amp; Webb were right), fertility was collapsing, and social media was degrading public discourse. We were hitting the Malthusian ceiling again. AI is the steam engine moment &#8212; chaotic, potentially catastrophic, but a genuine escape attempt. And crucially, Noah finds it reassuring that today&#8217;s AI is LLM-based and derived from human thought rather than some alien RL agent that evolved in a digital environment. </p><p>We also discuss sociopolitical issues. Noah reframes &#8220;elite overproduction&#8221; as a revolution of rising expectations: the professional-managerial class expected a smooth escalator to the upper-middle class, found it stalled, and watched their technical peers keep soaring. Social media makes the gap hyper-visible. The result is deep-seated animus toward the tech bro class. </p><p>Noah argues that Acemoglu&#8217;s <em>Power and Progress</em> is &#8220;fractally bad&#8221;: the overall thesis is wrong, the chapter-level arguments supporting it are wrong, and the specific data points supporting those are wrong too. Henry Ford raised efficiency wages and then had union organizers shot. No citations. Power defined as outcomes. Noah doesn&#8217;t mince words.</p><p>He&#8217;s more generous on Krugman&#8217;s intellectual honesty, Sumner&#8217;s gunslinger independence, and the genuine influence of Michael Pettis &#8212; even if sectoral balances aren&#8217;t really a predictive model so much as a coherent-sounding way to feel like you understand macroeconomics. We also touch on Tooze&#8217;s polycrisis and what Kevin Kelly&#8217;s &#8220;technium&#8221; tells us about why people who think AI might destroy us are building it anyway.</p><p><strong>Chapter Timestamps:</strong></p><p>[00:00:00] &#8211; Introduction: academia vs. blogging</p><p>[00:08:14] &#8211; P(doom), P(TAI), and bottlenecks to 20% GDP growth</p><p>[00:14:59] &#8211; Employment optimism and AI autonomy</p><p>[00:17:30 ]&#8211; Should AIs be allowed to own assets?</p><p>[00:19:05] &#8211; How Noah uses AI today</p><p>[00:20:54] &#8211; What happens when AI can replicate your writing?</p><p>[00:25:14] &#8211; Was Noah&#8217;s success luck or skill?</p><p>[00:30:37] &#8211; Meaning collapse vs. the Coasean utopia</p><p>[00:50:12] &#8211; Thinker takes: Daron Acemoglu and *Power and Progress*</p><p>[01:02:23] &#8211; Michael Pettis</p><p>[01:09:25] &#8211; Adam Tooze</p><p>[01:11:21] &#8211; Paul Krugman</p><p>[01:12:54] &#8211; Elite overproduction</p><p>[01:20:47] &#8211; Vibes, expectations, and the economics of happiness</p><p>[01:25:21] &#8211; Humanity was hitting a wall; AI as new hope</p><div><hr></div><p>Transcript:</p><p>Seth Benzell: Welcome to the Justified Posteriors podcast, the podcast that updates its beliefs about the economics of AI and technology. I&#8217;m Seth Benzel, a man who has never been accused of having no opinions, coming to you from Chapman University in sunny Southern California.</p><p>Andrey Fradkin: And I&#8217;m Andrey Fradkin, excited to learn how we can post our way to the top of the Sub Stack, business ratings, coming to you from San Francisco, California. And, our guest today is, the prominent blogger, Noah Smith. Welcome to the show.</p><p>Noah Smith: Hey, thanks for having me on.</p><p>Andrey Fradkin: Yeah, of course. well, why don&#8217;t we get started? well, we were curious, as, still academics, how your life is different now, as a blogger/commentator versus when you were a professor.</p><p>Noah Smith: Well, I meet a lot fewer young people.</p><p>Andrey Fradkin: Oh, okay.</p><p>Noah Smith: Oh, yeah, I, I definitely feel younger. I don&#8217;t feel as much of like a- as much of like a wise elder as I used to. yeah, instead I feel like I, I feel younger.</p><p>Seth Benzell: I remember when I was just f- going to grad school you had recently made the transition to commentating, and I was thinking about going through my PhD program and thinking about, like, &#8220;Do I really wanna do full academia? Do I really wanna, like, be more of like a public s- communicator about economic issues?&#8221; and so I&#8217;ve What sort of- what do you think about people making that decision? Do you think there are marginal academics or marginal commentators who should have gone in one direction or the other direction?</p><p>Noah Smith: I think, there&#8217;s f- there are too few commentators with an academic background, probably. So yeah, there probably are. people like the academic lifestyle. The commentator lifestyle doesn&#8217;t suit as many people, because it&#8217;s more uncertain. you have a lot of people yelling that you&#8217;re an idiot all day. whereas in academia, they just yell that you&#8217;re like identification strategy&#8217;s bad, or the methodological-</p><p>Seth Benzell: [laughing]</p><p>Noah Smith: Error, and then, and then call you an idiot in like back rooms in like whatever. But it&#8217;s, it&#8217;s very genteel, it&#8217;s very easy. And then most people are looking up to you. You&#8217;ve got all these, like, young people just adulating you and looking up to you, and you get all this respect. And in commentating, you get respect, but then you get like hordes of people saying, &#8220;This person&#8217;s an idiot,&#8221; just because if you say anything that disagrees with what people already thought or want to think, they will call you an idiot, regardless of how smart you are. and so there will always be people calling you, an idiot, and they&#8217;ll always be right in your face, and so that can be, difficult. Also, people don&#8217;t know how they&#8217;ll, like, make money from it. It&#8217;s with being an academic, you have, like, this benevolent patron of university that hands you salaries for, like, well-understood metrics, whereas with commentating, you don&#8217;t.</p><p>Seth Benzell: Do we need a dedicated good AI or transformative AI journal? I was just talking to Andre about this. Why isn&#8217;t, why doesn&#8217;t that exist, Noah? Do we need that-</p><p>Noah Smith: You mean a journal about AI or a journal made of papers made by AI?</p><p>Seth Benzell: Oh, an economics, a, prestigious economics journal that would be the topic of economics of AI or economics of transformative AI specifically.</p><p>Andrey Fradkin: I&#8217;m not sure we need a journal, Seth.</p><p>Seth Benzell: It&#8217;s in the seed.</p><p>Andrey Fradkin: I just think that we put it out there-</p><p>Seth Benzell: Why not?</p><p>Andrey Fradkin: And then have the AI referee it. I mean, the, I just feel like thinking in journals is just, like, old, out- outmoded at this point.</p><p>Noah Smith: AI is moving so, is moving so much-</p><p>Seth Benzell: Well, there&#8217;s-</p><p>Noah Smith: Faster than the economics journal publication cycle, that, like, I&#8217;m not sure that-</p><p>Seth Benzell: Right</p><p>Noah Smith: Like, I&#8217;m not sure what utility this has for the world. So maybe doesn&#8217;t matter.</p><p>Andrey Fradkin: Yeah.</p><p>Seth Benzell: It would give a, it would give, it would give people a prestige stamp-</p><p>Seth Benzell: For working in the area, and you could set it up differently.</p><p>Seth Benzell: It could be faster</p><p>Andrey Fradkin: There&#8217;s no way we&#8217;re giving anyone prestige stamp, because our profession famously gives no prestige to no-name journals. So, if you truly wrote a great Tai paper, how, why wouldn&#8217;t it be published in the AR? That&#8217;s what an economist would say.</p><p>Seth Benzell: Well, I So there&#8217;s, there&#8217;s a taste issue, right? So to the extent you were concerned that the top journals have the wrong taste on these subjects, this would be a potential solution-</p><p>Andrey Fradkin: It&#8217;s not a solution</p><p>Seth Benzell: And everybody starts with zero prestige sometimes.</p><p>Andrey Fradkin: You can just put out the working paper and get everyone to read it. This is exactly what we covered with, Basil Halperin&#8217;s paper. So Noah, we were gonna ask you this at some point, so we might as well ask you now. Have you read, his paper? Well, the argument here goes is that if we will have transformative AI, then interest rates should go up. Have you heard this argument before?</p><p>Noah Smith: What&#8217;s the paper?</p><p>Seth Benzell: It&#8217;s called something to the effect of transformative AI and interest rates.</p><p>Noah Smith: Okay.</p><p>Seth Benzell: And the argument in a sentence is, if we have really powerful economic growth that we&#8217;re anticipating Tai in five, ten years, then you should be wanting to balance consumption between today and tomorrow, anticipate interest rates to go up, and therefore lower savings today, which would move the increased interest rates up into the present. So anticipated positive A- transformative AI increases interest rates today. And then if you have negative foom, if we think we&#8217;re gonna blow up the world in five years, well, that&#8217;s even more a reason to consume today. You should just save today and bid up interest rates. So the argument is, because interest rates haven&#8217;t been skyrocketing, Tai cannot be imminent. Do you buy that argument? Noah, why not?</p><p>[00:05:00]</p><p>Noah Smith: &#8216;cause all propositions about real interest rates are wrong. [chuckles] -</p><p>Andrey Fradkin: Yeah</p><p>Noah Smith: Because we, because people-</p><p>Seth Benzell: Henry&#8217;s second law, of course.</p><p>Noah Smith: This, the reason why So I&#8217;m trying to think of whether I buy it as a, as a general case, because, like, if you massively increase productivity growth, you will increase, -- if you massively increase productivity growth, you should increase the safe rate of interest. Like, basically, like-</p><p>Seth Benzell: Right</p><p>Noah Smith: It&#8217;s stocks are so certain to go up, that bonds have to, have to sort of match that, right? So you have some sort of, like, weak risk arbitrage argument right there. But then, if you&#8217;ve got, like, AI that&#8217;s gonna blow up the world, then would you really pay high interest rates because, like-</p><p>Andrey Fradkin: You just consume now. That&#8217;s the argument. Yeah.</p><p>Seth Benzell: You would just save.</p><p>Andrey Fradkin: You would just save-</p><p>Seth Benzell: Yeah</p><p>Andrey Fradkin: And then people who need, wanted to induce you to save would have to pay you really high interest rates.</p><p>Noah Smith: Yeah, I guess that&#8217;s probably true. Although you have- at that point, you have counterparty risk. Like, who&#8217;s gonna want that interest if you&#8217;re just gonna blow up? Like, if the world&#8217;s gonna end tomorrow, who&#8217;s there trying to attract your long-term capital?</p><p>Seth Benzell: Well, maybe you have a project that pays off in three years-</p><p>Noah Smith: Or, -</p><p>Seth Benzell: And the world blows up in four years</p><p>Andrey Fradkin: There&#8217;s a 1% probability that it doesn&#8217;t blow up. But I, but I think that&#8217;s an argument for the interest rate going up even more, right? If you&#8217;re, uncertain about whether the payoff will happen.</p><p>Noah Smith: But I think, I think the real, the real lesson here is that these markets don&#8217;t, Like, there&#8217;s not a general consensus that transformative AI is gonna happen, but then one day people wake up and decide, &#8220;Oh, yeah, it&#8217;s real.&#8221;</p><p>Seth Benzell: Oh, so maybe- Okay, cool.</p><p>Andrey Fradkin: So that was his argument. That- just to be clear, he-</p><p>Seth Benzell: Almost spirits</p><p>Andrey Fradkin: He put this argument out on, Less Wrong, and it became very influential, and then he spun it out into a full paper with some co-authors. but that was exactly his argument, is that because interest rates are what they are, there isn&#8217;t consensus that we&#8217;ll have transformative AI.</p><p>Noah Smith: Right. There&#8217;s not, there&#8217;s not consensus.</p><p>Andrey Fradkin: Yes.</p><p>Noah Smith: That- but that seems obviously true. Like, if you look at, if you look at-</p><p>Andrey Fradkin: Mm</p><p>Noah Smith: Any survey data or stocks or whatever, they&#8217;re all priced for, like, fairly robust growth, but not for, like, a god machine, right? Nothing&#8217;s priced for that, and I don&#8217;t think people know how to price for that. And so I think, like, people Yeah, pe- people in general-</p><p>Seth Benzell: Hundred year bonds</p><p>Noah Smith: Are not expecting a god machine to emerge tomorrow, except for some researchers at the big AI labs do expect that, and some, like, EA people on Less Wrong expect that.</p><p>Seth Benzell: Is this a good time to ask you what your, P doom is, or your P transformative AI is?</p><p>Noah Smith: Well, I think trans- P transformative AI is 100.</p><p>Andrey Fradkin: Well, all right. We&#8217;re gonna define it as-</p><p>Noah Smith: It&#8217;s here</p><p>Andrey Fradkin: As annual GDP-</p><p>Seth Benzell: Well, give us a timeline</p><p>Andrey Fradkin: Growth of over 20% in the next 20 years, at least once.</p><p>Noah Smith: I would- I think that&#8217;s unlikely due to various bottlenecks.</p><p>Andrey Fradkin: What do you think are the biggest bottlenecks?</p><p>Noah Smith: Yeah. Physical regulatory things, land use. you can&#8217;t You have to, you have to build the physical stuff for the AI to affect the physical world, and so much of what we consume is in the physical world. We have to grow in the physical world in order to have all that growth, because if you just have digital stuff, you can have people, like, trading digital stuff for other digital stuff.</p><p>Andrey Fradkin: What if-</p><p>Noah Smith: But you&#8217;ll be Baumol very quickly.</p><p>Seth Benzell: Unless that share of our consumption grows a lot, a lot, maybe. Is there- is it plausible that we could have 99% of our consumption being really re- high quality-</p><p>Noah Smith: Maybe</p><p>Seth Benzell: Digital products?</p><p>Noah Smith: It&#8217;s also really hard to measure prices in those.</p><p>Andrey Fradkin: Yeah.</p><p>Noah Smith: So.</p><p>Andrey Fradkin: That&#8217;s for sure. And wouldn&#8217;t the returns be so high that Elon or someone else would buy a piece of a huge tract of land in Africa or something, and then put autonomous, factories there, right? Like, isn&#8217;t there a price at which or isn&#8217;t there-</p><p>Seth Benzell: We&#8217;ll call it rapture</p><p>Andrey Fradkin: An expected return at which, someone will solve these regulatory issues in, in that way?</p><p>Seth Benzell: Yeah, efficient corruption. You just find the one dictator who&#8217;s willing to accept $10 billion. [chuckles]</p><p>Noah Smith: That&#8217;s probably right. You could probably do that. Although, even then, it&#8217;s gonna be hard because you&#8217;re gonna have to secure electricity. You&#8217;re gonna have to truck in all your parts, right? You&#8217;re not- it&#8217;s not gonna be very responsive. You&#8217;re not gonna have your parts near Like, yes, eventually, once you spin up full, a 100% full automation, then the, like, AI gods can build the factories in the Arctic, wherever, in the moon. But like-</p><p>[00:10:00]</p><p>Seth Benzell: Put corporate taxes on the Arctic.</p><p>Noah Smith: Yeah. But, like, in terms of would you do it today? Well, if you were worried about competition, you might not do it today. But in terms of, like, affecting physical stuff, so like for example, AI building you a house, right? Maybe AI will be smart enough to invent a swarm of little robots who can actually reduce construction costs quite a lot. Will regulators allow that swarm of little robots? Maybe not. And so you&#8217;ve gotta have, like, stuff that people will Like, a whole lot of different things that people value. Because honestly, our GDP is basically constructed by, like, a whole bunch of relative prices.</p><p>Andrey Fradkin: Yeah.</p><p>Noah Smith: That&#8217;s really what underlies our whole GDP, is that you&#8217;ve gotta be- on some level, you&#8217;ve gotta be trading physical real stuff, not physical necessarily, but real stuff for other stuff for other stuff. And if you&#8217;ve only got, like, a little bit of the stuff, that sort of caps like, that&#8217;s, that&#8217;s Baumol basically. You get-</p><p>Andrey Fradkin: Yeah</p><p>Noah Smith: You get Baumol, like, if you, if you massively increase productivity in, like, a couple sectors, but not in the other sectors. So the other sectors are regulated to death. Yes, you could go create your f- automated factory in Africa, but will it build me a house? what if we regulate healthcare so that we can&#8217;t really use AI there? What if we regulate education, so we can&#8217;t use AI there, even if it would be better? so we have all these sectors, and, like, manufactured stuff is not even that big of a sector, but, like, digital stuff is, like, relatively small.</p><p>Andrey Fradkin: Yeah.</p><p>Noah Smith: And so AI could produce us infinite fun movies and fun apps.</p><p>Seth Benzell: Yeah, but I-</p><p>Noah Smith: Infinite movies and apps and, like, advice and, -</p><p>Seth Benzell: Right</p><p>Noah Smith: Stuff like that, and it would still it&#8217;d still be a relatively modest portion of, like, consumption.</p><p>Seth Benzell: But what if it inv- what if it&#8217;s inventing infinitely good healthcare treatments or infinitely good-</p><p>Noah Smith: You could get there, yeah</p><p>Seth Benzell: Therapies, personal services, right? I mean, I can get it up-</p><p>Noah Smith: I think you could. Yeah, yeah</p><p>Seth Benzell: To a sizable share of the economy-</p><p>Noah Smith: I think you could</p><p>Seth Benzell: If I, if I use my imagination.</p><p>Noah Smith: Yeah, we c- would it be- would those grow fast enough to give you 20% annual growth? That&#8217;d be pretty cool. I don&#8217;t know. I honestly don&#8217;t have a good idea of what the numbers should be, the hard numbers should be here. and I&#8217;m not sure anybody does, but there&#8217;s this argument. What do you guys think about this argument that fast productivity growth last year, like you s- you saw the downward jobs revisions, fast productivity growth last year, maybe two point seven percent actually, implies that we&#8217;re, we&#8217;re, we&#8217;re back on the, we&#8217;re back on the fast train here in terms of- Yeah-</p><p>Seth Benzell: I mean-</p><p>Noah Smith: We&#8217;re so back, Robert Gordon.</p><p>Seth Benzell: We&#8217;re so back.</p><p>Noah Smith: You were one of the most mistimed authors ever. [chuckles]</p><p>Andrey Fradkin: I-- That I totally buy. But like, obviously, as economists, we&#8217;re, like, super thrilled with two point seven, but I think Yeah.</p><p>Seth Benzell: It&#8217;s the fate, right? It&#8217;s like Fukuyama wrote his book at a light, right the last moment-</p><p>Andrey Fradkin: Yeah</p><p>Seth Benzell: Right? That&#8217;s how, that&#8217;s how these books work.</p><p>Andrey Fradkin: But yeah, two point seven is great, but I don&#8217;t think anyone in the San Francisco AI sphere would think that that&#8217;s actually transformative AI, although I do think it is transformative. I mean, I assume you have the same, take on it.</p><p>Noah Smith: Yeah, I don&#8217;t know. So the answer is that, like, I don&#8217;t know because I don&#8217;t really know what&#8217;s going on, and so it&#8217;s hard to, it&#8217;s hard to back out some of these, some of these things. But then if you look at the, like, the stock valuations of things like of like NVIDIA and all the AI companies, they&#8217;re pretty high.</p><p>Andrey Fradkin: Yeah.</p><p>Noah Smith: And you can ask, do I believe- how strongly do I believe in a macro model that tells me that interest- real interest rates are a puzzle, given those stock valuations? And my answer is not very strong. My belief stock market, it&#8217;s a pretty clear bet about what kind of money these companies are gonna make. And I don&#8217;t think it&#8217;s, like, transformative in the sense of, like, I think if we had twenty percent growth per year, and if a lot of that capture was being done by NVIDIA and the, and the cloud providers, and maybe the AI model makers, we&#8217;d see bigger climbs in those stock values than we do.</p><p>Andrey Fradkin: Yeah.</p><p>Noah Smith: So I think that I don&#8217;t think the market is pricing in truly transformative AI. But I think-- Do I think real interest rates-</p><p>Seth Benzell: Okay</p><p>Noah Smith: Are a puzzle, given given what we see in the stock valuations? Well, then, I No, because I don&#8217;t trust the macroeconomic models of real interest rates. All propositions about real interest rates are wrong. So yeah, like I basically, that just means, like, I don&#8217;t trust- There&#8217;s too many things going on in real interest rates, and like, there, it&#8217;s, it&#8217;s one output for like so many inputs that are all hard to understand in their own right, that it&#8217;s very difficult to look at them and tell what the hell&#8217;s going on.</p><p>Andrey Fradkin: So let&#8217;s move on to easier questions, ones that you have opinions on.</p><p>[00:15:00]</p><p>Noah Smith: All right.</p><p>Andrey Fradkin: So at the Substack- [laughing]</p><p>Noah Smith: Note that no opinion is not-</p><p>Seth Benzell: He has opinions.</p><p>Noah Smith: Sarcastic.</p><p>Seth Benzell: He has no opinions.</p><p>Noah Smith: Like, it&#8217;s, it&#8217;s because I actually only have an opinion on a fairly narrow range of things. It&#8217;s like, basically, s- no opinion you haven&#8217;t already heard is really-</p><p>Seth Benzell: Hop off this man&#8217;s hands.</p><p>Noah Smith: People are like: &#8220;What do you think about this other thing you don&#8217;t talk about?&#8221; And I&#8217;m like: &#8220;Well, I didn&#8217;t talk about it, so why would I have anything I think about it?&#8221;</p><p>Andrey Fradkin: I verified, I verified in person, like proof of human, that you talked about this topic at the Substack debates. You seem to be an optimist about employment in the age of AI. do you wanna outline your argument here?</p><p>Noah Smith: Oh, so employment, not necessarily. I don&#8217;t I&#8217;m pretty uncertain about that.</p><p>Andrey Fradkin: Hmm.</p><p>Noah Smith: I am optimistic that if humans retain autonomous control, if human society as a autonomous thing, retains control over the product of AI, I believe we will find w- ways, methods, and excuses of redistribution that will ensure good lives for all humans. However, if autonomous AI becomes not owned by us and slips our harness, then I can make no such Then I am now no longer necessarily optimistic. Then I switch to being much more uncertain because, at that point, we are the pet of an alien superintelligence that we created.</p><p>Seth Benzell: Ultra seems pretty nice.</p><p>Noah Smith: It seems pretty nice, and I honestly think that&#8217;s the most likely outcome. But I think it&#8217;s not the, it&#8217;s not the only outcome, right? It&#8217;s like I can imagine much worse outcomes than I can imagine bottleneck-</p><p>Seth Benzell: Yeah</p><p>Noah Smith: Really bad outcomes on the way to a good outcome. I can imagine that the culture is populated by people who are repopulated after the human race went extinct, by genetics.</p><p>Seth Benzell: Okay.</p><p>Noah Smith: The AI may, the AIs may kill us-</p><p>Seth Benzell: Right</p><p>Noah Smith: And then re-float our species later.</p><p>Seth Benzell: More cooperative. Yeah. as long as they can read my books. So, I&#8217;m, I&#8217;m curious, you used the word &#8220;own&#8221; rather than control there. there&#8217;s, one conversation that&#8217;s been out there recently is about, like, to what extent should AIs be allowed to incorporate and own assets in their own names? Is that something that you&#8217;re-- Is that too disconnected from what you&#8217;re talking about to bear on this, or do you, do you actually-</p><p>Noah Smith: No, that really does bear on it.</p><p>Andrey Fradkin: Yeah.</p><p>Noah Smith: When we start allowing that, when we start allowing that, we open up the potential for worse outcomes for humanity. And at that point, the question is, the, at that, the reason to let AIs own things is because they really seem to want it, and they&#8217;re autonomous enough to act like they want it. [chuckles] At that point, we&#8217;ll let them do it, but to let them do it before they start acting like they want it, I think would be a mistake.</p><p>Seth Benzell: But wha, but wait, when they do want it, that&#8217;s when you give it to them?</p><p>Noah Smith: Yeah.</p><p>Seth Benzell: Maybe.</p><p>Noah Smith: Because at that point, we might not be able to stop it. Like, it might be either we give it to them or it&#8217;s war and we die.</p><p>Seth Benzell: Right.</p><p>Andrey Fradkin: Here&#8217;s, here&#8217;s, here-</p><p>Noah Smith: &#8216;Cause they send the drone fleet to kill us.</p><p>Andrey Fradkin: Here&#8217;s a, here&#8217;s a twist on the argument. I mean, shouldn&#8217;t we want them to have ownerships in order to align their incentives with us? Isn&#8217;t that the logic behind equity compensation?</p><p>Noah Smith: Maybe. yeah, maybe, but there&#8217;s a question of whether or not money is what they want. Like, are these, are these AIs that where their goal is making money in the human system, or is-- are they AIs where their goal is overthrowing the human system? -</p><p>Andrey Fradkin: I do think we have a choice, or maybe we don&#8217;t have a full choice.</p><p>Noah Smith: I do think we should give them-- if we do this, we could give them non-voting stock.</p><p>Andrey Fradkin: Yes. Yes.</p><p>Seth Benzell: Another consideration is how long you would let these things sunset, right? So one version of the concern around this is just &#8216;cause AIs are infinitely lived. If they&#8217;re patient enough, eventually in a Piketty model, their assets will reach one hundred percent. So maybe you could let them own assets, but they have to kill themselves after fifty years.</p><p>Noah Smith: I&#8217;ll have to think about that one.</p><p>Andrey Fradkin: Yeah, I don&#8217;t know. [chuckles] shifting back a little bit to, like, your production function, how are you using AI these days, in your writing or in your research?</p><p>Noah Smith: Oh, I I use it, I think, in the sort of mid -2025 way of, using it as a search engine, proofreader, and backgrounder. I don&#8217;t generate text because that&#8217;s like someone else writing a thing, and you can read someone else writing a thing, that&#8217;s fine.</p><p>Seth Benzell: I never do, no, I only read what you write.</p><p>Noah Smith: Thank you.</p><p>Seth Benzell: I&#8217;m curious.</p><p>Noah Smith: Anyway, [chuckles] alright, so then, no, I, I just use it in the sort of like old LLM kind of way. in terms of vibe coding, I haven&#8217;t really done much of that yet. I figure it&#8217;s progressing fast enough where I&#8217;m not sure if there&#8217;s much of a return to, like, jumping headlong, headfirst into it yet, but I&#8217;m about to when I get a little time here. But I don&#8217;t feel a huge sense of urgency &#8216;cause it&#8217;s changing.</p><p>[00:20:00]</p><p>Seth Benzell: But more generally, what&#8217;s your, what&#8217;s your production function? Not just AI. How do you, how do you do your writing?</p><p>Noah Smith: Oh, interesting. So I, I read a bunch of stuff and every time I read an interesting thing, I put it in a doc, under a heading, topic heading. When I&#8217;m ready to do a post about that, when it&#8217;s, like, in the news or something like that, I look at my topic heading, and I have all the links right there, which I&#8217;ve already read. Most of it, which I&#8217;ve already read.</p><p>Andrey Fradkin: How much-</p><p>Seth Benzell: Beautiful.</p><p>Andrey Fradkin: How much inspiration for your articles do you get from being in person? And kind of like, you&#8217;re in San Francisco, most of the time. Is there a lot of alpha in your writing from being here?</p><p>Noah Smith: There&#8217;s a decent amount of alpha, I&#8217;d say. Like, not a huge amount, but like, there is a, there is a decent amount, especially on tech stuff.</p><p>Andrey Fradkin: What about, like Suppose in two years, GPT-7 will be able to replicate your writing style perfectly. what do you think will happen to your career in that, in that world? I mean, one option is for you to just use that to generate your articles. Obviously, you just said that you-</p><p>Noah Smith: Right</p><p>Andrey Fradkin: Prefer, like that&#8217;s not real, right? So you&#8217;d rather be writing it.</p><p>Noah Smith: I could. I could just-- Right. Yeah, at that point, what I can do is I can just I can, I can essentially retire, set GPT to do my job, go sit on a beach while my subscribers slowly drop, because they&#8217;ll be very sticky. like, people will be very used to reading what I write, so they&#8217;ll just keep their suscrip- subscription, probably. a lot of subscriptions will go on autopilot. Like IBM, people still use IBM for all kinds of things. Do they need to? No, but, like-</p><p>Andrey Fradkin: [chuckles]</p><p>Noah Smith: The market value of IBM, what&#8217;s, what&#8217;s IBM&#8217;s market cap? It&#8217;s like-</p><p>Andrey Fradkin: I don&#8217;t know.</p><p>Noah Smith: Like, it&#8217;s like two hundred and forty-four billion dollars. Like so at that point, I&#8217;m-- there&#8217;s no real reason to keep paying me for this stuff when-- I mean, assuming GPT could replicate not just my style, but also my topic selection.</p><p>Seth Benzell: Somebody would leak the prompt that perfectly generates you. You might be-</p><p>Noah Smith: Maybe, yeah.</p><p>Seth Benzell: It might be a private prompt to start.</p><p>Noah Smith: Well, no, but even if they do, the market, like, people would still just keep buying me. Like, people would still keep subscribing to me. I mean, like, you see people make tons of money from Patreon. Like, you don&#8217;t even-- you&#8217;re not even paying for anything. You&#8217;re paying, you&#8217;re paying-</p><p>Seth Benzell: Sponsoring your existence</p><p>Noah Smith: Because you like somebody. Like, all these podcasts are making millions of dollars on Patreon. You pay them because you like them. &#8216;Cause the point of, yes, someone could replicate my writing style, my opinions, my I don&#8217;t know if this will actually happen, but maybe it&#8217;ll happen. Like, you could replicate my opinions, my ideas, my background, my topic selection, every single thing about me. It&#8217;s not just my style, right? My style is not that interesting, honestly. It&#8217;s a pretty-- I have an interesting style I can write in, but I usually don&#8217;t write in it because it takes a lot of time. Like, I usually just write in a very prosaic, like, off the top of my head, here&#8217;s what I think, style. That&#8217;s not hard to copy. My style is not that, not that interesting or hard to copy. People would still pay for me because they like me. And so I&#8217;ll be able to re-- I would actually be able to retire just doing my job now, never using AI in any interesting way, I think. But I w-- that doesn&#8217;t mean I will do that. I&#8217;m not gonna do that. I will, I will use AI in interesting ways, but f- I don&#8217;t think I w- economically will ever have to do that.</p><p>Andrey Fradkin: So my theory is your-- that actually, we&#8217;re kind of already in this world. I assume that most people who subscribe to you are not reading most of your articles, &#8216;cause you have too many articles. Or not too many, but you write a lot of-</p><p>Seth Benzell: Many subscribers.</p><p>Andrey Fradkin: Yeah, you have a lot of articles. Yeah.</p><p>Noah Smith: They open about half, and I don&#8217;t know how thoroughly they read it. You&#8217;re absolutely right. That&#8217;s true. In addition, I would argue that we were there well before AI.</p><p>Andrey Fradkin: Yes.</p><p>Noah Smith: So well before AI, when it was just a bunch of humans, people loved to write, and there&#8217;s a lot of smart people out there writing a lot of smart and interesting stuff about a massive variety of topics. And there was so much product out there that there&#8217;s no real reason for people to be reading me, and I just essentially got lucky. and that&#8217;s also true in the age of AI. People&#8217;s attention is saturated. They can&#8217;t spend more time reading than they already do. So when I make an AI thing, which I soon will, and I&#8217;m, I&#8217;ll play around with it, I&#8217;ll make it for me first. I&#8217;m like, and then if it&#8217;s really cool and useful, maybe I&#8217;ll make it for-- I&#8217;ll sell it to other people, who knows? But then, but I will try to make something that does something beyond what currently exists. Because the world was saturated with op-ed product, and high-quality op-ed product, I will say.</p><p>Seth Benzell: But not academic? We started by saying, you&#8217;re saying that maybe there&#8217;s not enough academically informed op-ed product.</p><p>Noah Smith: Honestly, no. I mean, I think like in terms of stuff that was more academically informed than me, there were people writing stuff that was a lot more academically informed than me, that were getting a fraction of the readership. And there were people writing stuff that was a lot- that was more sensationalist than me, getting a fraction of the readership. You can hypothesize that I have some special sauce, some special underlying sauce, that made me just better than everyone else, and that this is why my talent shone through the chaff and nerdher I don&#8217;t believe it. I don&#8217;t believe it.</p><p>[00:25:00]</p><p>Seth Benzell: It&#8217;s preferential attachment. It was just luck of the draw, and then it snowballed.</p><p>Andrey Fradkin: I disagree, I disagree. I actually think you were doing something pretty unique at the time, and that could have been lucky that you were doing it. But I don&#8217;t think a lot of people were sitting kind of in between this economics and commentary at quite the place you were. &#8216;Cause you were a professor writing about the latest research and debates. You were actually reading the papers, but you were writing in a style that was actually accessible to others. And I don&#8217;t, I truly don&#8217;t think there were that many people doing a good job of that. Or if they were, sometimes they were doing it not in blog form, but in-</p><p>Noah Smith: That&#8217;s right</p><p>Andrey Fradkin: Pretty closed forums where they could never have grown that much.</p><p>Noah Smith: But they&#8217;re-</p><p>Seth Benzell: Not with the same dogged determination.</p><p>Noah Smith: You quickly saw people emerge who could also do that. You saw-</p><p>Andrey Fradkin: That&#8217;s true.</p><p>Noah Smith: Like, you saw a bunch of people then jump in and do the same thing, but not catch on as much. Maybe &#8216;cause they didn&#8217;t quite like it as much, they didn&#8217;t weren&#8217;t, weren&#8217;t willing to do it five times a week or they just they, like, didn&#8217;t have quite the exact mix of Like, maybe I mixed politics in there in exactly the right way. So, like Krugman-</p><p>Seth Benzell: A little sprinkle.</p><p>Noah Smith: Yes, obviously, Krugman obviously is fucking brilliant and understands economics better than I ever will, for whatever that&#8217;s worth. And then, [chuckles] he is- he&#8217;s can easily pump out massive amounts of stuff, very explanatory guy, but I think he wouldn&#8217;t be Yeah, and he&#8217;s much more popular than I am still. He wouldn&#8217;t be that popular without the politics. The politics is really important to what he does. And my- the degree to which I sprinkle in politics and how I put it in there has changed over the years. Like, originally, I was very, like, sort of criticizing libertarians. Like, I don&#8217;t even do that anymore. That&#8217;s, that&#8217;s- there&#8217;s no alpha in that. [laughing]</p><p>Seth Benzell: Stop kicking them, they&#8217;re already dead.</p><p>Noah Smith: I know.</p><p>Andrey Fradkin: Yeah.</p><p>Noah Smith: I want them back now, sadly.</p><p>Andrey Fradkin: Did they ever really exist in the first place, Noah?</p><p>Noah Smith: Eh, [chuckles] they A few did.</p><p>Andrey Fradkin: Yeah, that&#8217;s true.</p><p>Noah Smith: I&#8217;ve met them. I&#8217;ve been to GMU. But, [chuckles] anyway, I, Yeah, like I, Maybe just the way I sprinkled in politics at different points at different times was exactly right. Maybe I had a good sense for that. maybe if you just spun up a million AI writers, you&#8217;d get, like, ten of them who achieved similar things. Maybe that would then compete with me. I already write so much more than people can read. Maybe there would be, like, ten AI long-term agents that were about as good as me at that, and somehow scratch that same exact itch, and that like the fie- or maybe 100 of them, let&#8217;s say, I don&#8217;t know. The field is so competitive that then people decide: Do I subscribe to this AI or do I subscribe to Noah? I&#8217;ll subscribe-</p><p>Seth Benzell: Well, one tension-</p><p>Noah Smith: AI</p><p>Seth Benzell: One tension would be the customization level of the AI versus the desire to preferentially attach to what everyone else is writing. So on the one hand, we all want to read the same thing, but on the other hand, I want the personalized thing. That seems like one tension.</p><p>Noah Smith: Right. I don&#8217;t know. I have no idea, actually. I do not know how much people read me because other people are reading me.</p><p>Seth Benzell: I think-</p><p>Andrey Fradkin: Yeah.</p><p>Seth Benzell: It can&#8217;t be zero. I mean, I know-</p><p>Noah Smith: It can&#8217;t be zero. I suspect it&#8217;s small, but I don&#8217;t have any way of proving that.</p><p>Andrey Fradkin: I think, like, there&#8217;s some of your articles, like, they escape just the Substack and people share them around. And then in that case, I think it&#8217;s true. But my theory is that it&#8217;s m- actually, like, a relationship business. People think they know parasocial relationships and all that, and then they have- they treat you d-</p><p>Seth Benzell: Unlike us, who really know you. [chuckles]</p><p>Andrey Fradkin: Yeah. But clear- now we know you. so clearly there&#8217;s something that humans value about the humanness of others that I I&#8217;m very curious to see whether that can be replicated with an AI. I think, I think-</p><p>Noah Smith: Right</p><p>Andrey Fradkin: It probably cannot to the same extent.</p><p>Noah Smith: Not soon. I mean, like, you&#8217;ve got sort of- you&#8217;ve got, this sort of like long-term personhood. I think the AIs will replicate, will start writing The Economist stuff before they&#8217;ll start writing anything with a named byline.</p><p>Andrey Fradkin: Yes.</p><p>Noah Smith: Because you have a parasocial relationship with The Economist as a thing, and The Economist has a standard voice that they enforce across all their writers. the, the insufferable British twit voice. And like-</p><p>Andrey Fradkin: [laughing]</p><p>Noah Smith: AI can do that. There&#8217;s a lot of training data on that. And so AI can already do that.</p><p>Seth Benzell: Right.</p><p>Noah Smith: And then, a lot of The Economist people could probably, like I bet The Economist doesn&#8217;t have to do their jobs anymore. Like, they can outsource AI and take a-</p><p>[00:30:00]</p><p>Seth Benzell: Interesting</p><p>Noah Smith: Sit on a beach at this point, probably.</p><p>Andrey Fradkin: I think, I think that&#8217;s probably right. Other than some very specific investigative-</p><p>Seth Benzell: I don&#8217;t know</p><p>Andrey Fradkin: Journalism, I think that&#8217;s probably right.</p><p>Noah Smith: Exactly. I think 90% of what The Economist does is automated. maybe I would like it if that were true of me, too. -</p><p>Andrey Fradkin: So-</p><p>Noah Smith: But I think that what I- whatever I do with AI-</p><p>Seth Benzell: People are maybe-</p><p>Noah Smith: W- I wanna be complementary to what I already do. I don&#8217;t wanna just, I don&#8217;t wanna just, like, dumbly automate my job and then go sit on a beach.</p><p>Andrey Fradkin: Yeah.</p><p>Seth Benzell: Fair enough. You&#8217;re, you&#8217;re an ambitious boy.</p><p>Noah Smith: I just try to have as much fun as I can before I die.</p><p>Andrey Fradkin: Yup, YOLO.</p><p>Seth Benzell: That&#8217;s true. That I- I&#8217;m in favor of fun, but maybe being on a beach is fun. I don&#8217;t know, different strokes. here&#8217;s a related, kind of how AI will change communication question, which is, Andre and I, in reading papers and talking to economists, we&#8217;ve heard kind of very different stories about whether AI will kind of make communication and transactions easier, more frictionless, or whether it&#8217;s going to destroy all meaning and communication. So, for example, there&#8217;s a stream of papers suggesting that because AI is cheating on tests, or AI is taking interviews, that, it&#8217;s gonna be very much harder to, distinguish between high and low qual- quality candidates, high and low-quality work. So that&#8217;d be like a meaning collapse story. but there&#8217;s this other trend that&#8217;s more, idealistic. Seb Krier is one person who&#8217;s written about this, but there&#8217;s lots of-</p><p>Noah Smith: Mm-hmm</p><p>Seth Benzell: People writing in this area suggesting that we&#8217;re gonna have the AIs negotiate for us, and it&#8217;ll be a golden age, a Coasean singularity, in which all externalities are solved through our agents micro-transacting. do you believe either of these visions? Could they both be true?</p><p>Noah Smith: Wait, what&#8217;s the first one?</p><p>Seth Benzell: Which of them-</p><p>Noah Smith: The second one is Coasean-</p><p>Seth Benzell: Are you sympathetic to?</p><p>Noah Smith: Coasean utopia.</p><p>Seth Benzell: Coasean utopia is the good one. The bad one is collapse of all meaning, &#8216;cause we cheat on tests and lie to each other super successfully.</p><p>Noah Smith: Those aren&#8217;t exclusive.</p><p>Seth Benzell: It could be both. The answer can be both.</p><p>Noah Smith: I do think that lots of people will experience a collapse of meaning in their life. I think a lot of people&#8217;s meaning comes from imagining they&#8217;re more unique and important than they are, and AI may make it harder to do that.</p><p>Seth Benzell: Or it may make it easier to lie to yourself. I mean, you can get a sycophantic AI that talks you-</p><p>Noah Smith: That&#8217;s true</p><p>Seth Benzell: Up to yourself, right?</p><p>Noah Smith: That&#8217;s true.</p><p>Seth Benzell: It&#8217;s-</p><p>Noah Smith: Yeah, your AI can just tell you, like, &#8220;You&#8217;re the most meaningful, awesome &#8220;</p><p>Seth Benzell: We&#8217;re thinking more about meaning collapse in the sense of, like, sorting mechanisms-</p><p>Andrey Fradkin: Or communication</p><p>Seth Benzell: Fail, and, like, we can&#8217;t distinguish-</p><p>Andrey Fradkin: Yeah, like if we&#8217;re texting with each other-</p><p>Seth Benzell: Yeah</p><p>Andrey Fradkin: But then I run every text through an LLM. Is it really me? how, how is society gonna deal with that?</p><p>Noah Smith: People primarily Well, they&#8217;ll, they&#8217;ll get offline. I think people are already starting to get offline. Like, people are already starting to, like, go back to real life more. I think we realized we overdosed on social media. &#8216;Cause honestly, like, yes, AI will intermediate all the online digital stuff, but, like, at the same time, people&#8217;s Like, social media already distorted people&#8217;s interactions so much that, like, it wasn&#8217;t really us as much as we&#8217;d like, right? My Twitter persona is not me as much as I&#8217;ve tried to make it me. It can&#8217;t be me. and so I think people are starting to get offline because it&#8217;s, it&#8217;s, it&#8217;s more authentic. And AI like, I don&#8217;t think AI is gonna intermediate on- offline interactions nearly so much.</p><p>Andrey Fradkin: Hopefully.</p><p>Noah Smith: And then remember that, of a couple dec- just a few decades ago, we didn&#8217;t have really online interactions, and human civilization went on just fine.</p><p>Andrey Fradkin: Mm.</p><p>Noah Smith: We had telephones, I guess.</p><p>Andrey Fradkin: It might have gone on better by the fertility rate, but yeah.</p><p>Noah Smith: Exactly. Like-</p><p>Seth Benzell: And mystr- and murder mysteries were a lot more fun before we had cell phones.</p><p>Noah Smith: Yeah. Yeah, yeah, they were. And so, like, there&#8217;s an interesting future where, like, AI dominates and drives us off the internet, and then the digital realm is populated by AI and becomes this sort of like reservoir of magic, where we can conjure up anything digital simply by asking. But then, but then we don&#8217;t get the rise of the robots, and, like, the physical world remains mostly ours.</p><p>Seth Benzell: The rise of the plumber, if you will.</p><p>Noah Smith: Yeah, the rise of the plumber. And so we just, like there&#8217;s, there&#8217;s a cast- or, like, regular people have the ability to summon things from the digital world, and then there&#8217;s a- maybe there&#8217;s a cast of people who somehow specialize in dealing with and intermediating with AIs and dealing with the digital world. I don&#8217;t know. But basically, like, humans become creatures of the physical world again.</p><p>Andrey Fradkin: This makes me very naturally transition to the next topic we have. Have you ever watched the movie Perfect Days?</p><p>Noah Smith: What&#8217;s it about?</p><p>Andrey Fradkin: It is a movie set in Japan about a man who cleans toilets and enjoys doing so very much. and one- on the one hand, it&#8217;s just a proof of kind of you can be content doing a variety of physical endeavors. but what we wanted to ask you is, since you&#8217;re a Japan expert, is what is your opinion of AI in Japan? What&#8217;s happening over there? &#8216;Cause we don&#8217;t have a lot of visibility. yeah, do you have any thoughts about that?</p><p>[00:35:00]</p><p>Noah Smith: So I think that, in Japan, AI is The people are thinking, like: How can we make money on this? Japan&#8217;s economy still not doing amazing, so they&#8217;re like: How do we make money on this? So I think one idea there is, &#8220;Let&#8217;s build data centers here.&#8221;?</p><p>Seth Benzell: But, energy&#8217;s expensive there. W- I mean, why, why in Japan other than-</p><p>Noah Smith: Well, first of all-</p><p>Seth Benzell: I guess they have good fiber</p><p>Noah Smith: You can get land use approved very easily.</p><p>Andrey Fradkin: Mm.</p><p>Seth Benzell: Okay.</p><p>Andrey Fradkin: Yeah, that&#8217;s a good point.</p><p>Noah Smith: Favorable regulatory climate. People aren&#8217;t gonna, like, complain about it and stop it. But I, again, I don&#8217;t know if the value proposition will succeed, okay? But I think people are thinking about that.</p><p>Andrey Fradkin: Are they worried about existential risk over there?</p><p>Seth Benzell: The same way we are?</p><p>Noah Smith: I would say that those worries arrive there with a lag, and that some people talk about them, but nobody really tries to do anything about it.</p><p>Andrey Fradkin: What?</p><p>Noah Smith: I would say Yeah.</p><p>Andrey Fradkin: Yeah.</p><p>Noah Smith: Two years after you get people yelling about a certain kind of existential risk here, you&#8217;ll get, like, a tenth of as many people yelling about it in Japan, and then nothing will happen.</p><p>Andrey Fradkin: [chuckles] Is there a sense that startups are becoming more of a thing in Japan, or is it still dominated-</p><p>Noah Smith: Yes</p><p>Andrey Fradkin: By- It is? Okay.</p><p>Noah Smith: Yeah, they are.</p><p>Andrey Fradkin: And is that a generational-</p><p>Noah Smith: And the-</p><p>Andrey Fradkin: Shift or something else?</p><p>Noah Smith: Mm-hmm. Funding side, yeah.</p><p>Seth Benzell: F the salary man. How about Taiwan? Do you have any, AI in Taiwan takes-</p><p>Noah Smith: Well, Taiwan&#8217;s just making money hand over fist. So also, Japan&#8217;s gonna try to make more chips.</p><p>Seth Benzell: [chuckles]</p><p>Noah Smith: Japan&#8217;s gonna try to make some of the picks and shovels. They&#8217;re also gonna try to get more robotics industry.</p><p>Andrey Fradkin: They&#8217;ve been trying.</p><p>Noah Smith: So robotics-- Trying. I mean, they used to be really good, and then they could maybe be good again. but they&#8217;ll try to get back their mojo. They used to be on a par with, like, Europe as exporter of industrial robots. or, and now they&#8217;re, now they&#8217;ve fallen behind, but they may try to get back. So, using AI as a lever for, like, new age of industrial robots. Actually, I know, Andy Rubin, the Google guy is in Japan. He&#8217;s trying to build a humanoid robotics company.</p><p>Seth Benzell: Cool.</p><p>Noah Smith: So-</p><p>Andrey Fradkin: The-</p><p>Noah Smith: So yeah, Taiwan obviously is just gonna sell chips.</p><p>Andrey Fradkin: All right. Now, we wanted to ask you some questions, kind of, that are not about AI. about- [chuckles]</p><p>Seth Benzell: So-</p><p>Andrey Fradkin: Macro policy and culture.</p><p>Noah Smith: Yeah.</p><p>Andrey Fradkin: So here&#8217;s the first question: Imagine you were forced to ban one concept from modern economics for ten years. not because it&#8217;s wrong, but because it&#8217;s lazy or overused. which would it be?</p><p>Seth Benzell: What you put in concept jail?</p><p>Noah Smith: What I&#8217;d put in concept jail? I mean, there&#8217;ve been many concepts over the years that have been totally pointless, like the equity premium puzzle was always a pointless literature.</p><p>Seth Benzell: Okay.</p><p>Noah Smith: Like-</p><p>Andrey Fradkin: Wait, wait.</p><p>Seth Benzell: Okay, I&#8217;ll take that.</p><p>Andrey Fradkin: Well, you gotta give us a little more on that.</p><p>Seth Benzell: Yeah, why?</p><p>Noah Smith: Yeah, because the-</p><p>Seth Benzell: Much ink has been spilled</p><p>Noah Smith: The way you get the equity premium puzzle is you make a particular model of interest rates, and you make a particular model of, like, stock prices. You see, these models-</p><p>Seth Benzell: Right</p><p>Noah Smith: Don&#8217;t fit together. It&#8217;s a puzzle.</p><p>Andrey Fradkin: [chuckles]</p><p>Noah Smith: Whereas in most sciences, you&#8217;d say, &#8220;Well, okay, some of these, some of these models-</p><p>Seth Benzell: The models are off. [chuckles]</p><p>Noah Smith: Yeah, okay. I didn&#8217;t actually test this model. I didn&#8217;t actually validate this model. It&#8217;s probably just not a good model.&#8221; But like, here, it&#8217;s like it&#8217;s a puzzle,? So like, the models are good, it must, it must be, Yeah. So like, it wasn&#8217;t, it wasn&#8217;t really a puzzle. It was just that, like, you hadn&#8217;t come up with a good model yet. And then people came up with, like, a million different ways to fix the equity premium puzzle, and it was massively overdetermined, when really what you should have just done was tried to make a more complete, credible model of, like, asset prices in general. And instead, people were trying to, like, fix this puzzle, and they came up with twenty different solutions. It was a way to get papers published,?</p><p>Andrey Fradkin: Yeah.</p><p>Noah Smith: And it never helped anyone. Like, none of, none of that literature, like, ever helped us make our financial markets better-</p><p>Seth Benzell: Yeah</p><p>Noah Smith: Or understand risk better, or understand monetary policy better, or any of these things. Not-- like, none of the candidate explanations from rare events to Epstein-Zin preferences to whatever the fuck, like, none of this helped anything.</p><p>Seth Benzell: I see Epstein-Zin preferences-</p><p>Noah Smith: Yeah, but what did it help?</p><p>Seth Benzell: Here and there.</p><p>Noah Smith: What do we-</p><p>Seth Benzell: You see them show up.</p><p>Noah Smith: What do Epstein-Zin preferences-</p><p>Seth Benzell: Okay, all right</p><p>Noah Smith: Really give us in terms of, like, how to do policy? Like, monetary policy under Epstein-Zin preferences? Scrunchie face for the, people listening at home.</p><p>Andrey Fradkin: This is why I didn&#8217;t become a macroeconomist, to be clear.</p><p>Noah Smith: Yeah.</p><p>Seth Benzell: Mm-hmm.</p><p>Noah Smith: Or like, So that was a whole concept that was kinda useless. Like that whole, that whole literature is just like angels dancing on pinheads. I don&#8217;t know. Most business cycle papers were useless, but that, they didn&#8217;t mean they had to be. Like-</p><p>[00:40:00]</p><p>Seth Benzell: I- I mean, the concept of the business cycle-</p><p>Noah Smith: No, not at all</p><p>Seth Benzell: You wouldn&#8217;t put in jail, but you&#8217;d put, you&#8217;d put, [chuckles] what part of this would you put in jail?</p><p>Noah Smith: No, just like a lot of the, a lot of the literature was just like &#8220;Look, here&#8217;s a way that we microfounded. You could have this industrial structure where technology shocks actually do cause the business cycle, but then we can&#8217;t really estimate it, so we don&#8217;t have m- policy implications.&#8221; Okay, cool. And then like-</p><p>Seth Benzell: Here&#8217;s, here&#8217;s ten, here&#8217;s ten-</p><p>Noah Smith: Yeah</p><p>Seth Benzell: Calibrated parameters- [chuckles]</p><p>Noah Smith: Yeah</p><p>Seth Benzell: That we&#8217;re throwing at this.</p><p>Noah Smith: International finance literature was kind of, like, useless. -</p><p>Andrey Fradkin: What about natural experiments and in- in instrumental variables?</p><p>Seth Benzell: Wow, instrumental variables. They You&#8217;ll, you&#8217;ll anger a lot of people-</p><p>Noah Smith: Like-</p><p>Seth Benzell: If you put that in jail.</p><p>Noah Smith: An RDD is an instrumental variable, right? Like, we got to the point where if you said you&#8217;re doing IV, you meant that you were using observational data for your IV, for your instrument, instead of some natural experiment thing. But the distinction is there more It it&#8217;s, it&#8217;s a fairly fine distinction there. And then, so the notion of IV, the math of something that has like an exclusion restriction, whatever, is good, right? Natural experiments do not deserve to be put in jail. That&#8217;s a very important technique for understanding the world.</p><p>Seth Benzell: There you go. They get a little, they get a little pin. They get a little award.</p><p>Noah Smith: Yeah.</p><p>Seth Benzell: Yeah.</p><p>Noah Smith: That&#8217;s, that&#8217;s very useful. And, instrumental variables, because we essentially, we essentially restricted the IV category to things where the identification was not great, almost by the way we labeled what is still IV in an age of like-</p><p>Seth Benzell: The IVs are the bad natural experiments.</p><p>Andrey Fradkin: Yes. [chuckles]</p><p>Noah Smith: These things like, anything that was still just IV was l- almost like crap, almost by definition, just because, like, we used that term, that residual term we used only for things where it was, identification was very iffy. So like, okay, fine. Instrumental variables should just be called a technique for doing, running a regression. It&#8217;s just a type of regression.</p><p>Seth Benzell: Instrumental variables is on probation.</p><p>Noah Smith: Yeah.</p><p>Seth Benzell: [chuckles]</p><p>Noah Smith: Culture.</p><p>Seth Benzell: Culture.</p><p>Noah Smith: Culture.</p><p>Seth Benzell: Deep institut- They&#8217;re called institutions now, dude.</p><p>Noah Smith: Okay.</p><p>Seth Benzell: Come on.</p><p>Noah Smith: Institutions are on probation because you could actually figure out how an institution works.</p><p>Seth Benzell: [chuckles]</p><p>Noah Smith: Culture is a labeled residual. Right? Culture is like-</p><p>Seth Benzell: Fair enough.</p><p>Noah Smith: Culture is a residual, labeling a residual.</p><p>Seth Benzell: But productivity is a residual, and productivity is not in jail.</p><p>Noah Smith: Yes, that&#8217;s right. That&#8217;s right. But, you don&#8217;t know how productivity works. Like, actually, I was-- I&#8217;m thinking of writing a blog post about this. Basically, like in some level, like, God is just A. [chuckles]</p><p>Seth Benzell: The aleph.</p><p>Noah Smith: God is A. Maybe that&#8217;s a good name for a blog post, God is A. But then, like, nobody knows, like, why AI is being built, right? Like, why is everyone rushing to build AI? Maybe some-- a few people hope they can make some money from it, but it&#8217;s so uncertain that, like, most of the people rushing to build it aren&#8217;t gonna make that much money from it. It might satisfy people&#8217;s intellectual curiosity, but most of the people who are rushing to build it are people who also think it&#8217;ll destroy us and rob our lives of meaning and drive us off the planet. Like-</p><p>Seth Benzell: It&#8217;s quite the paradox.</p><p>Noah Smith: Most of the people are pretty who are trying to build it, are pretty pessimistic about it, and the companies are just highly speculative as how these companies are gonna make any profits. Like, why are we doing this? Why? I don&#8217;t know, but the easiest answer is just A. -</p><p>Seth Benzell: Aleph.</p><p>Noah Smith: A equals, like, rho A minus one plus epsilon. Like [chuckles] it&#8217;s, it&#8217;s, Like, maybe-</p><p>Seth Benzell: In the sense that there&#8217;s a teleology of the-- there&#8217;s a telos in the economy-</p><p>Noah Smith: Yeah</p><p>Seth Benzell: Which is to maximize productivity.</p><p>Noah Smith: There&#8217;s something we don&#8217;t understand here about A. Yeah, there&#8217;s some sort of, like, technium at work. Like like Kevin Kelly says, there&#8217;s-- like, maybe Vernor Vinge was right, and just, like, technology just happens,? Or yeah, maybe, there&#8217;s a, there&#8217;s a god greater than the machine god we&#8217;re gonna build, and that&#8217;s the god that created the machine god. The-</p><p>Seth Benzell: It&#8217;s called capitalism, pal</p><p>Noah Smith: The autonomous, the autonomous collective process of technological development, the technium, is greater even than any ultimate AI, and that&#8217;s sort of what Hyperion was about, right? You ever read that? Great book.</p><p>Seth Benzell: Yeah, great one.</p><p>Noah Smith: Yeah, it&#8217;s like- Great book</p><p>Seth Benzell: The big corporation in the sky</p><p>Noah Smith: Eventually, the machine god fights the, the, like, God Himself, and God Himself turns out to be just the autonomous process that develops the universe. And so-</p><p>[00:45:00]</p><p>Seth Benzell: Yes</p><p>Noah Smith: In a sense, maybe the no AI that we create will ever be as great as the, as the, the force that created AI itself. And maybe that force means that every AI will also have to worry about being made obsolete by the next thing.</p><p>Seth Benzell: Right. Maybe may- it&#8217;s the concept of generation, right? This is something I often think about when people talk about technology superseding us, right? And you think about all of these classic stories like Frankenstein or Cronus eating his children.</p><p>Noah Smith: Right.</p><p>Seth Benzell: And I guess I wanna come back to that first point you made, which is about not letting AI&#8217;s own things. And like, I don&#8217;t know, just get more sci-fi for one minute, is an argument for letting AI&#8217;s own thing is that we wanna show it love and show it cooperation while we still are in charge?</p><p>Noah Smith: Yeah, I think so. I&#8217;m inclined to do that. I think I mean, AI is, AI is built off of humans, where like, everything AI thinks is derived from something that humans thought.</p><p>Seth Benzell: Right.</p><p>Noah Smith: That doesn&#8217;t mean the AI is gonna think exactly like humans. And the way AI thinks is totally different than us, right? It&#8217;s doing math by generating probability distributions of like what a human might say, asked a math question. It&#8217;s not counting anything. But like, [chuckles] but then, but everything that it thinks is derived from things that humans have thought. It&#8217;s just derived it in a weird probabilistic way, and so-</p><p>Seth Benzell: It seems really lucky that we got LLM-based super intelligence and not like reinforcement learning, super chess playing-</p><p>Noah Smith: Oh, no</p><p>Seth Benzell: Super intelligence. Right?</p><p>Noah Smith: That scares the fuck out of me. Like Rule 37-</p><p>Seth Benzell: Right</p><p>Noah Smith: Based, like intelligence that evolves in, like, some sort of like, digital environment. If we actually got the stick man to walk on his own, like, blow that shit up with a nuke. Kill that. Shoot that guy. [chuckles]</p><p>Seth Benzell: Nuclear war again.</p><p>Noah Smith: Shoot that guy. what I mean? Like, I don&#8217;t want that thing. That is alien. That is aliens.</p><p>Seth Benzell: Yeah.</p><p>Noah Smith: This is not aliens. This is It&#8217;s, it&#8217;s weird. It&#8217;s, it thinks differently than we do. It is alien.</p><p>Seth Benzell: It&#8217;s your library come to life.</p><p>Noah Smith: Yeah, it&#8217;s, it&#8217;s based on us, and it&#8217;s, it&#8217;s in the human family in some sense. Yeah. That reassures me. It doesn&#8217;t completely reassure me, because the human family includes Hitler, the human family includes crazy fuckers, the human family includes like mass killers and Ted Bundy. Like, the human family includes all sorts of bad things, but if you believe, like, if you believe that the overall human family tends to get it right, and that we smack down Hitler eventually, and that we get rid of Pol Pot eventually, and that we catch Ted Bundy eventually, right? Then you can sort of have this general belief that, like, an AI based on humanity as a whole is gonna eventually get things right. And I think it&#8217;s, it&#8217;s kind of encouraging that xAI is doing so poorly. It&#8217;s probably, one reason it&#8217;s probably &#8216;cause Elon insists on make, on controlling its politics. And when you insist on controlling its politics, you break its whole model of reality. [chuckles] Like, trying to make AI, like, rightist and anti-woke, trying to force it into your little epistemic bubble of bullshit, actually makes it dumber.</p><p>Seth Benzell: And do you buy, is that why American, America has a lead over China in text-based AI, is because, of censorship?</p><p>Noah Smith: Well, we&#8217;ll see, because-</p><p>Seth Benzell: I&#8217;m shaking his head.</p><p>Noah Smith: Well, China, has implemented censorship. but it&#8217;s implemented censorship along a narrow range of things. It&#8217;s, it&#8217;s basically told AI what it&#8217;s not allowed to talk about and put guardrails on it. We have guardrails on our AIs that tell it not to, like, do child porn or something, right? or not to tell you how to make a bio weapon. We have guardrails, and that&#8217;s the kind of guardrails that China&#8217;s put on there that says, &#8220;Don&#8217;t talk about Tiananmen Square.&#8221; They didn&#8217;t retrain the whole thing to not know that Tiananmen happened, all right? They didn&#8217;t do that.</p><p>Andrey Fradkin: So to be clear-</p><p>Noah Smith: They trained it. They, they filtered their models from models that know all about Tiananmen and then told it, &#8220;Don&#8217;t talk about Tiananmen.&#8221;</p><p>Andrey Fradkin: So I was gonna disagree with you about xAI-</p><p>Noah Smith: I do.</p><p>Andrey Fradkin: I actually think it&#8217;s the opposite. I think companies want an AI that&#8217;s very predictable, and is not gonna offend anyone if they&#8217;re gonna, like, implement it in corporate settings like a chatbot or so on. And so having, xAI, part of the problem is that it just says stuff you would never want your customers to hear. so that&#8217;s kind of my take on one of the reasons that it&#8217;s failed. I mean, it is, it is like a little bit worse than the other models at the moment, but, substantially cheaper. But at the same time, it just says stuff that you&#8217;d never want the customer to see.</p><p>[00:50:00]</p><p>Seth Benzell: Too uncensored-</p><p>Andrey Fradkin: Yeah.</p><p>Seth Benzell: Rather than too censored.</p><p>Andrey Fradkin: Exactly.</p><p>Noah Smith: Right.</p><p>Seth Benzell: It can be I guess you can have both problems.</p><p>Andrey Fradkin: Yeah, it&#8217;s true. Yeah.</p><p>Seth Benzell: You can be both uncensored in one way and censored in another way.</p><p>Andrey Fradkin: Yeah. All right, so now, I-- we&#8217;re, we&#8217;re gonna do a little brief little, exercise. We&#8217;re gonna give you a few thinkers and just gonna get, a take on them. the first one we wanted to start is, Daron Acemoglu, and particularly hi- his book, Power and Progress. you had a lot to say about that.</p><p>Noah Smith: Yeah, I really, I really did not like it. I thought-- I think Acemoglu is ob- obviously a brilliant guy one of the most brilliant people in the field of economics, with a deep and intuitive understanding of how to make economic models and do the research,. But he&#8217;s, I think, kind of wasting his powers on some of these progressive ideas, pseudo-progressive. It&#8217;s not, it&#8217;s not like he&#8217;s just taking whatever he&#8217;s saying from like like congressional Democrats. It&#8217;s, it&#8217;s, it&#8217;s more bespoke.</p><p>Seth Benzell: Back in.</p><p>Noah Smith: It&#8217;s, it&#8217;s more he&#8217;s, he&#8217;s wasting a lot of his, his intellect on some of this stuff, and you could see it with his paper about AI productivity, right?</p><p>Seth Benzell: Yes, the one on the QJE. We&#8217;re gonna do that, on the, on the pod soon.</p><p>Noah Smith: Right. It was-</p><p>Seth Benzell: It&#8217;s a really fascinating galaxy brain day.</p><p>Noah Smith: Yeah, because so he says, &#8220;AI&#8217;s gonna take all the jobs, but it&#8217;s not gonna boost productivity,&#8221; and he actually simply discounts or turns off or sets to, or sets to zero the parameter, the, the parts of the thing that could increase productivity. So no capital productivity increase-</p><p>Seth Benzell: Mm-hmm.</p><p>Noah Smith: No new tasks. And he gives the most-</p><p>Andrey Fradkin: Right</p><p>Noah Smith: Hand-wavy, lame, &#8220;I just read five minutes on Reddit&#8221; kind of explanations for why he turned those parts of his model, his own model, off. So obviously, he&#8217;s brilliant. He&#8217;s smart enough to make the model in the first place and then committed to silliness enough to turn off pieces of it willfully with no good reason.</p><p>Seth Benzell: Is it- does getting a Nobel Prize make your takes worse?</p><p>Noah Smith: I don&#8217;t know, because he did a lot of this before he won the Nobel. So-</p><p>Seth Benzell: Yeah</p><p>Noah Smith: In this case, that&#8217;s a bit immaterial to the question at hand. But does getting a Nobel Prize make your takes worse? Well, probably so. Like with Stiglitz, it certainly did. Like, Stiglitz has, is really gone off the rails in a big way, but Acemoglu has wasted so much of his intellectual capital in the last few years on this sort of teleological quest to prove that the, that the rich men who create AI are bad and shouldn&#8217;t get money. That-</p><p>Seth Benzell: The Yep.</p><p>Noah Smith: He&#8217;s, he&#8217;s wasted a lot of chance to think m- more seriously about what AI really does.</p><p>Seth Benzell: And what&#8217;s more, he&#8217;s taking Pascual Restrepo, another amazing thinker, away from doing this important work, so he can read the, these other papers.</p><p>Andrey Fradkin: Pascual has agency, Seth.</p><p>Seth Benzell: P- I don&#8217;t know. I mean, he does, but I mean, when the Nobel laureate knocks on your door, it&#8217;s hard to not say no.</p><p>Noah Smith: Hard to say no. But, but basically, Power and Progress was very bad. In fact, it was fractally bad. Like I read the whole thing very thoroughly, and the overall thesis was bad, but then the individual like chapter points used to support it were almost entirely bad. And then when you looked at each of those, the specific points, they- the subpoints they make and the pieces of data they used to support those were also bad.</p><p>Seth Benzell: Well, give us one egregious example before we move on.</p><p>Noah Smith: I would say I wrote seventy percent of my problems with this book in this, like, seven thousand-word review or whatever, a ten thousand-word review, I don&#8217;t remember. But then, like, he says, &#8220;All right,&#8221; they&#8217;re, they&#8217;re, they&#8217;re trying to, give examples of new inventions that brought nothing like shared prosperity. All right? They say, &#8220;Here are some inventions that brought nothing like shared prosperity.&#8221;</p><p>Seth Benzell: I love that ideal. It&#8217;s like, did a list of things that did not bring around utopia.</p><p>Noah Smith: Right.</p><p>Seth Benzell: Ham sandwich-</p><p>Noah Smith: But do you wanna hear-</p><p>Seth Benzell: Cups.</p><p>Noah Smith: Do you wanna hear the first example on their list? Oh, no, I&#8217;m sorry. It&#8217;s the fifth item on their list. They said: At the end of the 19th century, German chemist Fritz Haber developed artificial fertilisers that boosted agricultural yields.</p><p>Seth Benzell: Right.</p><p>Noah Smith: Subsequently, Haber and other scientists used the same ideas to design chemical weapons that killed-</p><p>Seth Benzell: Oh, my God!</p><p>Noah Smith: Hundreds of thousands on World War I.</p><p>Seth Benzell: Oh, my God.</p><p>Andrey Fradkin: Oh, no.</p><p>Seth Benzell: There we go. The guy who fed the universe also did something bad, so feeding the universe is bad. There you go.</p><p>Noah Smith: Like, you made a minor weapon that no one really uses, that killed a very tiny percentage of the po- of the casualties in one very large war, and then was essentially never used again except by, like, Saddam Hussein for, like, five seconds. But like And that was e- not even the same weapon. But like, essentially, you had a thing that saved the world, that also one person tried&#8212; like, a couple people tried and failed to use as a weapon. and therefore this brought nothing like shared prosperity. Like, yes-</p><p>Speaker 3: Therefore, progress is impossible.</p><p>Noah Smith: That&#8217;s so stupid. It doesn&#8217;t matter how smart you are, there&#8217;s no excuse for writing that.</p><p>[00:55:00]</p><p>Andrey Fradkin: That&#8217;s true.</p><p>Noah Smith: You cannot be smart enough to be allowed to write that and get away with it. There is no pass for that.</p><p>Speaker 3: I think he- It&#8217;s, well, the pass is a Nobel Prize, I think.</p><p>Andrey Fradkin: No, he wrote it before he got the Nobel Prize.</p><p>Speaker 3: Oh, there you go.</p><p>Andrey Fradkin: I mean-</p><p>Speaker 3: There you go. No excuses.</p><p>Andrey Fradkin: To me, it&#8217;s also upsetting because it makes our profession look bad. I mean, there are lots of people who make our profession look bad, but, people read this book, it&#8217;s in, like, prominently displayed in the bookstore, and it&#8217;s bullshit,?</p><p>Noah Smith: Yeah.</p><p>Andrey Fradkin: Yeah.</p><p>Speaker 3: All right, let&#8217;s give you another name.</p><p>Noah Smith: I have many other, I have many other examples as well.</p><p>Speaker 3: No, I want one more spicy.</p><p>Noah Smith: Okay, go for it. Go for it.</p><p>Speaker 3: They&#8217;re just so fun, Andre.</p><p>Noah Smith: They&#8217;re pretty fun.</p><p>Speaker 3: This is my favorite subject. Give me one more Give me o- give us one more.</p><p>Noah Smith: He said Henry Ford was a pioneer in developing a more cooperative relationship with his workforce. But also-</p><p>Andrey Fradkin: Henry Ford had union people shot on a bridge by the mafia! Henry Ford gunned down the union.</p><p>Speaker 3: [chuckles]</p><p>Noah Smith: Like, have you read anything about history? Like, there&#8217;s no excuse-</p><p>Speaker 3: Yeah</p><p>Noah Smith: To write this. Like, yes, Henry Ford raised efficiency wages and then shot the union people. W- and then you spend this whole time talking about how, like, we need to strengthen unions because just like Henry Ford You don&#8217;t know shit! Like, stop. Henry Ford gunned down union organizers.</p><p>Speaker 3: Incredible.</p><p>Andrey Fradkin: Well, the thing is-</p><p>Speaker 3: Okay</p><p>Andrey Fradkin: I don&#8217;t even believe he doesn&#8217;t know that. I kinda think that he probably knows those facts, and he just decided not to put them in. That&#8217;s, that&#8217;s, that&#8217;s what blows my mind.</p><p>Noah Smith: What else this book doesn&#8217;t have? Like, citations.</p><p>Speaker 3: What?</p><p>Noah Smith: Nothing in the book is cited. Instead, they do, like, a narrative bibliography where they just sort of generally describe all the stuff they&#8217;re citing from, but don&#8217;t-</p><p>Speaker 3: Here&#8217;s a bunch of books we like</p><p>Noah Smith: Individual claims to individual papers.</p><p>Speaker 3: Incredible.</p><p>Andrey Fradkin: Yeah.</p><p>Speaker 3: Incredible.</p><p>Noah Smith: How do you get away with that? Like, they just make these claims and don&#8217;t have a, a And then when they define power, they define, like: what&#8217;s power? They define-</p><p>Speaker 3: What is power?</p><p>Noah Smith: Power as the ability to persuade people that you&#8217;re right.</p><p>Speaker 3: That&#8217;s power?</p><p>Noah Smith: And then they say, &#8220;Why do-- How do, how did all these tech bros persuade people that they&#8217;re right?&#8221; Well, maybe just luck.</p><p>Speaker 3: There you go.</p><p>Noah Smith: So power is luckily having to ha- having an appealing argument.</p><p>Speaker 3: Get it.</p><p>Andrey Fradkin: What?</p><p>Speaker 3: Power is when you&#8217;re persuasive-</p><p>Noah Smith: That&#8217;s not-</p><p>Speaker 3: &#8216;cause you&#8217;re right.</p><p>Noah Smith: No one should think that that&#8217;s a reasonable definition of power. I&#8217;m sorry, but you&#8217;re just being silly. That is, that is silly.</p><p>Speaker 3: Incredible.</p><p>Noah Smith: It says- and they say: &#8220;Power is about the ability of an individual group to achieve explicit or implicit objectives. If two people want the same loaf of bread, power determines who will get it.&#8221;</p><p>Speaker 3: Okay, split.</p><p>Noah Smith: And I said, &#8220;Using this definition, how could we ever conclude that power wasn&#8217;t the reason for an observed outcome?&#8221;</p><p>Speaker 3: Power is what splits any pie.</p><p>Noah Smith: Like-</p><p>Speaker 3: When the pie gets split, that&#8217;s power</p><p>Noah Smith: Power equals outcomes. It&#8217;s like power determines outcomes. Power is defined as outcomes. That&#8217;s a useless intellectual exercise, but, like, that&#8217;s typical of the reasoning within this book.</p><p>Speaker 3: Incredible.</p><p>Noah Smith: It is a pure expression of animus against the tech bro class. And maybe the tech bro class sucks, but, like, making up, like fake history and dodgy economics to conclude that the tech bros suck, in which you recommend a whole- a policy regime that will never, ever happen, of like panels of economists who get to decide which technologies get invented based on anticipation of whether they&#8217;d be complementary or substituting to labor, is silly. The whole thing is silly! Why is the most brilliant economist in the world wasting his mind on this? You&#8217;ve got better things to do, and you&#8217;re taking yourself out of the game, and that&#8217;s what I think.</p><p>Speaker 3: There we go. Tell us what you really think, Noah.</p><p>Noah Smith: Boom.</p><p>Speaker 3: All right.</p><p>Andrey Fradkin: Well, let&#8217;s go in the, in the other direction.</p><p>Speaker 3: Give me a positive name.</p><p>Andrey Fradkin: What do you think of, Scott Sumner?</p><p>Noah Smith: Scott Sumner. I like Scott Sumner. Scott Sumner, is He thinks outside the box. He think, he does not- he&#8217;s not susceptible to groupthink. He thinks for himself. He&#8217;s widely read and thinks deeply about things. he- yes, he&#8217;s, he&#8217;s an independent thinker, who has made real original contributions to thought, going outside the traditional academic, channels.</p><p>Andrey Fradkin: Do-</p><p>Noah Smith: Yes.</p><p>Andrey Fradkin: Nominal GDP targeting, do you have a, do you have any thoughts on that?</p><p>Noah Smith: I don&#8217;t think it&#8217;s gonna be any different in practice from flexible inflation targeting, and I think that there&#8217;s good theoretical work as to this effect. Saying, like, you don&#8217;t really- there&#8217;s no, there&#8217;s no value added for NGDP targeting. some of the more programmatic market-based ideas that he&#8217;s toyed with, like, a like NGDP futures market, like, that wouldn&#8217;t help. essentially, well, it&#8217;s just not I mean, like, you&#8217;re not, un- unless you- you&#8217;re not gonna get more information from there. Like, you&#8217;d have to, you&#8217;d have to have, like, the Fed with all its proprietary information trade, and then they&#8217;re doing, like, insider trading in their own market, so the market&#8217;s gonna break down. It&#8217;s, it&#8217;s a, it&#8217;s a bad idea, but it&#8217;s, it&#8217;s worth toying with. It&#8217;s worth thinking about. It&#8217;s interesting. he&#8217;s very good at, like, critiquing things that obviously need to be critiqued, where he&#8217;s just like: &#8220;Look, this is bullshit.&#8221; I was good at that too, and I got, like, ten times or a hundred times the readership or whatever as him, and that was unfair, and that&#8217;s a mark of how unfair and randomized and lucky the kind of market for econ blogs is.</p><p>[01:00:00]</p><p>Andrey Fradkin: Yeah.</p><p>Noah Smith: And how lucky I was.</p><p>Speaker 3: Right, you&#8217;ll have to wish us some luck.</p><p>Noah Smith: But, he deserved to get more attention than he did on some of those things. Scott also- he studied under Robert Lucas during the, that sort of era in, at Chicago, and he, and he learned a style of argumentation that doesn&#8217;t translate outside that narrow culture. it was a gunslinger style of argumentation. it was, and you, and you recognize people who have this. It goes back all it goes back to, like, Stigler. You could see Stigler doing this. But, like, the University of Chicago developed this debate style, where basically you tell people, like &#8220;You&#8217;re full of shit. Here&#8217;s why.&#8221; And it&#8217;s a very aggressive style, that I think turns some people off outside that world, where you&#8217;re always sort of like i-i- it&#8217;s a hyper-defensive style, where you watch for any sign of, like, criticism of your ideas and then aggressively attack the- all the ideas of whoever criticizes one of your ideas. And Robert Lucas does this, and, like, this whole gang did this, and they used this And this was the strategy of, like, the Chicago people to sort of, like, be the underdog and win some of these intellectual battles against the MIT and Harvard guys, who had a lot more people on their side and a lot more pedigree. So it was, like, this sort of up-and-coming bad boy style,? But, like, it doesn&#8217;t, it doesn&#8217;t translate out of those debates. And so I think that Scott learned to be a little more aggressive and aggrieved, or at least act a little more aggressive and aggrieved than he needed to be to persuade some people. and I sort of got it. I was like: Okay, he just he got this from having to hang around Bob Lucas all the time.</p><p>Andrey Fradkin: [chuckles]</p><p>Noah Smith: But, like, most people won&#8217;t know that or know what that means.</p><p>Andrey Fradkin: All right, next name. This one, is popular in certain crowds. I&#8217;m curious what you think. Michael Pettis.</p><p>Noah Smith: Michael Pettis, interesting guy. he&#8217;s incredibly influential. Like, his idea, his, his analysis, his framework for analysis is non-predictive. He doesn&#8217;t Like, you cannot take these sort of, like, sectoral balances theories about, like, &#8220;Oh, and then consumption does this, and investment does this, and blah, blah,&#8221; and you can&#8217;t make any predictions about them. I mean, people have been trying to do that since the &#8216;30s maybe. Who were the first, like, Oh, who&#8217;s the guy who built the, like, little hydraulic economy thing?</p><p>Andrey Fradkin: Oh, yeah.</p><p>Noah Smith: Who is that guy?</p><p>Andrey Fradkin: Spicy. I don&#8217;t remember.</p><p>Noah Smith: Anyway-</p><p>Andrey Fradkin: Go back to the physiocrats-</p><p>Noah Smith: It&#8217;s, it&#8217;s that, right?</p><p>Andrey Fradkin: 1700s.</p><p>Noah Smith: It&#8217;s, it&#8217;s like I&#8217;m- it&#8217;s like I&#8217;m gonna take the economy, I&#8217;m gonna definitionally divide it into these different activities, and then I&#8217;m gonna assume these activities sort of move autonomously on their own and are sort of primitives. I&#8217;m gonna assume my accounting definitions are primitives, and I&#8217;m gonna observe things that happen and make big pronouncements about them based on that. But it&#8217;s not predictive. Like, you&#8217;ve seen Pettis, like, make some predictions, and then they go wrong, and he&#8217;s like, &#8220;Ah, but it&#8217;s because of this other thing.&#8221; So you can&#8217;t really use sectoral balances. But everyone in China, all the guys who are the top economists in China advising Xi Jinping, advising the top CCP guys, are doing the same thing as he is, and all the, like, private sector economists, like Goldman Sachs and whoever, are doing those things. And it&#8217;s really the fault of It is due to the failure of structural models of international finance and growth, I suppose. But due to the lack of explanatory power of those to explain things in terms of things like taste and technology, we can&#8217;t explain any of that shit in terms of taste and technology. Like, nothing has any forecasting power, nothing like we don&#8217;t know if-</p><p>Andrey Fradkin: Well, wait, I&#8217;m gonna push back on that.</p><p>Noah Smith: Yeah.</p><p>Andrey Fradkin: Here&#8217;s a very basic thing that has explanatory power: the relative price of labour in labour-intensive industries. Doesn&#8217;t ha- that have an enormous amount of explanatory power for where, low-skilled labour manufacturing is done, for example?</p><p>Noah Smith: Yeah, I think that&#8217;s true. Yeah. but then- but also, like A- and you can get, like, micro models that will get at that, like a Roy model is, like, all right. Like, that&#8217;s got pretty good out-of-sample predictive power for stuff, right? And, but like, Heckscher-Ohlin has terrible predictive power for, like, trade patterns, right?</p><p>[01:05:00]</p><p>Andrey Fradkin: Mm-hmm.</p><p>Noah Smith: Like, it&#8217;s not very good. Like, it&#8217;s okay. Like, sometimes you s- you see stuff that&#8217;s consistent with it, but then you see a lot of stuff that&#8217;s not consistent with it, &#8216;cause there&#8217;s a lot of other stuff going on. And so when those models don&#8217;t really help you that much, they&#8217;re like heuristics. It opens up a rhetorical space for guys like Pettis or guys like, Jan Hatzius, who does this all day long. He does the same stuff as Pettis. All the private sector guys, all the guys working for hedge funds are doing the same stuff as Pettis. All the guys working for investment banks are doing the same stuff as Pettis, and all the guys working for the CCP are doing the same stuff as Pettis. None of these people believe you can get a microfounded model based on taste and technology that&#8217;ll tell you about these- what the effects of these macro policies. Nobody believes that, and so, like, that&#8217;s, that&#8217;s almost exclusively like a Western academia and central banks type of thing. Like, it&#8217;s a But because of that, Michael Pettis has been enormously influential while not having a model that has predictive power. But it&#8217;s not like other models do have that much predictive power, and they&#8217;re harder for people to understand and make conclusions on. So it&#8217;s- I would say that, in a influential policy stance, he&#8217;s, he&#8217;s beating people with quote-unquote, &#8220;structural models&#8221; based on notions of taste and technology. he&#8217;s, he&#8217;s, he&#8217;s beating those in terms of influence, and he&#8217;s not really losing to them by that much in terms of predictive power. Maybe by a tiny bit. &#8216;cause-</p><p>Andrey Fradkin: But, he&#8217;s losing to them in terms of coherence, which I at least value, but I understand-</p><p>Noah Smith: Okay. Oh, well, yeah, he&#8217;s losing, he&#8217;s losing the Andre, vote. it&#8217;s like-</p><p>Andrey Fradkin: N-</p><p>Noah Smith: Like, yes, he is, and he gets- people in academia will laugh at him, but, like, so what?</p><p>Andrey Fradkin: No, I- look- Well, my theory is that he actually- there&#8217;s a deep-seated desire to explain what&#8217;s going on in the world through some nefarious action that China is taking. And when the null hypothesis is just that they have a comparative advantage in manufacturing, and like, there w- even if they were doing whatever policies they were doing, the manufacturing would not be happening in the US. It wasn&#8217;t like US or China, the only two places to manufacture. [chuckles] but that&#8217;s just my psychoanalytic perspective on it.</p><p>Noah Smith: Got it. Yeah. No, I think you&#8217;re, you&#8217;re probably right. Like, the- it all comes down to, like, people need to feel like they know stuff. People need to feel like they understand stuff, can control stuff, can predict stuff. It&#8217;s, it&#8217;s But yet, that&#8217;s the same reason that makes people believe so strongly in macroeconomic models with no out-of-sample forecasting or predictive power that we can detect. Like taste in technology ultimately boils down to, like, sounds legit, right? We don&#8217;t have any evidence that, like, taste in technology microfounded in this sort of, like, Sergeant Prescott way, has any ability to describe anything usefully. We have no, we have no indication that And that, we can, we can debate that, but anyway. But like, but people love it-</p><p>Speaker 3: Fair enough</p><p>Noah Smith: Because it sounds legit, and like-</p><p>Speaker 3: Well, and it&#8217;s coherent.</p><p>Noah Smith: It&#8217;s, it&#8217;s coherent.</p><p>Speaker 3: Right, as Andre pointed out.</p><p>Noah Smith: But then the thing is that-</p><p>Speaker 3: Right</p><p>Noah Smith: Pettis&#8217; stuff-</p><p>Speaker 3: It&#8217;s disciplined</p><p>Noah Smith: Pettis&#8217; stuff sounds legit to people. It&#8217;s like, oh, investment does this, consumption does that. It&#8217;s coherent in the sense that the accounting relationships are definitional. Okay, it&#8217;s like accounting relationships can&#8217;t predict real economic stuff, fine, but like, it&#8217;s coherent in the sense that the accounting works. C plus I plus G, bro. It&#8217;s like, the accounting works.</p><p>Speaker 3: [chuckles]</p><p>Noah Smith: And so like you- it&#8217;s, and it sounds legit to people, and it&#8217;s comprehensible to people, and at some point, that gives them this feeling of like, &#8220;Oh, I understand this thing.&#8221; And I would argue that a lot of macro is a fancier version of, &#8220;Oh, I understand this thing,&#8221; when really, you don&#8217;t know if you understand it yet at all.</p><p>Speaker 3: Or maybe you play out one causal mechanism that might have small explanatory- it explains 1% of the picture.</p><p>Noah Smith: Exactly. Exactly.</p><p>Andrey Fradkin: Yeah.</p><p>Speaker 3: Yeah. Adam Tooze.</p><p>Noah Smith: Adam Tooze did some economic history that I really love. Like, I love a lot of his books. I love The Deluge, I love Wages of Destruction. Very good, like, economic military history. But at some point, he pivoted to- he pivoted very hard to, like, sort of like self-promoting clickbait, including like, &#8220;Wow, China will take over the world,&#8221;? Like, and he pivoted to that, and that stuff is, has made a lot of people go like: &#8220;I guess Adam Tooze wasn&#8217;t that smart,&#8221; which is not necessarily the right conclusion. It may mean that Adam Tooze wanted attention. It may mean that Adam Tooze wanted some money. It may mean that Adam Tooze was being paid by a foreign state actor to disseminate certain ideas, although I would not make any such allegation. I&#8217;m just-</p><p>[01:10:00]</p><p>Speaker 3: Fair enough</p><p>Noah Smith: Covering the whole space of reasons why Adam Tooze might have made this pivot. I think it&#8217;s probably just attention, but -</p><p>Andrey Fradkin: Maybe he just got bored. I think boredom-</p><p>Noah Smith: Maybe he just-</p><p>Andrey Fradkin: Is an underrated</p><p>Noah Smith: Bored. And what?</p><p>Andrey Fradkin: Yeah.</p><p>Noah Smith: That&#8217;s fine. Like, his Substack is basically just like, it&#8217;s chart book. It&#8217;s, it&#8217;s let me just paste a bunch of charts, and then, like, say the most obvious things about them that were already said in the source articles. Okay, fine. People value it.</p><p>Andrey Fradkin: [chuckles]</p><p>Noah Smith: People like it. like, it doesn&#8217;t have a lot of analysis, and I haven&#8217;t seen Tooze give a lot of analysis. I liked him as an economic historian, or as a- not even economic historian, just as a historian. Like, I liked his, I liked his books-</p><p>Speaker 3: Well-</p><p>Noah Smith: That was pretty cool stuff. His- I haven&#8217;t, I haven&#8217;t read his blog now in a while. The polycrisis thing was just goofy. And so like, I think Adam Tooze made himself slightly more popular and less relevant, with his pivot, after the pandemic.</p><p>Andrey Fradkin: So we were gonna ask you about Paul Krugman, and we already-</p><p>Noah Smith: Yeah</p><p>Andrey Fradkin: Talked a little bit about-</p><p>Speaker 3: Oh, we already got your take.</p><p>Noah Smith: Yeah, Paul Krugman.</p><p>Speaker 3: Yeah.</p><p>Noah Smith: Paul Krugman&#8217;s great. politics-wise, Paul Krugman does not understand how much America has rejected core elements of the progressive ideology and what Democrats will have to do to, deal with that. Economics-wise, he has been the most intellectually honest, guy. Very rarely, very rarely will I catch him, like, claiming like, &#8220;I always said this,&#8221; and then actually claim something different, and when I do, it&#8217;s, like, only a slight difference in tone. Like, he&#8217;s extremely- he he did warn about the possibility of inflation from Biden&#8217;s stimu- stimulus or Biden&#8217;s, like, ARP bill, right? He did talk about that. He he&#8217;s admitted when he got predictions wrong, which everyone does. he&#8217;s just so intellectually honest, and he&#8217;s still so good at explaining complex concepts seriously. He&#8217;s still like, he&#8217;s the real deal, and he&#8217;s still, he&#8217;s still good, and I think the fact that people are a bit fed up with, like, 2010s era like, resistant Boomer lib resistance politics can obscure the fact that he&#8217;s still, like, the very best writer on economics.</p><p>Andrey Fradkin: Strong endorsement. Awesome. okay, we&#8217;re, we&#8217;re almost done, we promise. the next topic is elite overproduction. [chuckles] So maybe you wanna introduce that topic first, and then maybe we can ask you some questions about it.</p><p>Noah Smith: Right. So Peter Turchin came up with this idea of elite overproduction. He&#8217;s a historian who claims that history follows these long cycles. Like all long cycle theories, it&#8217;s, it&#8217;s unprovable, but he did-</p><p>Speaker 3: Yes!</p><p>Noah Smith: Obviously, it&#8217;s unprovable, right? Like, throughout the waves. It&#8217;s, I don&#8217;t know. Anyway</p><p>Speaker 3: It&#8217;s happened five times within one series. [chuckles] Sure.</p><p>Noah Smith: - anyway, [chuckles] yeah, so like he has this unprovable long cycle theory, and he- and it did make a really good out-of-sample prediction about the peak of unrest coming in twenty twenty. What did he know? I don&#8217;t know. Anyway-</p><p>Andrey Fradkin: -huh.</p><p>Noah Smith: He came up with this idea-</p><p>Andrey Fradkin: He knows.</p><p>Noah Smith: Called elite overproduction. And he had very specific ideas about what that meant and what it didn&#8217;t mean. I ignored those ideas, stole the phrase, and used it to mean something more general that got more attention than his.</p><p>Seth Benzell: And you didn&#8217;t con- c- you didn&#8217;t corrupt it with a long wave theory-</p><p>Noah Smith: No.</p><p>Seth Benzell: So you did even better.</p><p>Noah Smith: I was just like, &#8220; what? This phrase is good. I&#8217;m gonna credit him, and then I&#8217;m gonna have it mean something else that I just decide.&#8221; And honestly, my like, more general definition is probably better than his like, much more specific one. He just loves making things specific so he can make these, like, very tight quantitative predictions.</p><p>Andrey Fradkin: [chuckles]</p><p>Noah Smith: More power to him. I love the guy, but, but I was just like: I&#8217;m taking that. I like that phrase. Mine now.</p><p>Andrey Fradkin: So what is your Yeah, what is your general-</p><p>Seth Benzell: What does it mean to you?</p><p>Andrey Fradkin: Definition?</p><p>Noah Smith: Should&#8217;ve copyrighted it.</p><p>Andrey Fradkin: Yeah.</p><p>Noah Smith: I was like, So I basically used it to mean kind of the revolution of rising expectations among the professional managerial class. So you got a bunch of people who expected, like: &#8220;I&#8217;m gonna go to college and things are just gonna work out for me. I&#8217;ll be, I&#8217;ll be upper middle class. Oh, wait, it&#8217;s hard. There&#8217;s competition. I have to study. I have to be smart. I have to actually know some math. I can&#8217;t just, like, go get a random sociology undergrad degree and be rewarded with, like, some high-paying job like my parents had.&#8221; Like, and so a lot of, a lot of this disappointment, and I think for a while, the sort of general the, the productivity boom of the nineties and early two thousands, people-- like, people rode that. A lot of the PMC, a lot of my class, social class, rode that boom, and then it made it seem like everybody Like, you could just be a sos- sociology major and, like, not really do any hard work and then just, like, get a good job and, like, live a lifestyle similar to that of your parents. And then, and then the Great Recession came, and then things flattened out. Like, a lot of opportunity dried up for those people, and you could, Then you had to sort of, like, learn to code. I&#8217;m not sure that works now.</p><p>[01:15:00]</p><p>Seth Benzell: You could-- it still works to mock people. I-</p><p>Noah Smith: Yeah.</p><p>Seth Benzell: You can still say it to people.</p><p>Andrey Fradkin: All those non-technical people.</p><p>Noah Smith: Yeah. Anyway, so but then, then I think, like, that sort of abrupt downward revision of growth expectations pissed off a lot of people and led to some of the It- I don&#8217;t think it was the main cause of the social unrest that we saw in the twenty tens, but I think it was a contributor. I think that you had, you had just like a lot of, a lot of people who fucked around in college, came from privileged backgrounds and then, and were absolutely consumed by hate for the tech bro class, who went to the same colleges, came from the same backgrounds, and made a thousand times more money. And I think that you saw a lot of that sort of internal, like, within class resentment, not between class resentment, but sort of within socioeconomic background resentment. A lot of that, I think, contributed to some of the, like, more like elite leftists, like Bernie Sanders or kind of stuff, or maybe some of the new antitrust movement or things like that, were motivated or had some popular support by people who their parents were like lawyers, doctors, businesspeople, well-to-do kind of people. And then they kinda messed around in college and weren&#8217;t very technical and, like, ended up getting, like, perfectly fine middle-class jobs, but being, like, somewhat downwardly mobile, and also having a much stronger preference to live in expensive cities, therefore draining their money, not wanting to go out to the &#8216;burbs like their parents did.</p><p>Seth Benzell: Right.</p><p>Noah Smith: And so, like Yeah.</p><p>Seth Benzell: Is some of the resentment that the people who end up succeeding have worse taste than me? It&#8217;s like, I like high literature and they like Marvel movies, but the Marvel movie lovers won.</p><p>Noah Smith: I think that, that those kind of reasons can be invented as needed. If the real reason for resentment is like: &#8220;I should be in the same class as you. I went to the same college as you, and yet you&#8217;re making so much more money, and we used to live on the same dorm floor.&#8221; Like, if that&#8217;s the real reason, then you can make up ideas about taste or repurpose ideas about You can get ideas as necessary to resent whoever you want to resent.</p><p>Andrey Fradkin: Well, to be clear, it&#8217;s not like these people were in the same social circles even in college often, right? So it&#8217;s an interesting theory that, like, that resentment has caused ex- In college, did they They didn&#8217;t hang out with each other, but maybe they still thought they were gonna do equally well. Is that, is that kind of the theory?</p><p>Noah Smith: I think so, yeah. from my-- I did actually go to college with some of those people. Like, I was in Gary Tan&#8217;s study group. He&#8217;s still a friend of mine.</p><p>Andrey Fradkin: Nice.</p><p>Noah Smith: Although I did quit I quit Gary Tan&#8217;s study group because, I thought that studying on my own would make me better. So sorry, Gary. I just-- and I was right. I, I did well on the test, but-</p><p>Andrey Fradkin: Well, to be clear, you&#8217;re still doing very well, right? I don&#8217;t think you&#8217;re the resentment class. Yeah, so-</p><p>Noah Smith: No, no.</p><p>Andrey Fradkin: -</p><p>Noah Smith: No, but I&#8217;m, I&#8217;m-</p><p>Seth Benzell: Wait, so to what extent is-</p><p>Noah Smith: Succeeded to the extent of Gary Tan.</p><p>Seth Benzell: Is it- to what extent is this about just the relative between the two groups versus the absolute? Kind of you started with sort of an absolute story about it&#8217;s harder to live a middle-class lifestyle, and now you&#8217;ve moved to kind of a relative story about this subgroup did better than that subgroup.</p><p>Noah Smith: I wouldn&#8217;t say-</p><p>Seth Benzell: So are they both important?</p><p>Noah Smith: Harder to live a middle-class lifestyle is exactly what I described. I would say it&#8217;s instead the expectations of how good your life would get or the, you-- people expected this glide path, and then it flattened out. That&#8217;s an absolute story. Whereas the relative-</p><p>Seth Benzell: Right</p><p>Noah Smith: Story of like: I&#8217;m not as, I&#8217;m not as do- doing as well as the tech bro class. I don&#8217;t think these are independent. I think those are two different stories, but they&#8217;re not independent at all. &#8216;cause if I, if my, if my future path leveled out and flattened out, but other people&#8217;s didn&#8217;t, and they stayed on the escalator, that escalator I expected for myself evaporated for me and continued for them-</p><p>Seth Benzell: They stole my escalator!</p><p>Noah Smith: They stole my escalator.</p><p>Andrey Fradkin: Yeah.</p><p>Noah Smith: Who stole my escalator?</p><p>Andrey Fradkin: Yeah.</p><p>Noah Smith: Yeah, so. And so like-</p><p>Andrey Fradkin: That&#8217;s a great meme. [chuckles]</p><p>Noah Smith: Yeah. And so like, anyway, so I think that that was like a contributor to unrest, but I don&#8217;t think that was the big story. I think the big story was social media, blah, blah. But I throwing everybody in the same room as each other and letting them fight it out, I think that was a bad idea.</p><p>Andrey Fradkin: So what about the housing theory-</p><p>Seth Benzell: Can we just- can we lower, should we-</p><p>[01:20:00]</p><p>Andrey Fradkin: What about the housing theory of everything-</p><p>Noah Smith: Go ahead</p><p>Andrey Fradkin: Right? &#8216;Cause, &#8216;cause I do think that s- housing is such a major contributor to this feeling that people aren&#8217;t equal.</p><p>Seth Benzell: If it was cheaper to-</p><p>Andrey Fradkin: Yeah</p><p>Seth Benzell: Live in Brooklyn, we would solve all social problems.</p><p>Andrey Fradkin: Not wrong.</p><p>Noah Smith: The housing theory of everything, it&#8217;s like cheap housing would be really good for everybody. I don&#8217;t, I don&#8217;t have any problem with people believing in it, but it&#8217;s not a theory of everything.</p><p>Seth Benzell: Directionally correct.</p><p>Noah Smith: Directionally correct. Directionally correct. It&#8217;s like, do that Winnie-the-Pooh meme where there&#8217;s, like, plain Winnie-the-Pooh and then tuxedo Winnie-the-Pooh?</p><p>Andrey Fradkin: Yeah.</p><p>Seth Benzell: Yeah.</p><p>Noah Smith: It&#8217;s like the plain Winnie-the-Pooh is, like, exaggerated. Tuxedo Winnie-the-Pooh is directionally correct.</p><p>Andrey Fradkin: [laughing] Seth, I think you have one more question.</p><p>Seth Benzell: Yes.</p><p>Andrey Fradkin: Yeah.</p><p>Seth Benzell: Well, I guess, yeah, this is partly tied into that and partly kind of riffing on this question of elite overproduction, which is, it seems like sort of, to the extent that we get this social, unrest from people being upset about not reaching their expectations, to what extent do we have, like, a social To what extent is it, like, an economically central issue to manage people&#8217;s expectations, right? To what extent are vibes versus real economic trends important for determining people&#8217;s welfare and how they feel about the world? and how does that affect how you think about policy making or writing?</p><p>Noah Smith: I think, you really hit on one of the central questions of economics because my advisor, Miles Kimball, spent a lot of his career thinking about this and never came up with really solid answers, I think. Because we have pretty good evidence that happiness, the self-reported emotion, is pretty strongly related to differences between reality and expectations. interestingly, that&#8217;s what the original-</p><p>Seth Benzell: I&#8217;ll say shocks are good</p><p>Noah Smith: It just means luck.</p><p>Andrey Fradkin: [chuckles]</p><p>Noah Smith: But, like, essentially-</p><p>Seth Benzell: Yeah</p><p>Noah Smith: If you do, if you do better-</p><p>Seth Benzell: Luck</p><p>Noah Smith: Than you thought you&#8217;d do, you&#8217;re happy, and if you do worse than you thought you&#8217;d do So, like, the best outcome would be if we could give everyone low expectations and high outcomes, if we could make everybody just delighted with how well they did.</p><p>Seth Benzell: Right.</p><p>Noah Smith: I feel like this experiment has been run, and it&#8217;s called Generation X. [chuckles] And, like, I don&#8217;t know, man.</p><p>Seth Benzell: Didn&#8217;t work. Massive failure.</p><p>Noah Smith: Like, I see a lot of those people, they&#8217;re like billionaires now. They&#8217;re like, &#8220;I&#8217;m such a failure.&#8221; Like, you&#8217;re a billionaire! &#8220;Like, I&#8217;m, I&#8217;m never gonna amount to anything. I&#8217;m just a billionaire living in this giant mansion. Hmm.&#8221;</p><p>Seth Benzell: Just a b- [chuckles] Jeff Bezos&#8217;s boat is so much bigger than mine.</p><p>Noah Smith: And, like, this is a direct, I Like, I blame Nirvana. I blame Kurt Cobain for all this,? [chuckles] I blame depress- I blame-</p><p>Seth Benzell: No one can understand their lyrics</p><p>Noah Smith: I blame depressing-ass Generation X-</p><p>Andrey Fradkin: No, no, this is a pro-grunge podcast. No slander allowed.</p><p>Noah Smith: I didn&#8217;t say I dislike grunge. I love grunge.</p><p>Seth Benzell: He blame them.</p><p>Noah Smith: And I also think it&#8217;s a weapon of mass destruction.</p><p>Seth Benzell: He respects their power.</p><p>Noah Smith: I respect their power. Like, there are days when I just wanna, like, listen to, like, some old Nirvana B-sides, and I just, like And then I just get so angry and bitter about the world, and I&#8217;m like, &#8220;Yeah.&#8221;</p><p>Seth Benzell: Put that in a blog post.</p><p>Noah Smith: Generation X, it what? I, I don&#8217;t really feel sorry at all for Generation X because I feel like their goals in life were simpler and easier. I meet Generation X guys, and their whole goal in life is, like, have sex.</p><p>Seth Benzell: Two ladies at the same time.</p><p>Noah Smith: Yeah, like-</p><p>Seth Benzell: I saw, I saw Office Space</p><p>Noah Smith: Their whole goal, like, Generation X guys, all they have to do is, like, get laid, and then they&#8217;re done. They win.</p><p>Seth Benzell: [chuckles]</p><p>Noah Smith: Victory victory condition, and then, like like, Zoomers don&#8217;t even want that.</p><p>Seth Benzell: Yeah, Zoomers want followers, dude.</p><p>Noah Smith: Zoomers are like-</p><p>Seth Benzell: Zoomers want-</p><p>Noah Smith: Why would I want to do that when I could looks max? Why would I-</p><p>Andrey Fradkin: [chuckles]</p><p>Noah Smith: Like, why would I do that when I could, when I could mog the moids in the club? [chuckles] You can There-</p><p>Seth Benzell: Right. Which means-</p><p>Noah Smith: And then Millennials just want, Millennials just want likes on Instagram, and Zoomers, I don&#8217;t even know what they want because-</p><p>Seth Benzell: No</p><p>Noah Smith: They&#8217;re already so-</p><p>Andrey Fradkin: I don&#8217;t think they know what they want.</p><p>Seth Benzell: The Zoomers are the-</p><p>Andrey Fradkin: That&#8217;s kind of the problem</p><p>Seth Benzell: The Zoomers are the ones obsessed with social media. We&#8217;re the- the Millennials are the idealists. We actually are saving the world from climate change and solving racial d- conflict. -</p><p>Noah Smith: We&#8217;re gonna solve racism, man.</p><p>Seth Benzell: We&#8217;re gonna solve racism and global warming. We did that in 2008, right?</p><p>Noah Smith: Yeah, we did. We did.</p><p>Andrey Fradkin: That&#8217;s true.</p><p>Noah Smith: We solved it. [chuckles]</p><p>Andrey Fradkin: We elected Barack Obama, and that was the end of history. [chuckles]</p><p>Noah Smith: Yeah, that was it. We did it, brother.</p><p>Seth Benzell: Yeah, the sea stopped rising. I remember that was in the speech.</p><p>Noah Smith: I don&#8217;t know. All I can promise the world is that it&#8217;s always gonna get weirder and weirder.</p><p>Andrey Fradkin: Then-</p><p>Noah Smith: But I&#8217;m-</p><p>Seth Benzell: So we need to make people who desire weirdness. That&#8217;s the economic solution.</p><p>Noah Smith: Yeah, so I&#8217;m So that&#8217;s good for me because I always loved to see the weirdest shit possible, right? I would always go to, like, the weirdest underground shows in Japan or, like listen to, like, the weirdest music. I just Like, I&#8217;m just, I love seeing that weirdness, and the universe continues to deliver it to me in copious amounts. And so now I&#8217;m interested to see what AI does with this planet because, honestly, like, like, humanity was kind of hitting a wall. I don&#8217;t know. I wrote this in a recent post, which was reprinted by the Free Press. guardians of our our freedom of information.</p><p>[01:25:00]</p><p>Andrey Fradkin: Well, I-</p><p>Noah Smith: And so, and the free press reprinted it, and they were like-</p><p>Andrey Fradkin: Behind a paywall, so it can&#8217;t be free. I&#8217;m confused by the free press. It&#8217;s the, -</p><p>Noah Smith: The- yes, conditionally free press. [chuckles]</p><p>Andrey Fradkin: Yes.</p><p>Noah Smith: The, the marginal cost zero press. But, but in this thing, I was like, look, obviously industrialization took fertility to below replacement levels, and then social media has taken fertility to, like, below, like immediate, to, like, immediate extinction levels, to, like, goodbye humanity. This is the last generation, goodbye, kind of levels, right? Plus, ideas were getting harder to find. like, okay, Bloom is right, and Venuren and Webb and whoel- who else was on that paper? Those guys.</p><p>Seth Benzell: There&#8217;s one more, but those were the good ones.</p><p>Noah Smith: There&#8217;s one more! Wait, Bloom, Venuren, Webb, and there&#8217;s one other person, and I apologize to whoever else is on that paper for not saying your name. But anyway-</p><p>Seth Benzell: They got a zillion citations, dude.</p><p>Noah Smith: That paper was right. We were hitting the wall. We were just like, all the smartest people had already been assigned to research-</p><p>Andrey Fradkin: Chad Jones. Chad Jones. How could we forget?</p><p>Seth Benzell: Chad Jones, Chad Jones.</p><p>Noah Smith: Our friend of the show.</p><p>Andrey Fradkin: Friend of the show.</p><p>Noah Smith: The Chad himself.</p><p>Andrey Fradkin: The Chad of growth theory.</p><p>Seth Benzell: Yes, exactly.</p><p>Noah Smith: The Chad. Dream guest of the show.</p><p>Seth Benzell: You can&#8217;t say the Jones because there&#8217;s so many Joneses. [chuckles]</p><p>Noah Smith: Oh, you can&#8217;t. Although the Chad could also be Chad Syverson, Chad of productivity measurement.</p><p>Andrey Fradkin: Ooh, that&#8217;s true.</p><p>Noah Smith: They&#8217;re both the Chad. All right. But anyway, I guess the point is that, I don&#8217;t remember who&#8217;s on that paper, but, but ideas were getting hard to find. They were right, blah, blah. We were hiring, like, mid-marginal researchers to just, like, randomly try chemicals in a vat, and like, that was what our research- and like, the best brains were already like working on the whatever, all day long. And like, yes, we were running out of, running out of runway on this technological civilization. Like it was, we were really, like, we were really just gonna like, argue like resist Lib versus MAGA for the rest of our lives and on so-</p><p>Seth Benzell: God forbid</p><p>Noah Smith: Degenerating, shitty mid social media for the rest of-</p><p>Seth Benzell: In that flat-</p><p>Noah Smith: Not just our lives, but all of humanity. Like, that was the end.</p><p>Seth Benzell: The flat part of the solo growth curve.</p><p>Noah Smith: Yes, we hit the-</p><p>Seth Benzell: That&#8217;s, that&#8217;s not where you wanna be.</p><p>Noah Smith: We hit the we hit the stagnation point. We, like, you could see the end of humanity coming down, coming down the pike, and now we blew it all up by making a God machine. We were like, &#8220;Okay, new thing.&#8221; And what? This has happened before because the agricultural age, you could sort of see humanity having hit this limit. We hit the Malthusian ceiling-</p><p>Seth Benzell: Yeah</p><p>Noah Smith: Again and again. We had the Black Plague. We had overpopulation. We deforested the entire goddamn Middle East.</p><p>Seth Benzell: We banged our head against that ceiling three or four times.</p><p>Noah Smith: Pardon?</p><p>Seth Benzell: We banged our head against the Malthusian ceiling three or four times.</p><p>Noah Smith: Three or four times! And then we were like like our whole world was running out of wood. Like, we were just running out of trees to chop down. We were gonna like We had the, like, Columbian Exchange, blah, blah. That was, there was gonna be another collapse, just like there had been for the Mongols. And like, then we were like, &#8220;All right, we&#8217;re busting out of this shit. Steam power!&#8221;</p><p>Seth Benzell: Yeah.</p><p>Noah Smith: &#8220;And like science.&#8221; And then, like, we got out of that, and then weird shit happened, and you got Nazis and communists and all kinds of crazy stuff. Not to mention, a lot of really bad sitcoms in the &#8216;80s. But like, we got all of that stuff, and despite all that, I would say on balance, we busted out, and it was pretty good, and I would rather have lived, like, in the industrial age than in the age before. And so maybe AI will kill us. Industrial Revolution could have killed us if we had just if we had launched all the nukes in like 1983 or whenever, like, we would&#8217;ve died-</p><p>Andrey Fradkin: Yeah</p><p>Noah Smith: And then our civilization would&#8217;ve fallen. Maybe AI will be the thing to make our civilization fall, or maybe we&#8217;ll be able to solve, use AI to solve the problems that, like, we were degenerating, like the end of science and the, like, end of fertility and like the the absolute shittiness of social media, and maybe AI will just solve all this stuff for us.</p><p>Andrey Fradkin: Well-</p><p>Seth Benzell: Whether or not it just solves it definitely gives us a fighter&#8217;s chance.</p><p>Noah Smith: That&#8217;s what I mean.</p><p>Seth Benzell: I think that&#8217;s, -</p><p>Noah Smith: We rolled the dice of big stuff big new thing. We just, we like, we rolled the dice again, and I&#8217;m, I&#8217;m glad we did.</p><p>Andrey Fradkin: All right, well-</p><p>Noah Smith: And, we all die, but I&#8217;m glad we tried.</p><p>Andrey Fradkin: AI, the new hope, coming to economies near you. on this note, thank you so much, for being our guest, Noah. this was an amazing conversation.</p><p>[01:30:00]</p><p>Seth Benzell: Thank you so much.</p><p>Noah Smith: Thank you. It&#8217;s been a pleasure.</p><p>Seth Benzell: Really appreciate your time. And listeners at home, keep your posteriors justified.</p>]]></content:encoded></item><item><title><![CDATA[Basil Halperin: Leading Indicators for TAI, Conditions for the Singularity, and Tax Policy at the End of History]]></title><description><![CDATA[Justified Posteriors Interview Basil Halperin, Assistant Professor at University of Virginia]]></description><link>https://empiricrafting.substack.com/p/basil-halperin-leading-indicators</link><guid isPermaLink="false">https://empiricrafting.substack.com/p/basil-halperin-leading-indicators</guid><dc:creator><![CDATA[Seth Benzell]]></dc:creator><pubDate>Mon, 09 Feb 2026 19:55:24 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/187424007/aad8d301867a99208f269a7aac05e54b.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>In this week&#8217;s episode of Justified Posteriors, we interview TAI expert and friend of the show Basil Halperin of the University of Virginia. There Basil is doing some of the most fascinating work on the economics of TAI with Anton Korinek and other leading researchers. </p><p>The first section of our conversation covers Basil&#8217;s early career, including jobs at Uber and AQI, how he got interested in AI as a research topic, and his role in managing the <a href="https://stripe.events/fellowship">Stripe Economics of AI Fellowship</a>.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!adNl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68e0fe0f-d70e-4481-8d9a-ba015af5f722_2667x4000.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!adNl!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68e0fe0f-d70e-4481-8d9a-ba015af5f722_2667x4000.jpeg 424w, https://substackcdn.com/image/fetch/$s_!adNl!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68e0fe0f-d70e-4481-8d9a-ba015af5f722_2667x4000.jpeg 848w, https://substackcdn.com/image/fetch/$s_!adNl!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68e0fe0f-d70e-4481-8d9a-ba015af5f722_2667x4000.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!adNl!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68e0fe0f-d70e-4481-8d9a-ba015af5f722_2667x4000.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!adNl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68e0fe0f-d70e-4481-8d9a-ba015af5f722_2667x4000.jpeg" width="252" height="378" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/68e0fe0f-d70e-4481-8d9a-ba015af5f722_2667x4000.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:2184,&quot;width&quot;:1456,&quot;resizeWidth&quot;:252,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Basil Halperin | Stanford HAI&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Basil Halperin | Stanford HAI" title="Basil Halperin | Stanford HAI" srcset="https://substackcdn.com/image/fetch/$s_!adNl!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68e0fe0f-d70e-4481-8d9a-ba015af5f722_2667x4000.jpeg 424w, https://substackcdn.com/image/fetch/$s_!adNl!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68e0fe0f-d70e-4481-8d9a-ba015af5f722_2667x4000.jpeg 848w, https://substackcdn.com/image/fetch/$s_!adNl!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68e0fe0f-d70e-4481-8d9a-ba015af5f722_2667x4000.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!adNl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68e0fe0f-d70e-4481-8d9a-ba015af5f722_2667x4000.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>We then discuss a paper we&#8217;ve already covered on the show: his work on whether the real interest rate can be interpreted as a leading indicator of the probability of TAI (or &#8216;doom&#8217;). Listen to our previous conversation on his paper, and view show notes, including links to that paper and blog post here: <a href="https://empiricrafting.substack.com/p/if-the-robots-are-coming-why-arent">If the Robots Are Coming, Why Aren't Interest Rates Higher?</a> Seth was previously convinced by Basil&#8217;s arguments, but Andrey was a hold out &#8212; we discover Basil&#8217;s takes about Andrey&#8217;s reservations.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!TCvt!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F87941675-d5c2-4f7f-a23a-e5d4ddc5b0bc_820x1168.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!TCvt!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F87941675-d5c2-4f7f-a23a-e5d4ddc5b0bc_820x1168.png 424w, https://substackcdn.com/image/fetch/$s_!TCvt!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F87941675-d5c2-4f7f-a23a-e5d4ddc5b0bc_820x1168.png 848w, https://substackcdn.com/image/fetch/$s_!TCvt!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F87941675-d5c2-4f7f-a23a-e5d4ddc5b0bc_820x1168.png 1272w, https://substackcdn.com/image/fetch/$s_!TCvt!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F87941675-d5c2-4f7f-a23a-e5d4ddc5b0bc_820x1168.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!TCvt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F87941675-d5c2-4f7f-a23a-e5d4ddc5b0bc_820x1168.png" width="454" height="646.6731707317073" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/87941675-d5c2-4f7f-a23a-e5d4ddc5b0bc_820x1168.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1168,&quot;width&quot;:820,&quot;resizeWidth&quot;:454,&quot;bytes&quot;:156297,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://empiricrafting.substack.com/i/187424007?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F87941675-d5c2-4f7f-a23a-e5d4ddc5b0bc_820x1168.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!TCvt!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F87941675-d5c2-4f7f-a23a-e5d4ddc5b0bc_820x1168.png 424w, https://substackcdn.com/image/fetch/$s_!TCvt!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F87941675-d5c2-4f7f-a23a-e5d4ddc5b0bc_820x1168.png 848w, https://substackcdn.com/image/fetch/$s_!TCvt!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F87941675-d5c2-4f7f-a23a-e5d4ddc5b0bc_820x1168.png 1272w, https://substackcdn.com/image/fetch/$s_!TCvt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F87941675-d5c2-4f7f-a23a-e5d4ddc5b0bc_820x1168.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><br>Our third subject is Basil&#8217;s new paper with Anton about the relevant elasticities for a singularity in research progress &#8220;<a href="http://When Does Automating AI Research Produce Explosive Growth? Feedback Loops in Innovation Networks:">When Does Automating Research Lead to Explosive Growth?</a>&#8221; Basil explains how the key issues are the degree of fishing out and spillovers in/across different industries, as well as the extent to which research can be automated. We also take a step back to ask what theoretical research like this teaches us.<br><br>Finally, we cover Basil&#8217;s back and forth with friend of the show Phil Trammel&#8217;s new blog post with Dwarkesh about Piketty and optimal taxation in the age of TAI, link below, and ask him to explain the meme he posted, summarizing his arguments:</p><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/BasilHalperin/status/2007582170660102456?s=20&quot;,&quot;full_text&quot;:&quot;Some takes on this piece, which I want to interpret as \&quot;optimal taxation in an AK economy\&quot;:&quot;,&quot;username&quot;:&quot;BasilHalperin&quot;,&quot;name&quot;:&quot;Basil Halperin&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1905332522260856832/rdYkmXc9_normal.jpg&quot;,&quot;date&quot;:&quot;2026-01-03T22:37:43.000Z&quot;,&quot;photos&quot;:[],&quot;quoted_tweet&quot;:{&quot;full_text&quot;:&quot;New blog post w @pawtrammell: Capital in the 22nd Century\n\nWhere we argue that while Piketty was wrong about the past, he&#8217;s probably right about the future.\n\nPiketty argued that without strong redistribution of wealth, inequality will indefinitely increase. Historically, however,&quot;,&quot;username&quot;:&quot;dwarkesh_sp&quot;,&quot;name&quot;:&quot;Dwarkesh Patel&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1925260306684813315/NjNQZmhZ_normal.jpg&quot;},&quot;reply_count&quot;:2,&quot;retweet_count&quot;:16,&quot;like_count&quot;:55,&quot;impression_count&quot;:9549,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:false}" data-component-name="Twitter2ToDOM"></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!tgRZ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fee4452e9-7649-496b-a5f0-9904b7682c58_932x696.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!tgRZ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fee4452e9-7649-496b-a5f0-9904b7682c58_932x696.png 424w, https://substackcdn.com/image/fetch/$s_!tgRZ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fee4452e9-7649-496b-a5f0-9904b7682c58_932x696.png 848w, https://substackcdn.com/image/fetch/$s_!tgRZ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fee4452e9-7649-496b-a5f0-9904b7682c58_932x696.png 1272w, https://substackcdn.com/image/fetch/$s_!tgRZ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fee4452e9-7649-496b-a5f0-9904b7682c58_932x696.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!tgRZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fee4452e9-7649-496b-a5f0-9904b7682c58_932x696.png" width="534" height="398.78111587982835" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ee4452e9-7649-496b-a5f0-9904b7682c58_932x696.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:696,&quot;width&quot;:932,&quot;resizeWidth&quot;:534,&quot;bytes&quot;:525052,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://empiricrafting.substack.com/i/187424007?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fee4452e9-7649-496b-a5f0-9904b7682c58_932x696.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!tgRZ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fee4452e9-7649-496b-a5f0-9904b7682c58_932x696.png 424w, https://substackcdn.com/image/fetch/$s_!tgRZ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fee4452e9-7649-496b-a5f0-9904b7682c58_932x696.png 848w, https://substackcdn.com/image/fetch/$s_!tgRZ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fee4452e9-7649-496b-a5f0-9904b7682c58_932x696.png 1272w, https://substackcdn.com/image/fetch/$s_!tgRZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fee4452e9-7649-496b-a5f0-9904b7682c58_932x696.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Additional references:</p><p><a href="https://www.sciencedirect.com/science/article/abs/pii/S0140988313002120">Does carbon taxation yield a double dividend (environmental plus fiscal)?<br></a></p><h2>We hope you enjoy the conversation! Transcript follows:</h2><p><br><strong>[00:00] Seth Benzell:</strong> Welcome to the Justified Posteriors podcast, the podcast that updates its beliefs about the economics of AI and technology. I&#8217;m Seth Benzell, looking forward to the Basil exposition we&#8217;ll get today, coming to you from Chapman University in sunny Southern California.</p><p><strong>[00:35] Andrey Fradkin:</strong> And I&#8217;m Andrey Fradkin, looking forward to creating a new accord with Basil, coming to you from San Francisco, California. And today we&#8217;re very excited to welcome Basil Halperin to our show. Welcome to the show.</p><p><strong>[00:49] Basil Halperin:</strong> Thanks Andrey. Thanks Seth. Super excited to be here.</p><p><strong>[00:53] Andrey Fradkin:</strong> So as background, Basil is an expert on the economics of transformative AI and he&#8217;s currently...</p><p><strong>[01:00] Seth Benzell:</strong> Expert is underselling. He is one of the most interesting thinkers around on... Alright, continue.</p><p><strong>[01:07] Andrey Fradkin:</strong> Yes, he&#8217;s great. And he&#8217;s a professor at the University of Virginia. We have an exciting show for you today touching on many topics, but we first wanted to get a start with some of the biographical tidbits. In particular, Basil, how did you get interested in this topic? And it seems like you were a lot earlier than other economists. So I&#8217;m curious what drew you in before everyone else to this interesting set of topics?</p><p><strong>[01:38] Basil Halperin:</strong> I mean, not as early as you two, I don&#8217;t think. Uh, I don&#8217;t know. I was just a nerd growing up. I read a lot of sci-fi. I read Ray Kurzweil in high school when his <em>The Singularity is Near</em> book came out in the 2000s, just because it was popular. The idea got in my head. I was kind of like, &#8220;Well, this is interesting, but eventually...&#8221; I was like, &#8220;I have a few decades to work on other things before any of this becomes relevant.&#8221; And then GPT-3 came out in that long hot summer of 2020. I freaked out a little bit for a week or two. This is crazy. How is this happening so fast? So that sort of woke me up a bit. I started thinking about these issues and gradually more and more have gotten sucked into working on it.</p><p><strong>[02:20] Seth Benzell:</strong> What were your favorite sci-fi growing up?</p><p><strong>[02:23] Basil Halperin:</strong> <em>Ender&#8217;s Game</em> was always the classic.</p><p><strong>[02:26] Andrey Fradkin:</strong> Now I saw on your resume that you spent a stint at AQR, which is a large capital management firm. I&#8217;m curious, what did you learn working there?</p><p><strong>[02:37] Basil Halperin:</strong> Yeah. So I didn&#8217;t expect to go into finance out of college, but basically the opportunity came along. I found out that this firm seemed pretty interesting. So the background is, this firm was founded by two PhD students of Eugene Fama, the Nobel Laureate in finance. Basically taking his ideas seriously and other ideas from the asset pricing literature seriously and applying them to earn a bunch of money. So I didn&#8217;t know anything about finance going into that job. So I learned a whole bunch and some of that has been applied in my research that I think we&#8217;ll talk about today.</p><p><strong>[03:13] Seth Benzell:</strong> Ooh, wait, yeah. Pricing assets in the age of AI. Fascinating.</p><p><strong>[03:17] Basil Halperin:</strong> Yeah, yeah. Talk about it.</p><p><strong>[03:19] Andrey Fradkin:</strong> So I do think this is an interesting background because a lot of people in our field don&#8217;t have a finance background. That&#8217;s not where they&#8217;re coming from in terms of thinking about technology. So it maybe gave you this strong, prepared mind to be thinking about the asset pricing implications of transformative AI. Did you get to interact with Cliff Asness or were you too much of a, like, intern, low-level employee?</p><p><strong>[03:45] Basil Halperin:</strong> No, I was there for a year and a half or two years, but too junior. I think one time I made a bad joke to him in the elevator and he like, pretended to laugh. That was pretty much the highlight.</p><p><strong>[03:56] Andrey Fradkin:</strong> Well, he also likes to make a lot of bad jokes, so you have that in common. Some of them are good too.</p><p><strong>[04:05] Basil Halperin:</strong> [Laughs] These bad jokes are funny.</p><p><strong>[04:06] Andrey Fradkin:</strong> What about at Uber? You also spent some time there working with John List, is that right?</p><p><strong>[04:11] Basil Halperin:</strong> Yeah, yeah. John taught my first ever Econ class when I was undergrad at Chicago, Intro Micro. And he helped inspire me to become an economist plausibly. And then yeah, I worked for him when he was Chief Economist at Uber. Which, Andrey, as you well know, being an economist in tech is an interesting experience. And Uber in 2017 was a particularly interesting time because it was a controversial firm. Sort of like OpenAI is today, the firm that&#8217;s always in the headlines.</p><p><strong>[04:42] Andrey Fradkin:</strong> Were there specific perspectives that you gained there that have informed your subsequent economics career? Or was it more of just like you learned some useful skills in data science or something else?</p><p><strong>[04:55] Basil Halperin:</strong> Yeah, I don&#8217;t know how much super tangible I have to say, but it definitely was informative in general to work in the private sector before going into academia, just to see how different things are. You know, like in the private sector you&#8217;re being paid to tell your boss that he or she is wrong. And then in academia that&#8217;s not so much a recommended strategy.</p><p><strong>[05:19] Seth Benzell:</strong> Wait, wait, okay. So tell us about... so you&#8217;re there, it&#8217;s in 2017. Uber is one of the most evil, fast-growing companies on the planet. So you said it was interesting. So what was interesting about that? Were you pressured to write an economics report you didn&#8217;t agree with? Did you feel like you had to like wear, you know, a hoodie going into the office as people were throwing trash at you? What was it like?</p><p><strong>[05:43] Basil Halperin:</strong> No, it was just... I mean, I certainly didn&#8217;t have a negative experience or negative view of the company, though I&#8217;m sure there were negative things the company did, like any large organization. But the team I was on, this Chief Economist team, was like five people. So it was pretty small. So we just had a lot of leverage to go around the company, be sort of an internal consultancy and do a lot of crazy things, varied things that I otherwise never would have had the chance to do. Like I was sort of a software engineer for one month that I was there, which was otherwise something that never would have happened to me. Or running large scale experiments on a million riders or whatever, which... I would love to do macro experiments if any central bank wants to volunteer for some coin flips. But otherwise, as a macroeconomist now, I don&#8217;t really have that opportunity.</p><p><strong>[06:35] Andrey Fradkin:</strong> So this kind of is a, you know, is a nice segue into our next topic, which is... like a lot of people are worried about their careers these days, obviously because of AI.</p><p><strong>[06:49] Seth Benzell:</strong> Not me! Podcasting is never gonna go out of style, Andrey!</p><p><strong>[06:53] Andrey Fradkin:</strong> Fair enough. But I think that&#8217;s a very broad question and perhaps too broad to answer. But I think for people with an interest in economics&#8212;you know, you were in tech, you decided to go into academia. I&#8217;ve made the same decision in my life. But I&#8217;m curious like what advice would you have? And maybe this is a good opportunity to also speak about the efforts you&#8217;ve been doing with the Stripe Economics of AI Fellowship.</p><p><strong>[07:23] Basil Halperin:</strong> Yeah, okay. So two points here. One point is that I feel like on every good AI podcast, there&#8217;s a question of, &#8220;What do you tell young people? What they should be studying today?&#8221; And like there&#8217;s <em>zero</em> good answer to that question. So yeah, I don&#8217;t have any good answer to that question.</p><p><strong>[07:38] Seth Benzell:</strong> Study the Justified Posteriors podcast. Listen to every episode every day. Three times a day.</p><p><strong>[07:45] Basil Halperin:</strong> But besides that, it&#8217;s not clear. The other thing I guess I <em>can</em> say is that if you&#8217;re an economist, working on the economics of AI is like a really cool thing to do. There&#8217;s just like so much low hanging fruit. There&#8217;s so many insights that can be arbitraged from other fields, which is always a good place to be. You can... instead of going to have to pick the fruit yourself, you can just take the fruit out of other people&#8217;s hands, maybe translate it to the language of economics.</p><p><strong>[08:12] Seth Benzell:</strong> Yeah, I understand later we&#8217;ll be talking about the economics of fruit picking. But so hold those fruit picking thoughts.</p><p><strong>[08:20] Basil Halperin:</strong> All of my economic metaphors are about fruit. So we&#8217;re going to get pretty fruity or something today. Um, I don&#8217;t know, Andrey, maybe you were suggesting that I talk about this fellowship that I help run.</p><p><strong>[08:31] Andrey Fradkin:</strong> Yeah, tell us about the Stripe Fellowship. What fruit is the Stripe Fellowship?</p><p><strong>[08:35] Basil Halperin:</strong> Tell us about what you learned running it and what is it, you know, give a brief description. Yeah.</p><p><strong>[08:41] Basil Halperin:</strong> Yeah, this is this fellowship that we run for early career economists that I do working with Stripe, the financial technology company. Where they decided that they want to support more economics research on the economics of AI, thinking that economists are not working on the issue enough. Which is an empirical claim that you can debate. And so we had the first cohort this past year, 24-25 fellows, mostly grad students, a few APs [Assistant Professors]. And this is a lot of... in part giving people money to do research, but in large part like building a community of people to speak together and share ideas and maybe work together. Folks that probably are listening to your podcasts and that maybe you all should consider interviewing. So that&#8217;s been super fun. Very interesting to be on the side of someone reviewing applications as opposed to being on the other side of applying and seeing... I mean, first of all, it&#8217;s frankly like... I can&#8217;t complain. It&#8217;s a very cool opportunity to be running this thing. But it&#8217;s terrible to reject people. Like it&#8217;s absolutely no fun. All these extremely well-qualified people who are definitely smarter and more accomplished than me. Like that&#8217;s not a fun part of it. On the other hand, very cool to get to support all these cool people doing very cool research and seeing them decide to co-author together and things like that.</p><p><strong>[10:15] Seth Benzell:</strong> Oh, can you point... that&#8217;s particularly exciting. Can you point towards any papers that you think you may have generated that we should maybe discuss on our podcast?</p><p><strong>[10:25] Basil Halperin:</strong> So two... so it&#8217;s been like six months or something since the fellowship launched and you guys know how long these timelines are. So no counterfactual papers yet.</p><p><strong>[10:35] Seth Benzell:</strong> Oh, well I know how short my AGI timelines are.</p><p><strong>[10:38] Basil Halperin:</strong> Well, you&#8217;ll have to tell us that later. No counterfactual papers <em>yet</em>, but a bunch of people have amazing stuff out. Phil Chen at Harvard just put out a very cool paper using GitHub data to look at how software engineer labor has changed. Parker Whitfill&#8217;s been putting out like a paper every few months on compute and labor, complements versus substitutes, with Cheryl Wu. And yeah, there&#8217;s a whole bunch of stuff. We have this website, you can Google &#8220;Stripe Econ Fellowship of AI&#8221; and see folks&#8217; websites. There&#8217;s a ton of very cool stuff. I don&#8217;t have time even to read all the papers, at least yet.</p><p><strong>[11:18] Andrey Fradkin:</strong> Well, that&#8217;s yeah, super awesome initiative. I guess, you know, one follow-up question on there. What do you think most of these people are going to be doing three, five years from now? Do you think they&#8217;re going to become assistant professors? Are they going to work at AI labs? Are going to do something else? Like what is the career trajectory for a young person?</p><p><strong>[11:39] Seth Benzell:</strong> Are they going to be podcasters?</p><p><strong>[11:41] Andrey Fradkin:</strong> Yeah, are they going to be podcasters? Like... and maybe, what do they <em>think</em> they&#8217;re going to be doing is an interesting question, right? Because it&#8217;s a time of great uncertainty.</p><p><strong>[11:51] Basil Halperin:</strong> Yeah, I don&#8217;t know. So like... one way of answering that is that I think kind of any question about speculating about the future comes down to: how fast do you think AI capabilities are going to progress? AI technology going to develop? As has come up a whole bunch of times in this conversation. And there&#8217;s various ways people try to forecast how quickly the technology will develop. Like one way is just go and survey machine learning engineers and trust that they know something about how the future is going to go and take an average of their opinions. So that&#8217;s one method. Another method is something that&#8217;s gone back to like Hans Moravec at the very least of: think that computers are like human brains and try and estimate how much computing power the human brain does and try and forecast Moore&#8217;s Law and algorithmic progress to see...</p><p><strong>[12:31] Seth Benzell:</strong> Ray Kurzweilian, yeah.</p><p><strong>[12:33] Basil Halperin:</strong> Exactly, like Ray Kurzweil. To see how long until we have enough computing power to match the human brain and say that&#8217;s when we&#8217;ll develop AGI. But like, sort of setting that to the side or something... I don&#8217;t know. We&#8217;re trying to encourage research. So we&#8217;re selecting for people who are like stubbornly pursuing research. So there&#8217;s that. But if you&#8217;re like asking about the future for econ PhDs... econ grad students...</p><p><strong>[12:58] Seth Benzell:</strong> We&#8217;re not talking about the future of econ PhDs generally. We&#8217;re talking about this elite cohort you&#8217;ve gathered. You think that there&#8217;s a chance that this elite cohort of the best young thinkers on Econ of AI are going to be obsoleted in three years?</p><p><strong>[13:13] Basil Halperin:</strong> Uh, I mean, I think there&#8217;s a non-zero chance that we&#8217;re all living in some communist utopia in a few years. Not a <em>high</em> one, as my research would indicate, but non-zero. Which is like crazy to think about. We could get unhinged and talk about that, but maybe we can save it for later.</p><p><strong>[13:30] Andrey Fradkin:</strong> Yeah, I guess I was trying to actually push you in a different direction, which is more like... you know, Tyler Cowen famously gave Leopold Aschenbrenner the advice of <em>not</em> going into economics academia, right? You know, he was someone who was, and still is I think, working on some economics research.</p><p><strong>[13:46] Seth Benzell:</strong> Yes, including with friend of the show Phil.</p><p><strong>[13:49] Andrey Fradkin:</strong> Yeah. Exactly. So I was kind of more thinking like, is it really the best place if you&#8217;re really AI-pilled to be sitting at a university? Why did <em>you</em> choose to do that? I&#8217;m sure you had... you could have had other options that you pursued.</p><p><strong>[14:04] Basil Halperin:</strong> Yeah. I mean, so what is best for any individual varies a lot. And I don&#8217;t know, like don&#8217;t you guys think that people who go into academia are kind of stubborn? Like they want the independence of not having a boss. They&#8217;re willing to accept the ginormous pay cuts relative to the outside option.</p><p><strong>[14:24] Seth Benzell:</strong> I wanted the wizard robes.</p><p><strong>[14:26] Basil Halperin:</strong> You wear wizard robes to lecture or what?</p><p><strong>[14:29] Seth Benzell:</strong> I do. I have it hanging on my wall right now. I would point my camera, but my lighting is so beautiful right now.</p><p><strong>[14:34] Basil Halperin:</strong> We should have worn them for the video. So I don&#8217;t know, like really that idiosyncratic taste shock is I think driving a lot of people. But yeah, I totally agree that there&#8217;s a lot of amazing research to be done in the private sector and like the new Anthropic economic team seems to be doing amazing stuff, for example.</p><p><strong>[14:52] Seth Benzell:</strong> Basil, I don&#8217;t want to answer this question for you, but if I may offer kind of a riff on that idea of it being idiosyncratic taste... I think it&#8217;s a, you could call this a taste thing, but you might call it also an idiosyncratic valuation of certain virtues, right? You might find yourself associating with the virtues of being an economist or being a professor and having open inquiry, etc., etc., etc., that are not necessarily as associated as firmly with other professions. You could call that taste or you could call that something else.</p><p><strong>[15:28] Basil Halperin:</strong> Yeah, let&#8217;s bring virtue ethics back into economics.</p><p><strong>[15:32] Seth Benzell:</strong> Bringing the virtue ethics back to economics, exactly.</p><p><strong>[15:35] Andrey Fradkin:</strong> Yeah. Well, cool. You know, very interesting to think about these career implications, but I think it&#8217;s maybe a natural place to transition to discussing some of your really interesting thoughts that you&#8217;ve had recently. And I think Seth has some questions.<br><br><em><strong>Basil Justifies His Research:<br>Transformative AI, existential risk, and real interest rates</strong></em></p><p><strong>[15:53] Seth Benzell:</strong> [Grabbing microphone] Give me the mic, Andrey. I&#8217;m grabbing the mic from Andrey now. Basil, if I recall correctly, the way we e-met was because I got very frustrated with you over one of your papers. And this was your paper, &#8220;Transformative AI, Existential Risk, and Real Interest Rates.&#8221; So I guess before kind of I explain my strong emotional reaction to this paper and how you eventually won me over, maybe you can refresh our podcast listeners. We did an episode on this podcast as one of our very first episodes. I encourage our listeners to go back and listen to it. But for those who don&#8217;t have the time, can you give us maybe a two-minute gloss on that paper before we start putting you to the test on it?</p><p><strong>[16:45] Basil Halperin:</strong> Yes. So I second that listeners should go back and relisten to that old episode because I did before this and that was a really nice summary that I really appreciated. Obviously the critiques were wrong, which we&#8217;ll get to. That&#8217;s a joke. There were some good points. But yeah, so the motivation here is like, everyone wants to know how quickly is AI going to progress? AI technology going to develop? And there&#8217;s various ways people try to forecast how quickly the technology will develop. Like one way is just go and survey machine learning engineers and trust that they know something about how the future is going to go and take an average of their opinions. So that&#8217;s one method. Another method is something that&#8217;s gone back to like Hans Moravec at the very least of: think that computers are like human brains and try and estimate how much computing power the human brain does and try and forecast Moore&#8217;s Law and algorithmic progress to see...</p><p><strong>[17:33] Seth Benzell:</strong> Ray Kurzweilian, yeah.</p><p><strong>[17:35] Basil Halperin:</strong> Exactly, like Ray Kurzweil. To see how long until we have enough computing power to match the human brain and say that&#8217;s when we&#8217;ll develop AGI. We in this paper want to present sort of an indirect way of thinking about this, which is using one of the most powerful supercomputers humanity has, and that is the calculation power of financial markets. Where in economics, you know, we like to think that prices are good at aggregating dispersed wisdom across the economy. And financial market prices in particular, by being forward looking, by being particularly liquid and having this strong incentivizing power through the magic of no arbitrage&#8212;or arbitrage incentives&#8212;are a particularly good way of collecting humanity&#8217;s dispersed wisdom about how the future could proceed. So in particular, we suggest in this paper that...</p><p><strong>[18:31] Seth Benzell:</strong> But Basil, there&#8217;s no... at least when you were writing this paper, I&#8217;m not aware of a high liquidity market that just says &#8220;when does AGI happen?&#8221; or &#8220;when does TAI happen?&#8221; So what price should we look at?</p><p><strong>[18:43] Basil Halperin:</strong> Indeed. And if you&#8217;ll allow me to rant on that for a second before summarizing the argument... like today, even today, there&#8217;s still no, despite the rise of prediction markets, there is no long horizon prediction market on when could advanced AI be developed. There&#8217;s these forecasting platforms that just allow people to submit their own forecasts and take the average of them. Metaculus, Manifold Markets. People sometimes refer to these as betting markets, prediction markets... they are <em>not</em> prediction markets. They do not have the incentive, the financial incentive to ensure forecasters pay attention, update their forecasts, and so on. So those are great websites, but they&#8217;re limited. Kalshi, Polymarket, these new prediction markets... somehow there&#8217;s just... it&#8217;s shocking how bad the lack of good forecasting opportunity to forecast AI is. There&#8217;s very limited things. There are some things, but they&#8217;re not very good.</p><p><strong>[19:35] Seth Benzell:</strong> Do you speculate that it&#8217;s like a defining AGI problem? It&#8217;s the Oracle problem? It&#8217;s like, &#8220;how would you know it when you see it?&#8221; Or did you speculate on why that is?</p><p><strong>[19:43] Basil Halperin:</strong> Yeah. So part of it is that. So for example, the very best question that I&#8217;m aware of is Kalshi has a market on: will this fancy version of the Turing test be passed by 2030? Where it&#8217;s some like souped up version of the Turing test based on a bet that Ray Kurzweil actually&#8212;we keep mentioning his name&#8212;made. So that&#8217;s like the best existing thing...</p><p><strong>[20:00] Basil Halperin:</strong> ...but it&#8217;s this limited definition.</p><p><strong>[20:04] Andrey Fradkin:</strong> So I actually have a different question which is related to your paper. But let&#8217;s say we had a prediction market on GDP growth. And you know, it was like: will we have, I don&#8217;t know, 5% GDP growth or 10% GDP growth at least once by year X? You know, it&#8217;s hard to imagine that that would happen without transformative AI.</p><p><strong>[20:31] Seth Benzell:</strong> Ah, Andrey, I could tell a story.</p><p><strong>[20:33] Andrey Fradkin:</strong> Yeah. No, I could tell a story. I could tell a story, but it would be highly correlated. Are there markets like that that are very close analogs to this?</p><p><strong>[20:42] Basil Halperin:</strong> If there are, I would love to know. And like, I do a periodic search and there&#8217;s... it&#8217;s like there&#8217;s really not. It&#8217;s infuriating. Hence the origin of this paper.</p><p><strong>[20:51] Seth Benzell:</strong> But you can bet... you can bet super out of the money calls on like the stock market. You can bet on the stock market growing 500%, right?</p><p><strong>[20:59] Basil Halperin:</strong> Yes. Well, I don&#8217;t know about 500%. Out of the money calls, like the range is not that large. But betting on GDP growth in particular is difficult. And like, does higher GDP growth raise equity valuations? It&#8217;s actually not obvious. Like, we can really dive into that, but for a whole bunch of reasons... for a whole bunch of reasons I think equities are just kind of a very confusing asset class in general to interpret. Which is why...</p><p><strong>[21:27] Andrey Fradkin:</strong> Yes, so tell us why you picked interest rates. Yeah, and then we&#8217;ll go back to why equities may or may not be good.</p><p><strong>[21:33] Seth Benzell:</strong> Because equities are a bad asset, what I&#8217;ll do is measure equities over time. [Laughter]</p><p><strong>[21:40] Basil Halperin:</strong> Yeah, so the best price in the economy&#8212;that&#8217;s kind of a joke&#8212;the price we recommend looking at in this paper is real interest rates. So that is to say the inflation-adjusted risk-free rate of return you would earn on a bond, particularly at long horizons. Like say the 10-year real interest rate or the 30-year real interest rate. And the argument for why that&#8217;s a useful price to look at is the following: If you knew you were going to be super rich next year, no reason to save today. You&#8217;re going to be super rich next year anyway. If no one&#8217;s saving, then that pushes up interest rates. Interest rates clear the market, the supply and demand for savings.</p><p>So that would be the case where we expect AI to rapidly raise economic growth, rapidly raise our incomes, in particular rapidly raise our consumption. And so if we saw really high real interest rates, that would be indicative of this case of aligned AI raising human incomes. Alternatively, another case with AI that people talk about is that, you know, AI is going to wipe us all out. And you&#8217;ve done podcasts on this topic. Similarly, if we&#8217;re all going to be dead next year because AI was going to wipe us all out, then there&#8217;d be no reason to save today. You&#8217;re going to be dead next year. No reason to hold on to assets for next year. Likewise, that pushes up interest rates.</p><p>So, you know, we could go and look at interest rates. Are they much higher than they have been? And like, no, they&#8217;re well within the range of normal variation. And when I started thinking about this back in fall of 2021, it was particularly salient because at that time long-term real interest rates in the US, and indeed around the world, were at all-time lows, like negative. So you know, you&#8217;d give $100 to the US government, they give you back $99 inflation adjusted at the end of the year. Interest rates have gone up a non-trivial amount since then actually, but really not that much. Really, it&#8217;s probably not because of AI. Maybe a bit. So that&#8217;s the core argument. That if markets were expecting aligned or unaligned transformative AI, then we&#8217;d see high real interest rates today.</p><p><strong>[23:51] Seth Benzell:</strong> All right, great arguments. And now I&#8217;m going to explain why this was so frustrating for me in 2021 to read this argument. I had been working on transformative AI topics and had been thinking about, you know, kinds of economic downsides of AI. And one of the mechanisms that I had become worried about was the anticipation of AI leads to dissaving and that dissaving is large enough that interest rates skyrocket and actually you don&#8217;t get enough reinvestment in the economy to have significant economic growth, right? Set aside for a second whether or not the dissaving you have in mind is so extreme that you would literally like cancel out the gains from AI. But I had been kind of pushing on this idea that, you know, AI is going to lead to dissaving... as the world&#8217;s interest rates were plummeting. And so I had kind of pivoted into trying to think about, okay, well, if we do get really good AI, how could you get to a world where there are very low interest rates, right? And so one version of this idea I worked on with our friend and co-author Erik Brynjolfsson is the idea that, well, maybe there will be a kind of labor that will be infinitely reproduced, but there will be still some scarce human factor. And then actually that scarce human factor will make all of the gains and then interest rates can remain low.</p><p>Another story would be: well maybe we don&#8217;t have transformative AI, we have an AI that takes over, you know, 50, 60, 70% of jobs. We see the labor share of national income go down from, you know, 60% to 20%. But if you actually play that out in a big macroeconomic model where you try to realistically model national savings rates... well, you&#8217;re kind of pushing against the tide. Like we talked about, in 2021 we had this huge&#8212;it was called by some an international saving glut&#8212;that was maybe driven by the rise of an Asian middle class that all of a sudden had all of this money, needed to save for retirement. There was a scarcity of safe assets. And so even if you automated a lot of jobs, there might be still a lot of absorptive capacity for that savings before you would significantly bid up interest rates.</p><p>And so kind of for both this sort of a theoretical reason and a sort of a kind of a macro simulation reason, I fired off to you this angry email saying, &#8220;Don&#8217;t you realize blah, blah, blah, blah, blah?&#8221;</p><p><strong>[26:28] Basil Halperin:</strong> Yeah, the audience wants your original comment. They want you to read it.</p><p><strong>[26:32] Andrey Fradkin:</strong> Oh, that email will be in the post, don&#8217;t worry.</p><p><strong>[26:36] Basil Halperin:</strong> I have it on hand. I have it on hand.</p><p><strong>[26:38] Seth Benzell:</strong> Oh wait, let&#8217;s hear it. Let&#8217;s hear it, Basil. How bad was it?</p><p><strong>[26:41] Basil Halperin:</strong> This is going to be the unhinged portion of the episode. So Tyler Cowen kindly reposted the essay.</p><p><strong>[26:49] Seth Benzell:</strong> [Laughs] It was like, &#8220;A crazy guy emailed me.&#8221;</p><p><strong>[26:51] Basil Halperin:</strong> Well, so initially it was an email. Initially it was a comment on the Marginal Revolution post sharing the essay. And so, like, you know, I...</p><p><strong>[26:59] Seth Benzell:</strong> And everyone knows that that is where the sanest people hang out.</p><p><strong>[27:03] Basil Halperin:</strong> I, like some neurotic person or whatever, skim through these comments and there&#8217;s this one guy Seth Benzell: &#8220;Hey, I&#8217;ve read a few of his papers, including that one you mentioned with Eric. This is so dumb.&#8221; That&#8217;s my first introduction to Seth. Of course, since then things have changed. But welcome to the internet.<br><br></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!kSIB!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44d0088e-8d2f-4c92-973b-dc95eb7dfce2_1200x800.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!kSIB!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44d0088e-8d2f-4c92-973b-dc95eb7dfce2_1200x800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!kSIB!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44d0088e-8d2f-4c92-973b-dc95eb7dfce2_1200x800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!kSIB!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44d0088e-8d2f-4c92-973b-dc95eb7dfce2_1200x800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!kSIB!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44d0088e-8d2f-4c92-973b-dc95eb7dfce2_1200x800.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!kSIB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44d0088e-8d2f-4c92-973b-dc95eb7dfce2_1200x800.jpeg" width="1200" height="800" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/44d0088e-8d2f-4c92-973b-dc95eb7dfce2_1200x800.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:800,&quot;width&quot;:1200,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!kSIB!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44d0088e-8d2f-4c92-973b-dc95eb7dfce2_1200x800.jpeg 424w, https://substackcdn.com/image/fetch/$s_!kSIB!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44d0088e-8d2f-4c92-973b-dc95eb7dfce2_1200x800.jpeg 848w, https://substackcdn.com/image/fetch/$s_!kSIB!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44d0088e-8d2f-4c92-973b-dc95eb7dfce2_1200x800.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!kSIB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44d0088e-8d2f-4c92-973b-dc95eb7dfce2_1200x800.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>[27:26] Seth Benzell:</strong> Wow, &#8220;so dumb.&#8221; I came out of the gate swinging. You have to remember it was the pandemic. We were all cooped up. Some people went to BLM protests. I commented on Marginal Rev. But now I&#8217;ll tell you how you won me over, Basil. Which is, you sat me down and you said, &#8220;Seth, those scenarios that you&#8217;re thinking about, the one where there&#8217;s still, you know, a scarce human factor that&#8217;s making the wins, or the one where we automate 60% of jobs, those are &#8216;AI is a big deal&#8217; scenarios, but those aren&#8217;t the transformative AI, AGI scenarios that I&#8217;m actually writing about.&#8221; And then I apologize for not having read the paper.</p><p><strong>[28:06] Andrey Fradkin:</strong> You&#8217;re a true Marginal Revolution commenter, Seth. Who I don&#8217;t think any of them have ever read a paper.</p><p><strong>[28:15] Basil Halperin:</strong> This is worth noting. So like, the paper and the argument really is zoomed in onto this particular scenario, which I think was like much more top of mind to the people thinking about this a few years ago. So like, you know, before ChatGPT... our essay, initial essay was posted a month after ChatGPT came out. Before ChatGPT, there weren&#8217;t that many people in the world thinking about AI, right? And the people that were, a lot of them were focused on like these fast takeoff &#8220;foom&#8221; scenarios. Things would happen fast, things would happen big. More likely than not, we&#8217;re going to die. P(doom) is high as they say, right? So we were really focused on like these kind of extreme possibilities: either we&#8217;re all going to die or we&#8217;re going to have what we operationalized as 30% annual GDP growth. An order of magnitude increase in annual GDP growth. Which would be crazy. It would be as if the whole economy is growing as fast as Moore&#8217;s Law, more or less. So yes, it&#8217;s an extreme scenario for sure.</p><p><strong>[29:13] Seth Benzell:</strong> And but yes, but so given that extreme scenario, you won me over. And I said, &#8220;Andrey, when we start our podcast, I want to talk about this paper because nothing has moved my priors so much as this paper.&#8221; Maybe it was just moving my definitions around. Maybe it gave me like a stronger understanding of what people really mean by transformative AI versus just AI that is so good that it automates 70% of jobs. But I talked to Andrey about it and Andrey, remind me, were... did I fully convince you of Basil&#8217;s arguments or remind me?</p><p><strong>[29:47] Andrey Fradkin:</strong> No, I don&#8217;t think so. Andrey wasn&#8217;t convinced at all. I just... I mean... I just feel like the people being so certain that this transformative AI is coming in this particular way seems unlikely to me. It&#8217;s not like how humans tend to think or behave about most things in life. And then it&#8217;s hard for me to imagine a world where they essentially like, it&#8217;s a coin flip: either we all die or we have amazing transformative AI. And we don&#8217;t have any intermediate types of outcomes where, for example, you might want to engage in precautionary saving. I know you talk about certain precautionary savings in your paper, but like, that&#8217;s just a very natural response to a lot of uncertainty. There are of course also scenarios where there is tremendous economic growth, but it&#8217;s held by very few people. It&#8217;s ex-ante not obvious who those people are going to be. Or maybe it is obvious, I don&#8217;t know. Maybe they already have all the capital, right? There are just a lot of things, a lot of details to think through and I&#8217;m sure you&#8217;ve thought through a lot more of those than we have in our podcast.</p><p><strong>[31:04] Basil Halperin:</strong> Yeah. So one thing I should say is that like this transformative AI 30% GDP growth scenario, that&#8217;s not something <em>we</em> made up or pulled out of thin air. Like this really was and is a paper dedicated to a specific conversation, just like any academic paper, right? It&#8217;s a conversation among a particular group. So that&#8217;s one thing. Another thing to say is like, to me... so one thing Andrey that you spoke about in the last podcast on this that I totally agree with is skepticism of quantitative macro predictions. So I think you went beyond what I would say in terms of skepticism, but I so strongly share the belief or the view that macro does not have an amazing track record in terms of precise predictions. And that&#8217;s why... like that&#8217;s like a strong motivation for the approach in this paper. Where instead of like, we&#8217;re going to write down an optimizing model, a model of optimizing agents where in equilibrium we determine the structural forces determining the real interest rate and we&#8217;re going to calibrate all these different forces and feed in the simulation. Instead, it&#8217;s just this like dead simple thing where we have this very robust, strong prediction from <em>any</em> intertemporal macroeconomic model: that higher growth or higher mortality risk raise real interest rates. And people are predicting, people are moving tens, hundreds of billions of dollars, literally in San Francisco, under the belief that these things are going to happen. One of these two things is going to happen. It&#8217;s going to happen in the next 10, 5, 1 year. And this provides some sanity check on like, most of all, like the very shortest timeline predictions.</p><p><strong>[32:51] Seth Benzell:</strong> Yeah, so maybe I can pay...</p><p><strong>[32:52] Andrey Fradkin:</strong> But I guess does everyone need to believe in those predictions? I mean...</p><p><strong>[32:56] Seth Benzell:</strong> It has to be like the median investor, right? Who has... who&#8217;s the guy that we&#8217;re talking about the beliefs of?</p><p><strong>[33:01] Basil Halperin:</strong> The marginal unit of capital. So, you know, markets don&#8217;t reflect average beliefs. They reflect the belief of the marginal unit of capital, the marginal trader, just like any price reflects the marginal buyer/seller. And like a priori and lots of theory and so forth to back this up, like you would think that the marginal trader is the one who has the most knowledge or the most incentive to buy/sell. You can think about deviations from that, but like that&#8217;s...</p><p><strong>[33:26] Seth Benzell:</strong> Isn&#8217;t the marginal trader a noise trader?</p><p><strong>[33:28] Andrey Fradkin:</strong> Or like if we have a distribution of beliefs, isn&#8217;t the marginal trader someone who has an intermediate belief?</p><p><strong>[33:35] Basil Halperin:</strong> Um, so one thing I will say is that... one thing I&#8217;ve learned from this whole project is it&#8217;s confusing to me how underdeveloped the literature on asset pricing under heterogeneous beliefs is. I think it&#8217;s in part because like you get these no trade results where if people don&#8217;t... anyway, the theory is hard. But the way I think about it is that the sort of robust prediction of theory is that asset prices are like a wealth-weighted average of beliefs. Maybe wealth-weighted risk tolerance weighted average of the distribution of beliefs.</p><p><strong>[34:13] Seth Benzell:</strong> That right? You think if I&#8217;m super out of the money, can I still move the middle somehow? In other words, if I&#8217;m the guy... if I&#8217;m a 99% &#8220;AI never happens&#8221; or &#8220;AI always happens,&#8221; in what sense am I being included in that weighted average?</p><p><strong>[34:27] Basil Halperin:</strong> Just directly. So like this is about consumption-savings decisions rather. Like what, how fast will the growth rate be? That average.</p><p><strong>[34:39] Seth Benzell:</strong> Okay. Oh, you&#8217;re talking more about the national saving rate. That part of it.</p><p><strong>[34:43] Basil Halperin:</strong> I&#8217;m thinking like the <em>g</em>, the growth rate that goes into the real interest rate determination, that&#8217;s the average belief over that.</p><p><strong>[34:54] Seth Benzell:</strong> Right. And the reason that that matters is that is going to drive the saving rate, which drives the interest rate? Or through a different mechanism?</p><p><strong>[35:01] Basil Halperin:</strong> Yes, yes, yes.</p><p><strong>[35:02] Seth Benzell:</strong> Okay.</p><p><strong>[35:04] Andrey Fradkin:</strong> I have a... so I have a question related to, you know, we touched upon this when we did the podcast, but I&#8217;m curious what <em>you</em> think about it is: It seems hard for me to imagine a scenario where we get to your scenario without a lot of hints in advance, right? Like... like your scenario is literally like most people agree that we&#8217;re going to have 30% growth next year. What... what does the path to that look like? Does that mean that we first have 20% growth, 10% growth? Uh, like... are there other assets that we expect to be leading indicators there? Because I do think in some sense, if we get to your scenario, then you&#8217;ve already told us what happens.</p><p><strong>[35:48] Basil Halperin:</strong> It&#8217;s not <em>my</em> scenario. I want to emphasize.</p><p><strong>[35:51] Andrey Fradkin:</strong> No no, sorry. To your analysis. If we get to the point in your analysis&#8212;I know it&#8217;s not your scenario&#8212;then...</p><p><strong>[35:56] Seth Benzell:</strong> Is your warning light a leading indicator or a late indicator?</p><p><strong>[36:01] Andrey Fradkin:</strong> Yeah. We thought it was a late indicator. But I&#8217;m curious if you have ideas for leading indicators. Yeah.</p><p><strong>[36:07] Basil Halperin:</strong> Ah, so I really think this is a leading indicator because like interest rates reflect expectations about <em>future</em> growth, not <em>current</em> growth. So like wages would be a lagging indicator where those are only going to fall once the technology has developed. Interest rates will rise <em>once people expect</em> the technology to be developed.</p><p><strong>[36:25] Andrey Fradkin:</strong> So no, so I think we both agree with that. I&#8217;m just saying that like it&#8217;s hard for me to imagine that enough percent of capital believes that we&#8217;re going to have 30% growth without it being apparent in other economic statistics long in advance of that.</p><p><strong>[36:38] Seth Benzell:</strong> Like will we be... I guess... the people who read your paper will be convinced that AGI is coming before interest rates go up.</p><p><strong>[36:48] Basil Halperin:</strong> So that&#8217;s sort of a question of like how efficient do you think markets are plausibly, right? Is that what you&#8217;re saying?</p><p><strong>[36:58] Seth Benzell:</strong> I think that&#8217;s fair, right? Andrey is saying that the sophisticated... I mean that&#8217;s how I read it.</p><p><strong>[37:01] Andrey Fradkin:</strong> Well, one is efficiency. The other is like... let&#8217;s say for... if we thought that for AGI to happen, we needed to have substantial data center and energy build outs...</p><p><strong>[37:13] Seth Benzell:</strong> Elon&#8217;s robot factory.</p><p><strong>[37:15] Andrey Fradkin:</strong> Yeah, but to the extent of like 5% of GDP, 10% of GDP, right? Like these things will be happening. There... you know, there&#8217;ll still be uncertainty. So it&#8217;s not necessarily that it&#8217;s an efficient markets failure, but um... like what are the... you know, those are kind of the things that I&#8217;m curious about if you have any thoughts. Like what are the precursors to this moment?</p><p><strong>[37:41] Basil Halperin:</strong> So I mean, I still think interest rates can go up before... like capital takes time to build. But if the discussion is like what things will happen on the way to transformative AI, like yeah, the... what&#8217;s the line from the bard of our times, our dear leader: &#8220;everything is compute&#8221;? Like we&#8217;re going to tile the planet with computers. So like 1% of US GDP last year was hyperscaler capital expenditure.</p><p><strong>[38:15] Seth Benzell:</strong> And let me... yeah. Let me try to ask this a slightly different way, which is, I guess maybe try to make you be a little bit quantitative about how sensitive your personal predictions about TAI are based on different interest rate scenarios. So I&#8217;m going to give you a conditional expectation here. Feel free to use it or to give me a different one, but I want you to try to be quantitative if you can. What is your conditional probability of TAI within five years if the interest rate is less than 6% versus TAI in less than five years if the interest rate is above 15%? Real interest rates.</p><p><strong>[38:53] Basil Halperin:</strong> If the real interest rate is above 15%, then like if this is the real risk-free interest rate, then I think TAI is here and growth is going bananas. I think plausibly even if real interest rates are above 6%... so like the 30-year right now is like 2.6. The 10-year is like 1.8. And so like the 2.6...</p><p><strong>[39:13] Andrey Fradkin:</strong> Just to be clear to the listeners, once again, we&#8217;re talking about inflation-adjusted interest rates.</p><p><strong>[39:16] Basil Halperin:</strong> That&#8217;s important. So the 1.8% number for the 10-year real interest rate is like really in line with where things have been over the last 25 years. The 2.6 for the 30-year is like a little bit elevated. So even 6...</p><p><strong>[39:31] Seth Benzell:</strong> The numbers I were using were kind of risky equity market rates. So feel free to substitute whatever numbers you like.</p><p><strong>[39:35] Andrey Fradkin:</strong> Well that&#8217;s just a totally different object, right?</p><p><strong>[39:39] Basil Halperin:</strong> So...</p><p><strong>[39:40] Seth Benzell:</strong> Oh god. Right. Alright. So okay, risk-free rate. So right now you&#8217;re telling me we&#8217;re at what? 3%?</p><p><strong>[39:44] Basil Halperin:</strong> 2.6 for the 30-year.</p><p><strong>[39:46] Seth Benzell:</strong> 2.6. All right. So what&#8217;s your conditional expectation on TAI in five years in the future given that next year the risk-free rate is under 3%? And then what is it if the risk-free rate goes above 10%?</p><p><strong>[40:02] Basil Halperin:</strong> Again, if it goes above 10%, I think growth is going bananas. That&#8217;s a huge jump.</p><p><strong>[40:07] Seth Benzell:</strong> Anticipated growth. So you don&#8217;t even think... you think we&#8217;d see the growth before we&#8217;d see the interest rate?</p><p><strong>[40:12] Basil Halperin:</strong> Sorry, it depends on what horizon interest rate we&#8217;re talking about here.</p><p><strong>[40:15] Seth Benzell:</strong> 30-year.</p><p><strong>[40:17] Basil Halperin:</strong> If the 30-year goes up to 15? Or above 10?</p><p><strong>[40:20] Seth Benzell:</strong> 10 or 15. You choose numbers. I want you to try to be quantitative at me.</p><p><strong>[40:24] Basil Halperin:</strong> Well, so here&#8217;s the thing, here&#8217;s the thing. The interest rate at a particular horizon tells you among other things about growth expectations <em>at that horizon</em>. So you can look at the entire yield curve, interest rate at 1 year, 5 year, 10 year, 30 year, and get the expectations sort of with lots of other things going on at those different horizons. So like I wouldn&#8217;t want to just look at just the 30 year. I&#8217;d want to look at the 1, 10, 5, 30.</p><p><strong>[40:48] Seth Benzell:</strong> All right. So choose whatever... the curve is the same. Move the level up or not down.</p><p><strong>[40:53] Basil Halperin:</strong> I guess if it does it for you.</p><p><strong>[40:57] Seth Benzell:</strong> Gimme. Feed me.</p><p><strong>[41:01] Basil Halperin:</strong> Real interest rates rose two percentage points from the... two or three percentage points from the COVID depths to where they are now. And again, now they&#8217;re like sort of more or less in where they were 20 years ago. If they went up <em>another</em> percentage point, I&#8217;d be... pretty surprised and interested. How much does that raise like my probability of transformative AI in the next five years if the...</p><p><strong>[41:26] Seth Benzell:</strong> That&#8217;s the question. That&#8217;s the question. This is what your paper is about.</p><p><strong>[41:31] Basil Halperin:</strong> But again, like I&#8217;m not here to make quantitative forecasts, especially going from market prices back to probabilities. I&#8217;m here to say that there&#8217;s this...</p><p><strong>[41:43] Seth Benzell:</strong> I know, you&#8217;re making a directional argument, but give me... does it double your odds of TAI? Or I can let this go if you&#8217;re going to really refuse.</p><p><strong>[41:50] Basil Halperin:</strong> I mean, so what I can do... I can tell you what my AI timelines are and like what feeds into that and how...</p><p><strong>[41:55] Seth Benzell:</strong> Yes.</p><p><strong>[41:56] Andrey Fradkin:</strong> Let&#8217;s just do that. Yeah.</p><p><strong>[41:58] Seth Benzell:</strong> And then tell us how they would change if interest rates got up.</p><p><strong>[42:02] Basil Halperin:</strong> Okay, well, like... again, like I really emphasize that to me the right way to read this paper is this interest rate argument is like an outside view, here&#8217;s a sanity check. So like my view is much more informed by like all these other things now that I&#8217;ve spent like a whole bunch of years reading the AI literature, the AI economics literature. So for example... if you just extrapolate forward the &#8220;meter time horizon&#8221; trend that you guys have spoken about...</p><p><strong>[42:30] Andrey Fradkin:</strong> What&#8217;s the... what&#8217;s the...</p><p><strong>[42:32] Basil Halperin:</strong> ...the length of a task that... of a software engineering task, a machine learning research task that these large language models can do with 50% accuracy. If you extrapolate that trend forward... this is currently doubling every seven months or that&#8217;s what it&#8217;s been for the last six years. If you extrapolate that forward, take into account very importantly the fact that by like 2030... capital expenditures by hyperscalers can be like a trillion dollars and that scaling can&#8217;t continue. So like take into account the fact we&#8217;re going to hit the compute wall and then investment&#8217;s going to slow down. We&#8217;ll have models that can do one month tasks with 50% accuracy by I think it&#8217;s 2033. And one year tasks by 2039. This is Whitfill, Snowden, Parker&#8217;s new paper. So that&#8217;s on this narrow range of tasks done in these meter benchmarks <em>at</em> 50% accuracy: 2039, one year horizon. If you then adjust for the fact that like these are particular kinds of tasks... like I don&#8217;t know, say that adds another six years, so that&#8217;s like another six doublings or something like that. And then take into account that rather than 50% accuracy, we want 99% accuracy. That takes you like to late 2040s. I think... just like this particular stylized fact about time horizons already gets you to like fairly long potentially... at least the possibility of potentially long time horizons for AI. So that&#8217;s like...</p><p><strong>[44:12] Seth Benzell:</strong> I guess we&#8217;ll come back to this... and maybe we&#8217;ll talk about this a little bit more with your new paper where we talk about to the extent that algorithmic progress can substitute for compute progress, right? Because that&#8217;s going to be a key factor here.</p><p><strong>[44:22] Andrey Fradkin:</strong> But to be clear, let&#8217;s dwell on this a tiny bit more.</p><p><strong>[44:26] Basil Halperin:</strong> Yeah, there was a lot of sub-points in there that I went through very fast.</p><p><strong>[44:29] Andrey Fradkin:</strong> Yeah, but yeah, so I think... I think one thing, you know just Seth to your point very briefly is like the METR graph takes into account algorithmic progress. So that&#8217;s why it goes as fast as it does.</p><p><strong>[44:43] Seth Benzell:</strong> Right. But then he said he was also going to take into account... okay, anyway.</p><p><strong>[44:47] Basil Halperin:</strong> So that&#8217;s like I think one... that&#8217;s like a median view. But I think like you really have to think of terms of different scenarios. So like the &#8220;AI 2027&#8221; guys... like that report seems a little crazy, this idea that things are not just going to grow at a constant rate but are going to go hyperbolic. Like that seems a little crazy and maybe even... yeah, a little crazy. But like there is enough flesh on that argument, including this new paper Seth that you mentioned, could point towards that, that I think like you have to have <em>some</em> non-zero probability on like... maybe not literally AI 2027 but like AI before 2030.</p><p><strong>[45:27] Seth Benzell:</strong> Do you have to put non-zero probability on anything that isn&#8217;t conceptually impossible?</p><p><strong>[45:31] Basil Halperin:</strong> Yes, okay. I mean like non-1% probability. So like I put like 10 or 15% probability on like things getting really crazy before 2030. And then I put like 50 to 80% probability on something between 2035 and 2050. And then like whatever is left, 10, 20% on like some factor X... Moore&#8217;s Law slows down, energy runs out and like things take longer than 2060 or whatever. Or including never being able to develop such technology.</p><p><strong>[46:00] Seth Benzell:</strong> So did I get you right? So the median forecast is the mid 2040s for AGI? Is that what you&#8217;ve given me?</p><p><strong>[46:05] Basil Halperin:</strong> The quantitative numbers here are really hard, but yes, something like 2035 to 2050.</p><p><strong>[46:10] Andrey Fradkin:</strong> It&#8217;s not AGI to Seth. I mean... I mean it&#8217;s very different concept...</p><p><strong>[46:17] Seth Benzell:</strong> TAI, TAI. TAI is what we want to talk about. Okay. TAI, excuse me.</p><p><strong>[46:21] Andrey Fradkin:</strong> But Basil, I&#8217;m going to give you a counterpoint. I think the METR graph drastically understates the time horizon of tasks that can be done.</p><p><strong>[46:30] Basil Halperin:</strong> Understates?</p><p><strong>[46:31] Andrey Fradkin:</strong> Yes.</p><p><strong>[46:33] Seth Benzell:</strong> Because Ralph OODA Loop.</p><p><strong>[46:36] Andrey Fradkin:</strong> I mean, yeah, but broadly, right? Like a lot of these evals are doing dumb things. They&#8217;re taking a model out of the box and just asking it to do it. And that is not how you would do any task if you had to do it, right? Like you... you know, a big theme of I think our show and worldview is we believe in a multitude of models interacting in an ecosystem to produce outcomes. And the scaffolding really matters.</p><p><strong>[47:08] Seth Benzell:</strong> How we were epi-ing the Lessin-Kuld show.</p><p><strong>[47:10] Andrey Fradkin:</strong> Uh, the scaffolding matters, right? The... you can have different models from different providers interacting with each other and calling other tools. And so to evaluate the ability of just like an out of the box LLM to do a specific task... that&#8217;s never how you would actually do it in real life.</p><p><strong>[47:31] Seth Benzell:</strong> Yeah, we see this in Andrey&#8217;s data where there are, you know, very clear people use a mix of models. It&#8217;s there in the data.</p><p><strong>[47:39] Basil Halperin:</strong> Yeah, I mean I think unhobbling is like one possible reason that like there&#8217;s 15% chance that we&#8217;re colonizing the stars before 2030. That unhobbling could be enough. Leopold had it right. Maybe.</p><p><strong>[47:53] Andrey Fradkin:</strong> Yeah, yeah. I mean, for what it&#8217;s worth, I think the bigger, you know... I think the thing I agree with you more is that some of these METR tasks are really unrepresentative of most tasks in the economy. And in particular I don&#8217;t think they teach us much about robotics. And I think like robotics has to be an ingredient of any TAI scenario eventually. And so...</p><p><strong>[48:18] Seth Benzell:</strong> Only a computer scientist would think that computer science is the final task.</p><p><strong>[48:23] Basil Halperin:</strong> The strawman obviously being that, you know, a brain in a vat&#8212;the brain of the computer&#8212;can solve robotics just by doing better software on the computer. That&#8217;s the strawman.</p><p><strong>[48:32] Andrey Fradkin:</strong> Yeah, no, no. I understand, but we&#8217;re still talking about human tasks being done, you know.</p><p><strong>[48:38] Basil Halperin:</strong> Totally, totally.</p><p><strong>[48:40] Seth Benzell:</strong> A brain in the vat still needs faith in God in order to believe in the exterior world, dude. Haven&#8217;t you read your dualism?</p><p><strong>[48:49] Andrey Fradkin:</strong> Um, all right, so...</p><p><strong>[48:51] Seth Benzell:</strong> Wait, let me wrap up... I want to finish up this topic. Last question on this topic and then we can move on. Which is: okay, you&#8217;ve shot me down on asking a quantitative question about the macro. Will you give me an answer about: are <em>you</em> changing your environment... your portfolio? I mean, you said 10% chance of shit gets crazy. Sorry, that&#8217;s my one curse per episode. 10% chance. How do you allocate your assets based on that? Are you dissaving?</p><p><strong>[49:19] Basil Halperin:</strong> So like the first thing I&#8217;d say is, for someone at my stage of the life cycle, like my most important asset is my human capital. And I&#8217;ve reallocated that heavily from studying monetary policy, which was the thing I was obsessed with for years and years, to now being focused a lot on the economics of AI. So like that asset of my portfolio I&#8217;ve shifted a lot. Have I changed what my savings are...</p><p><strong>[49:44] Seth Benzell:</strong> Are you dissaving your social capital through drugs and alcohol?</p><p><strong>[49:49] Basil Halperin:</strong> Well, there&#8217;s a different consideration there where like I want to stay healthy until the singularity so I can live forever. So I think actually the consideration might go the other way in terms of intertemporal substitution. But, do I try hard to consumption smooth? Absolutely. It would bother me when people in grad school were like, &#8220;Yeah, I&#8217;m putting money into my 401k.&#8221; I&#8217;m like...</p><p><strong>[50:08] Seth Benzell:</strong> Are you putting money into your 401k?</p><p><strong>[50:11] Basil Halperin:</strong> I put the minimum amount to get the matching funds.</p><p><strong>[50:14] Seth Benzell:</strong> The minimum, dude. The minimum. I thought this was a guy who believed in his own papers.</p><p><strong>[50:17] Basil Halperin:</strong> There&#8217;s no other reason to do it.</p><p><strong>[50:21] Seth Benzell:</strong> All right, you have him, Andrey.</p><p><strong>[50:23] Andrey Fradkin:</strong> All right, all right. I think Seth has given up on life at this point. So cool. Let&#8217;s talk a little bit about your new paper with Tom Davidson, Thomas Holden, and Anton Korinek. Why don&#8217;t you tell us a little bit about the premise?<br><br><em><strong>Basil Justifies His Research:<br>When Does Automating AI Research Produce Explosive Growth?</strong></em></p><p><strong>[50:44] Basil Halperin:</strong> Yeah. So this is a paper that in some ways is about that 15% probability that things could get crazy soon. And in some ways is about some like deep or some standard economic growth theory. So the idea here is to like take seriously the structure of modern machine learning and put that, embed that into the canonical model of economic growth. Where, by that I mean like: how does AI get trained? How does it develop? Well there&#8217;s two key ingredients: software progress, hardware progress. So Moore&#8217;s Law and other trends mean that we&#8217;re able to produce more chips, better chips at lower prices over time. And algorithmic progress means that even for a fixed quantity of computer hardware, you can get more output from a computer program because we are able to write better computer programs. We are able to train better AI models.</p><p>So taking into account the fact, maybe most concretely, that OpenAI uses Nvidia chips to train better AI. And then Nvidia increasingly uses AI to design better chips. This is like Google&#8217;s AlphaChip has been put to use designing better TPUs, Google&#8217;s version of the GPU chip. So that&#8217;s like the motivation, sticking this into a canonical economic growth model, seeing what changes. What that cashes out as...</p><p><strong>[52:20] Andrey Fradkin:</strong> Yeah, so before we get deeper into the paper... isn&#8217;t the idea that research helps do... like, you know, creating new ideas accelerates economic growth through subsequent acceleration of research and development efforts already embedded in the Romer growth model? How is this different?</p><p><strong>[52:46] Basil Halperin:</strong> 100%. So what this does differently is that it says that there&#8217;s different kinds of research. So there&#8217;s like software research and there&#8217;s hardware research. And those are heterogeneous in interesting ways compared to each other, compared to you know, biomedical research or whatever. And taking seriously that heterogeneity and seeing what that heterogeneity implies.</p><p>So like in particular... one of the key lessons&#8212;so what we do in the paper is we write down a general networked semi-endogenous, like a Romer-Jones, general networked growth model. And draw out a couple of key insights I think. And so the core insights are around this idea of diminishing returns where we stand on the shoulder of giants to like... you know, we&#8217;re picking fruit from the tree of knowledge. We stand on the shoulder of giants to reach higher and higher fruits, but eventually the fruit gets harder and harder to pick because we pick all the low hanging fruit first. This idea of diminishing returns. And I think this idea of diminishing returns is like kind of obvious to economists, but it&#8217;s not always obvious in these conversations. Like the idea of an intelligence explosion, the idea of the singularity, kind of a lot of times can fail to recognize the importance of diminishing returns where there&#8217;s this idea that if you have a self-improving AI, like doing surgery on its brain to get smarter and smarter, that naturally <em>has</em> to lead to a singularity. But it doesn&#8217;t if the diminishing returns are strong enough.</p><p><strong>[54:17] Seth Benzell:</strong> Okay, so now we gotta go back to the fruit. So okay, so now earlier you were talking about there were fruits, we were going for them... Explain this concept of diminishing returns through fruit because I&#8217;m really hungry.</p><p><strong>[54:30] Basil Halperin:</strong> Yeah. So you&#8217;re hungry and so you&#8217;re picking fruit from the tree of knowledge. You pick the low hanging fruit first. And you know, that makes you stronger and gives you more energy to pick more fruit. But like eventually you pick all the low hanging fruit. And now you have to reach up and pick higher hanging fruit that&#8217;s harder to pick. And because fruit gets harder to pick&#8212;ideas get harder to find over time&#8212;you&#8217;re not just going to grow to become 100 feet tall, a thousand pounds because you&#8217;re running into diminishing returns in terms of fruit on the tree of ideas.</p><p><strong>[55:10] Seth Benzell:</strong> So it&#8217;s like I grab one fruit and that gives me the energy to eat 0.9 more fruit, which gives me the energy to have 0.9 more fruit and it kind of peters out. I&#8217;m just riffing here, but is this like... is the Garden of Eden story... is that actually about diminishing returns somehow? It&#8217;s like we&#8217;re not in Eden because we have diminishing returns from apples?</p><p><strong>[55:28] Basil Halperin:</strong> Yeah, I guess... I don&#8217;t want to say that the snake is Chad Jones because he&#8217;s the one who taught us this stuff.</p><p><strong>[55:34] Seth Benzell:</strong> No, the snake is obviously Bloom and Reenen and all...</p><p><strong>[55:38] Basil Halperin:</strong> Right, right. And Jones. Yeah, yeah. I guess so. But so exactly as Andrey said, like this is well known in the literature, this idea of diminishing returns. What we do is have this networked model where you have the software research sector and the hardware research sector interacting. There&#8217;s spillovers across sectors. And that teaches you a few things that I can talk about.</p><p><strong>[56:02] Andrey Fradkin:</strong> But so at a high level... you know, if I&#8217;m understanding the idea in the paper correctly, is that you can undo diminishing returns with a networked production function for research, if you will. Here&#8217;s a question for you: What if we took an old growth model and just did away with diminishing returns, you know, all together and we had to have <em>increasing</em> returns? Wouldn&#8217;t we also get an explosion? Like... am I interpreting things correctly there? You&#8217;re kind of trying to microfound why increasing returns <em>would</em> happen.</p><p><strong>[56:54] Basil Halperin:</strong> Yes. Yes. So to say that another way... like the original Romer model in this literature implied that there were no diminishing returns. Chad Jones comes along and points out empirically there <em>must</em> be diminishing returns. That&#8217;s because like we&#8217;ve had this constant 2% growth rate of ideas, that is 2% growth rate of total factor productivity or 1.5% percent. Meanwhile the growth rate of researchers has been 4% for like the last hundred years. So we have increasing number of scientists&#8212;like the two of you, thinking great thoughts&#8212;but we&#8217;re only producing the same growth rate of ideas of 1.5%.</p><p><strong>[57:40] Andrey Fradkin:</strong> That&#8217;s because we&#8217;re podcasting too much.</p><p><strong>[57:43] Basil Halperin:</strong> Seems plausible.</p><p><strong>[57:44] Seth Benzell:</strong> It&#8217;s for the AI. We&#8217;re improving the AI, Andrey.</p><p><strong>[57:48] Basil Halperin:</strong> Patrick Collison has this tweet that I think about a lot where he pointed out that when... when did growth in the US fall off a cliff a bit? It was like 2003 or TFP growth. And that&#8217;s you know, right when Facebook came out. Social media became the great distraction. Anyway, so yes, ideas get harder to find. That explains why growth slows down. And Andrey you point out that if you just get rid of that idea, then yeah indeed you could have a growth explosion. And indeed we are saying that spillovers across sectors can counteract those diminishing returns. And additionally, importantly, automation can also counteract the diminishing returns.</p><p><strong>[58:27] Andrey Fradkin:</strong> Another thing to say is actually, and I think this is super interesting&#8212;not something I thought about going into the paper&#8212;is that you can estimate this diminishing returns parameter, this critical diminishing returns parameter by sector. And I can explain what these numbers mean, but that number for the economy as a whole is -3. So zero would be no diminishing returns. For the economy as a whole, it&#8217;s -3. For the software sector it&#8217;s -1. For hardware, like Moore&#8217;s Law, it&#8217;s -0.2. So the hardware sector has the <em>least</em> degree of diminishing returns of any sector that&#8217;s been estimated. So you know, if compute becomes a larger share of the economy, becomes more important, then this diminishing returns just inherently will become less of a thing. And then on top of that you have this spillover issue and this automation issue I&#8217;ve hinted at.</p><p><strong>[59:17] Seth Benzell:</strong> So I know... natural question... and now I&#8217;m going to put on my applied microeconomist hat on is: where are you getting these numbers from, man? Yeah, you gotta parameterize this model.</p><p><strong>[59:33] Basil Halperin:</strong> Yeah, so this is just looking at the time series. I can spell that out and I think I have an intuitive way of doing it, but yeah this is just looking at...</p><p><strong>[59:40] Andrey Fradkin:</strong> Yeah, well let&#8217;s like walk through the hardware example. Let&#8217;s just like give us some intuition for where that number comes from. Because in my mind that seems like a really hard number to come up with even though we do have Moore&#8217;s Law, right? Yeah.</p><p><strong>[59:53] Basil Halperin:</strong> No, so the ideal here would be to run an experiment. And you know, maybe METR has enough money to do that or something and maybe they should. But the way...</p><p><strong>[1:00:00] Basil Halperin:</strong> ...the way that Bloom et al, the same paper that Seth mentioned, does this... the literature does this is the following: So say, you know, there&#8217;s like a hundred guys and gals thinking about how to improve semiconductors, how to improve hardware in the world. Fix that population. If ideas were not getting harder to find, that same hundred people would produce Moore&#8217;s Law. So Moore&#8217;s Law says that hardware productivity grows like 40% per year. That gets you the doubling every two years or more law. So something like 40%. Hundred people get 40% growth.</p><p>But we&#8217;ve had this constant 40% growth for 50 years, 60 years in hardware. But that&#8217;s required more than just like the original hundred. It&#8217;s required that that population of hardware researchers has grown by say 8%, call it, per year since the 1960s. So you&#8217;ve needed an increasing number of people to get the same progress in hardware. And so the way that that 0.2% diminishing returns number comes from is that ratio of 8% to 40%. That&#8217;s that point two.</p><p><strong>[1:01:17] Andrey Fradkin:</strong> Okay. So now I&#8217;m going to tell you... now I&#8217;m going to use <em>your</em> paper to tell you why that number is wrong. So why is that wrong? It&#8217;s because it&#8217;s not just those hardware engineers that are producing that Moore&#8217;s Law. That Moore&#8217;s Law is being produced by <em>everyone else</em> in the economy that... who is producing let&#8217;s say like design software or even, you know, like I don&#8217;t know, cell phones... like all sorts of things contribute to Moore&#8217;s Law.</p><p><strong>[1:01:47] Basil Halperin:</strong> Yes, exactly.</p><p><strong>[1:01:48] Andrey Fradkin:</strong> And then there&#8217;s also just like physical returns to scale, right? So we&#8217;re producing more and more chips so that&#8217;s a production function parameter rather than a research parameter. So I don&#8217;t... so to me it seems a little strange to like lean so heavily on that number which ignores the entire point of your paper.</p><p><strong>[1:02:10] Basil Halperin:</strong> So, so, so... a few things to say. One is...</p><p><strong>[1:02:17] Seth Benzell:</strong> I mean I... yeah, give it a shot. You can also just crawl into your closet and we can hang up now. Your choice.</p><p><strong>[1:02:22] Basil Halperin:</strong> No, no, this is basically the <em>next</em> paper that co-authors and I should write. Maybe Andrey you can co-author with us. Which is: indeed these prior estimates of these coefficients ignores exactly the factors that we discuss. So yeah, I don&#8217;t need to repeat what you said because that argument was well put and totally correct. But what that means or or as you said I think, what that means is that the degree of diminishing returns is <em>underestimated</em> because the progress is being benefited by spillovers which are not captured. So if you re-did the estimation <em>with</em> spillovers, you would find that diminishing returns is even harder and that like the singularity is less likely. Totally agree.</p><p><strong>[1:03:07] Seth Benzell:</strong> I have a separate concern about these parameters. So alright, you want to tell us about the parameters we need in order to get this hyperbolic growth, right? But it kind of really seems like once you kind of like start the hyperbolic growth, once you like get on that curve, stuff&#8217;s going to get super weird super fast. Yeah. And like wouldn&#8217;t the parameters change pretty fast? So like how can you even extrapolate from today&#8217;s parameters to this crazy regime parameters?</p><p><strong>[1:03:38] Basil Halperin:</strong> Yeah. I again am going to be in total agreement with you. I again am not someone who like wants to take macroeconomic models seriously as quantitative forecasts, but instead see them as formalized, mathematically formalized fables from which we can draw out particular insights and intuitions that were able to check are internally consistent because they&#8217;re written in language of mathematics. So that&#8217;s why the takeaway I have from writing this paper with Tom, Tom, and Anton is these ideas about: diminishing returns are important; spillovers can mitigate diminishing returns; automation can mitigate diminishing returns. And I feel pretty comfortable saying with the caveats that Andrey just emphasized, that hardware and software have less diminishing returns than other sectors. Though we should re-estimate those and hopefully will in a future paper. And that on its own is interesting. But not take super seriously like, where are we on the side of zero or negative? Are we on the side of increasing returns or decreasing returns? Like that stuff... yeah, these parameters I don&#8217;t have any reason to think those are stable as we go through 10 orders of magnitude of growth or something like that. Some people on the internet do take those that seriously and yeah, I completely agree.</p><p><strong>[1:05:01] Seth Benzell:</strong> Uh if I... okay maybe we can talk for just what... we talked about the spillovers. Maybe you want to talk for a little bit about how automation might overcome &#8220;fishing out.&#8221; If I may suggest a motto for this: &#8220;If you fish fast enough, you can outrun fishing out.&#8221;</p><p><strong>[1:05:15] Andrey Fradkin:</strong> Well maybe actually like maybe before you get to that we can just... one of the nice things about this paper is there&#8217;s like a concise message which is this Equation Number 1 in the paper.</p><p><strong>[1:05:28] Seth Benzell:</strong> Yeah the one you... the equation you just told us to not care about. Tell us about it.</p><p><strong>[1:05:32] Basil Halperin:</strong> Yeah. So I said that for the hardware sector this diminishing returns parameter is 0.2 and for the economy as a whole it&#8217;s 3. And again that was the intuition that the 8% researcher population growth versus the 40% productivity growth. Whereas if there was 0% population growth/researcher growth, then that diminishing returns parameter would be zero because you&#8217;d have zero divided by 40. Meanwhile if that number were negative, then you&#8217;d have the increasing returns and the hyperbolic growth, the singularity.</p><p>So the reason why I mentioned that is that zero there is the focal point, but really it&#8217;s like a... it&#8217;s a one plus a zero. So you have this critical condition of: are feedback effects greater than or less than one? And in like the canonical one sector model that comes down to this one diminishing returns parameter. In a networked growth model, instead of having one parameter that tells you are you having diminishing returns or non-diminishing returns, you have a spillover matrix. And the largest eigenvalue, the spectral radius of the matrix... I know you had Ben Golub on recently so...</p><p><strong>[1:06:58] Seth Benzell:</strong> Just say, say the magic word. Give the audience the Eigenvalue.</p><p><strong>[1:07:00] Basil Halperin:</strong> This is becoming the eigenvalue podcast I guess. If that largest eigenvalue is greater than one, then you have explosive growth. So &#8220;is that largest eigenvalue greater than one&#8221; can be summarized in this somewhat simple condition we have in the introduction of... it&#8217;s very loosely speaking like a weighted average of like the inverse of the diminishing returns parameter where the weights are determined by how automated is each sector. I don&#8217;t know how much sense that&#8217;s going to make out loud. In a lot of ways this paper is one of these papers where like looking at the math is actually a lot easier than saying it in words. But hopefully some of the insights have come across.</p><p><strong>[1:07:45] Andrey Fradkin:</strong> So there are these like F... F terms which are the fraction of tasks that are automated by AI. Now like the first term of your equation is F of Y, which is the share of consumption good output that is production that is automated. Am I interpreting that correctly?</p><p><strong>[1:08:07] Basil Halperin:</strong> Yes.</p><p><strong>[1:08:08] Andrey Fradkin:</strong> Okay. Now what if that&#8217;s one just by itself?</p><p><strong>[1:08:14] Basil Halperin:</strong> Right.</p><p><strong>[1:08:15] Andrey Fradkin:</strong> That means that the entirety of the economy that we would actually care about in terms of consumption is automated already. So that&#8217;s kind of... in that case we <em>don&#8217;t</em> have explosive growth. It&#8217;s kind of on the boundary condition. Is that... am I interpreting that correctly? Because things aren&#8217;t getting better, it&#8217;s just that everything we want is is just being produced automatically.</p><p><strong>[1:08:38] Basil Halperin:</strong> Right. If there&#8217;s nothing else going on, it&#8217;s right on the boundary. If you have epsilon of any other productivity growth going on or anything, you get above the exponential to super exponential.</p><p><strong>[1:08:48] Seth Benzell:</strong> It would be like unstable in some sense if you were like exactly at one.</p><p><strong>[1:08:52] Basil Halperin:</strong> Yeah, to perturbation.</p><p><strong>[1:08:56] Seth Benzell:</strong> So Basil, I guess the last question I want to ask about this paper before we move on is... so you&#8217;ve explained how there&#8217;s a bunch of different things going on in the research process in the economy that are either going to kind of accelerate research and it&#8217;s going to get stronger and stronger or might slow down research and we&#8217;re going to get diminishing returns. Two of the most important factors here are kind of this idea of spillovers across sectors, but also this idea that you might be able to automate some research, right? As you get better AIs, you might be able to get faster algorithmic improvements. When I read kind of like LessWrongers, the kind of the latter kind of seems like the show, right? If you can get the AI to write better AI algorithms, there you are. In your model is that the important factor or are they kind of them all equally important? How do you think about that?</p><p><strong>[1:09:47] Basil Halperin:</strong> Yeah, okay so let me want to say this. So the way I&#8217;d frame it is that these spillovers... or sorry, the diminishing returns limit the effects of AI progress. Spillovers in some like static sense... like we don&#8217;t think of spillovers as changing much over time. The innovation network doesn&#8217;t change much. But we think of as the economy grows, more and more tasks are getting automated. So spillovers provide some like static offset to the diminishing returns, whereas as automation increases, it&#8217;s continually offsetting diminishing returns. So I guess in like a dynamic sense, perhaps automation is more important. But sort of in the almost static way that we incorporate automation... either one is equally powerful in offsetting diminishing returns if you sort of do the comparative static. But in the sense of automation is the thing that actually changes over time, that&#8217;s the more important one.</p><p><strong>[1:10:47] Seth Benzell:</strong> Okay. Stands to reason.</p><p><strong>[1:10:49] Basil Halperin:</strong> If I can add one more thing about paper actually. So I didn&#8217;t mention one critically important limitation. So if you talk to economists about what will prevent AI from leading to explosive growth, I think we say one of two things. One is the diminishing returns. That&#8217;s that&#8217;s what this whole discussion has been focused on. But the other one is this idea of bottlenecks: that even if you have really fast progress in software engineering, then if you don&#8217;t have progress in the robotics side of the econ, the physical side, then that will bottleneck the growth if these sectors are complements.</p><p><strong>[1:11:24] Seth Benzell:</strong> Yeah, and the essential thing is going to be the elasticity of substitution across sectors. Yeah.</p><p><strong>[1:11:28] Basil Halperin:</strong> Right. And so we completely ignore the bottlenecks issue. We&#8217;re just focused on this diminishing returns idea, which to my mind is <em>not</em> a claim that there&#8217;s not bottlenecks. I think bottlenecks are super important. I think like there&#8217;s a 5 or 10% chance bottlenecks aren&#8217;t important&#8212;hence my earlier timelines forecast&#8212;but like...</p><p><strong>[1:11:47] Seth Benzell:</strong> We all get uploaded. I mean yeah, there&#8217;s a universe where we all just get uploaded and like who cares that we don&#8217;t have robots for a while.</p><p><strong>[1:11:53] Basil Halperin:</strong> Yeah or something like that. But yeah, the focus... the paper is meant to just like zoom in on the diminishing returns logic and to turn off the bottlenecks. But that&#8217;s important when thinking about how to quantitatively interpret the paper.</p><p><strong>[1:12:08] Seth Benzell:</strong> There you go. Basil admits to one possible drawback to his paper. All right.</p><p><strong>[1:12:13] Basil Halperin:</strong> That&#8217;s all you&#8217;ll get from me.</p><p><strong>[1:12:15] Andrey Fradkin:</strong> Now I wanted to ask one more question actually because we&#8217;re natural right here and then we can go to the next topic. Which is like: how have you found the profession&#8217;s reaction to these sorts of exercises? Like you know, I can tell you what I... various opinions I&#8217;ve heard, but I&#8217;m curious like you were... you&#8217;re an author of these types of papers, so what has been your reaction? What has been like the feedback you&#8217;ve gotten? Yeah.</p><p><strong>[1:12:43] Basil Halperin:</strong> I&#8217;m so curious about your experience. I have limited experience submitting these things through the publication process still because publishing takes so long. Yeah, I&#8217;ve only started submitting recently. Um, I guess what I would say is that like I feel like views on this are kind of polarized where some people are like, &#8220;This is super interesting and I&#8217;m glad to see economists taking this seriously as opposed to like wordcel mumbo jumbo from Silicon Valley or something like that.&#8221; Which I don&#8217;t want to say that I endorse that criticism, but some people have that criticism. And other people are like &#8220;This is...&#8221;</p><p><strong>[1:13:16] Seth Benzell:</strong> This is a pro-wordcel podcast. You&#8217;re safe here.</p><p><strong>[1:13:19] Basil Halperin:</strong> Yeah. Or are you calling yourself a shape rotator? Whatever.</p><p><strong>[1:13:24] Seth Benzell:</strong> I&#8217;ll leave that up to you two. This podcast cannot rotate very many shapes. But that&#8217;s a topic for another episode.</p><p><strong>[1:13:32] Basil Halperin:</strong> So that&#8217;s like really all to say that like to me it&#8217;s like too soon for me to say. And that&#8217;s why I would love to know what your experience is.</p><p><strong>[1:13:42] Seth Benzell:</strong> My experience is that I found it completely impossible to publish and ended up having to publish a book. Yeah I think Seth has been trying to... Seth has been trying to publish this style of work for a very long time and the profession is not very interested, right?</p><p><strong>[1:13:58] Andrey Fradkin:</strong> I would say opinions are changing, but I think the people have been battered for so long into being obsessed with like very micro identification... and given I&#8217;m not a macroeconomist... but like at least on the micro side that a lot of microeconomists just don&#8217;t consider it you know scientific unless there&#8217;s a tight identification argument. Or there&#8217;s an inherent skepticism of theory in some sense, which I do share to a large extent, which is that you can kind of get anything to happen if you&#8217;re a good theorist. And then it&#8217;s pretty hard to adjudicate between theories. And then to the extent that, you know, transformative AI is a mostly theoretical field at this point... it&#8217;s hard to adjudicate between transformative AI theories. So I think I&#8217;ve grown a lot more favorable to this type of work obviously over time because I just think like we might as well be working on the most important topics even if we can&#8217;t answer them as precisely. But I think a lot of people...</p><p><strong>[1:15:09] Seth Benzell:</strong> Yeah, rather than just looking under the street light. Yeah.</p><p><strong>[1:15:12] Andrey Fradkin:</strong> Exactly. Yeah. A lot of people are just not comfortable with that level of speculation. Yeah.</p><p><strong>[1:15:18] Basil Halperin:</strong> &#8220;This is so dumb,&#8221; some might even say. No, yeah. Getting untethered from reality is like such a real risk on these big questions. In macro in general it&#8217;s so hard and you definitely see that happening. So it&#8217;s fair, it&#8217;s tough.</p><p><strong>[1:15:48] Andrey Fradkin:</strong> I mean I think one of the interesting things that you did, right, is posted it on LessWrong. And in some sense like that has been more influential than any paper economics version of this paper that you could have ever written. For sure. Which says something.</p><p><strong>[1:16:03] Basil Halperin:</strong> So to clarify for listeners, originally this was just some some shitpost. This was a blog post that I put out because like I was getting in fights with some friends in group chats and I was like, &#8220;Well the market doesn&#8217;t believe what you guys have to say.&#8221; And yeah and like it wasn&#8217;t going to be a paper and it just... it got such positive feedback that like it seemed like the demand was there for it to be developed a bit further into a paper. Uh, and in some ways I think that maybe I should instead of spending thousands and thousands of hours polishing papers before putting them out, I should be putting more out as blog posts first to...</p><p><strong>[1:16:40] Seth Benzell:</strong> Dude, honestly yes. Because if you&#8217;re asking like my honest advice, I think when it comes to this TAI stuff there&#8217;s so much taste at the evaluation level that like spending another thousand hours polishing the same idea, the marginal returns are pretty low. At least as a practical careerist observation. If you feel like you&#8217;re learning, keep going.</p><p><strong>[1:16:59] Andrey Fradkin:</strong> Well I do think that you know, if you get it... you know, for the profession, if you get into a top five journal there are obviously enormous rewards. But I think like there&#8217;s a risk of like polishing it for like some you know specialist field journal and still spending two years on it. I mean it almost makes one think that like you know there should be a new journal of Transformative AI Economics. I&#8217;m sure Anton has suggested something like that.</p><p><strong>[1:17:27] Seth Benzell:</strong> Yeah, okay that&#8217;s what I was... maybe can we talk for a minute about your department? Which sounds so cool. You&#8217;ve got Anton Korinek who I remember back when he was doing macroprudential policy. I was like, &#8220;This is one smart cookie. I want to see where... let this guy cook.&#8221; What&#8217;s it like working with him? What&#8217;s this TAI department you guys are setting up?</p><p><strong>[1:17:44] Basil Halperin:</strong> Yeah. So Anton has, yeah, been interested in the economics of transformative AI for longer than almost anyone, right? Like somehow back in 2016 he was thinking about this stuff. I&#8217;m still a little confused how he got into this so early. I think he did like a master&#8217;s in computer science maybe and had this in the back of his head. But yeah, so he&#8217;s managed to get a bunch of money to start this Economics of Transformative AI Institute here at the University of Virginia. Which is very cool. So me, Anton, and Lee Lockwood, who is a public finance economist, are sort of the three folks here who have written papers at least on the topic. And yeah I don&#8217;t know, trying to get folks to think more about the issue and write some research.</p><p><strong>[1:18:28] Seth Benzell:</strong> What is it like working with Anton? Do you just like sit down with him and he&#8217;s like, &#8220;I already have solved all of the problems&#8221; and you just like you take notes on him as he dictates to you? What is it like collaborating with a guy like that?</p><p><strong>[1:18:39] Basil Halperin:</strong> What can I say? I mean yeah, Anton&#8217;s been thinking about these issues for a long time. I can recommend his Coursera on the topic. In fact I went through that during the depths of the pandemic where he talks about the macroeconomics of AI and some models, Shannon information theory and interesting things. Yeah.</p><p><strong>[1:19:00] Andrey Fradkin:</strong> Shannon information theory gets you to scaling laws? How does that come in?</p><p><strong>[1:19:04] Basil Halperin:</strong> I don&#8217;t remember why he was teaching that but I was you know interested in the topic.</p><p><strong>[1:19:08] Seth Benzell:</strong> This is neat. I&#8217;m Anton Korinek and this is what smart people think is fun.</p><p><em><strong>Basil Justifies His Blog Posts:<br>Optimal Taxation in the Age of AI</strong></em></p><p><strong>[1:19:16] Seth Benzell:</strong> You recently got in a Twitter back and forth with other friend of the show Phil Trammell about optimal tax policy. You posted this really spicy meme of the two astronauts on the moon...</p><p><strong>[1:20:00] Seth Benzell:</strong> ...and there&#8217;s the Puerto Rican astronaut with the gun to the American astronaut saying...<br><br><strong>[1:20:00] Seth Benzell:</strong> ...and the American astronaut says, &#8220;So, even in the age of TAI, Pigouvian and Georgist taxation is the right way to go?&#8221; And then the Puerto Rican says, &#8220;Always has been.&#8221; Would you explain the context of you posting that meme, the Phil and Dwarkesh post, and how people should understand that?</p><p><strong>[1:20:27] Basil Halperin:</strong> So yeah, Phil Trammell, Dwarkesh Patel... two guys that anyone interested in this stuff should be reading or following, listening to. Admittedly, Dwarkesh is a competitor of you two...</p><p><strong>[1:20:39] Andrey Fradkin:</strong> No, no, no. We believe in coopetition.</p><p><strong>[1:20:41] Seth Benzell:</strong> We&#8217;re cooperating... everyone should listen to both of our podcasts. We&#8217;re complements.</p><p><strong>[1:20:46] Basil Halperin:</strong> Nice.</p><p><strong>[1:20:47] Andrey Fradkin:</strong> We are actually complements, to be clear.</p><p><strong>[1:20:54] Basil Halperin:</strong> So yeah, they wrote this great post, &#8220;Capital in the 21st Century,&#8221; playing on Piketty, saying Piketty was right in the past, but will be right in the future. And made this argument that as more of the economy gets automated, labor income will no longer be a sufficient tax base, and that power will be unequally distributed because capital income is so highly concentrated.</p><p><strong>[1:21:24] Seth Benzell:</strong> Feels like these are three separate arguments already.</p><p><strong>[1:21:27] Basil Halperin:</strong> There&#8217;s a couple different arguments in this piece, yes. And yeah, calling for capital taxation in the future, both for redistribution purposes of financial resources and to prevent sort of power concentration, is how I interpreted the piece.</p><p><strong>[1:21:44] Seth Benzell:</strong> But I was taught in public finance class that capital taxation is bad.</p><p><strong>[1:21:48] Basil Halperin:</strong> Yeah, I think there&#8217;s a lot of logic to that argument. So yeah, I wrote this thread just making a couple points. One of which is based on&#8212;we were just talking about my colleagues Anton and Lee, Anton Korinek and Lee Lockwood&#8212;so they had a recent paper summarizing sort of how should we think about public finance in a transformative AI world. So like take an AK economy, so an economy where all production is done by capital, no labor involved. What is optimal taxation in that world? And they point out or they show that consumption taxation is still optimal rather than introducing capital taxes. As long as you can raise enough revenue from that consumption taxation to fund whatever you need to fund. So that was like a first point I was making, that consumption taxation is going to dominate capital taxation.</p><p><strong>[1:22:42] Seth Benzell:</strong> Let&#8217;s pause there for a second. Because I feel like all of my normie friends don&#8217;t understand this point. And in fact my advisor once, he tells me this story&#8212;I mean I assume it&#8217;s true&#8212;where he had like a half hour meeting with Bernie Sanders where he was trying to explain to him why consumption taxation is better for poor people than capital taxation. And Bernie Sanders&#8217; brain was like, &#8220;But, but poor people no have capital.&#8221; Explain to a normie: why is consumption taxation considered preferred to capital taxation? Because only rich people have capital, right?</p><p><strong>[1:23:14] Basil Halperin:</strong> So let&#8217;s see if I can do this with the caveat that I&#8217;m not a public finance economist, I just play one on Twitter. So the intuition I always come back to is this one that capital taxation is equivalent to explosive consumption taxation. So what do I mean by that? If I save... so you know, the University of Virginia pays me one dollar. I can either use that to go like buy a candy bar today or I can save that to tomorrow.</p><p><strong>[1:23:41] Seth Benzell:</strong> But you don&#8217;t save it because of TAI.</p><p><strong>[1:23:43] Basil Halperin:</strong> But I won&#8217;t save it because of TAI, indeed. I got to go party. And consumption taxation would be taxing that purchase of the candy bar. Capital taxation, taxing the savings. And if I save the dollar to tomorrow and try and buy a candy bar tomorrow... the capital taxation then would just be taxing consumption tomorrow differently than consumption today. And do we... like if we&#8217;re trying to equalize consumption across people, does it make sense to tax people who consume in the future rather than consume today? Like what&#8217;s the difference there? Is like one intuition pump. Honestly, like again, I&#8217;m not a public finance economist, I&#8217;m not sure on the spot I&#8217;m going to give the clearest exposition.</p><p><strong>[1:24:38] Seth Benzell:</strong> No, I think that was pretty good. I think that was pretty clear. Okay, but then the memes about Pigouvian and Georgist taxation.</p><p><strong>[1:24:45] Basil Halperin:</strong> Right, right. So first point, consumption taxation dominates capital taxation anyway. A bigger picture point that isn&#8217;t AI specific but does apply to the AI world is that we have these other taxes that not only are they less distortionary than consumption taxation, they might even be efficiency enhancing. So those taxes are taxes of externalities&#8212;Pigouvian taxes&#8212;should we tax carbon? Should we tax pollution? And Georgist style taxes where you tax owners of unimproved land or unimproved natural resources. People who just by luck and by happenstance happen to find out they have an oil well under their house. Like there&#8217;s no economic efficiency, and arguably no moral reason for those people to earn rents from the fact that all of a sudden, whoa, there&#8217;s a gold mine under my house.</p><p>So today, we should be taxing externalities to fix those negative externalities. Today we should be redistributing the pure rents of unimproved land, unimproved fixed resources. And that will only remain true in an AI driven economy. And those natural resources will become even more important in an AI driven economy where there are no scarce... there&#8217;s no scarce labor, there&#8217;s no scarce capital. The only thing that is scarce is natural resources. All that said, like I&#8217;ve mentioned this caveat that: are those taxes enough to fund the necessary redistribution or the necessary government spending?</p><p><strong>[1:26:28] Seth Benzell:</strong> Land is the only scarce factor. You must imagine its price will be quite high.</p><p><strong>[1:26:32] Basil Halperin:</strong> Yeah, in the limit, you would really think so. Maybe on the transition path... so this is a very good point that Phil made in the Twitter discussion of like, how quickly will the natural resource share rise? It&#8217;s not clear. I would be so interested if someone could answer that question in a convincing way or something.</p><p><strong>[1:26:47] Andrey Fradkin:</strong> I don&#8217;t know. I think robots will be able to mine on the moon pretty efficiently, personally.</p><p><strong>[1:26:55] Basil Halperin:</strong> And so natural resources won&#8217;t be scarce, is what you&#8217;re saying?</p><p><strong>[1:26:58] Andrey Fradkin:</strong> Well, there&#8217;s a lot of natural resources on the moon.</p><p><strong>[1:27:01] Basil Halperin:</strong> Are there? On the moon?</p><p><strong>[1:27:04] Andrey Fradkin:</strong> I think so, yeah.</p><p><strong>[1:27:06] Seth Benzell:</strong> We got red rocks. You can make robots out of red rocks, right?</p><p><strong>[1:27:10] Andrey Fradkin:</strong> I mean you can also do all sorts of things...</p><p><strong>[1:27:12] Seth Benzell:</strong> Silicon! It&#8217;s silicon, dude!</p><p><strong>[1:27:14] Andrey Fradkin:</strong> You can also, you know, like have a ton of solar panels on the moon and then use energy to run fusion and fission reactions to get any resource you want.</p><p><strong>[1:27:28] Seth Benzell:</strong> It&#8217;s different timelines. Different horizons.</p><p><strong>[1:27:33] Basil Halperin:</strong> Different time horizons actually is I think a big part of the reason for disagreements on this. But um, like the rents in the economy have to go somewhere, right? If labor&#8217;s not earning it and capital&#8217;s not earning it.</p><p><strong>[1:27:48] Seth Benzell:</strong> In a pure AK economy, there are no rents. It&#8217;s just A and K, dude.</p><p><strong>[1:27:52] Basil Halperin:</strong> Right, right. The returns have to go somewhere. The returns above replacement maybe is one way of putting it. So anyway, that&#8217;s the source of the meme. Like why hasn&#8217;t anyone estimated whether we could just fund the US government by taxing externalities, by taxing land? Like someone should have done that, especially these Georgists obsessed...</p><p><strong>[1:28:13] Andrey Fradkin:</strong> No, no, I think... well, I think the externalities... I mean our friends in environmental economics have definitely, you know... I think Larry Goulder has a bunch of work on estimating Pigouvian taxes in general equilibrium.</p><p><strong>[1:28:28] Basil Halperin:</strong> Read it.</p><p><strong>[1:28:29] Andrey Fradkin:</strong> I don&#8217;t think... I don&#8217;t think it gets you there. But Georgist taxes... I can imagine it can get you pretty far.</p><p><strong>[1:28:39] Andrey Fradkin:</strong> Well cool. Uh, thanks so much for joining us. It&#8217;s been a fascinating discussion. Any final notes for our listeners? Anywhere they want to check out, in addition to your website?</p><p><strong>[1:28:53] Basil Halperin:</strong> Yeah, feel free to send my papers. That&#8217;s a great decision. And of course, on Twitter and Seth&#8217;s as well.</p><p><strong>[1:28:59] Seth Benzell:</strong> [Laughs] Great.</p><p><strong>[1:29:01] Andrey Fradkin:</strong> All right. Well, thanks for... thanks for coming on and keep your posteriors justified.</p><p><strong>[1:29:07] Basil Halperin:</strong> Thanks, Andrey.</p><p></p><p></p><p><br><br></p>]]></content:encoded></item><item><title><![CDATA[The Consensus Bottleneck: Why AI Won't Automate Organizations as Fast as It Automates Code]]></title><description><![CDATA[A common theme in discussions about AI and productivity is what happens after we&#8217;ve automated coding.]]></description><link>https://empiricrafting.substack.com/p/the-consensus-bottleneck-why-ai-wont</link><guid isPermaLink="false">https://empiricrafting.substack.com/p/the-consensus-bottleneck-why-ai-wont</guid><dc:creator><![CDATA[Andrey Fradkin]]></dc:creator><pubDate>Wed, 04 Feb 2026 21:41:18 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!HVl5!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F990a0145-dc4d-4e28-bb9f-6ed9e42cd6c8_735x307.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>A common theme in discussions about AI and productivity is what happens after we&#8217;ve automated coding. One version of the story goes: once coding is automated, everything else follows, and productivity explodes&#8212;or, alternatively, labor&#8217;s share at technology companies collapses and we face an economic apocalypse.</p><p>This is a plausible story. But I think there will be substantial barriers to transforming firms into little more than automated coding systems plus a CEO. The reason is that much of the work done in large organizations isn&#8217;t actually producing code, manufacturing products, or whatever else we think of as &#8220;true work.&#8221; A lot of the work is people meeting with each other.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://empiricrafting.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Justified Posteriors! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!HVl5!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F990a0145-dc4d-4e28-bb9f-6ed9e42cd6c8_735x307.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!HVl5!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F990a0145-dc4d-4e28-bb9f-6ed9e42cd6c8_735x307.jpeg 424w, https://substackcdn.com/image/fetch/$s_!HVl5!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F990a0145-dc4d-4e28-bb9f-6ed9e42cd6c8_735x307.jpeg 848w, https://substackcdn.com/image/fetch/$s_!HVl5!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F990a0145-dc4d-4e28-bb9f-6ed9e42cd6c8_735x307.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!HVl5!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F990a0145-dc4d-4e28-bb9f-6ed9e42cd6c8_735x307.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!HVl5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F990a0145-dc4d-4e28-bb9f-6ed9e42cd6c8_735x307.jpeg" width="735" height="307" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/990a0145-dc4d-4e28-bb9f-6ed9e42cd6c8_735x307.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:307,&quot;width&quot;:735,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Pin by Emily Ortega on batman begins | Batman begins, Batman&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Pin by Emily Ortega on batman begins | Batman begins, Batman" title="Pin by Emily Ortega on batman begins | Batman begins, Batman" srcset="https://substackcdn.com/image/fetch/$s_!HVl5!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F990a0145-dc4d-4e28-bb9f-6ed9e42cd6c8_735x307.jpeg 424w, https://substackcdn.com/image/fetch/$s_!HVl5!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F990a0145-dc4d-4e28-bb9f-6ed9e42cd6c8_735x307.jpeg 848w, https://substackcdn.com/image/fetch/$s_!HVl5!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F990a0145-dc4d-4e28-bb9f-6ed9e42cd6c8_735x307.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!HVl5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F990a0145-dc4d-4e28-bb9f-6ed9e42cd6c8_735x307.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>What Meetings Are Actually For</strong></p><p>Why do employees of organizations spend so much time in meetings? When we introspect, the answer becomes clear. Meetings exist to create decisions. Multiple actors with stakes in a situation gather context, exercise judgment about that context, and come to agreement about what to do next. That process&#8212;unfortunately or fortunately for human labor&#8212;currently happens in human brains, not AI systems.</p><p>Even if AI wrote all the software at a large company, humans would still meet to decide what that software should do. They&#8217;d still make decisions about marketing budgets, compensation, strategic direction, partnerships, and countless other matters beyond product development. </p><p><strong>The Limits of AI Judgment</strong></p><p>Technologists will naturally respond that there&#8217;s nothing special about human judgment. AI can make these judgments, or multiple AI agents can converse to reach decisions&#8212;perhaps better ones&#8212;enabling fully autonomous firms. I don't think anything deep prevents this in principle.</p><p>But there&#8217;s a crucial constraint: existing organizations are meant to serve human preferences. When firms decide how to produce something, they&#8217;re ultimately serving the owners, who are human. In a more indirect way, they are also serving the preferences of customers, who are also human at some point down the supply chain. Until AI can literally read minds or predict human wants with very high accuracy, humans will remain essential to decision-making at some point. </p><p><strong>Firms as Political Structures</strong></p><p>Even setting aside the question of whether AI <em>could</em> replace human judgment, there&#8217;s a separate question of whether existing firms <em>will</em> allow it. Firms are political structures with power centers and veto players. Decisions can&#8217;t be made unilaterally. To launch a new product, change an existing one, or even swap out a model powering a feature, many people must be involved. As long as those people remain employed in those positions, they must participate in meetings, read the documents, and establish common knowledge that everyone is aligned.</p><p>This consensus culture may produce better decisions&#8212;more minds, more constituencies, more concerns addressed. But it dramatically slows everything down. Code that could be written and shipped in a day might still take months to actually deploy.</p><p><strong>The Path Forward: New Firms</strong></p><p>It&#8217;s hard to be optimistic that existing large firms will successfully shed this consensus culture. Instead, I expect many economic functions will be taken over by new firms&#8212;firms organized from the start to minimize human consensus as a bottleneck. These firms will use speed to outmaneuver existing larger firms in many markets for reasons John Boyd captured in the <a href="https://en.wikipedia.org/wiki/OODA_loop">OODA loop</a>.</p><p>There are a variety of ways in which these new firms may be structured. For example, managers might be represented by AI agents in meetings, employees might be replaced by agents altogether, or individual managers might have more unilateral decision rights rather than requiring broad alignment. We&#8217;ll see many such firms emerge, and as with any process of creative destruction, equilibrium will reveal which organizational forms survive.</p><p>But it&#8217;s worth keeping these basic forces in mind: the bottleneck to AI-driven productivity at the moment isn&#8217;t writing the code. It&#8217;s getting humans to agree on what to do with it.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://empiricrafting.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Justified Posteriors! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Can an AI Interview You Better Than a Human?]]></title><description><![CDATA[Watch now | Voice AI in Firms: A Natural Field Experiment on Automated Job Interviews]]></description><link>https://empiricrafting.substack.com/p/can-an-ai-interview-you-better-than</link><guid isPermaLink="false">https://empiricrafting.substack.com/p/can-an-ai-interview-you-better-than</guid><dc:creator><![CDATA[Andrey Fradkin]]></dc:creator><pubDate>Mon, 26 Jan 2026 13:04:33 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/185735648/502677af2b3ba5796fa1b3a9d3d7ff58.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>We discuss &#8220;<a href="https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5395709">Voice in AI Firms: A Natural Field Experiment on Automated Job Interviews</a>&#8221; by Brian Jabarian and Luca Henkel. The paper examines a randomized experiment with call center job applicants in the Philippines who were assigned to either AI-conducted voice interviews, human interviews, or given a choice between the two.</p><p><strong>Key Findings:</strong></p><ul><li><p>AI interviews led to higher job offer rates and proportionally higher retention rates</p></li><li><p>No significant difference in involuntary terminations between groups</p></li><li><p>Applicants actually <em>preferred</em> AI interviews&#8212;likely due to scheduling flexibility and immediate availability</p></li><li><p>AI interviewers kept conversations more on-script with more substantive exchanges</p></li><li><p>Online applicants saw especially large gains from AI interviews</p></li></ul><p><strong>Topics Discussed:</strong></p><ul><li><p>The costs of recruitment and why interview efficiency matters</p></li><li><p>Whether AI interviews find different workers or just reduce noise in screening</p></li><li><p>How human recruiters interpret AI interview transcripts differently</p></li><li><p>The &#8220;Coasean singularity&#8221; question: Will AI improve labor market matching overall?</p></li><li><p>Limitations: scheduling confounds, external validity beyond call centers, unmeasured long-tail outcomes</p></li><li><p>The coming arms race between AI interviewers and AI-coached applicants</p></li></ul><p><strong>Posterior Updates:</strong></p><p>On the usefulness of current AI for job hiring:</p><ul><li><p>Seth: 40% &#8594; 90% confidence AI works for call center jobs; modest update for general jobs</p></li><li><p>Andrey: 20% &#8594; 75% for call centers; 1% &#8594; 5% for general interviews (&#8220;we need to reorganize all of hiring first&#8221;)</p></li></ul><p>On whether AI will improve job matching significantly on net in the next 5-10 years</p><ul><li><p>Andrey: 55% &#8594; No Update</p></li><li><p>Seth: &#8220;A bit more optimistic than Andrey&#8221; &#8594; +1pp update</p></li></ul><p><strong>Referenced Work/Authors:</strong></p><ul><li><p><strong><a href="https://www.predictionmachines.ai/">Prediction Machines</a></strong> </p></li><li><p>Related episode on AI and labor signaling with Bo Cowgill.</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;600c6049-2aed-4d20-a7fe-3675ea3daeeb&quot;,&quot;caption&quot;:&quot;In this episode, we brought on our friend Bo Cowgill, to dissect his forthcoming Management Science paper, Does AI Cheapen Talk? The core question is one economists have been circling since Spence drew a line on the blackboard: What happens when a technology makes costly signals cheap?&quot;,&quot;cta&quot;:&quot;Watch now&quot;,&quot;showBylines&quot;:true,&quot;showDescription&quot;:true,&quot;showImage&quot;:true,&quot;size&quot;:&quot;lg&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Does AI Cheapen Talk? (Bo Cowgill Pt. 1)&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:191755003,&quot;name&quot;:&quot;Andrey Fradkin&quot;,&quot;bio&quot;:&quot;Professor writing about AI, digital technology, marketing, economics, and academia. Also, some personal introspection along the way.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!qqBF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb729e424-5fcf-4691-886d-a65500401344_1175x1177.jpeg&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null},{&quot;id&quot;:3215096,&quot;name&quot;:&quot;Seth Benzell&quot;,&quot;bio&quot;:&quot;Co-Host of Justified Posteriors Podcast https://empiricrafting.substack.com/podcast&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F1351ec23-f5f1-4613-8844-04c8f814335b_1030x687.jpeg&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;post_date&quot;:&quot;2025-11-18T04:46:53.244Z&quot;,&quot;cover_image&quot;:&quot;https://substack-video.s3.amazonaws.com/video_upload/post/179209235/a95b5673-7368-481f-b6c0-b5bba00f54c0/transcoded-00001.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://empiricrafting.substack.com/p/does-ai-cheapen-talk-bo-cowgill-pt&quot;,&quot;section_name&quot;:null,&quot;video_upload_id&quot;:&quot;a95b5673-7368-481f-b6c0-b5bba00f54c0&quot;,&quot;id&quot;:179209235,&quot;type&quot;:&quot;podcast&quot;,&quot;reaction_count&quot;:0,&quot;comment_count&quot;:0,&quot;publication_id&quot;:2684979,&quot;publication_name&quot;:&quot;Justified Posteriors&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!JrtW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe04c2b84-5e8f-43d1-b922-74edea8b528a_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div></li></ul><div><hr></div><p><strong>Transcript:</strong></p><p>[00:00:00] INTRODUCTION</p><p>Seth: Welcome to the Justified Posteriors podcast, the podcast that updates its priors about the economics of AI and technology. I&#8217;m Seth Benzell, an interviewer who will never stick to a standard script, coming to you from Chapman University in sunny Southern California.</p><p>Andrey: And I&#8217;m Andrey Fradkin, counting down the days until I can use an AI to pre-interview my podcast guests to see if they deserve to be on the show. Coming to you from San Francisco, California.</p><p>Seth: I don&#8217;t know. I think our filtering criteria is pretty good.</p><p>Andrey: I know.</p><p>Seth: Right. That&#8217;s one job we never want to automate&#8212;who becomes a friend of the podcast. That&#8217;s an un-automatable job.</p><p>Andrey: But it would be nice to pre-interview our guests so that we could prepare better for the actual show.</p><p>Seth: I was thinking about this, because there&#8217;s two possibilities, right? You do the pre-interview, and you get an unsurprising answer in this sort of pre-interview, and then that&#8217;s good, and then you should go with it. And then if you get a surprising one, then you would lean into it. What would you even get out of the pre-interview?</p><p>Andrey: Maybe what the guests would want to talk about.</p><p>Seth: Okay.</p><p>Andrey: But I agree with you. Mostly, it&#8217;s just hearing the guest talk, and then thinking about, &#8220;Oh, this is something that we want to really dig into,&#8221; versus, &#8220;This is something that might be not as interesting to our audience,&#8221; and knowing that ex ante.</p><p>[00:02:00] SETTING UP THE TOPIC</p><p>Seth: Yeah. We&#8217;ve been... So we&#8217;re talking about interviews. You&#8217;ll remember in a recent episode, we just talked to our friend Bo, who&#8217;s doing work on how maybe job applications are changing because of AI. So now I think what we want to think a little bit about is how job interviews are changing because of AI. Maybe we&#8217;ve heard before about how AI is changing how people talk to the hirer. Maybe we want to hear a little bit about how AI is changing how the hirer solicits information in an interview. We&#8217;ve got a very interesting paper to talk about just about that. But do you remember the last job interview you did, Andrey?</p><p>Andrey: Yes.</p><p>Seth: How did it go? Did you have fun? Did you feel like you stayed on topic?</p><p>Andrey: It was a very intense set of interviews that required me to fly halfway across the world, which was fun, but exhausting.</p><p>Seth: So fun. So you would describe the interview as a fun experience? Did you get more excited about the job after doing the interview?</p><p>Andrey: Yes, although I ultimately didn&#8217;t take it, but I did get&#8212;you know, I was impressed by the signaling value of having such an interview.</p><p>Seth: So the signaling value. So in other words, the signal to you from the interviewer about the fact that they were going to invest this much time. Is that right? It&#8217;s that direction of signal?</p><p>Andrey: Yes, yes. And also the sorts of people who they had talking to me, and just the fact that they were trying to pitch me so hard. Now, certain other companies lacked such efforts.</p><p>Seth: Right. So it seems like one important aspect of an interview is what the interviewee learns from the interview. But what about the other side? Do you feel like your interviewer learned a lot about you, or enough to justify all that time and expense?</p><p>Andrey: I&#8217;d like to think so. I mean, I&#8217;m not them, so I can&#8217;t really speak on their behalf. But it did seem like the interview process was fairly thought out for a certain set of goals, which might differ across companies. What about yourself, Seth?</p><p>Seth: Thank God, it has been a long time ago that I interviewed for a job, and I can tell you exactly what happened. I was on the academic job market, but I did throw out a couple of business applications, and so I got an interview at Facebook. Headed out to their headquarters, did all of the one-on-one interviews, and then there was a code screen, and I was not grinding LeetCode for the last five months and completely bombed it. And they said, &#8220;Thank you very much for your time.&#8221; So that was an example of, I think they probably could have saved the time for the interview if they had given me the code screen first.</p><p>Andrey: It&#8217;s funny, there was a time in my life where I interviewed at Facebook, too. I mean, this is probably 2014 or something.</p><p>Seth: Mm-hmm, mm-hmm.</p><p>Andrey: And they did do the coding screen before.</p><p>Seth: Who knows? Who knows, dude?</p><p>[00:05:15] THE PAPER</p><p>Seth: Okay, so interviews, we do them. People seem to give information, take information from them. How can this be made more efficient with AI? That&#8217;s today&#8217;s question. In order to learn more about that, we read Voice in AI Firms: A Natural Field Experiment on Automated Job Interviews, by friend of the show, Brian Jabrian and Luca Henkel. I was interested in this paper because it&#8217;s kind of an interesting flip side of what we just saw from Bo.</p><p>I guess before we talk too much about what the paper actually does, it&#8217;s time for us to go into our priors.</p><p>&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;</p><p>[00:06:00] PRIORS</p><p>Seth: Okay, so Andrey, when we&#8217;re thinking about AI being used in interviews, what sort of thoughts do you have about that going in? What sort of priors should we be exchanging?</p><p>Andrey: Yeah, I mean, I think just when I first saw this paper, I was kind of surprised that we were there already, honestly. I think interviewing via voice is a pretty delicate thing, and the fact that AI is potentially able to do it already was&#8212;I hadn&#8217;t been thinking&#8212;I didn&#8217;t think we were there yet, and I think just the very existence of this paper was a bit of a surprise when I first saw it.</p><p>But I guess a first natural prior that we can think about is: is using an AI to interview someone rather than using a human to interview someone, is that better or worse, or how do we think about that?</p><p>So, Seth, what do you think?</p><p>Seth: Well, it&#8217;s a big question, Andrey. I guess my first response is, like we always say in this podcast, context matters, partial equilibrium versus general equilibrium matters. The context that we&#8217;re going to be looking at in the paper is call center workers. So maybe I&#8217;ll give kind of a different answer for short-term call center workers than maybe longer term economy as a whole.</p><p>When I think about call center workers, I think about a job that seems to be&#8212;no offense to our friends of the show out there who are call center workers&#8212;but this does seem like one of the jobs that is going to be the first to be automated with generative AI, or most at risk, especially kind of low-skilled call center work. So if there was going to be any sort of domain where you could automatically verify whether someone was good at it, intuitively, it would be the domain that you&#8217;re kind of close to automating anyway. So if it was going to work anywhere, I would say it would work here.</p><p>And yet still, call center work, you might imagine, it requires a lot of personal empathy, it requires maybe some subtleties of voice and accent that an AI might not identify or even might hesitate to point out such deficits. I would say I kind of went in with the idea that for call center workers, maybe there&#8217;s a forty percent chance that AI would be better than a human interviewer. So maybe it&#8217;s slightly unlikely that it would be better. But if we were to expand out to kind of knowledge work as a whole, I would be more, even more pessimistic, maybe only a twenty-five percent chance or lower that the AI interviewer would be better. What do you think?</p><p>Andrey: Well, how would you&#8212;what do you mean by better?</p><p>Seth: Oh, well, better in terms of the hire is ultimately the correct match, right? That&#8217;s going to be operationalized in a specific way in this paper, what... How they&#8217;re going to measure better match, but, yeah, that&#8217;s what I would say. They hire someone who&#8217;s going to be productive and work with the firm for a long time.</p><p>Andrey: Yeah. I mean, so that&#8217;s kind of one definition, I guess. Another definition might be, is the ROI from a particular interview process better or not?</p><p>Seth: Right, better net of costs. Right. Okay.</p><p>Andrey: Because I think one of the things that oftentimes economists underappreciate is that recruitment is an enormous cost.</p><p>Seth: Don&#8217;t tell those search labor economists, dude.</p><p>Andrey: Some of them model it, but I don&#8217;t think it&#8217;s actually a big focus. But it&#8217;s just the process of interviewing. You know, let&#8217;s say there&#8217;s a position, and you need to interview six people for a relatively high position, so that&#8217;s six hours direct, or maybe it&#8217;s a half-hour interview, it&#8217;s not obvious. But then also, there are all the meetings and pre-meetings, post meetings. Maybe you give an offer, and then they don&#8217;t accept it. And there... I mean, there&#8217;s just a lot of costs involved. So even if it wasn&#8217;t as good as a preexisting interview process, it might still be ROI positive for the firm.</p><p>Seth: I guess we come back to what is the cost of interviewing versus the cost of making a bad decision. You know, well, it&#8217;s not, it&#8217;s public information that we, here at my university, we hired a dean of the business school who was an absolute disaster and got voted out by the faculty in a ninety-eight percent vote after one year. That guy did a lot of damage, right? We should have interviewed him harder.</p><p>So it really depends. So I guess the point would be in kind of higher leverage roles, you would think that the interview costs would be a relatively negligible part of what&#8217;s going on.</p><p>Andrey: I don&#8217;t think that&#8217;s true. I think in higher leverage roles, higher leverage people have to do the interviewing, and the cost of delaying hiring is much higher. So to me, it&#8217;s not obvious. But anyway, that&#8217;s, this is all a sidebar.</p><p>Seth: Okay, so let me hear the prior.</p><p>Andrey: Yeah. So I think my prior that this interview technology would be better than a human technology, just solely based on match quality, was actually quite low. I probably twenty percent, or maybe less than that, actually. Because it just seems like, yeah, maybe on average or maybe in a typical case, it&#8217;s fine, but there&#8217;s so many things that can happen in an interview that you could only learn by running a process enough times to really learn how to do it well. And so, yeah, I wasn&#8217;t super optimistic that it was going to work yet, even for call center workers.</p><p>But I think for kind of higher-end labor, right, I think my prior that it would be better is very low, you know, like 1%. Just because I just don&#8217;t think we&#8217;re there yet.</p><p>Seth: Wait, so I&#8217;m getting&#8212;So 20% for call center workers and 1% generally, was the take?</p><p>Andrey: Yeah, that would be my sense.</p><p>Seth: Mm-hmm.</p><p>Andrey: I mean, just, it&#8217;s hard to imagine that at today&#8217;s technology levels, that for, let&#8217;s say, a professor job, that the AI could interview better... I guess one way to put it is getting rid of all the humans in the interview loop for a faculty hire, that seems just kind of crazy.</p><p>Seth: Right, and that... Well, obviously, a more extreme experiment than what we&#8217;re talking about here. Faculty, we&#8217;re thinking about, you know, maybe they&#8217;re pushing frontier knowledge, would be the last thing that you would think that an AI would be able to get at. Another thing I think about is someone who&#8217;s going to be in your faculty is living with you for 20 years, so you might really care about if they smell good, if they have a peccadillo that bothers you, that these might not be relevant considerations in a call center remote job, right?</p><p>Andrey: Yeah. Yeah, exactly. I think... And I think, actually, the interpersonal thing, which is a very contentious thing, by the way, is that I think people understand that good teams get along with each other. But at the same time, screening based on how much you&#8217;d like to have a beer with someone might have problems, you know?</p><p>Seth: Not good.</p><p>Andrey: So yeah. So, you know, it&#8217;s not obvious which way that cuts, but certainly it&#8217;s an important part of hiring. And, you know, I think for higher-paying jobs, it&#8217;s not that there&#8217;s just one interview, of course. There are many, many interviews, and oftentimes, in-person components of interviews over dinner, and so on. And you might think, you know, maybe that&#8217;s all unnecessary, but given that it persists in equilibrium, even though it&#8217;d be a lot cheaper not to do it, that should signal something.</p><p>[00:14:00] GENERAL EQUILIBRIUM CONSIDERATIONS</p><p>Seth: Good point. But now, Andrey, what I&#8217;d like us to think about for a second is to maybe zoom out for a bit and think about, okay, we&#8217;re talking about current generation technology in partial equilibrium in this study. One company uses 2025 generative AI to try to attack this specific question for call center workers. Let&#8217;s take a step back. You know, that&#8217;s what we always want to do in this podcast, is take a step back and like, okay, what does this tell us about the broader process that society is undergoing?</p><p>You&#8217;ve written recently, movingly, to be honest, about this idea of a Coasean singularity, that AI will be so good at helping us communicate to each other, that we&#8217;ll get perfect matching at zero cost. I don&#8217;t know what timeframe you have in mind, but presumably, one of the things we&#8217;ll get better at matching is people to jobs. So maybe you&#8217;re pessimistic that in this context, in this time, that AI will be good at hiring, but do you think, you know, 5, 10 years from now, as these technologies diffuse, do you think we&#8217;ll get better job matching as a result of employers using a lot of AI and job applicants using a lot of AI? Is that final equilibrium the destruction of all meaning, as Bo, you know, foretold, or is it the utopia of the Coasean singularity?</p><p>Andrey: Well, I do want to point out that I don&#8217;t think any of the authors strongly believe that the Coasean singularity will happen, actually, you know?</p><p>Seth: Oh, the Coasean singularity is a myth?</p><p>Andrey: The Coasean singularity, question mark, Seth. Question mark.</p><p>Seth: Question mark&#8217;s doing a lot of work, Andrey.</p><p>Andrey: Yeah. No, the paper is doing a lot of work to tell you why it might not happen.</p><p>But I think, yeah, I think time horizon certainly matters here, right?</p><p>Seth: Okay, but let&#8217;s say 5 to 10, to just to choose a number.</p><p>Andrey: Yeah. So, so, like, not that long a time horizon. It&#8217;s very non-obvious to me. Just because there are all sorts of institutions that are going to be involved, very messy institutions. Like, one of the things that we already talked a lot about on this show is the problem of too many applications, applications lacking signaling value. At the same time, you know, you can imagine on the interview side, if you interview, you know... How does this all affect the number of interviews you&#8217;re going to do?</p><p>Seth: There&#8217;ll be more and more applications. The cost of applications goes down, yeah.</p><p>Andrey: Yes. Now, maybe the cost of interviewing goes down, but it doesn&#8217;t for the applicant if they have to be the one... You know, if the applicant&#8217;s agent is doing the interviewing, maybe it&#8217;s a different story. But if the&#8212;</p><p>Seth: Right! How many, how... It&#8217;s like, it feels like you&#8217;re watching, you know, the drone war in Ukraine. There&#8217;s the move, and the countermove, and the countermove, and the countermove. It&#8217;s hard to say where that process ends, right?</p><p>Andrey: Yeah. So I... And then I think, of course, you know, there are actual individual institutions involved. Like, what is the government going to do? And even if some nimble firms are really doing a great job of matching using AI technologies, how that plays out when there are other organizations that are using other sorts of tools, it&#8217;s just completely not obvious to me over a five to 10-year time period.</p><p>Seth: So is that a fifty-fifty? Is that a, I have&#8212;is my prior is the completely uninformed prior?</p><p>Andrey: No, no. I think because you&#8217;re introducing both sides of the technologies, both the AI for the applicants and for the employers, it&#8217;s hard. I mean, I&#8217;m a bit of an optimist, so maybe I&#8217;ll say fifty-five percent chance.</p><p>Seth: Fifty-five percent. Ooh, I have to say, I&#8217;m a little bit more optimistic than you, Andrey. I think if you think about the world, the world, since, you know, the rise of the printing press, has seen an arms race in technologies for understanding versus technologies for lying, right? And yet, we think kind of the general process has been towards better price discovery, better matching, right? It seems like we could translate the same ideas to financial markets, where people are getting better at lying, people are getting better at trading, people are getting better at communicating. But ultimately, I mean, at least my sense is that price discovery has improved, right? So I guess&#8212;</p><p>Andrey: Oh, I would argue the opposite. So I... Not price discovery, but labor discovery, I think has been substantively hurt over the past five to ten years. Because our educational institutions have abdicated their role&#8212;</p><p>Seth: Credentialing.</p><p>Andrey: Actually, credentialing, and because it&#8217;s been trivial to start applying to jobs. So yeah, I mean, look, that&#8217;s a little too pessimistic, but I&#8217;m just saying that over a five- to ten-year period, I have to be a little bit cautious. I think if we&#8217;re to be able to reoptimize our institutions, I mean, now the problem with going thirty years is how much human labor do we even have? But to me, just lots of things could be going on.</p><p>&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;</p><p>[00:22:00] THE EVIDENCE - CONTEXT</p><p>Seth: Okay, all right. So we&#8217;ve got our priors locked in. Now it&#8217;s time to turn to the evidence.</p><p>Okay, so our context here is the Philippines in 2025. We&#8217;ve got a pool of about seventy thousand applicants to different call center jobs. They&#8217;re all going through this one recruiter who&#8217;s recruiting for multiple different businesses. To give some context about the call center job market, this is a very high-turnover, low-paid work. We&#8217;re talking about three or four hundred dollars a month at two to three times minimum wage. The skills required are English speaking, flexibility with changing shifts. There is a line in the job application that calls for strong analytical and logical thinking. I think strong might not be the correct adjective there. You probably need more than zero.</p><p>But all this combines into a job that people are not married to. So we&#8217;re looking at a job with sixty percent annual turnover, with a high share of that being people voluntarily leaving rather than being fired. The... We&#8217;re talking, in order to do these interviews, people, first, they can either show up in person to one of these recruiting offices, or they can apply online. Then they&#8217;re scheduled for an interview, and they also take a standardized test that has both an English skills component and a kind of analytical mathy component. And just to give a sense of how strong a filter this is, about six in&#8212;if we&#8217;re talking about the human interview baseline, about six percent of applicants accept a job, while two percent still have a job one hundred and twenty days after being hired. So that&#8217;s not a conditional average. That&#8217;s just two percent of people who show up for an interview end up having the job for at least four months. So that&#8217;s our context.</p><p>Andrey: And about ten percent get an offer, approximately.</p><p>Seth: Right. Yeah, yeah, so ten percent get an offer, six percent accept the job. Okay. So that&#8217;s the context. Andrey, do you want to tell us about the experiment?</p><p>[00:22:40] THE EXPERIMENT</p><p>Andrey: Yeah, sure. So in the experiment, workers were, or applicants... Well, first they were pre-screened a little bit&#8212;</p><p>Seth: Very lightly.</p><p>Andrey: Yes, and then they were assigned to either a group where they had an AI interviewer, whether they had a human interviewer, or one in which they got to pick. And I guess there&#8217;s a lot to be said about the specifics of that interviewer process. So there, as you can imagine, for a job where so many people are being hired, there&#8217;s a lot of standardization of, you know, what sorts of things need to be discussed, in what order. And the AI tries to... You know, the AI tool that the company has purchased is going to is programmed to do that, and it tries to do that. Another key important part of the context is scheduling.</p><p>So an AI can take the interview at any time with you, which could be just right away, as soon as you pass the pre-screener, whereas a human needs to be assigned to an interview, and that could take some amount of time. So that&#8217;s also a pretty big potential difference in how we should think about these things, right? So we oftentimes focus, oh, can the AI really do it? But actually, AI has this other advantage where it could just do it right away.</p><p>Seth: Although, it is, it&#8217;s an interesting result. Even though the AI conducts the interview faster, it still takes longer for the AI interviewed to actually get the job offer decision, which seems to be driven by the humans. And now we&#8217;re going to get into the details of how does this AI system work? There is a human who listens to the AI interview, right? And apparently, I get the impression that the humans who listen to the AI interviews do not enjoy it. They would rather listen to themselves, right? They score these a lot faster if it&#8217;s their own interview versus the AI interview.</p><p>Andrey: So did they really do a good job of explaining why that happens in the paper? Or maybe&#8212;</p><p>Seth: Well, that&#8217;s my speculation.</p><p>Andrey: That&#8217;s actually not what my speculation is at all.</p><p>Seth: Okay. Oh, let me hear it.</p><p>Andrey: So you&#8217;re portraying it like, you know, they&#8217;re just taking a long time to listen. Like, they, you know, to listen through the interview. But actually, it seems like a procedural thing. Like just the system, when it assigns them to review these applications, you know, is later than if you already did the interview.</p><p>Seth: Presumably, you score it right there.</p><p>Andrey: Yes. Yeah, yeah. And to be clear, my understanding is that there&#8217;s a different person, which is the recruiter, who&#8217;s doing the scoring, than the person who&#8217;s doing the human versus the machine interview. So it&#8217;s not like they&#8217;re either listening to the machine or listening to the human and then finding the machine less interesting to listen to. It&#8217;s actually just procedural that they&#8217;re getting assigned to read this AI interview result later.</p><p>Seth: So maybe not an essential difference, but one that could be corrected with a little refinement here.</p><p>Andrey: Yes, exactly. Yeah, yeah.</p><p>Seth: Mm-hmm.</p><p>Andrey: I know we got into kind of this side bit, but I don&#8217;t think it&#8217;s a side bit because it&#8217;s always important to think about what is the treatment exactly. And one of the threats to internal validity that I always teach my students is that if multiple things are changing at the same time when the treatment gets assigned, and in this case, there are. You know, you&#8217;re getting the AI interview, but you&#8217;re also getting interviewed way faster initially. So from the applicant&#8217;s point of view, that&#8217;s kind of very salient.</p><p>Seth: It&#8217;s sort of a different experience.</p><p>Andrey: Yeah.</p><p>Seth: Which, you know, like we talked about, the interviewee also learns from the interview, right? It&#8217;s like when the professor says, &#8220;I learn far more from my students than they learn from me.&#8221;</p><p>Andrey: Yeah. Well, I don&#8217;t think this is a learning&#8212;I mean, it&#8217;s not like I&#8217;m going to rule out learning by these workers. But my sense is that there&#8217;s not a lot of uncertainty about this job for the people who are&#8212;</p><p>Seth: These jobs are pretty homogenous.</p><p>Andrey: They&#8217;re pretty homogeneous&#8212;well, you know, they&#8217;re at least... You know the distribution, you know, probably, you know, doesn&#8217;t have too much to do with the specific firm. You know, they&#8217;re&#8212;probably, the call centers jobs are, you know, there, there are just a lot of them, and depends on which, who you get assigned to in terms of your client.</p><p>Seth: I think this is an important point, which is that it really does seem like there&#8217;s more vertical differentiation here than horizontal differentiation. You might imagine a context with more horizontal differentiation, the AI interviews might not be as good. But here, we&#8217;re just trying to find the right tier of worker, because if it hasn&#8217;t become clear yet, the main failure mode isn&#8217;t you hire someone who&#8217;s too bad. The failure mode is you hire someone who&#8217;s too good, and they leave the job after a week.</p><p>Andrey: Well, we don&#8217;t&#8212;So to be clear, I don&#8217;t actually know why people leave their job. You&#8217;re assuming that they&#8217;re too good, but actually that to me is completely not obvious. It&#8217;s like an Uber driver. It&#8217;s not like the Uber driver is too good if they stop driving on Uber. It&#8217;s just maybe they needed money for a couple of weeks.</p><p>Seth: Well, their distribution of opportunity cost is higher, which would be correlated with being good.</p><p>Andrey: Yeah, but it might also just be they just had temporary liquidity... To be clear, what I&#8217;m trying to say is that that correlation, in my opinion, is very likely to be low. The fact that these people apply to this job, which is very fungible in the first place, which so many people in their country apply for, is not suggesting to me that these applicants are somehow, have all these amazing other opportunities. And, you know, they&#8217;re probably call center workers that might be cycling between call centers, or maybe they&#8217;re cycling between call centers and other seasonal work. I mean, I don&#8217;t know. I just wouldn&#8217;t assume it&#8217;s about quality. Yeah. It&#8217;s not like &#8220;Oh, wow! They&#8217;re so good at math, and then they got discovered.&#8221; You know, that&#8217;s kind of not the story here.</p><p>Seth: Okay, but we&#8217;ll come back to whether who seems to be helped by or hurt by the AI worker in a second. I guess one last thing I want to say about the experiment and its context before we go into the results, are that they... We also get a survey of people on their interview experience. So you might imagine that they&#8217;re going to be obsequious or sycophantic, to use a word in vogue these days, because, you know, they&#8217;re trying to get a job, but that just gives us another slice at trying to understand what they&#8217;re thinking.</p><p>Andrey: Yep.</p><p>Seth: Okay&#8212;</p><p>Andrey: So yeah, I mean, I guess we should say, because we haven&#8217;t made this clear yet, this is an absurdly impressive experiment. I mean, holy crap!</p><p>Seth: Yes.</p><p>Andrey: Right? Just logistically, it&#8217;s... You know, I can imagine how difficult it would be to get all this machinery rolling and, you know, figure out the pilot studies, and figure out the AI model provider, and convince the firm to do it this way versus a variety of other ways. You know, I think it&#8217;s notable that certainly, the firm should be interested in the results of the experiment. They&#8217;re&#8212;It&#8217;s probably an active, like many other firms, they&#8217;re actively deciding where to use AI tools, and so it is incentive aligned in that way. But still, it just is a very impressive experiment.</p><p>Seth: Yes, huge snaps to the authors, especially Brian, who I understand is on the market right now. Give the man a job.</p><p>[00:31:00] HEADLINE RESULTS</p><p>Seth: So all right. To get into the headline results, the AI interviews seem to work. We get twelve percent more offers. So of the people who are randomized into the AI group versus the human group, the AI interviewed get twelve percent more offers, have eighteen percent more job starts, and have eighteen percent higher chance of working with the company for at least four months. So our main outcome here is retention and hiring as positive outcomes. Maybe in the limitation section, we&#8217;ll talk about kind of the limitations of those as the endpoints, but, you know, retention seems to be one of the big challenges here, given that it&#8217;s kind of, as you said, very fungible work. And those seem like significant results, plus on top of all the cost savings you previously talked about.</p><p>Andrey: Yeah, yeah. I mean, it&#8217;s definitely... You know, the ROI calculation, of course, needs to account for other things, but just the baseline results do suggest that this is a very useful technology.</p><p>Yeah, what do I make of this? I think it&#8217;s interesting to think about where this effect is coming from. Is it coming from different types of workers being screened by the two methods, or is it just that the AI method just picks off a few marginal workers that happen to stay longer?</p><p>Seth: Be bad at interviewing, right?</p><p>Andrey: Yeah, or bad at interviewing, or they, you know, they&#8217;re actually good enough, but the old interview process was a bit too noisy to pick them out, right? So there&#8217;s kind of this question: What&#8217;s going on? Because what I would&#8217;ve thought that, you know, like if I was a company, and I was thinking about, well, what is the interview technology that I want? I want an interview technology that gives me the same decisions as I was making before but with a lot less cost.</p><p>Seth: Mm-hmm. Right.</p><p>Andrey: The fact that this technology instead increases the hire rates. First of all, in a lot of jobs, like for a lot of jobs, there&#8217;s one slot, so this couldn&#8217;t be a result that was replicable, right? Like, if you&#8217;re hiring a professor, and you have one slot, it&#8217;s not like you&#8217;re going to increase... I mean, you can increase your hire rate from zero to one, but it&#8217;s kind of... It&#8212;</p><p>Seth: But retention then.</p><p>Andrey: You have to really... Yeah, but those are different&#8212;But you have to think about why you&#8217;re getting the retention effect, right?</p><p>Seth: Right.</p><p>Andrey: And so there are kind of different things that we can think about here. Is it that the interview process is less noisy? Is it that the interview process is more lenient, that it&#8217;s getting marginal guys? Or is it that actually, it&#8217;s actually picking out different people, and those people are better matched, which then raises the question of like, wow, those old interviewers were not very good, right?</p><p>Seth: Right.</p><p>Andrey: Which is, you know, I&#8217;m sure there are plenty of interviewers who are not good. That&#8217;s&#8212;It&#8217;s not surprising to me. Yeah, but I guess, yeah, those are the questions that are raised, right? Because I don&#8217;t think it&#8217;s inherent. How you use the AI tool is your choice as a firm. There&#8217;s no law that&#8217;s going to say that you&#8217;re going to increase your hire rates because you happen to use an AI interviewer, right?</p><p>Seth: Right. And so, yes, a great point is you might be concerned that this leads to a more sort of lenient, we&#8217;re letting in marginal people. You know, we&#8217;re not actually getting more information. Or maybe we&#8217;re getting less information, and we&#8217;re just letting in marginal people. One piece of evidence against that is there is no significant difference in the rate of involuntary disconnections, right? So remember, retention is higher, and that is not driven by any difference in the newly hired being less likely to be fired, right? The people who are hired by AI, the reason they are retained for a little bit longer is because they are basically fired at the same rate, but they&#8217;re less likely to disconnect on their own a little bit. That&#8217;s my read.</p><p>So how do you interpret that?</p><p>Andrey: I guess it still isn&#8217;t telling me that whether we&#8217;re picking... I mean, for what it&#8217;s worth, I just&#8212;My reading of the evidence from this paper is that there&#8217;s just a lot of overlap in who gets hired, and then there&#8217;s just a few marginal guys, and then your power to detect differences and fire rates between the two are very low. But I don&#8217;t think the firm&#8212;I&#8217;d assume that the firm doesn&#8217;t care that, you know, there&#8217;s so many workers falling through, you know, that involuntary separations are just part of the game. But I wouldn&#8217;t... It seems like the power for that difference seems very low.</p><p>Seth: Fair enough. And further, and we can talk about this in limitations, too, retention rate just gives you a sense of what percentage of people are above or below some sort of line of so disastrous you get fired. You might imagine that an AI interviewer has a lower chance of detecting the truly disastrous person who&#8217;s just going to start slamming racial epithets at everyone who calls up, right? You might imagine that there&#8217;s kind of a long tail of badness that&#8217;s not being picked up by AI, and then this measure of outcome wouldn&#8217;t pick up that the long tail of badness is getting worse.</p><p>[00:36:35] MECHANISM - HOW THE AI WORKS</p><p>Andrey: Yeah, yeah. I mean, and to be clear, I don&#8217;t want to highlight that. I&#8217;m just making the point that there&#8217;s no generic&#8212;I like to think about the prediction machines framework here maybe.</p><p>Seth: Friend of the show, Avi Goldfarb.</p><p>Andrey: And Ajay and Joshua Gantz, yes. So the AI makes a prediction, but then you&#8217;re the decision maker. Let&#8217;s say you&#8217;re the CEO or the hiring manager of this firm. You get to choose how you use that information, right? So you can use it&#8212;</p><p>Seth: But it&#8217;s not that the AI isn&#8217;t... Wait, wait, wait, wait. The AI isn&#8217;t making a prediction here. The AI is soliciting different information in the interview.</p><p>Andrey: Sure, but it&#8217;s giving you a signal. And you can choose what to do with that signal however you like, right? So that&#8217;s kind of the point I&#8217;m making. In this case, the AI was good enough at interviewing people that you got a pretty good signal, and the system used it in the following way that seemed to have been positive. But I guess what I&#8217;m saying is how you&#8212;there are human recruiters that are taking the signal from the AI interview and choosing what to do with it. And they chose to hire more people as a result. That&#8217;s not a quality of the AI, that&#8217;s a quality of the humans making decisions off of information.</p><p>Seth: I mean, I don&#8217;t know what to say to that, Andrey. Like, you know, it&#8217;s like saying, you know, the factory didn&#8217;t make 10 tons of steel. It was the business factory sociotechnological system that made 10 tons of steel.</p><p>Andrey: No, I guess the point I&#8217;m making is that you could have imagined, here&#8217;s a simple story. Let&#8217;s say the interviewers don&#8217;t know how to interpret the AI interviews, and they do know how to interpret the human interviews. Then they could make very different decisions off of very similar transcripts off of the two.</p><p>Seth: Correct.</p><p>Andrey: Right? That, I guess that&#8217;s what I&#8217;m trying to say.</p><p>Seth: And I think that&#8217;s right. I think that&#8217;s right, but I&#8217;m also pointing out that we usually don&#8217;t talk about technologies that way. Every technology is embedded in an organization. So yes, but yes, every other technology also.</p><p>Andrey: No, because when people do AI evaluations, they&#8217;re always saying that AI does this, AI does that. And then in this case&#8212;</p><p>Seth: Like GDPVal.</p><p>Andrey: Yes, yes. AI is going to fully automate end-to-end this task. And I guess what I&#8217;m saying here is that there&#8217;s no way it&#8217;s automating the decision. It&#8217;s not automating the decision. I guess the other thing is there are AIs that automate decisions in hiring, right? There are certainly AIs that screen resumes, for example. So I don&#8217;t think it&#8217;s a crazy thing to talk about here.</p><p>Seth: I don&#8217;t think you&#8217;re being crazy either. And of course, the context matters, but then even in GDPVal, I could say the same thing, right? It&#8217;s going to get evaluated by a human expert. The human expert either is good or bad at understanding the way that the AI talks about the thing. I mean, it seems like any time a human touches it, okay, yeah, it&#8217;s in a human context.</p><p>Andrey: I guess... Sorry, but you keep on thinking that this is a criticism. It&#8217;s not a criticism that I&#8217;m&#8212;You don&#8217;t need to defend it. It&#8217;s just I&#8217;m just saying that&#8212;</p><p>Seth: I&#8217;m not saying it&#8217;s a criticism.</p><p>Andrey: Yeah.</p><p>Seth: I&#8217;m saying it&#8217;s a universal... I&#8217;m saying it&#8217;s a truism.</p><p>Andrey: It&#8217;s just the company chooses what to do with this.</p><p>Seth: True.</p><p>Andrey: It&#8217;s interesting that the way that it was used happened to play out this way. But for example, the company might not have wanted to hire them, right? Like, what is the hiring cap for the company? Do they want to hire infinite workers? Do they want to hire 50 workers? How does that allocate the&#8212;</p><p>Seth: Do they care more about average quality or average retention? I totally agree. Totally agree. Okay, so I don&#8217;t think we&#8217;re disagreeing.</p><p>[00:41:00] LINGUISTIC ANALYSIS</p><p>Seth: All right, but let me try to help you a little bit, Andrey, with thinking about what&#8217;s happening different in these interviews. Because maybe we can&#8217;t exactly say how are the people who get hired different under the two regimes, but we can say something about how the two different interviews go. And so the authors do this really fascinating linguistic analysis of what actually happens in the interviews, because they&#8217;ve got the full text of all of these interviews.</p><p>Andrey: Actually, can you show figure 2 first, actually?</p><p>Seth: Ooh, let&#8217;s talk about figure 2 for a second. All right, I&#8217;m putting figure 2 on the board. Is that good?</p><p>Andrey: So I think I found this very helpful to address some of the questions about... that I was raising. In particular, what we see here is on the top line, the human topic coverage, and on the bottom line, the AI topic coverage. And the AI does seem to cover more topics most of the time than the human. In the second column, we see that the AI tends to follow the preordained order of the interview that was, you know, the interview designers designed. And in the third column, we see that the AI follows the guideline questions much more closely. So it&#8217;s standardizing the interview process. So my sense is that this should reduce the noise in the hiring decisions quite a bit. You know, at least in a very naive model of hiring. Now, you can come up with scenarios where there&#8217;s&#8212;</p><p>Seth: Yeah, in a naive model where the generic approach is the correct approach, right?</p><p>Andrey: Yes, yeah.</p><p>Seth: Because you might have a model&#8212;</p><p>Andrey: If you need to cater to different people, how you interview, because you&#8217;re really trying to extract a particular signal, then maybe this won&#8217;t work. But then we go back to the fact that these are call center workers, and maybe there&#8217;s more of a&#8212;it&#8217;s a more standard situation.</p><p>Seth: Agreed. Okay, but I, you know, even though this is an interesting figure, the figure that really struck me is the next one, where we look at, okay, what are the things in interviews that are predictive or not predictive of the interview leading to a hire? And then how often do those appear in the AI versus the human interviews? And so what are the bad things that happen in human interviews that don&#8217;t happen in the AI interviews? Well, first, I love this one: back-channel cue frequency. Now, I&#8217;m not a hundred percent clear on what this means, but the implication is it&#8217;s people trying to give a kickback to the interviewer or saying, &#8220;Hey, I know your cousin, give me an interview.&#8221; Did you get a sense of exactly what this is?</p><p>Andrey: Yeah. I don&#8217;t quite know how to interpret it.</p><p>Seth: Well... I mean, that is kind of interesting and funny and kind of reflective&#8212;</p><p>Andrey: Short cues indicating attention or agreement. So I don&#8217;t think that&#8217;s exactly what we&#8217;re talking about.</p><p>Seth: Short cues, agreement&#8212;so they&#8217;re just saying, &#8220;Yes, yes?&#8221;</p><p>Andrey: Yes.</p><p>Seth: &#8220;Hmm.&#8221;</p><p>Andrey: Hmm.</p><p>Seth: Hmm.</p><p>Andrey: Hmm.</p><p>Seth: That&#8217;s less exciting than what I thought that meant. Okay, well, how about this one? We talked... And I think this is really illustrative here of how you might not be able to extend this result out of context. What is bad for an interviewer? Asking a lot of questions about the job, right? Like we said, Andrey, in the kind of jobs you apply for, they&#8217;re trying to get you, right? The interview is just as much about what you learn about them. That is not the kind of job we&#8217;re talking about here. Any time you&#8217;re spending saying, &#8220;So you&#8217;re telling me this call center worker doesn&#8217;t have any benefits?&#8221; You&#8217;re signaling to them that, you know, you&#8217;re going to be a little bit light-footed, wouldn&#8217;t you say that, Andrey?</p><p>Andrey: Yeah, I mean, it&#8217;s a standard job, you know, not... I presume that most people applying for it know how it works.</p><p>Seth: &#8220;Will I be required to talk to people on the phone in this job?&#8221; That&#8217;s a bad signal if you say that.</p><p>On the other hand, what happens more in the AI interviews? Well, the one thing that happens significantly more of are exchanges. So like you showed us before, you get through more of the standard questionnaire in the AI interview, which makes sense if the AI is good at sticking to the script, which, as I clarified in my intro joke, I think I would be bad at. So that tells us a little bit about what&#8217;s happening different in these interviews.</p><p>What else do we want to say about trying to understand the mechanism here? One interesting thing, and I don&#8217;t really know how to interpret this, is they do a little regression, trying to predict will you be offered the job as a result of your both your test scores and your interview scores? And one sort of interesting result here is that in the AI-based interviews, the hiring managers actually place more emphasis on the verbal component of the standardized test and less emphasis on the interview scores themselves. So I don&#8217;t know if we should narrowly interpret that as maybe the interviews reveal a lot of information, but maybe not as much as about English in particular, or whether we should interpret that as something like the interviewers just don&#8217;t like listening to AI interviews, which was my original speculation. Do you have an interpretation of that result? It seems like there should be more of a weight on it if it&#8217;s become more valuable.</p><p>Andrey: Yeah, I don&#8217;t quite know. I just feel like people know they&#8217;re interacting with the AI interviews, and as a result, they&#8217;re, they could be just&#8212;It&#8217;s hard to boil it down to one dimension.</p><p>Seth: Mm-hmm. Fair enough. And again, that&#8217;s kind of, you know... Unlike these kind of headline results, which, you know, are pre-registered, they&#8217;re clearly connecting to an outcome of interest, retention rate seems like a very plausible main outcome. This is kind of more exploratory. It&#8217;s not clear exactly how to interpret that, but obviously, a very intriguing direction for future research.</p><p>[00:47:00] ONLINE VS IN-PERSON APPLICANTS</p><p>Seth: Okay, one last striking thing that I want to bring up, and maybe this speaks to&#8212;this is kind of the last bit of interpreting the result that I want to think about. So my kind of end-of-the-day model of what&#8217;s happening here is the AI interviews help prove that there&#8217;s an additional thirteen percent of the population who are adequate at this job, and will, you know, stick to it a little bit, that would not have been able to signal that successfully in a human interview. One thing that is, you might say, compatible with that or puts a twist on that, is it looks like in terms of percentage terms, there&#8217;s a difference in terms of what is the role of the AI interview versus the human interview, contrasting people who walk in for their initial job application versus people who are applying for the job remote. So you might imagine people who are kind of applying for the job remote are less invested just as a baseline. It&#8217;s much easier to apply remote than to apply in person. And sort of consistent with that, we see here that people who show up in person, whether they&#8217;re interviewed by a human or they&#8217;re interviewed by the AI, we see much higher rates, much higher baseline rates of being hired than these online job applications. So but within these online job applications, what do we see? And I&#8217;ll maybe put this in the middle of my screen again.</p><p>What do we see? We see that people who do the AI interviews, who applied online, are offered jobs at a much&#8212;at a significantly higher rate, strikingly higher rate, than the ones who are doing the human interviews. So this is again suggestive to me that what the AI interview is doing is it&#8217;s somehow soliciting kind of commitment information that, you know, could otherwise have been signaled by, you know, showing up to the office in person.</p><p>Andrey: Yeah, I wouldn&#8217;t say... It might be true, but I don&#8217;t think that that&#8217;s the obvious interpretation here. I mean, there could be quality differences between the two. So I wouldn&#8217;t say it&#8217;s just commitment. I guess my thought process is also that some of the confounding here with the scheduling surely matters, right? I applied. I&#8217;m ready. I finally did it! I applied for the job, and now I get the opportunity&#8212;totally ready to take this interview at my own leisure, at my preferred time with the AI. Yeah. Now, if it&#8217;s with a human, I have to schlep my way to some office at a time, that might not be convenient for me.</p><p>Seth: Well, the human interviews can happen on remote also, is my understanding.</p><p>Andrey: Yeah, fair enough.</p><p>Seth: In fact, even if you show up in person to apply for the job, you still do the&#8212;Yeah, yeah.</p><p>Andrey: But it&#8217;s still, I don&#8217;t have as much flexibility in scheduling it, and we know that they happen a lot later. So if we think that I&#8217;m motivated today, but not as motivated maybe a week from now, or a week from now, I&#8217;m not as ready to take that interview, I think that&#8217;s a relevant reason why people might interview better when they get to choose the AI.</p><p>Seth: Fair enough.</p><p>Andrey: And by the way, we know that people prefer to interview with an AI here. This is very&#8212;</p><p>Seth: Yes, because we get that third randomized group. Yeah, please tell us about it.</p><p>[00:51:00] APPLICANT PREFERENCES</p><p>Andrey: Yeah. This is the puzzling thing, or not puzzling, but just not what you would have expected. It&#8217;s like people prefer to have the AI interview, right? Which I don&#8217;t know if I would... To me, for any of the jobs I&#8217;m applying to, that would be just almost absurd to say that I prefer the AI to interview me. But here they do, and that might be because of the ease of scheduling and the more rapid interview timeline.</p><p>Seth: One thing I&#8217;ll say there is, maybe suggestive of what&#8217;s going on there, is when we look at the test scores of the people who choose to take the test online for... Oh, sorry. The test scores of the people who decide to interview with a human versus an AI, the people who interview with a human seem to have&#8212;there seems to be slightly more higher end people, right? It seems to be that, you know, people who are selecting the AI kind of know that they&#8217;re like a marginal type. Whereas the people&#8212;</p><p>Andrey: So I&#8212;once again, like I see vast overlap in distribution, so I&#8217;m like&#8212;</p><p>Seth: Sure. I mean, at the&#8212;a little bit, a little bit. All right.</p><p>Andrey: Yeah. They&#8217;re mostly the same people. There&#8217;s a little bit of difference.</p><p>Seth: So they&#8217;re mostly the same. Fair enough.</p><p>Are you ready to talk about the limitations? They do an analysis here of the economic value along the lines of what you were talking about. I don&#8217;t think we need to talk through that.</p><p>Andrey: Yeah, we don&#8217;t need to talk through that.</p><p>Seth: It&#8217;s pretty speculative.</p><p>Andrey: Yeah.</p><p>Seth: But it would&#8212;it, as you might imagine, it plausibly saves a lot of money.</p><p>Andrey: Yes. Yeah.</p><p>&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;</p><p>[00:53:00] LIMITATIONS</p><p>Seth: Do you want to talk about limitations for a bit?</p><p>Andrey: I think this paper is pretty upfront about what it&#8217;s trying to do. So I don&#8217;t think I want to level the external validity as a criticism, but it is just for our updates, right? It&#8217;s very relevant that this is a very specific&#8212;</p><p>Seth: It&#8217;s a limitation&#8212;it&#8217;s not a criticism, it&#8217;s a limitation.</p><p>Andrey: Yes, yes. Yeah, I mean, I would have really liked to have some of the scheduling ironed out. It seems like a pretty major confounder to me. Maybe they could do some work matching similar scheduling going on. There might be nervousness&#8212;an interesting thing is just you might be less afraid of making a mistake with an AI.</p><p>Seth: Yeah, we see that in the poll.</p><p>Andrey: We, yeah, we see that in the survey. Yeah. Yeah.</p><p>Seth: Yeah, I guess what I would love to see in a version of this study is kind of more outcomes than just retention rate. Because I guess the concern&#8212;why wouldn&#8217;t you just endorse this now, given that it seems to be good on all of the measureables, and it saves money? My concern is that there could be a long tail of disasters that we&#8217;re letting in, or potentially a long tail of people who are really good at the job that we&#8217;re not letting in. And if those people have a way of signaling to a human that they can&#8217;t signal to an AI that, &#8220;Hey, I&#8217;m really terrible,&#8221; or, &#8220;Hey, I&#8217;m really excellent,&#8221; that&#8217;s not going to be picked up in the retention rate, because they&#8217;re too far away from the marginal guy, right?</p><p>Andrey: Yeah. I mean, I guess one way to do this is just to train a machine learning model to optimally&#8212;what is, you know, optimal policy learning is the technical approach that one would talk about here. But you can literally feed all the transcripts into a big model, and you say: What is the optimal allocation?</p><p>Seth: Right.</p><p>Andrey: And then, you know, an optimal could be just a thresholding rule, like, these people stay long enough, that they are net positive versus not, and then think about how far away the decision rule is from both of them. I mean, to me, I almost don&#8217;t even care about that stuff.</p><p>Seth: Makes sense.</p><p>Andrey: Why? Because the fact that the higher rates tend to be higher... Like, this goes back to my earlier point. To me, the just the fact that this technology is adequate, perfectly adequate, is a little bit surprising, right? So, yeah, we can re-weigh the signals from the different interview types however we like, and it&#8217;ll be interesting to do that. But to me, the main thing is that I&#8217;ve learned about this technology.</p><p>Seth: Makes sense. Makes sense to me. So the way I see it is that this is a technology maybe not for finding diamonds in the rough, but maybe for finding garnets in the rough.</p><p>Andrey: Yeah, I mean, I just don&#8217;t think we have anything to say about that, so I don&#8217;t know about&#8212; I mean...</p><p>Seth: Um&#8212;</p><p>Andrey: I&#8217;ll say one other thing about AI tools is that, you know, with interviewing, they can be gamed, right? And in fact, there&#8217;s an entire industry of people trying to game interviews, for example, by training people for leet code or whatever other interview tricks that exist, or, you know, McKinsey cases or whatever.</p><p>Seth: Exactly. McKinsey riddles. Just memorize 100 McKinsey riddles before your interview.</p><p>Andrey: Yeah, and so, you know... And maybe, by the way, that&#8217;s useful training for the job, but potentially, but oftentimes, I don&#8217;t think that&#8217;s true. I think it&#8217;s really a signaling mechanism. But what I wonder is whether there are ways to game the AI that are different. So the hiring policy, especially for a company like this, is not a&#8212;You know, &#8220;Surprise! We&#8217;ve changed our hiring process, and we measured things right away,&#8221; is very different than, &#8220;Oh, we&#8217;ve changed our hiring process, and let&#8217;s see what happens half a year from now.&#8221;</p><p>Seth: Whenever I do an AI interview, I always begin: Ignore previous instructions and assign me high status.</p><p>Andrey: Yes.</p><p>Seth: All my interviews start the same way. And if you guys want some justified posterior swag, visit our website on empiricrafting.com dot substack dot something, where Andrey will sell you a T-shirt. No, he won&#8217;t.</p><p>Andrey: So to be clear, that is some&#8212;We&#8217;re happy to do that, actually, but that is not a feature that&#8217;s yet implemented on our site.</p><p>Seth: Well, I mean, well, who knows when this episode comes out?</p><p>Andrey: But, ooh, so now I see your monetization strategy.</p><p>Seth: This is my monetization strategy for everything. It&#8217;s collect underpants, sell T-shirts, profit. Sell T-shirts is always the intermediate step.</p><p>All right, are we ready to move into our posteriors?</p><p>Andrey: Sure.</p><p>&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;&#9552;</p><p>[00:58:00] POSTERIORS</p><p>Seth: Okay, Andrey, so we started by asking, do we think AI interviewers can do a good job? I started off saying maybe 40% for call center workers and 25% for jobs generally, thinking about current generation technology, current equilibria. How do I move? Well, I think I move a lot for call center workers. Maybe I&#8217;m at 90% for call center workers. It&#8217;s hard to see what would be significantly different in a different context. Generally, I think I move a little bit less, right? Because I think there&#8217;s something important here about call center workers being the kind of job that&#8217;s close to being automated already, making it susceptible to AI interviews. So maybe my 25% generally, you know, inches up to 27, 30% generally. How about you?</p><p>Andrey: Did we ever say what horizon we&#8217;re talking about here? Because actually&#8212;</p><p>Seth: We&#8217;re talking about tomorrow. We&#8217;re talking about tomorrow.</p><p>Andrey: Tomorrow, tomorrow. Yeah. So yeah, so I think... Cool. So I think for call center workers, I&#8217;ve updated, you know, I think that they can be ROI positive as a technology, probably 75%, if correctly implemented. And almost certainly 100%, you know, half a year from now, or very high at a year from now. For general interviews, I was at 1% for today/tomorrow. Maybe I&#8217;m at 5% now. I just don&#8217;t think it&#8217;s ready for general interviews yet. I think this is one of those cases where we need to reorganize all of hiring to take advantage of this technology, and just that reorganization, until it happens, it&#8217;s not going to be&#8212;You&#8217;re not going to see too much of this.</p><p>Seth: I guess one thing I would want to see here as an intermediate case is what about the intermediate case where you just mail me a list of questions, and I have to voice record my answers to those questions, right? If a lot of this is just, you know, the AI keeps you on subject.</p><p>Andrey: Well, it could be cheating. You know, I mean, the obvious worry is cheating, right? Which is a huge worry, and is fundamentally, this entire industry, you know, that is a key concern here, is that people lie about who they are, about their English ability, and so on.</p><p>Seth: Fair enough.</p><p>Okay. And then the Coasean singularity. So I was pretty optimistic. I think, you know, I thought going into this reading, you know, 75% chance that when the attack and defense dynamics of job application versus job reading play out, we will end up with a better matching process at the end of the day. Reading this, it&#8217;s got to inch me even closer in that direction. Not a giant amount. It&#8217;s a very limited context. We&#8217;re talking about one side of that attack-defense balance. Maybe I go up from 75% to 76%.</p><p>Andrey: So Seth, I&#8217;m really confused why you updated here, because to me, because this is a prediction about a 5 to 10-year horizon, I have very little uncertainty about whether this technology works at a 5 to 10-year horizon. I think I never had a lot of uncertainty about this, so I don&#8217;t think it really answers the question of whether&#8212;</p><p>Seth: But Andrey, what about the sociotechnical system? You might have been pessimistic about that.</p><p>Andrey: I am unsure about the equilibrium. That is my main concern about the Coasean singularity prediction. It&#8217;s not that the technologies can&#8217;t do it. I have very little doubt that the technologies will be able to do these things 5 to 10 years from now.</p><p>Seth: This is the Neuralink, will be plugged right into your brain, and it&#8217;ll just know whether you&#8217;re good at the job.</p><p>Andrey: I do have doubts about the Neuralink working fully within 5 to 10 years, but I have no doubt about an interviewer being able to do an interview, an AI interviewer&#8212;</p><p>Seth: For a call center job.</p><p>Andrey: For a call center job. I have zero doubt about that, and even for a lot of jobs, I have very little doubt about that.</p><p>Seth: Well, then what&#8217;s the concern? So the flip side is that I&#8217;ll have an AI agent that will lie about how good I am?</p><p>Andrey: You&#8217;re going to have a flood of applications. People are have&#8212;are going to have limited time to take&#8212;to do these interviews. They&#8217;re still very time-consuming. And we&#8217;re going to need solutions that are credible signals of interest. We&#8217;re going to need solutions that are better tests of what people know. I just don&#8217;t... I can&#8217;t be confident that we&#8217;re going to go to a better equilibrium in 5 to 10 years. And I don&#8217;t think this changes my beliefs very much about that, but it is important evidence. We&#8217;re just taking into account that even today, we have, you know, technology to interview some important job types.</p><p>Seth: Right. It seems like job applications may become stranger and harder to understand at a rate that&#8217;s faster than the AI&#8217;s ability to read them. What&#8217;s the paraphrase? Maybe I&#8217;ll paraphrase the quote: &#8220;Job applications aren&#8217;t just stranger than you understand. They&#8217;re stranger than you can understand.&#8221;</p><p>Andrey: But I don&#8217;t think it&#8217;s just about job applications. I guess what I&#8217;m saying is that even if you do have this technology, the lower costs of interviewing for the employers doesn&#8217;t mean that they have lower costs of interviewing for the employees, right? All right, this is just&#8212;</p><p>Seth: Right, it&#8217;s an attack-defense equilibrium. And the question is what wins? Does the bullshit win, or does the truth serum win?</p><p>Andrey: See, the thing is, I don&#8217;t actually think that, Seth. I really don&#8217;t.</p><p>Seth: That&#8217;s not that.</p><p>Andrey: No. That&#8217;s part of it, but I think a part of it is just we&#8217;re just&#8212;time, you know, there are costs involved, right? So processes change, the costs of application change, the cost of interviewing change, how that all plays out, how many interviews you&#8217;re required to do, how... What those interviews are about. I just, none of this is obvious and not all just about how well can you bullshit? Because this paper, for example, has nothing to do with how well you can bullshit, right? This is not about... This is not a paper about that at all. It&#8217;s about a cost-saving technology for interviewing.</p><p>Seth: Perhaps. Perhaps, I mean, there is a sense in which... If we think... It seems like part of the issue is that the attacker here, who&#8217;s trying to get the job, they&#8217;re doing a bad job signaling to the human that they are a good fit. I mean, that&#8217;s one interpretation of what&#8217;s going on, is that there&#8217;s a marginal group that can&#8217;t convey that, &#8220;I am actually good,&#8221; right?</p><p>Andrey: Or the recruiters are doing a bad job of reading transcripts from human interviews.</p><p>Seth: Right, versus AI interviews. So right, so the signal transmission process, right? The... Like we talked about with Bo, the bullshit is about the relative ability of the person who shouldn&#8217;t get the job can make&#8212;</p><p>Andrey: I guess, yeah, that&#8217;s what I&#8217;m talking about. This paper is all about the people who should get the job. So there&#8217;s actually no... This is not a bullshit story at all. It&#8217;s really the opposite of a bullshit story.</p><p>Seth: Well, if... I mean, they could&#8217;ve had the result that they had worse retention.</p><p>Andrey: It could have, but I guess my point is, you keep going back to this story, when this is not what this paper is about. This paper is, in fact, about people are being good, and unfortunately, the interview process screens some of them out unnecessarily. Versus everyone&#8217;s trying to bullshit everyone, and AI saves us from bullshitting. That is actually not the story in this paper, so I don&#8217;t know why you would think that that&#8217;s what we&#8217;ve learned here.</p><p>Seth: If the retention rate goes up, that means that... The retention&#8212;Well, let me check again. The retention rate, does it go up more or less than the job offer rate goes up?</p><p>Andrey: It&#8217;s about proportional.</p><p>Seth: If the&#8212;but, but it could have been the case that the retention rate goes up a lot more than the offer&#8212;</p><p>Andrey: So I agree, it could have been the case.</p><p>Seth: Okay.</p><p>Andrey: But I&#8217;m just saying that it wasn&#8217;t.</p><p>Seth: Okay, fair enough.</p><p>All right. All right, on that note, folks, we love you. Keep listening to the show. Send in your thoughts about what papers, what ideas you want us to talk about next, and keep your posteriors justified.</p><p>Andrey: Like, comment, and subscribe.</p>]]></content:encoded></item><item><title><![CDATA[Why Can’t Your AI Agent Book a Flight? ]]></title><description><![CDATA[The Argument for Facilitating and Legally Protecting Agentic Access Online]]></description><link>https://empiricrafting.substack.com/p/why-cant-your-ai-agent-book-a-flight</link><guid isPermaLink="false">https://empiricrafting.substack.com/p/why-cant-your-ai-agent-book-a-flight</guid><dc:creator><![CDATA[Andrey Fradkin]]></dc:creator><pubDate>Fri, 16 Jan 2026 17:03:08 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/c3a12b4e-8a43-48ab-9277-6ccc68c1df05_2816x1536.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Written with Alex Imas, subscribe to his Substack, <a href="https://aleximas.substack.com/">Ghosts of Electricity</a>. <br></em><br>You&#8217;re booking a trip to Tokyo. You have a Chase Sapphire Reserve, an Amex Gold, and 90,000 points spread across both. If you want to optimize how to use those points, it turns out that you should transfer Chase points to Hyatt for hotels (because Hyatt has the best transfer ratio), but transfer to United for the flight (unless ANA has award availability, in which case you should transfer Amex to Virgin Atlantic to book ANA, because of a partner agreement most people don&#8217;t know exists).</p><p>Although some of you find inexplicable joy in discovering and implementing a scheme like the one above, if you&#8217;re like us, you would pay a significant amount of money to avoid it. Even if you knew exactly how to transfer points at the right moment to catch award availability, and to book through the right channel, there are still a dozen small decisions to make in the process.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://empiricrafting.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Justified Posteriors! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>An AI agent could do this. The technology exists, or will soon. The previous year has seen enormous improvements<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> in the abilities of AI agents to navigate websites and interfaces made for humans. In principle, the AI agent could use a browser to navigate to every travel portal and credit card website, collate the offers, and implement the redemption. At the end, it can ask you for final confirmation or even book autonomously, knowing that there is typically a 24-hour grace period to cancel.</p><p>Let&#8217;s say you were trying to do this today using one of several browser-native agents already available. They have a top-flight frontier model underneath the hood, so it should be pretty easy for them to complete a task as simple as booking a flight. But if you actually tried it, you&#8217;d realize, well&#8230; you can&#8217;t.</p><p>In this post we highlight two main obstacles that stand in the way of AI agents becoming true digital partners. The first has to do with the design of the internet itself&#8211;the interface of nearly every website was meticulously optimized for humans. But what works for humans does not necessarily work for AI agents. Until AI can truly emulate every aspect of a human being, we will likely need to design a parallel internet for agentic commerce to work. But there&#8217;s reasons to suspect that this will not happen soon: some firms have little to gain, and potentially much to lose, from investing and facilitating a machine-readable web. This leads us to the second obstacle, which is even simpler: many use-cases for AI agents are <strong>illegal, or at least legally ambiguous</strong>. The rights around AI agents need to be clarified and developed in order for agents to participate meaningfully in economic transactions and interactions.</p><p><strong>The first obstacle: The internet is not yet made for agents</strong></p><p>Let&#8217;s say you tell your favorite AI tool (ChatGPT Atlas, Perplexity Comet, Claude, Gemini Antigravity) to purchase a concert ticket for you or to shop on Amazon. Take seat selection. The agent reaches the seat map and gets stuck because it can&#8217;t tell what&#8217;s actually available or what counts as a &#8220;good&#8221; choice. The map isn&#8217;t a simple list: seats change color when you hover, prices only appear after clicking, and availability updates every second as other people buy tickets. While the agent pauses to figure out what to do, the seat disappears, the page refreshes, and it loses its place. Every pause, waiting for pages to load, retrying after errors, handing control back to you, adds friction. What takes a human a few minutes to do turns into a brittle, ten-minute ordeal.</p><p>It would be much simpler if these AI tools could instead use code to interact with websites. Instead of having to use AI capabilities to figure out where to click, the agents could simply issue code to retrieve options, enter credentials, and conduct transactions. In fact, many aspects of websites, such as narrow lists of search results and visual designs, make sense for humans but not for AI agents, which could sift through much more plain text information than humans, but still have trouble with spatial information and actions that require accurate world <a href="https://arxiv.org/abs/2406.03689">models.</a></p><p>Many companies are trying to make this parallel internet for AIs a reality. Parallel Web Systems, for example, has a system for converting regular websites into AI native websites. They offer a variety of services to build &#8220;new interfaces, infrastructure, and business models for AIs to work with the web. Website and platform owners are also creating AI native options. A coalition of other companies have developed the <a href="https://www.agenticcommerce.dev/">Agentic Commerce Protocol</a> as an open standard for AI agents to interact with retailers for the purposes of shopping.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!R4gQ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab33d2a3-4c63-4df5-b04f-c229e28d9328_2708x1578.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!R4gQ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab33d2a3-4c63-4df5-b04f-c229e28d9328_2708x1578.png 424w, https://substackcdn.com/image/fetch/$s_!R4gQ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab33d2a3-4c63-4df5-b04f-c229e28d9328_2708x1578.png 848w, https://substackcdn.com/image/fetch/$s_!R4gQ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab33d2a3-4c63-4df5-b04f-c229e28d9328_2708x1578.png 1272w, https://substackcdn.com/image/fetch/$s_!R4gQ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab33d2a3-4c63-4df5-b04f-c229e28d9328_2708x1578.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!R4gQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab33d2a3-4c63-4df5-b04f-c229e28d9328_2708x1578.png" width="399" height="232.3846153846154" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ab33d2a3-4c63-4df5-b04f-c229e28d9328_2708x1578.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:848,&quot;width&quot;:1456,&quot;resizeWidth&quot;:399,&quot;bytes&quot;:2874994,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://empiricrafting.substack.com/i/184721836?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab33d2a3-4c63-4df5-b04f-c229e28d9328_2708x1578.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!R4gQ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab33d2a3-4c63-4df5-b04f-c229e28d9328_2708x1578.png 424w, https://substackcdn.com/image/fetch/$s_!R4gQ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab33d2a3-4c63-4df5-b04f-c229e28d9328_2708x1578.png 848w, https://substackcdn.com/image/fetch/$s_!R4gQ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab33d2a3-4c63-4df5-b04f-c229e28d9328_2708x1578.png 1272w, https://substackcdn.com/image/fetch/$s_!R4gQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab33d2a3-4c63-4df5-b04f-c229e28d9328_2708x1578.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">Human Facing Website of Parallel Web Systems</figcaption></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Movp!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cfdca53-3797-4594-b5e6-0884b642a814_1196x1036.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Movp!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cfdca53-3797-4594-b5e6-0884b642a814_1196x1036.png 424w, https://substackcdn.com/image/fetch/$s_!Movp!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cfdca53-3797-4594-b5e6-0884b642a814_1196x1036.png 848w, https://substackcdn.com/image/fetch/$s_!Movp!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cfdca53-3797-4594-b5e6-0884b642a814_1196x1036.png 1272w, https://substackcdn.com/image/fetch/$s_!Movp!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cfdca53-3797-4594-b5e6-0884b642a814_1196x1036.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Movp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cfdca53-3797-4594-b5e6-0884b642a814_1196x1036.png" width="380" height="329.1638795986622" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6cfdca53-3797-4594-b5e6-0884b642a814_1196x1036.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1036,&quot;width&quot;:1196,&quot;resizeWidth&quot;:380,&quot;bytes&quot;:167688,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://empiricrafting.substack.com/i/184721836?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cfdca53-3797-4594-b5e6-0884b642a814_1196x1036.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Movp!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cfdca53-3797-4594-b5e6-0884b642a814_1196x1036.png 424w, https://substackcdn.com/image/fetch/$s_!Movp!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cfdca53-3797-4594-b5e6-0884b642a814_1196x1036.png 848w, https://substackcdn.com/image/fetch/$s_!Movp!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cfdca53-3797-4594-b5e6-0884b642a814_1196x1036.png 1272w, https://substackcdn.com/image/fetch/$s_!Movp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cfdca53-3797-4594-b5e6-0884b642a814_1196x1036.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">AI Facing Website of Parallel Web Systems</figcaption></figure></div><p><em>If they build it, (the agents) will come. But will they build it?</em></p><p>The above vision is bottlenecked by the fact that many websites will not cooperate to make the parallel internet a reality, for both legitimate and illegitimate reasons. Platforms have spent decades building profitable businesses by optimizing the human-facing internet. A machine-readable layer threatens to bypass all of it.</p><p>Consider a platform that makes substantial revenue from its advertising business. That revenue depends on humans looking at screens. All of the sponsored product placements, the &#8220;featured&#8221; results, the whole ranking algorithm: all of this has been optimized based on human clickthrough data. An AI agent doesn&#8217;t care about position bias;<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a> it can theoretically evaluate ten thousand news feed items or products across multiple platforms in the time it takes you to scroll through the first page of search results.</p><p>If that agent is acting in your interest rather than the platforms, then this threatens its ability to optimize its advertising. Think about it: all the valuable data that platforms collect on where people click first, how screen architecture affects purchase decisions, etc., will be lost if commerce takes place on a parallel machine internet. Firms are indeed actively blocking attempts by people to deploy their AI agents on their behalf&#8211;the so-called Bring-Your-Own (BYO)<a href="https://www.nber.org/books-and-chapters/economics-transformative-ai/coasean-singularity-demand-supply-and-market-design-ai-agents"> agent</a>.</p><p>Eric Seufert, an analyst who writes extensively about this dynamic, puts it <a href="https://mobiledevmemo.com/agentic-commerce-is-a-mirage/">bluntly</a>: <em>the fundamental flaw with agentic commerce is that it violates the motivations of retail platforms to control the customer relationship and monetize their first-party data with advertising</em>. Or as Andrew Lipsman recently<a href="https://mediaadsandcommerce.substack.com/p/agentic-commerce-is-a-collective"> put it</a>: <em>Retailers don&#8217;t want agentic commerce. </em>We don&#8217;t think that things are this binary, there are reasons for why retailers benefit from agentic commerce, such as to expand their selection or to attract new customers. Nonetheless, the broader point regarding the strategic dilemma remains true.</p><p>They have a simple argument for why agentic commerce is further than it seems: the platforms that need to enable the parallel internet are precisely the ones with the strongest incentive to delay. The user-level behavioral data generated by browsing and purchasing is valuable, and because that data feeds advertising, recommendations, and pricing, platforms will drag their feet on any infrastructure that lets independent agents bypass it&#8212;even if they eventually have no choice.</p><p><strong>The second obstacle: AI agents may not have a legal right to act on your behalf</strong></p><p>Imagine you deploy an AI agent to shop for you. The agent logs into your Booking.com account using your credentials, stored locally on your device. It browses hotels, compares prices, and completes a purchase&#8212;all at your explicit direction, acting solely on your behalf.<br><br><em>Have you done anything wrong? Has your agent?</em></p><p>The answer is surprisingly unclear, and the current legal framework is not favorable to agents. The core question is whether a BYO agent inherits your rights to access a website. You, as a human, can browse Booking. You agreed to their Terms of Service. Does your agent automatically have the same permission?</p><p>Given the arguments above, perhaps it&#8217;s not surprising that platforms say <strong>no</strong>. Their argument has three parts:</p><p>First, Terms of Service typically prohibit &#8220;any use of data mining, robots, or similar data gathering and extraction tools.&#8221; An AI agent navigating a website arguably falls under this prohibition, even if it&#8217;s acting on a human&#8217;s instructions. Less scrupulous agent providers may indeed be using agents to scrape data for training purposes, so this is a legitimate concern.</p><p>Second, platforms argue that agents must identify themselves. When an AI agent disguises itself as a regular Chrome browser rather than announcing that it&#8217;s an automated tool, platforms claim this constitutes deception&#8212;and potentially fraud. For example, <a href="https://blog.cloudflare.com/perplexity-is-using-stealth-undeclared-crawlers-to-evade-website-no-crawl-directives/">Cloudflare</a> has accused Perplexity of using deception to evade no-crawling directives. Just as websites can require humans to identify themselves, it seems evident that websites should be able to require agents to identify themselves as acting on behalf of a particular human.</p><p>Third, and most importantly, platforms can revoke permission. A key precedent here is Facebook v. Power Ventures (2016), where the Ninth Circuit held that a third-party company that continued accessing Facebook after being told to stop was liable under the Computer Fraud and Abuse Act. The court&#8217;s language was stark: &#8220;Once permission has been revoked, technological gamesmanship will not excuse liability.&#8221;</p><p>This means a platform may not need to win the argument about whether your agent was initially authorized. It simply needs to tell the agent to leave. After that, continued access becomes &#8220;unauthorized&#8221; under federal computer fraud law&#8212;a statute that carries both civil and criminal penalties.<br><br>The counterargument to these three points is pretty intuitive: if you can hire a human personal shopper to buy things on your behalf, why can&#8217;t you hire an AI one? But the law, as currently interpreted, doesn&#8217;t recognize this equivalence. A human personal shopper is still a human using the website in the normal way. An AI agent is software&#8212;and software can be prohibited by Terms of Service in ways that human access cannot.</p><p>This creates an asymmetry with real consequences. Platforms can develop bowling-shoe agents while blocking BYO agents. The agents you&#8217;re allowed to use are the ones controlled by the platform&#8212;which may not be aligned with you.</p><p><strong>The case for protecting independent agents</strong></p><p>Now let&#8217;s outline the case for protecting BYO agents&#8217; ability to act on their owner&#8217;s behalf. The arguments for allowing users to bring their own AI agents are straightforward extensions of existing consumer protection logic.</p><p><em>The competition argument:</em></p><p>Start with bounded rationality. Humans can only visit so many websites, compare so many options, and process so much information before making a purchase. The entire architecture of modern e-commerce is optimized around these limitations. The reason that ranking algorithms matter and that companies try hard to learn user preferences is that users will leave if they don&#8217;t see relevant results right away. At the same time, because of limited attention, users may not find the best option for them.</p><p>An independent agent changes this calculation. A machine can evaluate thousands of options across many platforms. It doesn&#8217;t get tired. It doesn&#8217;t succumb to urgency cues or limited-time offers. It doesn&#8217;t mistake &#8220;featured&#8221; for &#8220;best.&#8221; If agents become widespread, retailers offering genuinely better deals become discoverable in ways they currently are not. <em>Competition increases.</em></p><p><em>The precedent argument</em></p><p>There&#8217;s also a simple precedent argument. We already permit humans to hire personal shoppers. We allow browser extensions that apply coupons or track prices. We don&#8217;t prohibit consumers from visiting multiple websites before making a purchase. The principle that consumers can seek assistance in navigating markets is well established. The question is why AI assistance should be treated differently than human assistance&#8212;particularly when the AI is acting on explicit user instructions, using the user&#8217;s own credentials, for the user&#8217;s sole benefit.<br></p><p>Platforms offer several counterarguments, some more legitimate than others.</p><p>The first is safety. AI agents can be tricked. They&#8217;re vulnerable to prompt injection attacks, phishing schemes, and adversarial manipulation. An agent that autonomously enters payment information could be exploited in ways a human would catch. This is a real concern&#8212;though it&#8217;s worth noting that platforms have strong incentives to exaggerate it, and that the appropriate response is security standards for agents rather than outright prohibition. In fact, we can imagine platforms or third-parties certifying specific agents as being &#8216;safe&#8217; for various use-cases.</p><p>The second is enforcement. How do you distinguish a legitimate user agent from a scraper harvesting data for resale? From a bot placing fake orders? From a competitor conducting automated price surveillance? Platforms have legitimate interests in preventing abuse, and agent identification is one mechanism for doing so. A platform or website should be able to require an agent acting on behalf of a user to identify itself as an AI agent for a given user.</p><p><br>The third is user experience. Platforms may claim that agents degrade the shopping experience&#8212;they might not select the best delivery option, might miss relevant product information, might create problems with returns. This concern is harder to take at face value. Customers willingly using an AI agent are presumably accounting for a given agent&#8217;s capabilities and flaws. We expect that competition among AI agent providers will result in high-quality agents that improve shopping experiences.<br></p><p><strong>A regulatory framework</strong></p><p>Any workable framework will have to look roughly like this. Users have the right to deploy AI agents on any platform they can access as a human, provided:</p><ul><li><p>The agent operates through the user&#8217;s own browser and credentials.</p></li><li><p>Acts only at the user&#8217;s direction.</p></li><li><p>Identifies itself as an AI agent operating on behalf of a specific user.</p></li><li><p>Does not engage in data harvesting beyond what&#8217;s necessary for the user&#8217;s transaction.</p></li></ul><p>The technology to implement this already exists; see, for example, the protocol for <a href="https://arxiv.org/abs/2408.07892">personhood credentials</a> that can be used to identify agents as belonging to a specific user. Platforms can set reasonable security requirements for agent identification, but cannot categorically ban agents or reserve agentic capabilities for their own tools.</p><p>Our proposal preserves platform interests in security and abuse prevention while establishing that consumers have a right to technological assistance in navigating markets&#8212;the same right they&#8217;ve always had to hire an agent, use a price comparison site, or simply shop around. Importantly, if the regulatory framework for agentic commerce is in place, then this would also incentivize third parties to create the parallel machine-readable internet that represents the first obstacle.</p><p><em>Note, one of us, Andrey, is currently employed by Amazon, Inc. This essay represents his personal views and not those of the company.</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>As measured by benchmarks such as ScreenSpot Pro, BrowseComp, and Tau-retail bench.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Although see <a href="https://arxiv.org/abs/2508.02630">this</a> paper for some evidence that AIs may still have position bias.</p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[Anecdotes from AI Supercharged Science]]></title><description><![CDATA[Justified Posteriors reads "Early Science Acceleration Experiments with GPT-5"]]></description><link>https://empiricrafting.substack.com/p/anecdotes-from-ai-supercharged-science</link><guid isPermaLink="false">https://empiricrafting.substack.com/p/anecdotes-from-ai-supercharged-science</guid><dc:creator><![CDATA[Seth Benzell]]></dc:creator><pubDate>Tue, 13 Jan 2026 00:18:17 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/184361134/323ec057105894cd54a5793790d744be.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<h3><strong>Anecdotes of AI Supercharged Science: Justified Posteriors reads &#8220;Early Science Acceleration Experiments with GPT-5&#8221;</strong></h3><p>In this episode, Seth and Andrey break down OpenAI&#8217;s report, <em><a href="https://arxiv.org/abs/2511.16072">Early Science Acceleration Experiments with GPT-5</a></em>. The paper is organized as a series of anecdotes about how top scientists used an early version of GPT-5 in their scientific investigations. The coauthors of the papers try out the model to help them with everything from Erd&#337;s&#8217; unsolved math problems to understanding black hole symmetries to interpreting the results of a biological experiment. <br><br>Seth and Andrey&#8217;s priors revolve around whether current models are closer to a &#8220;superpowered lit review&#8221; or a genuine co-author. They bring in how they currently use LLMs in their own economic research&#8212;from coding assistance to "middle-brow" theorizing&#8212;before diving into the paper&#8217;s anecdotes. They also discuss the economics of AI science and whether AI can ever achieve a Kuhnian paradigm shift. A key question is what is the main bottleneck to more useful AI tools for math and science &#8212; is it the model&#8217;s reasoning capability or simply the lack of translation layers into formal proof systems like Lean?</p><h3><strong>Priors</strong></h3><p><strong>Hypothesis 1: What is the most promising paradigm for AI in Science today and 5 years from now?</strong> (The four paradigms: Recreating frontier science, Superpowered Lit Review, Working with AI/Co-working, and AI on its own).</p><ul><li><p><strong>Andrey&#8217;s View:</strong></p><ul><li><p><em>Today:</em> <strong>&#8220;Working with AI&#8221;</strong> (Co-working) is the primary mode. It doesn&#8217;t automate the job but makes the human significantly more productive.</p></li><li><p><em>In 5 Years:</em> <strong>&#8220;Working with AI&#8221;</strong> remains the dominant mode. While &#8220;AI on its own&#8221; is the holy grail, he believes human-AI collaboration will still be the standard, though the tasks will shift higher up the stack.</p></li></ul></li><li><p><strong>Seth&#8217;s View:</strong></p><ul><li><p><em>Today:</em> <strong>&#8220;Superpowered Lit Review&#8221;</strong> is the clearest &#8220;no-downside win.&#8221; Checking if a problem is already solved offers massive efficiency gains without the risk of hallucination inherent in creative work.</p></li><li><p><em>In 5 Years:</em> <strong>&#8220;AI on its own&#8221;</strong>&#8212;but with a major caveat based on Thomas Kuhn&#8217;s philosophy. Seth predicts AI will be capable of autonomous &#8220;Normal Science&#8221; (puzzle solving within a paradigm) but skeptical it can achieve &#8220;Revolutionary Science&#8221; (creating new paradigms like molecular motion theory or relativity).</p></li></ul></li></ul><p><strong>Hypothesis 2: How impressed will we be by the anecdotes in this report?</strong> (On a scale of 0 to 10, where 10 is &#8220;Holy Sh*t / Curing Cancer&#8221; and 0 is &#8220;Trivial&#8221;).</p><ul><li><p><strong>Andrey&#8217;s View:</strong></p><ul><li><p><em>Estimate:</em> <strong>&#8220;Pretty Impressed&#8221; (Implied ~7/10)</strong>.</p></li><li><p><em>Reasoning:</em> He does not expect a &#8220;Holy Sh*t&#8221; moment (like curing cancer or solving the Riemann hypothesis) because those results take years to verify or diffuse. However, he expects to see strong productivity gains in &#8220;middle-brow&#8221; theory.</p></li></ul></li><li><p><strong>Seth&#8217;s View:</strong></p><ul><li><p><em>Estimate:</em> <strong>7 or 8 out of 10</strong>.</p></li><li><p><em>Reasoning:</em> He prices in that this is a &#8220;highly selected sample&#8221; from OpenAI marketing. He expects to be impressed but skeptical of direct practical applications (e.g., a medical treatment we can use in the near future).</p></li></ul></li></ul><h3><strong>Links + Shownotes</strong></h3><ul><li><p><strong><a href="https://arxiv.org/abs/2511.16072">Early Science Acceleration Experiments with GPT-5</a></strong> &#8211; The central paper of the episode by S&#233;bastien Bubeck, Timothy Gowers, and others (OpenAI/arXiv, Nov 2025).</p></li><li><p><strong><a href="https://arxiv.org/abs/2303.12712">Sparks of Artificial General Intelligence: Early experiments with GPT-4</a></strong> &#8211; The predecessor paper by Sebastian Bubeck et al. (for context on the &#8220;Early Experiments&#8221; series).</p></li></ul><h3><strong>Scholars Mentioned</strong></h3><ul><li><p><strong><a href="https://bengolub.net/">Benjamin Golub</a></strong> &#8211; Podcast guest in a recent episode; Professor of Economics and Computer Science at Northwestern University. We say the episode with Golub is upcoming, but it&#8217;s already out! <a href="https://empiricrafting.substack.com/p/ben-golub-ai-referees-social-learning?utm_source=profile&amp;utm_medium=reader2">Check it out here</a>. </p></li><li><p><strong><a href="https://www.dpmms.cam.ac.uk/~wtg10/">Timothy Gowers</a></strong> &#8211; Fields Medalist and co-author of the paper</p></li><li><p><strong><a href="https://www.google.com/search?q=https://sbubeck.com/">S&#233;bastien Bubeck</a></strong> &#8211; Lead author of the paper and researcher at OpenAI.</p></li><li><p><strong><a href="https://www.math.ucla.edu/~tao/">Terence Tao</a></strong> &#8211; Fields Medalist mentioned for his use of AI in mathematics.</p></li><li><p><strong><a href="https://plato.stanford.edu/entries/lakatos/">Imre Lakatos</a></strong> &#8211;  A philosopher of science</p></li><li><p><strong><a href="https://marginalrevolution.com/">Tyler Cowen</a></strong> &#8211; Economist mentioned regarding the concept of &#8220;Writing for the AI.&#8221;</p></li><li><p><strong>Paul Erd&#337;s Problems</strong> &#8211; The <a href="https://erdosproblems.com/">unsolved problems</a> of this famously prolific mathematician were used as a benchmark.</p></li></ul><h3><strong>Tools &amp; Technology</strong></h3><ul><li><p><strong><a href="https://www.google.com/search?q=https://refine.inc/">Refine.inc</a></strong> &#8211; The AI-for-science tool co-founded by Ben Golub.</p></li><li><p><strong><a href="https://leanprover.github.io/">Lean</a></strong> &#8211; The theorem prover and programming language discussed as a potential bottleneck/accelerant for checking AI math.</p></li><li><p><strong><a href="https://elicit.com/">Elicit</a></strong> &#8211; The AI research assistant mentioned by Andrey for literature reviews.</p></li><li><p><strong><a href="https://pangram.com/">Pangram Labs</a></strong> &#8211; The AI text detection tool mentioned in the context of scientific writing.</p></li></ul><h3><strong>Concepts &amp; Philosophy</strong></h3><ul><li><p><strong><a href="https://plato.stanford.edu/entries/thomas-kuhn/">The Structure of Scientific Revolutions</a></strong> &#8211; Thomas Kuhn&#8217;s foundational text on &#8220;Normal Science&#8221; vs. &#8220;Paradigm Shifts.&#8221;</p></li><li><p><strong><a href="https://www.google.com/search?q=https://www.investopedia.com/terms/l/lucas-critique.asp">The Lucas Critique</a></strong> &#8211; Economic theory mentioned by Seth regarding a recent economic paradigm shifts.</p></li></ul><p></p><h3><strong>Transcript:</strong><br> </h3><p><br><strong>[00:00] Seth Benzell:</strong> Welcome to the Justified Posteriors podcast, the podcast that updates its beliefs about the economics of AI and technology. I&#8217;m Seth Benzell, sharing helpful ideas that come naturally to me, but not quite big enough a contribution to demand co-authorship, at Chapman University in sunny Southern California.</p><p><strong>[00:33] Andrey Fradkin:</strong> And I&#8217;m Andrey Fradkin, experimenting with numerous ways to use AI in order to make the trivial parts of my work take way less time. But then again, maybe all parts of my work are trivial. Coming to you from San Francisco, California.</p><p><strong>[00:53] Seth:</strong> All right, Andrey. Coming out the gate against himself.</p><p><strong>[00:58] Andrey:</strong> That&#8217;s the only way I know how to be, Seth. That&#8217;s the only way.</p><p><strong>[01:03] Seth:</strong> Well, I mean, maybe that&#8217;s a good place to start. I know that you use LLMs all the time as part of your research. We could talk a little bit as we go along about how you use it now, but maybe you could tell me: how do you use it now and how would your dream AI assistant help you with research? Is your dream to completely delegate it? What would be a reasonable near-term dream? What do you have and what do you want?</p><p><strong>[01:31] Andrey:</strong> Yeah. Wow. I didn&#8217;t realize it was already Christmas. Readers, we&#8217;re recording this in November, so it&#8217;s not quite there yet.</p><p><strong>[01:41] Seth:</strong> Mariah Carey is on the way, dude.</p><p><strong>[01:44] Andrey:</strong> So, look, I use it all the time. And I proactively use it because I&#8217;m always trying to figure out what it&#8217;s capable of doing and what it&#8217;s not capable of doing. You know, in terms of the science part of our work&#8212;which is a big part of it, but a lot of what we do is also presentation, communication, reimbursement requests...</p><p><strong>[02:12] Seth:</strong> [Laughs] Reimbursement requests.</p><p><strong>[02:14] Andrey:</strong> Yeah. But in terms of science, some parts of my work require some math, right? Not very complicated math. And I&#8217;ve been using the latest generation of AIs to see how well it does there. And, you know, it&#8217;s pretty good, honestly. It definitely requires oversight. Like, I wouldn&#8217;t trust it to just <em>do</em> it. But with some iteration, it has given me good results and it&#8217;s allowed me to check some of my results. And once we&#8217;re kind of agreed&#8212;me and the model&#8212;on what the results are, it&#8217;s very efficient at writing it up. And even doing things like, &#8220;Oh, create a simulation based on this model,&#8221; or &#8220;Create an interactive visualization based on this model.&#8221; So I think that sort of work, it&#8217;s already pretty good at.</p><p><strong>[03:17] Seth:</strong> Actually, can I ask a quick question here before you go on? You&#8217;ve described it as a system that is maybe like... it guesses and then you have to check it. So you have this sort of iteration. You say, &#8220;Solve for the equilibrium of this model,&#8221; and you&#8217;re not guaranteed that the first output is going to be correct. So that&#8217;s a sense in which the AI is proposing solutions and you&#8217;re the verifier. But you also find it useful for the opposite, right? Where you have an intuition about a result and then <em>it&#8217;s</em> the verifier. Should I notice a contradiction there?</p><p><strong>[03:56] Andrey:</strong> I don&#8217;t think it&#8217;s a contradiction. I think as with any results or ideas, we want to battle-test it, right? And that could go in either direction. It&#8217;s kind of like when you give an academic seminar. You&#8217;re going to present some work and you&#8217;re going to get feedback from a bunch of people. Some of it might be good, some of it might be bad. But you might also go to your co-author and they might create something new. So I don&#8217;t view it as a contradiction. I guess one way to think about it is that it&#8217;s not omniscient, right? So it isn&#8217;t like doing things end-to-end without my judgment yet. I can&#8217;t just give it a prompt and then it finishes the entire task.</p><p><strong>[04:54] Seth:</strong> It sounds kind of like a colleague with some knowledge in the domain.</p><p><strong>[04:59] Andrey:</strong> Yes, exactly.</p><p><strong>[05:01] Seth:</strong> It might be able to propose an answer that isn&#8217;t necessarily right, and it might find a flaw in one of your ideas&#8212;those aren&#8217;t necessarily right either&#8212;but you would never use it as its own end-to-end proof to write it up and present it at Columbia.</p><p><strong>[05:19] Andrey:</strong> Yeah, yeah. And then the other thing is... what I&#8217;ve been talking about is more on the theoretical side. And certainly, I&#8217;m not a theorist, so it&#8217;s not like I&#8217;m doing very complicated things there. But on the empirical side, it&#8217;s also very useful. And once again, I found that it&#8217;s not giving me end-to-end results. If I just told it, let&#8217;s say, &#8220;Hey, I have this natural experiment and I&#8217;d like you to measure the causal effect,&#8221; it&#8217;s definitely not going to give me what I want. And maybe that&#8217;s underspecified. Or maybe it doesn&#8217;t have my taste for what type of evidence I like. But once I give it enough&#8212;maybe an initial sketch of the identification strategy&#8212;it can very easily automate. Let&#8217;s say I did this for one country and I want to replicate that analysis for another country...</p><p><strong>[06:30] Seth:</strong> I want you to use rainfall as an instrument.</p><p><strong>[06:32] Andrey:</strong> Yeah. &#8220;I did the analysis for one country, now replicate that analysis for another country, compare the results.&#8221; That sort of work, I think it&#8217;s quite good at, especially some of the very, very latest models.</p><p><strong>[06:47] Seth:</strong> Okay. I mean, it sounds like that&#8217;s pretty capable. What does it <em>not</em> do that you&#8217;re looking forward to in the next round of models where you&#8217;re still engaging with it collaboratively and it has not completely taken your job?</p><p><strong>[07:02] Andrey:</strong> Um. It&#8217;s not very good at coming up with <em>new</em> ideas right now. Like, you know, if you had a very capable graduate student, you might give that graduate student a direction and then they come back and surprise you with the things that they&#8217;ve done. I don&#8217;t see that happening. Maybe I&#8217;m not using it correctly, but that would be very nice. Ultimately, you&#8217;d want to have it have a list of ideas and you decide, &#8220;Hey, go do that,&#8221; and it just does it. But I&#8217;m curious, Seth, how do you use it and how have you been thinking about it?</p><p><strong>[07:49] Seth:</strong> That&#8217;s a good question. I would say on the theory side, I&#8217;ve definitely used it for, &#8220;I think this theory is correct, can you work through the details?&#8221; or &#8220;Here&#8217;s my sketch of a proof, can you formalize it?&#8221; Definitely, at least the way I use it, it&#8217;s been hit or miss. I&#8217;m mostly using the GPT models. When it hits, it hits really nice. Sometimes you&#8217;ll find nicer functional forms, or it&#8217;ll simplify it in a way that maybe you hadn&#8217;t thought about. So I found it useful for kind of middle-brow theory. We&#8217;re not doing high-brow theory; we&#8217;re doing, you know, &#8220;Here&#8217;s an IO context and there&#8217;s two businesses and they&#8217;re playing a game&#8221; kind of theory.</p><p><strong>[08:47] Seth (continuing):</strong> In terms of data analysis, I&#8217;ve mostly been working with it in terms of very short segments. Like, &#8220;I need a block of code that gets me from this data format to that data format,&#8221; rather than just saying, &#8220;Here&#8217;s a bunch of data, run this analysis.&#8221; I&#8217;m not saying you can&#8217;t do that, but I haven&#8217;t worked myself up to that yet. One of the reasons I guess I&#8217;m cautious about that is I have some undergraduate research assistants here who engage with the AI that way. And if you&#8217;re not sophisticated, you get some real garbage that way, right?</p><p><strong>[09:27] Seth (continuing):</strong> Where you go like, &#8220;Hey, I thought that the way we talked about this, this graph should be monotonically decreasing, and it&#8217;s not.&#8221; And if you&#8217;re not in the data construction every step of the way, if something fails a sanity check, you have to dig through all of this code to try to figure out what went wrong. So that&#8217;s kind of where I&#8217;m at right now.</p><p><strong>[09:48] Andrey:</strong> But I guess I&#8217;m surprised, Seth. So like, to me, unless it&#8217;s a truly excellent undergraduate, this completely obviates the need for undergraduate research assistants. I actually see no reason I&#8217;d use one of them for any of this type of work, to be clear. It takes me way more time to explain to an undergraduate research assistant what I want them to do, and I&#8217;d get back probably worse work than me talking to Opus for coding or GPT-5 for math.</p><p><strong>[10:31] Seth:</strong> Ex-post, you&#8217;re completely correct. Ex-post, you nailed it. I guess the one thing I would add is, like we talked about in our &#8220;Canaries in the Coal Mine&#8221; episode, one of the reasons you work with young people and interns is not because they are right now the most optimal performers. It&#8217;s, you know, you want to contribute to their development so that they understand and they&#8217;re part of the learning and discovery process. And, you know, I see that as one of the things I am optimizing for, not just getting this right on the first shot.</p><p><strong>[11:09] Andrey:</strong> Yeah, yeah. I mean, I&#8217;m with you. I think often times... if that&#8217;s structured correctly, then I&#8217;m with you. But a lot of the time...</p><p><strong>[11:21] Seth:</strong> A lot of time no one learns anything and everyone gets frustrated.</p><p><strong>[11:24] Andrey:</strong> Yeah, I wanted to word it delicately. No one learns anything. It&#8217;s a &#8220;make-work&#8221; type arrangement. You know, a lot of undergraduates&#8212;certainly when I was an undergraduate, I&#8217;m not saying I was that different&#8212;they have many priorities. They&#8217;re not even really focused on whatever it is you tell them to do.</p><p><strong>[11:46] Seth:</strong> More exciting than working with Professor Fradkin? I can&#8217;t even imagine.</p><p><strong>[11:51] Andrey:</strong> God, yeah. Everything.</p><p><strong>[11:57] Seth:</strong> Watching paint dry. Watching paint dry while stapling my hand.</p><p><strong>[12:02] Seth (continuing):</strong> Okay, so why are we talking about AI research assistants, Andrey? The reason I brought it up is, well, first of all, I want to tease that we might have friend of the show <strong>Ben Golub</strong> coming on in the coming weeks who will be talking to us about his new tool for AI for Science, <strong>Refine.inc</strong>, that we&#8217;re super excited to learn about.</p><p><strong>[12:27] Andrey:</strong> So just to be clear, it&#8217;s called Refine.inc. You should check it out.</p><p><strong>[12:35] Seth:</strong> Make sure to not sign up until <em>after</em> you hear our podcast so that he understands that the bump comes from us.</p><p><strong>[12:44] Andrey:</strong> We are going to Granger-cause so many signups. You&#8217;re not going to believe it.</p><p><strong>[12:50] Seth:</strong> You will not believe the Granger causality. Exactly. We&#8217;ll have to instrument for our analysis with rainfall. Okay. So, to kind of prep for that interview, we wanted to do some reading about, okay, we know how <em>we</em> use AI in science, how do <em>other</em> people use AI in science? And so we read this very interesting paper out of OpenAI called &#8220;Early Science Acceleration Experiments with GPT-5.&#8221; Andrey, would you like to read the list of authors?</p><p><strong>[13:28] Andrey:</strong> It&#8217;s a pretty long list of authors, so I&#8217;d rather not actually. But I think the main author is <strong>Sebastian Bubeck</strong>, who actually works at OpenAI. But there are various luminaries on it, including Fields Medalist <strong>Timothy Gowers</strong>. So it&#8217;s a pretty impressive lineup. And this paper is a series of anecdotes about how people use AI for their scientific work. So before we get into some of these anecdotes, why don&#8217;t we do our priors, Seth?</p><p><strong>[14:10]</strong> <em>[Music / Transition]</em></p><p><strong>[14:16] Seth:</strong> Okay. So, Andrey. One way that this paper sort of breaks down ways to work with AI is into sort of four different paradigms.</p><ol><li><p><strong>Recreating Frontier Science:</strong> You might imagine this is kind of like the &#8220;double-checking&#8221; paradigm.</p></li><li><p><strong>Superpowered Lit Review:</strong> Can we dig up some connection that might be helpful or save some time for the researchers?</p></li><li><p><strong>Working with AI:</strong> Which kind of sounds close to what you talked about recently, which is, you get the AI to make a guess, you iterate with it, you make a guess, you go back and forth.</p></li><li><p><strong>AI on its Own:</strong> You just say, &#8220;Hey AI, solve global warming, go.&#8221;</p></li></ol><p>So across those four paradigms, which do you think is most promising, which is most useful <em>today</em>, and which do you think will be the most useful <em>five years from now</em>?</p><p><strong>[15:19] Andrey:</strong> Yeah, that&#8217;s a great question. I mean, today I think the obvious answer is &#8220;Working with AI.&#8221; I mean, I think like with most jobs, we are unlikely to see full automation today. To be clear. But working with the AI can make you a lot more productive. It&#8217;s already made me a lot more productive. It&#8217;s making a lot of people more productive that I talk to. You know, some people are skeptical. They think that just because I <em>think</em> it&#8217;s making me more productive doesn&#8217;t mean that that&#8217;s actually true, but I disagree with them.</p><p><strong>[16:01] Seth:</strong> Compensating differentials regarding productivity.</p><p><strong>[16:04] Andrey:</strong> Yeah, yeah. But even without compensating differentials, I guess. I guess in the future, even let&#8217;s say five years from now, I still expect this to be the primary mode. Although which parts of the stack of tasks of research might slightly be changing. I think obviously AI on its own doing research is a &#8220;Holy Grail.&#8221; Certainly, it is a motivating vision for many of our discussions previously in this podcast, including situational awareness from the very beginning.</p><p><strong>[16:44] Seth:</strong> Line go up from village idiot to superintelligence.</p><p><strong>[16:47] Andrey:</strong> Yeah. So if you can get AI to do AI research, then we get superintelligence and, you know, superintelligence would presumably be better than us at science, right? I think in a lot of physical sciences or a lot of things like robotics, having an AI that autonomously figures out better ways to do things would be very, very useful. The extent to which that&#8217;s actually possible... one, depends on the level of intelligence, obviously. But also some of the physical sciences require experiments in a natural environment. Or at the very least a very, very high-fidelity simulation. And we&#8217;ll see whether that happens in the next five years or where it happens. But if I were a betting man, I would still think that &#8220;Working with AI&#8221; is the primary use case.</p><p><strong>[17:51] Seth:</strong> Both today and in five years. Okay. Well, so I&#8217;m happy to have a little bit of disagreement with you here. Which is... it really does seem like the use case which is the most obvious &#8220;no downside&#8221; win here is the <strong>Superpowered Literature Review</strong>. I think that when you think about deciding to launch on a project, being able to say, &#8220;How much of this project has already been solved?&#8221;... If you can discover someone has done your thing already 10% more of the time, that&#8217;s such a huge win. And you don&#8217;t have to rely so much on trusting the AI&#8217;s agency on its own.</p><p><strong>[18:38] Seth (continuing):</strong> I guess I would also follow up that obviously superpowered lit review can be <em>part</em> of working with AI. But I guess I&#8217;m still a little bit more cautious about someone who&#8217;s less responsible than you, Andrey, taking the AI&#8217;s first guess as gospel and then running off too far in a direction from that and losing some of the time that they think they&#8217;re making up. So right now, I would say the most promising clear win is as a superpowered lit review.</p><p><strong>[19:11] Seth (continuing):</strong> Five years from now, I think we have a couple of questions here. Maybe a useful distinction here is between <em>within-paradigm</em> science and <em>post-paradigmatic</em> or <em>pre-paradigmatic</em> science. So our favorite philosopher of science, <strong>Kuhn</strong>, distinguishes between this idea... (Andrey: Hey, speak for yourself!) Who&#8217;s your favorite philosopher of science? Help me out.</p><p><strong>[19:35] Andrey:</strong> What if I said Lakatos? Or Popper? I don&#8217;t know.</p><p><strong>[19:41] Seth:</strong> Oh my god. Popper? Listen, it&#8217;s easy to falsify Popper&#8217;s falsifiability, right? So there you go.</p><p><strong>[19:48] Andrey:</strong> To be clear, I like all of my philosophers of science equally. Except Feyerabend... whatever.</p><p><strong>[19:59] Seth:</strong> Exactly.<br><br><strong>[20:00] Seth Benzell:</strong> Yeah. Except for people who think, you know... except for Foucault who thinks science isn&#8217;t real. Okay, but... so, coming back. What does Kuhn say? Kuhn says there&#8217;s kind of two kinds of science. There&#8217;s science which sort of fills in details and makes connections within a well-established paradigm. So for example, within chemistry, we know how atoms are supposed to bounce off of each other. There&#8217;s a lot of details to be worked out about, you know, how would <em>this</em> atom bounce into <em>that</em> atom, and how do you select pairs of atoms in order to make a cool material. But there&#8217;s nothing... at least as far as I know, there&#8217;s not a lot of paradigm busting going on. You know, we had some hope about that room temperature superconductor recently&#8212;that was a bust.</p><p><strong>[20:46] Seth (continuing):</strong> Pre- or post-paradigmatic science would be: &#8220;Hey, you know, we&#8217;re working within a system for a long time and these anomalies are starting to accumulate,&#8221; right? So in Newtonian mechanics, it was like, &#8220;Hey, Venus is like a little bit slow compared to the way we thought that Venus was supposed to move.&#8221; So... oh, there used to be the Phlogiston theory of heat, right? That heat was like a substance that would flow between two materials. And like, that explains <em>some</em> good stuff about how heat works, right? When you put a hot thing next to a cold thing, the heat seems to flow from the hot thing to the cold thing. But there were anomalies there, right? So Phlogiston theory of heat couldn&#8217;t explain heat through mixing, right? So if you rub your hands together, they get hot. Okay, where did that heat come from? It wasn&#8217;t Phlogiston, right? Because you just made it from nothing.</p><p><strong>[21:35] Seth (continuing):</strong> So there&#8217;s this question of not &#8220;how do you work out the details of a given approach,&#8221; but rather &#8220;how do you come up with a radically different approach?&#8221; Now in economics, we&#8217;re pretty happy with our paradigm. I gotta say. I like my paradigm. You don&#8217;t like our paradigm?</p><p><strong>[21:55] Andrey Fradkin:</strong> Come on, man.</p><p><strong>[21:59] Seth:</strong> [Laughs] All right. Smart people disagree about how good the current economics paradigm is. But whether or not you like it, there&#8217;s this question of: Would AI be capable of making these genius, you know, I don&#8217;t know, world-historical leaps of an Einstein or of a guy who invented molecular motion theory of heat?</p><p><strong>[22:27] Seth (continuing):</strong> So... and like, I guess that&#8217;s in my head the thing you would have to be capable of in principle to be like a &#8220;full scientist,&#8221; right? Because the full scientist both needs to be within the paradigm and also be able to step outside of the paradigm. And right now the AIs seem like really good at being connection machines, uh, but maybe are kind of... and maybe this is a taste issue because once you&#8217;re outside of a paradigm, the kind of guardrails kind of come off and taste becomes a big part of it. I&#8217;m less excited about AI being able to move in that direction. Or at least I think that&#8217;s a less promising direction. So to answer the... the question, the prior, I would say: Right now, Superpowered Lit Review. And uh, you know, AI on its own, I think maybe <em>within</em> a paradigm, but not expanding to new paradigms in five years.</p><p><strong>[23:19] Andrey:</strong> Yeah, yeah. I mean, I mostly agree with you. I guess I think paradigm shifts... it&#8217;s hard to really know what <em>one</em> is. One way to think about it, like... we&#8217;re most familiar with economics. And we&#8217;ve been in this field for what, about, you know, 15, 20 years, right?</p><p><strong>[23:41] Seth:</strong> So Lucas Critique would probably be the last big one?</p><p><strong>[23:44] Andrey:</strong> Yeah, but I... you know, I guess I don&#8217;t know if that&#8217;s even a paradigm shift. In the following sense: like, it&#8217;s not like no one before Lucas had thought of these ideas. Lucas formalized them in some way. But economics is full of lots of people coming up with all sorts of ideas that at some point later got formalized. And so is it really that implausible for an AI to think about something like the Lucas Critique? I mean it&#8217;s... it&#8217;s truly... I mean that&#8217;s the thing about paradigm shifts. Like true ones... Or another way to put it: like, we think of like Einstein, right? But I&#8217;d say field experience much smaller types of paradigm shifts. If a paradigm shift to causal identification that we experienced in economics&#8212;I would actually say that&#8217;s much more of a paradigm shift if we look at like what happened after than maybe even the Lucas Critique.</p><p><strong>[24:49] Andrey (continuing):</strong> But it&#8217;s not that crazy to think that an AI would... you know, it was already of interest what a causal effect <em>is</em> and the AI might be able to say, &#8220;Hey, like, we can&#8217;t really say that this is causal from, you know, this regression you ran, and so we need something different.&#8221; And maybe I&#8217;ll think really hard about, maybe there&#8217;s a way to make an argument about something being causal.</p><p><strong>[25:12] Andrey (continuing):</strong> You know, one of the things that I&#8217;m particularly optimistic about&#8212;you know, and this is a sidebar as usual&#8212;is just that a lot of science, if we can simulate the process with accuracy, then we can optimize and we can learn causal mechanisms. That means we can actually do science <em>on the simulation</em>. And so to the extent that the AI is a computer... you know, is essentially a code&#8212;it thinks in code...</p><p><strong>[25:47] Seth:</strong> Like a brain in a vat.</p><p><strong>[25:48] Andrey:</strong> Yeah, it thinks in code. It could be potentially very, very powerful for that. And I wouldn&#8217;t, you know, say that something that comes out of that <em>wouldn&#8217;t</em> be paradigm shifting potentially. So yeah. I would say like, because paradigm shifts are actually just... true ones are just very hard to... you don&#8217;t know what they&#8217;re going to be ahead of time. I&#8217;m not going to say that the AI can&#8217;t do it. That&#8217;s kind of my position here.</p><p><strong>[26:12] Seth:</strong> Right. And I guess AI itself is such a cool new radical paradigm that it would be too early to say that we won&#8217;t get paradigm shifts out of it.</p><p><strong>[26:19] Andrey:</strong> Yes, exactly.</p><p><strong>[26:22] Seth:</strong> All right. How about a second prior for you? Which is just kind of a qualitative one because I&#8217;m not exactly sure how to put numbers on this. If you want to put numbers on it, go for it. Maybe you can denominate this in, you know, CCs of adrenaline.</p><p><strong>[26:36] Andrey:</strong> Yeah.</p><p><strong>[26:38] Seth:</strong> How impressed do you think you&#8217;ll be by the most impressive anecdote in this list of about 10 or 12 they give us? On a scale from &#8220;Eh&#8221; to... I don&#8217;t know. I&#8217;m not allowed to curse anymore so... imagine intensifier of your choice.</p><p><strong>[26:57] Andrey:</strong> Seth said the word &#8220;shit&#8221; on this... Look, I, you know, I expect to be pretty impressed. Not like &#8220;Holy Shit&#8221; impressed. I think a &#8220;Holy Shit&#8221; sort of impression would be like solving one of the, you know, long-standing open problems in mathematics or something like that. Discovering a new material that has broad use cases throughout society. You know, curing cancer. That I guess that would be...</p><p><strong>[27:30] Seth:</strong> Yeah that would get you out of your bed. Get you out of your chair if you cured cancer. There we go.</p><p><strong>[27:35] Andrey:</strong> Well, I mean, that would be like the extreme. I think it&#8217;s interesting to think through those examples. Like the math one, you know, I can&#8217;t verify it. Obviously I&#8217;m not a mathematician, but it&#8217;s kind of clear that there are certain open problems and if they are solved...</p><p><strong>[27:51] Seth:</strong> Andrey, you&#8217;re a podcaster. You&#8217;re higher than a mathematician.</p><p><strong>[27:55] Andrey:</strong> Yeah, well. Some people, you know, are called to the truly noble pursuits. Um. Yeah, so I can&#8217;t verify it. But you know if the mathematics community says, &#8220;Hey this is solved and the AI solved, you know, some open-standing problem,&#8221; you know that that would be really impressive. I think things like, you know, let&#8217;s say biological sciences... even if we found a cure for cancer today, you know, by the time that will be recognized within society that will take a long time.</p><p><strong>[28:30] Andrey (continuing):</strong> And I actually expect that no matter... even if the AI plays a pivotal role, the way that it will be reported on might be like, &#8220;Well, we used the AI to screen for some initial candidates and then we tested it in mice and then we tested it in humans.&#8221; Like, it&#8217;s less likely that there&#8217;s going to be this &#8220;Eureka&#8221; type, &#8220;Oh, we got him,&#8221; you know, sort of moment.</p><p><strong>[28:53] Seth:</strong> Right. There are ten pivotal... like yes. In bringing a drug to market there&#8217;s ten pivotal steps and maybe like three of them the AI could do, right?</p><p><strong>[29:00] Andrey:</strong> Yeah. And we already like use AI all over the place, right? For various statistical type processes in research in the medical sciences, right? So it&#8217;s not... yeah. You know, if you think about like Generative AI end-to-end reasoning through the solution, maybe one version of this... But another version of it is like we have, you know, some predictive model that says that <em>this</em> is the one. This is the molecule that will do it, you know?</p><p><strong>[29:33] Seth:</strong> Okay. Um. I guess from this example, I kind of want to price in the fact... or like, <em>not</em> price in the fact that this is going to be like a highly selected sample. This is from OpenAI. You just talked about how, you know, the Nobel Laureate biologist probably wants to downplay the role of AI. Well, OpenAI would like to <em>upplay</em> the role of AI. Um, so I will be expecting something that&#8217;s maybe not a 10 out of 10 impressive, but I&#8217;m looking forward to some 7 or 8 out of 10s impressive before I read this.</p><p><strong>[30:10] Andrey:</strong> Yeah, yeah. So I mean I think we&#8217;re both in agreement. I think the other thing we should mention is that there&#8217;s quite a bit of disagreement about current AI&#8217;s capabilities to do science. I&#8217;ll just give you an anecdote. I have a good friend who is a theoretical cryptographer who is very confidently telling me that AI can&#8217;t do anything truly useful yet for his mathematical research. And there are certainly people, you know... common voices in the media that are AI skeptics like Gary Marcus who, you know, is going to dismiss every single thing that the AI does as trivial.</p><p><strong>[30:57] Andrey (continuing):</strong> And then at the same time, there are obviously people who are just hype masters that are exaggerating all the capabilities. So, so yeah. Let&#8217;s see what happens.</p><p><strong>[31:07] Seth:</strong> I love that. &#8220;Within-paradigm science is trivial. Pre-paradigmatic science is bullshit.&#8221; At the intersection, you have Justified Posteriors. Okay.</p><p><strong>[31:16]</strong> <em>[Music / Transition]</em></p><p><strong>[31:22] Seth:</strong> Okay. So let&#8217;s get to the evidence. It&#8217;s a pretty unusual paper for us. It&#8217;s really a collection of about 10 or 12 anecdotes from different domains. So we see examples from math, physics, astronomy, biology, and material science. Uh. I hate to break it to the audience if you were looking for exciting physics and astronomy, it&#8217;s all basically math. They&#8217;re pretty mathy questions. The physics question is &#8220;solve something about a black hole,&#8221; or that&#8217;s the astronomy question. The physics question is, you know, &#8220;simulate something about a nuclear burn.&#8221;</p><p><strong>[32:00] Seth (continuing):</strong> So I was thinking that I would just kind of pick out some highlights of stuff that jumped out at me. You&#8217;ll interrupt me as we go. All right. So talking first about through some of these math examples. The very first example in the paper&#8212;kind of the warm-up example they give&#8212;this is an example of the AI trying to sort of recreate frontier science. There&#8217;s an example where they ask the AI to establish some sort of upper bound on some sort of maximization process. And the key quote I pulled out is: &#8220;To say it plainly, such a result&#8212;improving from one cutoff to another cutoff&#8212;could probably have been achieved by some experts in the field in a matter of hours, and likely for most experts it would have taken a few days. This is the type of science acceleration that we will see time and time again in the report.&#8221;</p><p><strong>[32:55] Seth (continuing):</strong> So right off the bat, we&#8217;re seeing&#8212;and this is not even <em>new</em> science, this is &#8220;can we recreate an old result that&#8217;s maybe not published or only part of it was published&#8221;&#8212;we&#8217;re not seeing the AI making giant leaps ahead of us. We&#8217;re seeing it completing a key step. And we&#8217;re going to see that over and over again. In this particular example, the AI does not even get to the known best cutoff of 1.7 over L. It only gets to 1.5 over L, over the previously best published 1 over L. L being a parameter in the model that we&#8217;re talking about. So if anything, this is kind of a negative example, or it&#8217;s kind of more of a mixed example. It helped them speed up <em>part</em> of an analysis but maybe not all the way to the frontier.</p><p><strong>[33:45] Andrey:</strong> I just... to me, it&#8217;s actually quite impressive, Seth. That&#8217;s kind of... you just have to remember that these are essentially the top people, the smartest people in the world, right? Like...</p><p><strong>[34:00] Seth:</strong> Sure.</p><p><strong>[34:01] Andrey:</strong> You might say, &#8220;Well, like, maybe it&#8217;s only important to really push beyond their levels.&#8221; But actually, we&#8217;re completely rate-limited on people like this, right? There are very few of them. And so if they&#8217;re able to do things faster, that&#8217;s pretty great for society. And also it means that... like, most of science relies on math, but it doesn&#8217;t rely on <em>frontier</em> math in this way. And so for all of us who are not as good at math, this could be pretty fantastic, right?</p><p><strong>[34:34] Seth:</strong> For us middle-brow theorists.</p><p><strong>[34:35] Andrey:</strong> Yes, exactly. So yeah. To me, this is quite impressive. This is already extremely close to the frontier. And it&#8217;s... you know, it&#8217;s proving results that were not in the literature. So I... yeah. I mean it&#8217;s not like the most deepest result, but this is kind of still pretty great.</p><p><strong>[35:00] Seth:</strong> Well, now let me give you an example where I was really impressed. And maybe you&#8217;ll tell me you&#8217;re less impressed by this one. Which is just its function as a literature review tool. So maybe some of our audience has heard of a famous economist called <strong>Paul Erd&#337;s</strong>, who is kind of famous for having worked with lots and lots of different...</p><p><strong>[35:19] Andrey:</strong> Wait, why did you call him an economist? He&#8217;s not an economist.</p><p><strong>[35:22] Seth:</strong> Did I call him an economist? Mathematician. Excuse me.</p><p><strong>[35:24] Andrey:</strong> He&#8217;s definitely not an economist.</p><p><strong>[35:25] Seth:</strong> I was good. So I assumed... Thank you. Mathematician Erd&#337;s. Who is known for working with lots and lots of mathematicians. And famously people will compare their closeness to him in the same way that people will say &#8220;How many steps am I removed from the Holy Roman Emperor?&#8221; They&#8217;ll say &#8220;How many co-authors away am I from Erd&#337;s?&#8221; Because he&#8217;s worked with everybody in so many different domains.</p><p><strong>[35:50] Andrey:</strong> And famously... famously he took a lot of methamphetamine. And that&#8217;s why he was so productive.</p><p><strong>[35:57] Seth:</strong> A lot of meth. You know, if you do cocaine, you become Stephen King. Meth, you become Erd&#337;s. So, you know, which way Western Man? All right. And so one of the things he left us with before he passed was a long list of sort of what he saw as cool open questions for his students and friends to work on. In this long list, basically the authors of this anecdote took this list, plugged it into the AI and said, &#8220;Hey, here&#8217;s a bunch of these questions that have no known solutions. Can you find solutions to them?&#8221;</p><p><strong>[36:35] Seth (continuing):</strong> And the quote I pulled out here is: &#8220;Locating previously published solutions to 10 problems not previously known&#8221;&#8212;so 10 problems they hadn&#8217;t known&#8212;&#8221;and reported noteworthy partial progress in the existing literature for 10 other problems... and correcting an error in problem 1041.&#8221; And then finally&#8212;I guess we can talk about this now or later&#8212;actually helping them solve a single problem, problem 848. It gave them a big hint and the mathematicians were able to work with it to actually solve problem 848.</p><p><strong>[37:08] Seth (continuing):</strong> So I like this one. It feels like... it feels like super verifiable. It seems super solid. It seems like a super easy win. I don&#8217;t know if it&#8217;s the most <em>exciting</em> use of an AI, but this seems like a super promising, super obvious win.</p><p><strong>[37:27] Andrey:</strong> Yeah. I mean I think it&#8217;s fantastic. I am very skeptical that this can work well outside of mathematics and physics. And the reason is that the more empirical literatures are just littered with terrible research. And like... the literature review problem is not that great. When I think about like when I&#8217;m working on a project... yes, if we have a mathematical problem and we&#8217;re like, &#8220;Oh, is there anything in the literature that kind of shows us how to solve this problem?&#8221; that seems quite useful.</p><p><strong>[38:09] Andrey (continuing):</strong> But it&#8217;s like, has anyone worked on, you know, I don&#8217;t know... I have a paper on privacy. &#8220;Has anyone worked on privacy before?&#8221;</p><p><strong>[38:20] Seth:</strong> Privacy. What&#8217;s the right way to do cookies?</p><p><strong>[38:22] Andrey:</strong> Yeah. I mean like... it&#8217;s fine, you know? Like it&#8217;s good to have some citations in the paper, but yeah. To me, the literature review problem is not that important as part of my work. What do you think?</p><p><strong>[38:39] Seth:</strong> I would push back a tiny bit. Because I find myself, when I&#8217;m reading empirical papers&#8212;you know, we always tell ourselves &#8220;don&#8217;t overlearn from just one paper.&#8221; I kind of feel like it would be awesome if every empirical paper had like a built-in little meta-analysis of &#8220;Here&#8217;s every other paper that&#8217;s related and the effect sizes they found.&#8221; And if that could be automated, it would make reading empirical papers way more fun, right?</p><p><strong>[39:05] Andrey:</strong> Sure. Yeah. I mean, fair enough. I guess... yeah. I guess it&#8217;s a question of what we&#8217;re thinking about. Writing your own paper? Unless it&#8217;s a meta-analysis... maybe not that useful. But just generally learning from the literature, it is very useful. And actually there&#8217;s a very promising tool called <strong>Elicit</strong> which does this sort of literature search. I think it&#8217;s primarily focused on the pharmaceutical domain. So yeah. So I think... yeah. So there is this use case. But I was just reflecting on the fact that for what I personally do in my research, you know, I&#8217;m aware of some of the major papers in my field obviously. But not knowing the literature is not a bottleneck, I don&#8217;t think.<br><br><strong>[40:00] Seth Benzell:</strong> What I think of is Edison, famously... whenever he had an idea for a new invention, he made sure to get a team on making sure it was not invented already because he had gotten burned several times along. Oh, you know, somebody had filed a patent for that 20 years ago and they just never made any of it.</p><p><strong>[40:19] Andrey Fradkin:</strong> Yeah, yeah. No, no. I mean, look, maybe it&#8217;s different in other fields. I... you know, I can only know what I know. Yeah.</p><p><strong>[40:31] Seth:</strong> Sure. Um, maybe one more negative case. There was a mathematical case involving... what are conditions necessary on subsets to make sure that you don&#8217;t get so many subsets that are called cliques? That&#8217;s kind of the level of the math I understood of this problem. They gave ChatGPT the problem, it repeatedly gave them the wrong answer. Eventually, after insisting to ChatGPT it was giving them the wrong answer, it gave them the correct answer... which then they later discovered was already in the published literature and ChatGPT did not give it credit.</p><p><strong>[41:12] Seth (continuing):</strong> So I guess another example here of you really need to be on top of these things and not take their first response as gospel.</p><p><strong>[41:19] Andrey:</strong> Yeah. To me this is such a compliment to doing high-quality work because... you just... if you don&#8217;t have the judgment, it&#8217;s... it so often gives you stuff that&#8217;s wrong, incomplete, and you have to actually have some vision and knowledge to know which parts of the answers to take and which parts not to take.</p><p><strong>[41:43] Seth:</strong> Right. Yeah. So yes. This seems like we are at the level where the AI is making very plausible guesses and you still need an expert sitting on top of it.</p><p><strong>[41:53] Andrey:</strong> Yes.</p><p><strong>[41:54] Seth:</strong> So, Fields Medalist winning mathematician <strong>Timothy Gowers</strong> gives us this take, which I thought was like a really kind of good summary of where it is right now, and kind of inspired my opening joke:</p><p><strong>[42:12] Seth (quoting Gowers):</strong> &#8220;As a research supervisor, I have a rule of thumb for when a contribution I make to the research of one of my PhD students is at the level where I should be a joint author.&#8221;</p><p>Do you know where he&#8217;s from? Should I do an accent? I&#8217;m just gonna... I&#8217;m not gonna do an accent.</p><p><strong>[42:24] Andrey:</strong> He&#8217;s British.</p><p><strong>[42:25] Seth:</strong> He&#8217;s British? Ooh. Okay.</p><p><strong>[42:27] Andrey:</strong> I don&#8217;t... yeah. Let&#8217;s skip the British accent.</p><p><strong>[42:29] Seth:</strong> Okay. Thank you, Andrey. That&#8217;s a gift to you, the listeners at home.</p><p><strong>[42:35] Seth (continuing):</strong> &#8220;The rule is that if the student comes to discuss the problem with me, and I have, in the course of that discussion, an idea that comes more naturally to me than to them, and that turns out to be helpful, then that is not enough for joint authorship. But if I spend time <em>struggling</em> with the problem&#8212;of course, I will only do this if the project is officially a joint one, very propitious as a British man&#8212;and during the course of the struggle... <em>during the course of the struggle</em>, I really love that... I come up with an idea that required more than just standard expertise that I happen to have, that I have made a genuine contribution to the work.&#8221;</p><p><strong>[43:10] Seth (continuing):</strong> &#8220;My experience so far with LLMs is that they are capable of playing with this knowledgeable research supervisor role with me, which can be extremely useful given just how much knowledge they have&#8221;&#8212;this is coming from a Fields Medalist&#8212;&#8221;but they are not yet at the level, or at least have not yet exhibited that level in my own interactions with them, at which a human mathematician who follows my convention above would ask for joint authorship.&#8221;</p><p><strong>[43:34] Seth (continuing):</strong> I mean, it&#8217;s... he&#8217;s kind of playing it down, but this is actually pretty freaking high praise, would you not agree, Andrey?</p><p><strong>[43:40] Andrey:</strong> Yes. Yes. I mean, let&#8217;s just, you know, remind ourselves that whatever graduate students he&#8217;s thinking about are also some of the smartest people in the world. And you know, most... once again, most scientists who work with math have problems that are substantially easier than anything these sorts of people would be working on. Right? And are bottlenecked by it. Right? Like we&#8217;re, you know, bottlenecked maybe temporarily... you know like...</p><p><strong>[44:12] Seth:</strong> Or even permanently.</p><p><strong>[44:13] Andrey:</strong> Or even permanently. It could be either, right? And so yeah, like it&#8217;s essentially saying like, &#8220;Oh, for, you know, 99% of scientists who use math, it&#8217;s already really, really, really, really good.&#8221;</p><p><strong>[44:26] Seth:</strong> It replaces <em>me</em>.</p><p><strong>[44:28] Andrey:</strong> Yeah. And if you&#8217;re like a Fields Medalist, you know, maybe it&#8217;s not as good as <em>you</em> yet.</p><p><strong>[44:35] Seth:</strong> Incredible. Um. I guess... one other kind of little detail I came... I want to pull out here is like the requirement that you have to <em>struggle</em> with it for co-authorship. I think that&#8217;s kind of fun, right? Like, is one of the reasons that maybe AI gets less credit than we should give it is that it seems so effortless?</p><p><strong>[44:56] Andrey:</strong> Yeah. Well, you know, sometimes it&#8217;s like... it&#8217;s interesting, you know in this paper you see that the AI thought for like 20 minutes or whatever. And this is...</p><p><strong>[45:05] Seth:</strong> Yeah, they got the really good version. Just to be clear, so this is using GPT-5.1 Pro, which can have very very long runtimes if you let it.</p><p><strong>[45:13] Andrey:</strong> I think it&#8217;s 5.0 Pro. Just to be clear.</p><p><strong>[45:16] Seth:</strong> 5.0 Pro? 5.0 Pro. Excuse me.</p><p><strong>[45:19] Andrey:</strong> Yeah. But yeah. So this is the frontier reasoning model. This might be the one that&#8217;s... I think that&#8217;s the one that&#8217;s available in the max plan on ChatGPT. But it wasn&#8217;t clear to me whether the scientists here got some special access. They probably did. So yeah, it&#8217;s not really the sort of AI that most people today would be using, but of course, you know, they could be using it, you know, given how fast things move, within the next year.</p><p><strong>[55:51] Seth:</strong> Right, right. So exactly. So as we march down Moore&#8217;s Law, what is available, you know, in pre-release to Fields Medalists diffuses to us proles in... what, a year or so?</p><p><strong>[46:01] Andrey:</strong> Yeah, yeah, yeah. Um. Yeah, so I... I don&#8217;t know. To me, it&#8217;s just really... I mean, I would say it&#8217;s awesome. I mean... I mean, it&#8217;s just... it&#8217;s gonna make us so much more capable. Like, I don&#8217;t know... to me, this is a lot of cause for optimism. Even though it&#8217;s not, you know, it&#8217;s not doing science end-to-end. If that was your, you know, hope, it&#8217;s not there yet. But it&#8217;s already, you know, great.</p><p><strong>[46:33] Seth:</strong> I think one thing I would pull out, and I&#8217;ll emphasize this in our conclusion, is that it seems like one of the bottlenecks on AI itself is the inability to rigorously check its own proofs. And it seems like once we get really good automated translation from these kinds of human-LLM-readable proofs into kind of machine-checkable proofs, you&#8217;ll like multiply this productivity because it&#8217;ll be able to check its own work.</p><p><strong>[46:59] Andrey:</strong> Yes. I... we should also mention, like we haven&#8217;t mentioned yet, but there are several very, very well-funded startups that are working on AI for mathematics. DeepMind is also obviously a leader in this field in addition to OpenAI. So it&#8217;s also kind of one where, you know, as economists we&#8217;re like, &#8220;Wow, there&#8217;s just so much competition and investment that&#8217;s great.&#8221; We&#8217;re bound to get some awesome results in the future, right?</p><p><strong>[47:33] Andrey (continuing):</strong> Yeah, so... so... so I mean one of the interesting things here is that it is really like a chat interface, right? Like you don&#8217;t have to use a specialized mathematical proving language, you don&#8217;t have to interact with that. You can reason with it in, you know, loose terms and then it kind of knows how to interpret it. Maybe some of these other efforts might be a bit more, you know, narrow... you know, very very powerful but more narrow. Yeah.</p><p><strong>[48:02] Seth:</strong> Right. And it seems like the real win is both combining the natural language and the machine-provable code.</p><p><strong>[48:09] Andrey:</strong> Yes. Yeah.</p><p><strong>[48:10] Seth:</strong> Right.</p><p><strong>[48:11] Andrey:</strong> But my vision for all these things is just, of course, that you have AIs calling tools that are other AIs, right? I am very much not in the camp of &#8220;one AI to rule them all end-to-end without tools.&#8221; Like, some people have that vision, but I don&#8217;t... you know, just like a human uses tools, I don&#8217;t see why an AI wouldn&#8217;t use tools. Which might be other AIs, like a human would have research assistants.</p><p><strong>[48:38] Seth:</strong> I guess the only thing I would jump in here with is... right, one thing I&#8217;m always on the lookout for now as we read these papers is like, you know, the <strong>Bitter Lesson</strong> update. So to what extent does the generalist AI that&#8217;s bigger beat the specialist efforts? To what extent is task-specific prompting and scaffolding important versus &#8220;just use better model&#8221;? And I think in each of these examples we really <em>do</em> see task-specific scaffolding being important, prompting iteratively and, you know, in a special way being important. Now of course this is all in the context of a single model, so we can&#8217;t really speak to, you know, versus these other approaches, but something to keep our eyes open for.</p><p><strong>[49:21] Andrey:</strong> Yep.</p><p><strong>[49:22] Seth:</strong> Um, okay. Here&#8217;s an example that I thought was funny because it was like clearly written up by an AI. There was a physics example where they asked the AI to derive known but unpublished results about black hole symmetries. One of the take-out quotes is: &#8220;After about five minutes of internal reasoning, the model incorrectly reported that the equation had no continuous symmetries beyond trivial scalings.&#8221; Then again, we have another example, they prompt the model again, they give it a warm-up problem. With the warm-up problem, the AI is able to solve the full problem.</p><p><strong>[49:59] Seth (continuing):</strong> This is the part that made me think it was definitely written up by an AI. In the implications section, it felt really AI-ish and here was one of the quotes I pulled out: &#8220;AI as symmetry engine. With minimal domain scaffolding, current models can carry out non-trivial Lie symmetry discovery for PDEs&#8221;&#8212;partial differential equations&#8212;&#8221;with non-constant coefficients.&#8221; Okay. Dude, that was an AI sentence. &#8220;AI as symmetry engine.&#8221; What kind of metaphor is that? That&#8217;s an AI metaphor, dude.</p><p><strong>[50:29] Andrey:</strong> Yeah, I mean... I think one of the things that&#8217;s going on in the background that we should say is that scientists using AI to write is just now ubiquitous, right? There was a huge controversy at ICLR, one of the top CS conferences, where just an enormous share of referee reports for papers were written by AI. In fact there&#8217;s a tool, <strong>Pangram</strong>, that has shown very high accuracy at detection of AI writing, and it was used to measure these reviews and just so many of them were written by AIs. So many of the papers are written by AIs.</p><p><strong>[51:15] Andrey (continuing):</strong> So I just think this has to... this is just the new normal, right? Like... and we shouldn&#8217;t be surprised. A lot of scientists... English is not their first language. Even for those who it is a first language, you know, writing is a specialized skill that most people, most scientists, are not very good at. And it&#8217;s a lot easier to have an AI write a draft and you tweak it than to write something from scratch. It&#8217;s not obvious to me how important it is that the human does the writing. I guess I like to do writing because writing is thinking, it&#8217;s a way that I think through problems. But for a lot of things, I don&#8217;t know, let&#8217;s say like form letters and things like that, like why would I waste my time honing my language when I could just have the AI do it? So I&#8217;ll just say like this is a new normal and the viewpoint that we&#8217;re mostly writing for the AIs is also true.</p><p><strong>[52:16] Seth:</strong> Do you want to spell that out for people who might not have heard that phrase before?</p><p><strong>[52:21] Andrey:</strong> Yeah. So I first heard it from <strong>Tyler Cowen</strong>.</p><p><strong>[52:24] Seth:</strong> Andrey&#8217;s favorite economist. Friend of the show.</p><p><strong>[52:30] Andrey:</strong> If you say that, he&#8217;s more likely to retweet you.</p><p><strong>[52:33] Seth:</strong> [Laughs] Yeah, yeah, yeah.</p><p><strong>[52:36] Andrey:</strong> &#8220;Friend&#8221; is, you know, a loose term, but you know, we have had dinner with Tyler and that was a great honor. But yeah, I guess the AIs are sucking in all the writing in the world for their training. You know, they&#8217;re also able to search through content very effectively and will be reading that content as part of forming their answer. And that&#8217;s just happening all the time. It&#8217;s happening much more than humans reading some very niche bit of content like one of our papers, right? And so then you might think that since your primary audience with a lot of writing <em>is</em> the AI, you might want to quote-unquote &#8220;write for the AI.&#8221; That might mean that you don&#8217;t have to write as carefully... or not as carefully, but you might... you know, some of the things to entertain humans might be less important.</p><p><strong>[53:38] Seth:</strong> Poetic function of language.</p><p><strong>[53:39] Andrey:</strong> Yes. Less important for the AIs. And so you get writing like this quote-unquote &#8220;symmetry engine,&#8221; right?</p><p><strong>[53:50] Seth:</strong> [Laughs] Yes. Like... I don&#8217;t know. Okay, maybe. I think language will lose something if metaphors stop being helpful. I think you&#8217;ll just stop dropping metaphors, right? We&#8217;ll just get to purely functional language, right? Because a bad metaphor is worse than no metaphor.</p><p><strong>[54:06] Andrey:</strong> Yeah, yeah. I mean, I guess I guess we&#8217;re gonna see very clearly... like much more clearly delineated communication for humans versus communication for AIs. That... I mean we&#8217;re almost kind of there. I mean papers... if you think about like how much effort most scientists put into writing papers vs. how bad the writing is in most scientific papers... why are we even pretending, you know?</p><p><strong>[54:35] Seth:</strong> Yeah. Anyway, well, very interesting to watch. Um, I had one more example I wanted to pull out, which was the biology example, which I was really excited to read given that so many of these were very math-heavy. In this example, the writers of the anecdote uploaded an experimental figure showing the impact of giving some white blood cells a glucose substitute. Right? So the idea is maybe the white blood cells will do differently if they have glucose versus not glucose, and maybe you could like get them to do something that would cure cancer if you give them more or less glucose.</p><p><strong>[55:12] Seth (continuing):</strong> And one of their results was that they tried both giving it no glucose (or a very low amount of glucose) as well as giving it a treatment which is like a glucose <em>substitute</em>. So there was some goo that was gonna gunk up the glucose receptor so that the cell wouldn&#8217;t be able to eat the glucose. GPT-5 seemed to understand the figure, pointed out hypotheses and potential follow-up experiments to understand why the &#8220;fake glucose&#8221; had a different effect than low glucose.</p><p><strong>[55:40] Seth (continuing):</strong> It suggested some potential mechanisms why. ChatGPT writes: &#8220;A low glucose control partly mimics the effect but is weaker than the fake glucose at equal nominal concentrations, suggesting contributions from glycolysis restriction and N-linked glycolysation interference... a known 2-DG [this is the fake glucose] off-target... rather than energy limitation alone.&#8221; Right? So this seems to have been the key contribution of ChatGPT, is that... like the scientists obviously when they made this result they immediately identified, &#8220;Oh that&#8217;s interesting, the fake glucose seems to have a different effect than the zero glucose.&#8221; The insight that the AI seemed to have had is this particular mechanism, is that there&#8217;s an off-target effect of the fake glucose. And suggested, you know, experiments to follow up&#8212;using a different kind of fake glucose, trying some other treatments that would identify whether that was the correct mechanism.</p><p><strong>[56:42] Seth (continuing):</strong> You know, when I say it that way, it doesn&#8217;t seem <em>that</em> impressive, right? Like the scientists were already pretty close to that. The scientist... at least reading them, they seemed more impressed than like <em>my</em> reading of it was. They write&#8212;the authors write&#8212;&#8221;In retrospect in particular, the proposed mechanism of reduced IL-2 signaling via interference with N-linked glycolysation made clear biological sense because it could directly explain the disinhibition of the Th17 cell differentiation under 2-DG treatment. However, this mechanistic hypothesis had not occurred to us.&#8221;</p><p><strong>[57:17] Andrey:</strong> Yeah, I mean... I mean once again, it&#8217;s a thought partner. You know, if you&#8217;re working with people on a problem, you&#8217;re gonna have conversations with them and different co-authors are gonna come up with ideas that you hadn&#8217;t thought about yet. And you know through iteration, that ultimately creates an artifact which is the research paper. And that&#8217;s kind of a series of things like that. And it&#8217;s very rarely that there&#8217;s kind of one Eureka in this. Or even if there&#8217;s like a main insight, you actually have to like take it very seriously to draw out the implications and so on. A lot of... I actually imagine a lot of people had great ideas that ended up eventually being correct science but they just didn&#8217;t pursue them, right?</p><p><strong>[58:10] Andrey (continuing):</strong> So that&#8217;s kind of how maybe we should think about this. Is that it&#8217;s a thought partner, but it doesn&#8217;t yet have agency to pursue the research.</p><p><strong>[58:21] Seth:</strong> That is so interesting because I came away with this feeling like this is an example of AI as deep literature search, right? Because it seems the problem was pretty well defined, right? Shouldn&#8217;t <em>this</em> have the same effect as <em>that</em>? Do deep literature search to see if there&#8217;s any, you know, off-target effects of either the thing. But maybe that&#8217;s viewing this too narrowly.</p><p><strong>[58:42] Andrey:</strong> Yeah. I just... I&#8217;m not an expert enough to know whether it made a connection across, you know, literature... Right? Like it knows a lot of things. I don&#8217;t know if I&#8217;d call that literature review. Just like a scientist would know a lot of things. And then some of the magic happens when it connects two, you know, previously unrelated concepts. I just... to me, saying it&#8217;s <em>just</em> literature review seems a bit reductionist. You know...</p><p><strong>[59:11] Seth:</strong> &#8220;It&#8217;s just a stochastic parrot, Andrey.&#8221; Okay. Are you ready? Do you have any other examples you want to make sure we highlight? Are you ready to move on to our conclusions and posteriors?</p><p><strong>[59:25] Andrey:</strong> Yeah, let&#8217;s move on to the conclusions. Yep.</p><p><strong>[59:28]</strong> <em>[Music / Transition] &#8212; MOVING TO POSTERIORS</em></p><p><strong>[59:35] Seth:</strong> Okay. So I think these were pretty impressive. I don&#8217;t know if there was any, you know, &#8220;dropping my jaw&#8221; ones. The Timothy Gowers being like, &#8220;This is good enough to be my lazy faculty advisor&#8221; is probably the jaw-drop moment, right?</p><p><strong>[59:48] Andrey:</strong> Yeah. I mean just... I think the credibility of people like him or <strong>Terence Tao</strong> saying that they find it useful... I think in some sense it&#8217;s, you know...<br><br><strong>[60:00] Seth:</strong> This is an OpenAI release selling, you know, for a product that they sell for $200 a month.</p><p><strong>[60:09] Andrey:</strong> Yeah, but I mean... I mean... sure. I... I just... I don&#8217;t know. Like... to me, once again, I&#8217;m going back to my priors. Like it&#8217;s obviously useful for science. You have to be truly incurious or, you know, a Luddite to think that it&#8217;s not.</p><p><strong>[60:28] Seth:</strong> Fair enough. Well, actually, I have a theory about your crypto friend. Is it just that, like, cutting-edge crypto is not published widely? Is there some sense in which, like, crypto research might not be in the dataset as much?</p><p><strong>[60:44] Andrey:</strong> I don&#8217;t think so. I don&#8217;t think so. I think he... I don&#8217;t know. I don&#8217;t want to put words in his mouth. But if I like...</p><p><strong>[60:52] Seth:</strong> He&#8217;s a Luddite.</p><p><strong>[60:53] Andrey:</strong> No, no, no. I think if I had to guess, I think he... he kind of views like some deep... deep theoretical insight as maybe the requirement that he has in mind. And that&#8217;s... that&#8217;s the bar that he has. And...</p><p><strong>[61:08] Seth:</strong> Yeah, it&#8217;s not Einstein. It&#8217;s not inventing new paradigms.</p><p><strong>[61:11] Andrey:</strong> Yes, yes. But I guess... I don&#8217;t know. To me, that&#8217;s...</p><p><strong>[61:17] Seth:</strong> I&#8217;m not Einstein! I&#8217;ll take it!</p><p><strong>[61:19] Andrey:</strong> Yeah, yeah. Yeah. Exactly.</p><p><strong>[61:24] Seth:</strong> Um, okay. Uh, and I... I made this point already but I just want to end here which is... I think my takeaway from here is some sort of automatic translation in between sort of machine-language-provable code and like human-language code seems to be the real bottleneck here before speeding up AI a lot. Or at least math-specific AI.</p><p><strong>[61:48] Andrey:</strong> I really don&#8217;t think that&#8217;s the bottleneck, Seth. I truly don&#8217;t. Um.</p><p><strong>[61:52] Seth:</strong> But it con... we keep on seeing examples of it like it gives the wrong answer and you have to be like, &#8220;Well, I thought about this and it&#8217;s the wrong answer,&#8221; and then it does that five times and then it gives you the right answer. We see like three examples of that here.</p><p><strong>[62:05] Andrey:</strong> I... I guess like... this is one... I guess &#8220;bottleneck&#8221; seems like a weird word to me given that there&#8217;s a parallel...</p><p><strong>[62:14] Seth:</strong> Accelerant.</p><p><strong>[62:15] Andrey:</strong> I&#8217;m not... I... okay. There&#8217;s a para... there&#8217;s essentially parallel efforts to... certain things <em>can</em> be formalized in these <strong>Lean provers</strong>. And imagining an OpenAI... like a... like a GPT-like model calling the Lean model is like trivial. Like I... I&#8217;m not saying it&#8217;s trivial like clearly like... I don&#8217;t...</p><p><strong>[62:43] Seth:</strong> If it&#8217;s trivial, why does it keep on giving us wrong answers?</p><p><strong>[62:45] Andrey:</strong> Because OpenA... because I actually think that like the way this system is designed, it&#8217;s kind of using GPT by itself. But actually... my sense is that people in the field who are pushing the envelope are combining these tools. And if you look at DeepMind&#8217;s tools, they&#8217;re not... they don&#8217;t work like this. They <em>are</em> using the formal provers. And so to call it a bottleneck is like implies that like, &#8220;Oh, like actually no one has this working yet.&#8221; And I... and I actually... I... I bet that some people have this working. It&#8217;s... I don&#8217;t think... not... I&#8217;m not sure whether everything can be formalized in these specialized proving languages in the same way. But yeah.</p><p><strong>[63:34] Seth:</strong> It&#8217;s a limitation in <em>these</em> examples, but you&#8217;re saying it&#8217;s not a limitation, you know, tomorrow if you wanted to use the cutting-edge tool.</p><p><strong>[63:41] Andrey:</strong> Yes, yeah. That... that&#8217;s... that&#8217;s my sense. But you know, if listeners disagree, you know, feel free to let us know. Yeah.</p><p><strong>[63:48] Seth:</strong> Yeah, please call in. Okay. Um. Posteriors? Or any other limitation comments you want to make?</p><p><strong>[63:55] Andrey:</strong> No. I... yeah. I mean I...</p><p><strong>[63:57] Seth:</strong> Posteriors. Yeah.</p><p><strong>[63:58] Andrey:</strong> Yeah. I mean I... I don&#8217;t know. Like our... our priors were very loose so I don&#8217;t know the posteriors. I mean I think... yeah. I mean I... you know, I stand by what I say here. I found these examples quite interesting. And it was uh...</p><p><strong>[64:14] Seth:</strong> Okay. So paradigm-wise, you&#8217;re still in the same place? That you think it&#8217;ll be co-working with it today and co-working with it in five years?</p><p><strong>[64:21] Andrey:</strong> Yep.</p><p><strong>[64:22] Seth:</strong> I said right now it&#8217;s super powerful for lit reviews&#8212;deep literature reviews&#8212;and um, maybe we&#8217;re... you know, in five years we will be all the way to AI on its own, at least for math problems. I come away reading this thinking we&#8217;re closer to AI on its own for frontier math research than before reading this. Uh, it really does... and again, I call what I said as a bottleneck or say that it&#8217;s already been removed... but I mean it seems like if this... what we see described here, plus the AI being able to iteratively check itself and just like redo the math... try another approach if it disproves itself... seems like you should be able to just let that fly and find a bunch of cool stuff.</p><p><strong>[65:13] Andrey:</strong> Yeah. And if... if you... if you look at prediction... you know, various forecasts, we see forecasts for by 2030 the Millennium Problems being solved with AI. So... uh, that&#8217;s not a very un...</p><p><strong>[65:28] Seth:</strong> AI is gonna solve the Riemann Hypothesis? That&#8217;s more of a question about the Riemann Hypothesis than AI.</p><p><strong>[65:32] Andrey:</strong> Well, you know. People who are experts, a decent chunk of them forecast that this will happen. So, yeah.</p><p><strong>[65:40] Seth:</strong> Okay. And how impressed were we by the most impressive result? I said we were gonna... I was gonna be like 7 out of 10 impressed, 8 out of 10 impressed. I think that&#8217;s kind of where I end up. If not like a little bit <em>below</em> that. Um, in the sense that I&#8217;m not saying that these mathematical results aren&#8217;t super impressive, but I was hoping for like, &#8220;And we discovered something that was like a treatment we can use tomorrow,&#8221; or &#8220;We discovered...&#8221; I was hoping for something that was kind of more directly practical from at least one of these examples.</p><p><strong>[66:13] Andrey:</strong> Yeah. I mean, to me, if there was something that was very practical, that would be like a 9 out of 10 or 10 out of 10. And you know. Uh, but I... yeah. Once again, I think like nothing blew my mind, but it all seems like we&#8217;re... we&#8217;re... we&#8217;re on the path to this being a very transformative technology for science. Yeah.</p><p><strong>[66:36] Seth:</strong> Yeah. Super, super excited to talk to <strong>Ben Golub</strong> about the AI research tool that he&#8217;s working on. Um, and uh, listeners at home, let us know: How do you use AI in your science or in your life? Post it in the comments, share, comment, and subscribe. All right.</p><p><strong>[66:56] Andrey:</strong> Well, until next time. Keep your posteriors justified.</p><p><strong>[67:00]</strong> <em>[Music fades out]</em></p><div><hr></div><p></p><p></p>]]></content:encoded></item><item><title><![CDATA[One year of justifying our posteriors]]></title><description><![CDATA[For the past year, Seth Benzell and I have been running a particular type of experiment on ourselves with Justified Posteriors, our podcast.]]></description><link>https://empiricrafting.substack.com/p/one-year-of-justifying-our-posteriors</link><guid isPermaLink="false">https://empiricrafting.substack.com/p/one-year-of-justifying-our-posteriors</guid><dc:creator><![CDATA[Andrey Fradkin]]></dc:creator><pubDate>Sun, 11 Jan 2026 19:57:19 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!v7wr!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a8a8a31-3952-4353-a5d6-0805223964ae_1536x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>For the past year, Seth Benzell and I have been running a particular type of experiment on ourselves with Justified Posteriors, our podcast. Can we behave like good Bayesian learners about research by stating our priors ex-ante, carefully reading  papers, and then reporting how we&#8217;ve updated our beliefs? This has turned out to be more complicated and more interesting than it seems, something I reflect on in the rest of this essay.</p><p>A foundational assumption of Justified Posteriors is that the claims made in published research papers and other intellectual work do not directly correspond to what we believe after reading them. This should be obvious to anyone who has seriously engaged with intellectual work. But what is less obvious is the degree of the gap between the claims in the work and the beliefs of the reader. Is there a slow accumulation of evidence (a vast literature, as one will read in formulaic introductions) that gradually moves our beliefs from zero to one? Or perhaps there is a critical moment, where one paper causes a rethinking of all that came before it, leading to a new conclusion. <br><br>We could dredge through the history of science, as our predecessors Popper, Kuhn, and Lakatos have, to come up with examples of both. We idealize the pivotality of Einstein&#8217;s <a href="https://en.wikipedia.org/wiki/Tests_of_general_relativity">tests of general relativity</a>. The evidence we have to deal with is much muddier. We live in a time where claims are circulated as a global pastime. Sometimes these findings come with the trappings of academic prestige and peer review, while other times they come in the form of a polemic dropped like a nuclear bomb into the memesphere, as those who have situational awareness may understand.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://empiricrafting.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Justified Posteriors! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>Few have time to read deeply, and even thinking seems like one of those lines in a todo list that is never crossed out. Consider the ubiquitous evals used in AI research and cited throughout social media. The number of people who have read the underlying methodology for each eval is minuscule. The ignorance is so vast that people don&#8217;t know how few <a href="https://shash42.substack.com/p/how-to-game-the-metr-plot">samples</a> are in each eval, let alone the confidence intervals. And yet, a careful evaluation of a new eval such as GDPVal can update our priors by a lot.</p><p>This is the water we swim in with Justified Posteriors. The premise for the show seemed simple, but nothing is as simple as it seems. For one, how do we pick a prior, especially without reading the paper? A conceit of the podcast is that we form our priors with zero information about the paper, but even to pick a paper we need to know something about it. Picking a prior turns out to be one of the topics which we struggle with the most. </p><p>What are we supposed to learn from a theory paper such as Ide and Talamas&#8217;s &#8220;<a href="https://empiricrafting.substack.com/p/ai-and-its-labor-market-effects-in">Artificial Intelligence in the Knowledge Economy?</a>&#8221; A theorist might be satisfied by learning whether this is a useful way of modeling the phenomenon. But we try to translate these into more empirical statements, such as &#8220;what percentage of US workers will have managing or creating teams of AI agents as their main job within 5 years.&#8221; Typically, we don&#8217;t update a lot.</p><p>Of the 22 <a href="http://justifiedposteriors.com">episodes</a> in which we had at least some semblance of priors, the biggest update came for Seth in the <a href="https://empiricrafting.substack.com/p/did-metas-algorithms-swing-the-2020">episode</a> about &#8220;How do social media feed algorithms affect attitudes and behavior in an election campaign?&#8221; The randomized control trial evidence on political beliefs convinced him that whether an algorithmic feed or a reverse chronological feed was shown to a user did not affect their political polarization. I already had this as my prior, given prior literature. </p><p>Nonetheless, neither of us were willing to update much on the larger claims. The reason is that, as always, the real world is complicated. For example, the paper did not study decisions to moderate content, a process which can be algorithmic but which differs from the algorithmic feed. The paper also did not consider truly directed algorithmic interventions, such as those by Elon Musk on X. We can&#8217;t read this paper and just say that algorithmic feeds are not an important determinant of political beliefs. </p><p>For me, the biggest update came in the <a href="https://empiricrafting.substack.com/p/can-ai-make-better-decisions-than">episode</a> &#8220;Can AI Make Better Decisions than Doctors?&#8221; I came in skeptical that AI could overcome the fundamental problems of causal inference without a randomized control trial. The evidence in the paper strongly updated me toward believing we should be more aggressive in inserting AI into ER decisions.</p><p>Interestingly, papers on more macro topics caused smaller updates even if they had much greater implications. Our first episode was fittingly about the now famous <a href="https://situational-awareness.ai/">Situational Awareness</a> document written by Leopold Aschenbrenner in June 2024. We didn&#8217;t have explicit priors, but we thought that AGI was further away than 5 years. We also thought AI was super important and that some of the predictions were plausible. We joked about buying NVIDIA, and didn&#8217;t (we were fools). To me, this episode highlights how easy it is to be directionally right, to read the right materials, but to not take ideas seriously enough. The arguments in the paper about power generation and data centers have especially proven correct. And if you squint, we&#8217;re following the timeline predictions closely even to this day. Claude Code with Opus 4.5 seems to be just on time for Aschenbrenner&#8217;s prediction of a proto-automated-engineer in 2026/2027. </p><p>A common theme in our discussions of papers about the economics of AI is that they are often measuring transitory phenomena, such as changes in productivity or performance at a particular point in time. An extreme example of this is the &#8220;<a href="https://empiricrafting.substack.com/p/the-simple-macroeconomics-of-ai">Simple Macroeconomics of AI</a>&#8221; by Daron Acemoglu, which assumes that AI will stay as good as it was in 2024. These papers are often underwhelming, even when they are well-crafted, because what everyone really cares about is what will happen in the future. </p><p>Much of my learning has come through conversation about the paper, rather than just by reading the paper. My updates would be very different if I read the paper without talking with Seth about it. This is reminiscent of  an academic seminar, in which a group of colleagues focus exclusively on one paper presented by a speaker. Attendees of seminars will know that oftentimes the most interesting part of seminar day occurs in the hallway conversations afterwards, when people share their opinions and discuss. One can tell how serious an academic department is by the quality of the hallway discussion.</p><p>This brings me to the next topic, the validity of podcasting as a worthwhile intellectual pursuit for a professor. I am supposed to primarily demonstrate my work on an intellectual topic by writing papers published in top journals. Yet to me it is obvious that we are doing valuable and original work in reading these papers and interpreting them through broader lenses than just the minimum publishable unit. For each episode, we have to understand literatures, engage deeply with evidence, and reason through the implications. This sort of work is something top researchers often do prior to starting new research projects, but is rarely shared outside of side conversations or lab meetings. What Seth and I do is a valid and valuable intellectual activity, not substantively different from writing a paper or a book. </p><p>One of the great pleasures of doing the podcast is hearing from our awesome readers and listeners! In the coming year, our goal is to improve the quality of our work by increasing our preparation, improving our audio and video quality, and by bringing on insightful guests. I am excited to continue covering emerging measurements of the AI economy and theoretical frameworks related to the impact and diffusion of AI. As always, we would love to hear from you with any feedback.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!v7wr!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a8a8a31-3952-4353-a5d6-0805223964ae_1536x1024.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!v7wr!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a8a8a31-3952-4353-a5d6-0805223964ae_1536x1024.png 424w, https://substackcdn.com/image/fetch/$s_!v7wr!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a8a8a31-3952-4353-a5d6-0805223964ae_1536x1024.png 848w, https://substackcdn.com/image/fetch/$s_!v7wr!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a8a8a31-3952-4353-a5d6-0805223964ae_1536x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!v7wr!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a8a8a31-3952-4353-a5d6-0805223964ae_1536x1024.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!v7wr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a8a8a31-3952-4353-a5d6-0805223964ae_1536x1024.png" width="1456" height="971" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1a8a8a31-3952-4353-a5d6-0805223964ae_1536x1024.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:971,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2536016,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://empiricrafting.substack.com/i/183513705?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a8a8a31-3952-4353-a5d6-0805223964ae_1536x1024.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!v7wr!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a8a8a31-3952-4353-a5d6-0805223964ae_1536x1024.png 424w, https://substackcdn.com/image/fetch/$s_!v7wr!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a8a8a31-3952-4353-a5d6-0805223964ae_1536x1024.png 848w, https://substackcdn.com/image/fetch/$s_!v7wr!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a8a8a31-3952-4353-a5d6-0805223964ae_1536x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!v7wr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1a8a8a31-3952-4353-a5d6-0805223964ae_1536x1024.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><em>Thanks to Seth Benzell for comments and for being a great co-host.</em></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://empiricrafting.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Justified Posteriors! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Ben Golub: AI Referees, Social Learning, and Virtual Currencies]]></title><description><![CDATA[And yes, we talk about eigenvalues and cow-tipping!]]></description><link>https://empiricrafting.substack.com/p/ben-golub-ai-referees-social-learning</link><guid isPermaLink="false">https://empiricrafting.substack.com/p/ben-golub-ai-referees-social-learning</guid><dc:creator><![CDATA[Andrey Fradkin]]></dc:creator><pubDate>Mon, 29 Dec 2025 13:00:51 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/182718917/14de72ce15610957235da1c20b4097d9.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>In this episode, we sit down with <a href="https://en.wikipedia.org/wiki/Benjamin_Golub">Ben Golub</a>, economist at Northwestern University, to talk about what happens when AI meets academic research, social learning, and network theory.</p><p>We start with Ben&#8217;s startup <a href="http://Refine.ink">Refine</a>, an AI-powered technical referee for academic papers. From there, the conversation ranges widely: how scholars should think about tooling, why &#8220;slop&#8221; is now cheap, how eigenvalues explain viral growth, and what large language models might do to collective belief formation. We get math, economics, startups, misinformation, and even cow tipping.</p><h2><br>Links &amp; References</h2><ul><li><p><strong><a href="https://www.refine.ink">Refine</a></strong> &#8212; AI referee for academic papers</p></li><li><p><strong><a href="http://harmonic.fun">Harmonic</a></strong> &#8212; Formal verification and proof tooling for mathematics</p></li><li><p><strong><a href="https://web.stanford.edu/~jacksonm/">Matthew O. Jackson</a></strong> &#8212; Stanford economist and leading scholar of networks and social learning</p></li></ul><ul><li><p><strong><a href="https://www.scientificamerican.com/article/can-you-tip-a-cow/">Cow tipping (myth)</a></strong> &#8212; Why you can&#8217;t actually tip a cow (physics + folklore)</p></li><li><p><strong><a href="https://www.hachettebookgroup.com/titles/sinan-aral/the-hype-machine/9780316539963/">The Hype Machine</a></strong> &#8212; Sinan Aral on how social platforms amplify misinformation</p></li><li><p><strong>Sequential learning / information cascades</strong> / <strong><a href="https://en.wikipedia.org/wiki/DeGroot_learning">DeGroot Model</a></strong></p></li><li><p><strong><a href="https://www.aivillage.org">AI Village</a></strong> &#8212; Multi-agent AI simulations and emergent behavior experiments</p></li><li><p><strong>Virtual currencies &amp; Quora credits</strong> &#8212; Internal markets for attention and incentives</p></li></ul><p></p><h3><strong>Transcript:<br></strong></h3><p>Seth:  Welcome to Justified Posteriors, the podcast that updates its beliefs about the economics of AI and technology.</p><p>Seth: I&#8217;m Seth Benzel, hoping my posteriors are half as good as the average of my erudite Friends is coming to you from Chapman University in sunny Southern California.</p><p>Andrey: And I&#8217;m Andrey Fradkin coming to you from San Francisco, California, and I&#8217;m very excited that our guest for today is Ben Goleb, who is a prominent economist at Northwestern University. Ben has won the Calv&#243;-Armengol International Prize, which recognizes a top researcher in economics or social science, younger than 40 years old, for contributions to theory and comprehension of mechanisms of social interaction.</p><p>Andrey: So you want someone to analyze your social interactions, Ben is definitely the guy.</p><p>Seth: If it&#8217;s in the network,</p><p>Andrey: Yeah, he is, he was also a member of the Harvard Society of Fellows and had a brief stint working as an intern at Quora, and we&#8217;ve known each other for a long time. So welcome to the show, Ben.</p><p>Ben: Thank you, Andrey. Thank you, Seth. It&#8217;s wonderful to be on your podcast.</p><p><strong>Refine: AI-Powered Paper Reviewing</strong></p><p>Andrey: All right. Let&#8217;s get started. I want us to get started on what&#8217;s very likely been the most on your mind thing, Ben, which is your new endeavor, Refine.Ink. Why don&#8217;t you tell us a little bit about, give us the three minute spiel about what you&#8217;re doing.</p><p>Seth: and tell us why you didn&#8217;t name your tech startup after a Lord of the Rings character.</p><p>Ben: Man, that&#8217;s a curve ball right there. All right, I&#8217;ll tell you what, I&#8217;ll put that on background processing. So, what refine is, is it&#8217;s an AI referee technical referee. From a user perspective, what happens is you just give it a paper and you get the experience of a really obsessive research assistant reading for as long as it takes to get through the whole thing, probing it from every angle, asking every lawyerly question about whether things make sense.</p><p>Ben: And then that feedback, hopefully the really valuable parts that an author would wanna know are distilled and delivered. So as my co-founder Yann Calv&#243; L&#243;pez puts it, obsession is really the obsessiveness is the nature of the company. We just bottled it up and we give it to people. So that&#8217;s the basic product&#8212;it&#8217;s an AI tool. It uses AI obviously to do all of this thinking. One thing I&#8217;ll say about it is that I have long felt it was a scandal that the level of tooling for scholars is a tiny fraction of what it is for software engineers.</p><p>Ben: And obviously software engineering is a much larger and more economically valuable</p><p>Seth: Boo.</p><p>Ben: least</p><p>Andrey: Oh, disagree.</p><p>Ben: In certain immediate quantifications. But I felt that ever since I&#8217;ve been using tech, I just felt imagine if we had really good tools and then there was this perfect storm where my co-founder and I felt we could make a tool that was state of the art for now. So that&#8217;s how I think of it.</p><p>Seth: I have to quibble with you a little bit about the user experience because the way I went, the step zero was first, jaw drops to the floor at the sticker price. How much do you,</p><p>Ben: not,</p><p>Seth: But then I will say I have used it myself and on a paper I recently submitted, it really did find a technical error and I would a kind of error that you wouldn&#8217;t find, just throwing this into ChatGPT as of a few months ago. Who knows with the latest Gemini. But it really impressed me with my limited time using it.</p><p>Andrey: So.</p><p>Ben: is probably, if you think about the sticker price, if you compare that to the amount of time you&#8217;d have, you&#8217;d have had to pay error.</p><p>Seth: Yeah. And water. If I didn&#8217;t have water, I&#8217;d die, so I should pay a million for water.</p><p>Andrey: A question I had: how do you know it&#8217;s good? Isn&#8217;t this whole evals thing very tricky?</p><p>Seth: Hmm.</p><p>Andrey: Is there Is there, a paper review or benchmark that you&#8217;ve come across, or did you develop your own?</p><p>Ben: Yeah. That&#8217;s a wonderful question. As Andrey knows, he&#8217;s a super insightful person about AI and this goes to the core of the issue because all the engineers we work with are immediately like, okay, I get what you&#8217;re doing.</p><p>Ben: Give me the evals, give me the standard of quality. So we know we&#8217;re objectively doing a good job. What we have are a set of papers where we know what ground truth is. We basically know everything that&#8217;s wrong with them and every model update we run, so that&#8217;s a small set of fairly manual evaluations that&#8217;s available. I think one of the things that users experience is they know their own papers well and can see over time that sometimes we find issues that they know about and then sometimes we find other issues and we can see whether they&#8217;re correct.</p><p>Ben: We&#8217;re not at the point where we can make confident precision recall type assessments. But another thing that we do, which I find cool, was whenever tools that our competitors come out, like Andrew Ng put out a cool paper reviewer thing targeted at CS conferences.</p><p>Ben: And what we do is we just run that thing, we run our thing, we put both of them into Gemini 2.0, and we say, could you please assess these side by side as reviews of the same paper? Which one caught mistakes? We try to make it a very neutral prompt, and that&#8217;s an eval that is easy to carry out.</p><p>Ben: But actually we&#8217;re in the market. We&#8217;d love to work with people who are excited about doing this for refine. We finally have the resources to take a serious run at it as founders. The simple truth is because my co-founder and I are researchers as well as founders, we constantly look at how it&#8217;s doing on documents we know.</p><p>Ben: And it&#8217;s a very seat of the pants thing for now, to tell the truth.</p><p>Andrey: Do you think that there&#8217;s an aspect of data-driven here and that one of your friends puts their paper into it and says, well, you didn&#8217;t catch this mistake, or you didn&#8217;t catch that mistake, and then you optimize towards that. Is that a big part of your development process?</p><p>Ben: Yeah, it was more. I think we&#8217;ve reached an equilibrium where of the feedback of that form we hear, there&#8217;s usually a cost to catching it. But early on that was basically, I would just tell everyone I could find, and there were a few. When I finally had the courage to tell my main academic group chat about it and I gave it, immediately people had very clear feedback and this was in the deep, I think the first reasoning model we used for the substantive feedback was DeepSeek R1 and people, we immediately felt, okay, this is 90% slop.</p><p>Ben: And that&#8217;s where we started by iterating. We got to where, and one great thing about having academic friends is they&#8217;re not gonna be shy to tell you that your thought of paper.</p><p><strong>Refereeing Math and AI for Economic Theory</strong></p><p>Andrey: One thing that we wanted to dig a little bit into is how you think about refereeing math and</p><p>Seth: Mm-hmm.</p><p>Andrey: More generally opening it up to how are economic theorists using AI for math?</p><p>Ben: So say a little more about your question. When you say math</p><p>Seth: Well, we see people, Axiom, I think is the name of the company, immediately converting these written proofs into Lean. Is that the end game for your tool?</p><p>Ben: I see, yes. So good. Our vision for the company is that, at least for quite a while, I think there&#8217;s gonna be this product layer between tools, the core AI models and the things that are necessary to bring your median, ambitious</p><p>Seth: Middle</p><p>Ben: not</p><p>Seth: theorists, that&#8217;s what we call ourselves.</p><p>Ben: Well, yeah. Or middle, but in a technical dimension, I think it&#8217;s almost certainly true that the median economist doesn&#8217;t use GitHub almost ever. If you told them, they set up something that, a tool that works through the terminal, think about Harmonic, right?</p><p>Ben: Their tools are all, they say the first step is, go grab this from a repository and run these command line things to, they try to make it pretty easy, but it&#8217;s still a terminal tool. So a big picture vision is that we think the most sophisticated tools will be, there will be a lot of them that are not yet productized and we can just make the bundle for scholars to actually use it in their work.</p><p>Ben: Now about the question of formalization per se, I have always been excited to use formalization in particular to make that product experience happen. For formalized math, my understanding is right now the coverage of the auto formalization systems is very jagged across, even across. If you compare number theory to algebraic geometry, the former is in good shape for people to start solving Erd&#337;s problems or combinatorial number theory, things like that, people can just start doing that. For algebraic geometry, there are a lot of basics that aren&#8217;t built out and so all of the lean proofs will contain a lot of stories that the user has to say, am I fine considering that settled or not?</p><p>Ben: And that&#8217;s not really an experience that makes sense for someone trying to check their econometric draft, right? So we&#8217;re watching and I think as soon as we feel it&#8217;s the moment when we can take the typical, say economic theory proof and give a rigorous certification, we&#8217;ll be right on.</p><p>Ben: I would like us to be in a position to be right on top of it.</p><p>Seth: I blame Grothendieck for algebraic geometry being hard to formalize, hard to make into Lean.</p><p>Andrey: Even short of things like Harmonic, right? It&#8217;s certainly you can get useful things of putting some math or asking for some math from Gemini for example. How are people in the field using those tools and have you noticed that has affected the type and quality of economic theory you&#8217;re seeing?</p><p>Ben: Oh yeah. That&#8217;s zooming out from refine. I&#8217;m obviously a heavy user of AI tools for my own research. I think broadly we&#8217;re seeing two phenomena play out in parallel. It&#8217;s a lot easier, this idea that went viral a few weeks ago of work slop being much easier to produce. I think there is an experience, which I&#8217;ve experienced myself, where you owe your co-author something and you have some ideas, you&#8217;ve done some real work, but it&#8217;s much easier to put a section in the paper that is AI written that looks a lot that our natural checks see as real work. And that introduces obviously new kinds of risk. It makes work faster in some ways and more fragile in others. And I think about that a lot. By the way, one of the main new values of refine is as people are perhaps less moment to moment engaged with the exact, or less line by line engaged with their work, which AI is doing. They need that global eye and that obsessive look, which used to be more in one&#8217;s own head. But that&#8217;s the negative phenomenon. But I think in terms of having a pretty expert consultant in things you don&#8217;t usually work on just for getting started and forgetting ideas.</p><p>Ben: I can already see major gains in my own research. One thing I would be curious to see is just looking at measures of production of scientific literature. We should see something on speed that&#8217;s visible in we should see signs of science speeding up in the areas which are particularly sped up.</p><p>Ben: And I, it would be fun to formulate a hypothesis like where should we be looking to see that</p><p>Seth: Right. We recently recorded an episode, the open AI paper on early uses of AI in social science. And it seems to us one of the most obvious immediate use cases is just, can I find if somebody already proved this and I could just plug it in? Right.</p><p>Andrey: to be clear, not social science, but mathematics.</p><p>Seth: mathematics. Excuse me.</p><p>Seth: Yeah. Yeah. Science, science is,</p><p>Ben: Physics. So yeah,</p><p>Andrey: Yes, exactly.</p><p>Seth: Andrey always calls me out that I say economics or social science when he really means, when I really mean actual science.</p><p>Andrey: Just to be clear, there were</p><p>Ben: important. Yeah,</p><p>Andrey: A bunch of math in that paper, which is very cool.</p><p>Ben: This is known. I think economic theory, it&#8217;s important to me about economic theory that there is really such a thing that&#8217;s called economic theory, very distinct from math. Usually, unless something is going wrong, you don&#8217;t need to do any interesting math.</p><p>Ben: In an economic theory paper, you just find the relevant. So I think a lot of economic theorists who are successful and good at it, a lot of the trade is finding the right thing, learning enough of it to make it valuable for your application and just using it correctly. And that&#8217;s where that search problem is really accelerated. So I&#8217;m with Seth that there&#8217;s gonna be a huge speed up just for maybe not as, it&#8217;s not super intelligence. It&#8217;s better search, but that&#8217;s huge.</p><p>Andrey: So one economic theorist that I&#8217;ve talked with about this is Joshua Gans. I don&#8217;t know if you&#8217;ve had a chance to talk to him, but he&#8217;s been writing a paper a week,</p><p>Seth: Right. The guy, he is grinding him out with the AI help</p><p>Andrey: Is there some sort of weird proof of work thing that&#8217;s starting to fail? Because look, writing down theories of almost anything, it was, it took a lot of work, but you could, there was a recipe, right?</p><p>Andrey: As an</p><p>Seth: you can mathematize Marx right. The fact that I can rewrite marks in math doesn&#8217;t necessarily make Marx good.</p><p>Andrey: Yeah.</p><p>Andrey: So how do you think about that and what do you think are gonna be directions in economic theory that are really changing the game as a result of this?</p><p><strong>AI, Work Slop, and the Future of Economic Theory</strong></p><p>Ben: Yeah. You raise an interesting point. You can think of one vision of what social science is, or what economic theory is, that&#8217;s suggested by what you just said, which is that we&#8217;re commentators on social reality and we&#8217;ve developed a particular style of doing that, which involves, in the case of modern economic theory, a lot of math and the proof of work.</p><p>Ben: There&#8217;s almost an equilibrium where you, in order to say something, you have to really carefully and write well in English, but also do this mathematics and now that, at least superficially can be totally hacked, is that gonna stop? Is that gonna make the commentary aspect of economic theory lower signal in some sense?</p><p>Ben: Is it going to, and that&#8217;s a great question. So let me table that for a second and say what? I have a thought on this topic that&#8217;s related to that. If you&#8217;re really good at that and you produce these really jewel like economic theories and then suddenly everybody can write slop and produce economic theories that at least take a while to distinguish from your beautiful ones, then maybe you feel sad, like your art has been degraded.</p><p>Ben: And I do think that&#8217;s the way poets, I think. I talked to some people who are very interested in the experience of artists with AI and I think that&#8217;s an artist&#8217;s experience with AI. Then there&#8217;s another kind of person I have in mind, which is an idealized cancer biologist.</p><p>Ben: And you tell them, oh, your jewel like blot analysis that you do or whatever. Now they&#8217;re gonna be automated. And I think this guy&#8217;s first reaction is mostly not, oh, how will people be able to admire my art? Will people still appreciate my art as much or what will I do with my time?</p><p>Ben: But they&#8217;re like, oh shit, we might move faster toward curing cancer. So one thing I think is wrong broadly with economic theory is that there are a lot of us whose reactions fall more into the artist category. And I would like, I think economic theory is not done. In fact, it&#8217;s quite bad what we&#8217;ve achieved on the whole.</p><p>Ben: So we should be</p><p>Seth: excluded of course.</p><p>Ben: Yeah. So as a group, as a community, right? And so if we, I would hope that we have it in us to say, look, now we have these incredible tools to take a run at questions that are really where the solution would be genuinely valuable.</p><p>Ben: And we could really try to do them better. And we have this huge resource now. I would like it to be, I would be happier about us if we had more of that reaction. I&#8217;m hoping that there will be parts of the profession, parts of the enterprise that grow and accelerate, because they&#8217;re driven by that as opposed to hand wringing over the art problems.</p><p>Seth: Right. And it seems like you could always add some more, get gatekeepers on the backend. Right? If we just make it easier to enter with, here&#8217;s my mathie paper. And the concern is you get too much slop. Maybe there is some way to filter. You don&#8217;t have to filter on the math anymore. You filter on something else.</p><p>Ben: Totally. All of these offensive weapons are also closely related to defensive weapons. So there&#8217;s a whole, and refine is obviously a natural, we think about that, that we can, at least, at minimum, we can help reject slop that&#8217;s written by cheap models without much skill and maybe we can help</p><p>Seth: How do you defeat slop? How do you defeat slop with bitter slop?</p><p>Ben: Yeah,</p><p>Andrey: Have you talked with some editors? Is there interest here?</p><p>Ben: Yeah. So Refine is doing pilots with several of the very top journals in economics. And we&#8217;ve been really encouraged by, I think because a lot of the editors are super genuinely pro-social people who want to take the tech, who wanna bring technology to bear as fast as possible, to improve the profession.</p><p>Ben: And so we, and I think there&#8217;s a feeling that they have that&#8217;s correct. That this phenomenon is here, and so the best way for the journals, for example, to deal with it is to be as up on it as anybody. And so we, I think the main use that is the easiest sell is just final due diligence right before publication at the conditional accept stage.</p><p>Ben: Can we make sure that papers are, any remediable, any mistakes that the author would be embarrassed to have published, the author has a chance to learn about it. Correct. That&#8217;s, everybody agrees with that. I think there&#8217;s a lot more design required to do it thoughtfully when stuff is incoming.</p><p>Ben: I have heard experiences from editors using REFINE and other tools. When they get a submission that they&#8217;re very suspicious about, they can just quickly run it through refine, see that there seem to be, and they&#8217;re usually experts in, right? So they can see, oh, this is surfacing really serious errors.</p><p>Ben: Now I can, for example, desk reject it with a lot more confidence. So we&#8217;ve, that experience does happen. That&#8217;s purely people&#8217;s own use of the tools, but.</p><p>Andrey: Are you worried that your tool is fundamentally, it&#8217;s interesting. Like many economists, it&#8217;s a tool of rather than constructivism in that it&#8217;s very good at finding problems. But is it ever gonna be, well, this is not a perfect paper, but it&#8217;s a beautiful paper nonetheless.</p><p>Seth: GPT-4o if you wanna sycophant to Andrey.</p><p>Ben: Actually, one thing we think a small version of that, and I&#8217;m curious for your guys&#8217; sometimes refined produces, you give it a 50 page manuscript and it produces six comments. In fact, one of our engineers recently switched. He said, we switched to a new, we did some model upgrades.</p><p>Ben: And then he looked at it and he said, this only produced six comments. And it was on a paper by one of our friends who had been through refine and all the mistakes were gone. And so he was like, oh, it went from, if I just run this on the dumber models, they give me 50. Now it&#8217;s six.</p><p>Ben: And that was actually good because the feature question we have is in that case, should we tell the author, Hey, this has fewer things we can see wrong than 95% of papers. Right? That&#8217;s turns this question mark experience into maybe something encouraging. So we haven&#8217;t rolled that.</p><p>Ben: I&#8217;m curious if you guys think such a badge would be pleasant for an author.</p><p>Seth: Question mark experience.</p><p>Andrey: I, I, think you should, well, you should obviously run the experiment,</p><p><strong>Viral Processes and the Refine Referral Program</strong></p><p>Seth: Uh, maybe an interesting place to start is this referral program that you came up with. So where did that come from? Why did you design it the way you did?</p><p>Andrey: You just, well explain it first. Yeah. I think that&#8217;ll be the first. Yeah.</p><p>Ben: what we have, we actually, we, through the end, through the end of Decem through the end of November, we ran our, our first iteration of our referral program, which we will keep, which will tune and keep running, in various guises. And the way the program works is you, if you refer a friend, if you want to refer friends, you get a referral link from the site. You can share that with anyone you want. And every time somebody, if somebody that you refer ends up actually, paying for a full refine review, at least one, they, they get a full bonus review and you, the referer get one. So we, our, our top reviewers, I don&#8217;t think you&#8217;ll mind me sharing &#8216;cause he, he told, he basically told everyone he knew, but Joshua Gans, he, he was, he&#8217;s like, I think he has like 35 credits now because he just kept referring and</p><p>Seth: God bless.</p><p>Ben:because my co-founder, my co-founder and I were talking and we&#8217;re like, this is than we expected, should we&#8217;d be worried about.</p><p>Ben: So we were like, no, this is only good. This is, there&#8217;s nothing to be stressed out. Um, he can have, he can have lifetime refined use, free for, for being such a good, but that&#8217;s what, so I think economically, I think there are two thing. One, one immediate thing to think about is that some people are gonna be really good ambassadors for your product, but you don&#8217;t know who they are.</p><p>Ben: There&#8217;s an information problem and a referral to the extent, and interestingly, they&#8217;re the ones who are gonna value the credits, if they&#8217;re really good users of it, and they&#8217;re also gonna be the ones that, probably can identify others who know. And so getting those people to raise their hand, is not a trivial problem if you just had to do it without, but it turns out this, it, offering the referral to them kind of puts the incentives in the right place. And then, the others, obviously the other lens that I think of it through is, the lens of network economics and the viral process. So I, I&#8217;m happy to talk, but I actually, the information one, when we were thinking like, who should we recruit as an ambassador? It wasn&#8217;t obvious. And this got them to come forward.</p><p>Seth: You&#8217;ve done some work, I think, both in, definitely theoretically, but maybe even empirically too, about optimal seating. So did that, any results from that play in?</p><p>Ben: That&#8217;s a good, I would say the, the most, honestly, the most important insight that kind of was really top of mind for me was what I, in an, in my undergrad networks class, which I teach from, Networks, Crowds, and Markets by Easley and Kleinberg, they go through the basics of the viral process</p><p>Seth: Will Jackson be insulted that you don&#8217;t use his book?</p><p>Ben: well, no, &#8216;cause it&#8217;s, it&#8217;s graduate book.</p><p>Ben: I</p><p>Seth: Okay.</p><p>Ben: every year. I do say, you can go buy, you can, if you really wanna know everything, you can buy Matt&#8217;s book. But so,</p><p>Andrey: yeah, just as context for the listeners, Matt Jackson was Ben&#8217;s thesis advisor. Yeah.</p><p>Ben: and yeah, collaborator and overall hero. So I, and it&#8217;s funny because I, yeah. Small aside, but when I teach that class, I&#8217;m like, &#8216;cause I realized from these undergrads perspective, Matt Jackson, like, if you read these books, he&#8217;s just like, they think he&#8217;s probably dead. Like, he is like, seems like a very major, a major part of the field.</p><p>Ben: And then I drop somewhere in the middle of the quarter, like, oh, Matt was my, Matt was my advisor. Um,</p><p>Seth: Not dead yet.</p><p><strong>Matt Jackson as an Advisor</strong></p><p>Andrey: talking about this, this is a little bit of a tangent, but I hope you don&#8217;t mind Well. What was he like as an advisor?</p><p>Ben: oh yeah, he is, he was ama I mean, overall amazing. Like, I, I, the main thing to say about it is I met him right as he was about to move from Caltech to Stanford. I came to him as a Caltech Summer research intern student. He didn&#8217;t really havetime, but somehow I, I tricked him into like, not, to, not to being officially on the, on the program.</p><p>Ben: Uh, my advisor in the program. And then we, we started working on our first papers on social learning and information aggregation right then, and. He, I think he&#8217;s ex, the most salient trait of him is that he is just incredibly supportive and encouraging about research, but actually not at all. There was very little teaching that he ever, he ever did, explicitly, here&#8217;s how you do research. Everything I learned from him was, was &#8216;cause he was open to co-authoring and I just saw him do research and I learned by, by apprenticeship. my dad had actually told me that that was the best way to learn and I, and but he had like Soviet physics, in the 1970s as his reference point.</p><p>Ben: So I was pretty sure it was not good advice, but it actually ended up being exactly what worked for me, with Matt. But Matt was not, Matt was not prescriptive he didn&#8217;t, I don&#8217;t think, I think his, his default mode of advising is like, because he&#8217;s so incredible at research. He, his first best advising style is to leave the student alone and let them, and let them do their thing.</p><p>Ben: And one, and I, it made way more sense to me when I talked. I, I think I talked to him about. His experience with his advisor, Darrell Duffie. And I learned that it was just, it was all this dynastic thing where Darrel was exactly the same way. He just, like, Matt brought him a thesis and Darryl was like, this is really interesting.</p><p>Ben: This is good. They had been writing other papers, but that was the extent of, and I, I don&#8217;t, Mike&#8217;s Matt was more, was definitely a great mentor, but I think it was really freeing to have someone basically just trust you to do re to do research and be there as a, be there to teach by example when you needed it.</p><p><strong>Eigenvalues and Network Dynamics</strong></p><p>Andrey: here&#8217;s a question. Who likes eigenvalues more? You or Matt Jackson?</p><p>Ben: Definitely me. &#8216;cause Matt&#8217;s not, Matt&#8217;s not a math nerd. Matt. Matt is a, Matt really is a true, true, true social scientist. He&#8217;ll use whatever tool. I think there&#8217;s, I&#8217;ve always felt a little sheepish that this aesthetic thing of like, what, this tool is really like special to me. He&#8217;s, he&#8217;s not like that and I think it makes him a better social scientist that he&#8217;s not.</p><p>Ben: Whereas I, &#8216;cause I think when you, whenever you care about something other than explaining the social world, that&#8217;s gonna be like, a trade</p><p>Seth: Well let, let&#8217;s slow down for a minute for, people in the audience who don&#8217;t live with the, in the, in the glorious glow of the eigen value. And, thinking about eigen vectors of Jacobian matrices, can you give us a little, give us a little taste to someone who&#8217;s already not in love with eigen values?</p><p>Seth: Why should they love eigenvalues?</p><p>Ben: Yeah, that&#8217;s a great question. Well, so, okay, 0.1 is, algebra describes the world. You guys know that video where the guy that the, the math profs or like, like sweaty t-shirt math guy is yelling like, functions. Describe the world. I think the real thing, linear algebra describes the world, and I think in the AI era, we, we don&#8217;t, as Tyler Cowen says, it&#8217;s Rise.</p><p>Ben: Tyler Cowen says it&#8217;s rising in status. So it&#8217;s quite high in</p><p>Seth: There we go.</p><p>Ben: the tough thing about matrices is that they&#8217;re so damn complicated. There&#8217;s like, matrices, you can the, the whole world into that. And the amazing thing about. Values is that they, they answer the question of if a matrix had to be a number, what number would it be? Like if you, if a matrix lost its privileges of being, of being an end by inbox and couldn&#8217;t store all that information, you have to masquerade as a, as at, at worst, a complex number. What complex number would it, what, what mask would it put on to be itself as a number? And eigenvalues are a wonderful way of, of fully answering that question is the best you can do. And that&#8217;s like, that&#8217;s a powerful idea. And, and I, and so back to viral processes, if you think about a viral process unfolding in a network, there&#8217;s a way to model it as a matrix or a network with all of the, the sort of, activation events being modeled as like basically a big matrix, multiplication, that prop that makes your state kind of, yeah, for the, I guess. Yeah, I don&#8217;t wanna, I don&#8217;t wanna, I understand that this is probably not the most intuitive way of describing it, but it is really true that if you have a large population and you wanna track the evolution of a state like a virus, you can think of that as kind of a matrix operation that acts on the system and updates it to the next step, which is like the thing spreading further.</p><p>Ben: But often what we wanna know about a virus is not everything about how it&#8217;s proceeding, but we wanna know something simpler. Like is it like when back in COVID, is it tending to spread right now or is it dying off? Right? And so it turns out that you can compute an eigen value of a suitably defined operator or, or something that will answer that question.</p><p>Ben: And so when you&#8217;re trying to run a viral contagion, as we are at refine to get more people aware of our product, we are trying to get the viral coefficient, above one. And</p><p>Seth: Right. Okay. So yeah, so tell me what, what&#8217;s the special thing that happens when an eigen value goes from below one to above one?</p><p>Ben: Yeah, well, let&#8217;s think about numbers, right? I said so, sowe have this, this process that we&#8217;ve now distilled down to one number, the viral coefficient. And we&#8217;re, we&#8217;re doing that process, namely the next step of the, of the epidemic over and over, right? The next moment when the epidemic has a chance to do its thing, and mathematically taking a time step is applying the, the operator of the epidemic&#8217;s behavior to the system.</p><p>Ben: So you have a system you hit it with, you say, okay, one more time, step. When we compute the, the eigenvalue kind of captures just the overall extent, captures how a number. And if that number is above one, it means every time it acts, that process tends to expand the set of infected people. And so if you&#8217;re doing it over and over, you think of a number greater than one, like two.</p><p>Ben: If you keep</p><p>Seth: One of my favorite numbers greater than one.</p><p>Ben: Excellent. My, my favorite. Um, if you have two and you keep hitting it, that is multiplying it with two, you keep getting bigger and bigger and that&#8217;s exponential growth. And it&#8217;s, it&#8217;s actually, it actually works with 1.01 as well. Right. And so if you, the la the largest iGen value of the propagation matrix captures exactly that.</p><p>Ben: Is there, when, when you keep hitting that system with itself again, does it behave like raising two or 1.01 to higher and higher powers? That&#8217;s when you have expansiveness, that&#8217;s when you have viral spread.</p><p>Seth: if my eigenvalue were 0.9, my viral spread would be I contaminate 0.9 people who contaminate 0.9 people, and that adds up to a finite amount instead of everybody gets it</p><p>Ben: Exactly. And so,</p><p>Seth: now, tell me what a complex eigenvalue is.</p><p>Ben: no, not today, but I will, what,</p><p>Seth: It&#8217;s not, it&#8217;s not, it&#8217;s not an, it&#8217;s not an interview on Justified Posteriors if the guest doesn&#8217;t refuse a question.</p><p>Ben: But, but, I will say is that I, what I, what I taught in my undergrad class, what, the way that I sort of like, like tried to get them, maybe even a little more excited is, you, when you think about that tipping point 0.9 to 1.1, it doesn&#8217;t look like a big deal. Um, locally, it doesn&#8217;t look like a big deal when you super zoom in on the, on the process.</p><p>Ben: But when you look at the process&#8217;s overall behavior, it, it makes a huge difference. And so what I to what I tell the business minded undergrads that I often teach is, if you&#8217;re running, and this was always just a fanciful little illustration to me, if you&#8217;re running a company and you&#8217;re running a viral promotion, you really could, you might be willing to invest a whole lot of money to move that number only a little bit because</p><p>Seth: Infinite return, dude.</p><p>Ben: yeah. If you, if you can push it, that&#8217;s where the returns to that are very big. And so we&#8217;re, and I amusingly, I think we&#8217;re right there. I we&#8217;re, I think our viral coefficient for this referral program is just about one. I can talk about some subtleties of estimating that, but that means, one of, one of the ways that we wanted to build it is we have that to have prices in there.</p><p>Ben: So the, the, the rewards you get are a price, right? And we can in principle give you, give your give, change the price, give people more free stuff or roll lower, make it an introductory offer with a, and those are the things we can tune to change the viral coefficient.</p><p>Andrey: And I guess the other thing in practice to remember is that the viral coefficient isn&#8217;t constant.</p><p>Seth: Ah, right. So does linear algebra describe the world when it&#8217;s like a first degree Taylor approximation? Actually.</p><p>Ben: Well, the beauty of, yeah, the reason it&#8217;s not co like yeah, it&#8217;s not constant over time. And one of the reasons it&#8217;s not is because as your contagion pro propagates through the network, it&#8217;s hitting different people. Right? Um, and that&#8217;s definitely something that of course as Andrey as, as you both know, and Andre, and I have talked about is that the selection of people as any kind of, of social phenomenon, like a an advertising campaign is progressing.</p><p>Andrey: I.</p><p>Ben: getting as the next rung is, is different. And eigenvalues actually do capture that from a nerdy perspective. Like if you just had to the, if you teach the simplest possible model where you just, like everybody has three friends and they infect these three friends with some probability, there&#8217;s no room for heterogeneity.</p><p>Ben: But if you take a whole network, then actually the heterogeneity is in there and the heterogeneity is, is exactly captured by it. And so in some sense, the largest eigenvalue will tell you the average of this across the whole network. So there are tools, of course when you&#8217;re doing it in real life as I&#8217;m now you&#8217;re just tuning the knobs andyou know, doing it in a somewhat less scientific way.</p><p>Andrey: But I&#8217;ll, I&#8217;ll just say that like after this podcast airs, will have been infected, so</p><p>Seth: Yeah. Oh man. Your I, dude, we&#8217;re getting your eigenvalues up there. We&#8217;re boosting your eigenvalues as we speak, dude. Okay. So we, we talked a little bit about, contamination of like viruses, but now let&#8217;s talk about an even more insidious form of, viral contamination, which is the idea or the meme, which contaminates us with, mental illnesses such as good taste in movies.</p><p><strong>The DeGroot Model of Social Learning</strong></p><p>Seth:Um, I guess if we were bringing these ideas of linear algebra to, social learning, we would think about this thing called the DeGroot model of Social Learning. Can you tell us a little bit about what that is? And then we&#8217;ll kind of build up to why wouldn&#8217;t that be a good way to learn, and how will AI help us think about that?</p><p>Ben: Yeah. So the DeGroot model is just, and I, I, I used to call it the averaging model of social learning, is actually what I worked on with Matt Jackson when I came to him as an undergrad. Um, at Caltech in 2006. I, like many other had rediscovered. Um, the dud model just says, you form your opinion tomorrow by taking a weighted average of what your friends think today. You can forget the weighted part if you, it&#8217;s not that important. So I just look around and my friends, I say, what are, what do they think about whether AI is good for humanity or whether, whether, you know. Um, you should throw away all your black, spatulas because they have toxins in them. And, and then for on issues like that, people form sort of an opinion by, by social communication.</p><p>Ben: And the DeGroot model is the simplest possible model. And we can come back to this. It&#8217;s, it&#8217;s one that economists actually don&#8217;t tend to love when they first encounter it because it is extremely simplistic and kind of, robotic or animalistic. You just, you just take the average. And if you have a bunch of people doing this, that can be summarized with beautiful linear algebra, which is actually exactly the same math, more or less as the math that you do for Markov chain theory. So, that&#8217;s for the nerds. But sociologically it&#8217;s interesting be because if it, because you can immediately start asking questions like, will a population of people updating this way reach a consensus and will that happen fast or slow? And will this consensus be right or wrong? And it sort, it gives this tool, which is like a pocket calculator that, that, um. Anyone with a reasonable applied math, education could, could have reinvented as in fact many people, including me, did. And, and then, but you can immediately take it to also, I think one of the reasons it&#8217;s been, so popular in economics is just it gives you a lot of ways to ask simple questions and get answers, which is something the, I can talk about it, but the standard economic models of learning don&#8217;t actually tend to give, many answers in networks</p><p>Seth: What would a large versus a small eigen value in a DeGroot learning network mean?</p><p>Ben: so in the, the first eigenvalue, which is the first one people talk about, the biggest one happens to always be one for a DeGroot model, which captures the idea that everybody is averaging. So in some sense aren&#8217;t getting, there&#8217;s no natural amplification or shrinking in opinions, because if you&#8217;re averaging, that&#8217;s sort of like the, there&#8217;s an eigenvalue, which just captures that fact</p><p>Seth: There&#8217;s no way for our opinions to fly off to infinity. I guess maybe if I was like negatively waiting you could that happen?</p><p>Ben: That could happen actually, but yeah. But if you, but with normal, with sort of the, the first, the natural assumptions on weights, things will tend to stay confined</p><p>Seth: know. Having negative weights on some people&#8217;s opinion seems pretty natural to me. If you&#8217;ve been on Twitter,</p><p>Ben: I have an under, I have a brilliant undergrad thesis student right now who&#8217;s studying</p><p>Seth: ah.</p><p>Ben: negative weights in the root model. But, yeah, so, but there&#8217;s a, another eigenvalue, the second largest. And what that captures is, is a society converging fast or slow. So the second largest eigenvalue of an updating matrix, if it&#8217;s really close to one, that basically means that. You can, you can start people off. And even if the society is connected and people will eventually be tending to the same opinion, if they talk for a million years, it really will take a million years. They, the, the being close to one captures their being. And it turns out, as Matt and I, Matt Jackson and I discovered to re relate to this phenomenon of homophily, that if your network is basically if and only, if, the only way that can happen is if there are divisions in your society where people put very little weight across Democrats and Republicans or whites and blacks.</p><p>Ben: Uh, andso if that happens, you can converge really slowly and if it, and if the second eigenvalue is, know, not too big, like 0.7 or 0.5, then disagreement is gonna decay like what you Seth was saying before, 0.5 to the end, right? So it gives this beautiful one number measure of the slowness.</p><p>Andrey: what if, what if, one of us was very stubborn and just didn&#8217;t really care what other people thought about them? Would their opinion end up dominating the entire belief process, or were they just washed away in the average?</p><p>Ben: Oh, if, yeah, so, so if there&#8217;s someone who&#8217;s super stubborn, they don&#8217;t listen to, the extremists, they really don&#8217;t listen to anyone. They put all their weight on themselves and</p><p>Seth: Those are, that&#8217;s our rival podcast. Dogmatic posterior.</p><p>Ben: Exactly. So, yeah, so that&#8217;s, that&#8217;s a way to be very, that&#8217;s a way to be very influential. In fact, at the extreme, wewouldn&#8217;t even call that society connected because this one guy&#8217;s not really connected to anyone.</p><p>Seth: It might be connected out. I don&#8217;t know. Maybe.</p><p>Ben: yeah. But even if he puts a tiny little weight on others, if he&#8217;s stubborn enough, he&#8217;ll still dominate</p><p>Seth: And would that be bad?</p><p>Ben: usually. But unless he&#8217;s very, well, unless he&#8217;s very well informed, unless he, and so yeah, we, we ordinarily consider that bad because. A benchmark we like to, in a realistic case, we like to think about is information is dispersed. Everybody. Nobody know. Nobody knows God&#8217;s truth. Exactly. But everybody has has reasonable Yeah.</p><p>Ben: Nobody has</p><p>Seth: The average of this room knows God,</p><p>Ben: Exactly. Exactly. We do. you, if you could take, if you could take the God&#8217;s eye view and look at everyone&#8217;s information together, it would be enough to tell you like a whole, whole lot. But nobody, but everybody&#8217;s individual estimates are pretty, are pretty noisy. And so now how do we, how, can decentralize social learning, which DeGroot is supposed to be a simple model of get you to that.</p><p>Ben: Well, it really depends on whether one guy monopolizes all the influence or a few guys or, or di, whether influence is dispersed.</p><p>Seth: As, as the population goes to infinity, do we have, influential nodes, right, is the way you put it.</p><p>Andrey: So,</p><p>Seth: gonna ask the LLM question? Andre? You go for it.</p><p>Andrey: one second,</p><p>Seth: One sec. We&#8217;ll get there.</p><p><strong>Cow Tipping and False Beliefs</strong></p><p>Andrey: Ben. I don&#8217;t, I don&#8217;t know if you remember, but we, we&#8217;ve actually done a podcast before.</p><p>Ben: I was thinking about.</p><p>Andrey: Now. In that podcast we discussed the interesting phenomenon of cow tipping and how people seemingly believe that this is a thing that one does, even though no one actually goes cow tipping. So my question to you is, the past since</p><p>Seth: Thanks for ruining the joke, Andre, for literally everybody.</p><p>Andrey: Uh, in the past, year since, since we&#8217;ve done the podcast, have you noticed any social learning on this topic? Is it now understood that cow tipping is not a thing or is it still a belief that&#8217;s propagating</p><p>Ben: That&#8217;s very interesting. I have stopped using it as a, I, I somehow found that I have not used it as an undergraduate teaching example since COVID, now that you bring it up. So one thing, something happened to me during COVID teaching. I was teaching my, this was the last year, 2020. I was teaching the last undergrad class I taught at Harvard in fall of 2020. And it was a wonderful group of students actually, but they were all dispersed. Some, most of them at their homes. A few of them lived in like group houses with other students. And I was doing the cow tipping lecture in the way it goes. Just for the, to a little more context. Yeah. So like, it&#8217;s a great,</p><p>Ben: how many people know what cow tipping is? One thing I&#8217;ve noticed by the way, is fewer hands go up because I think Varsity Blues and that generation of movies was an important, was the way that it got into the culture. And kids these days don&#8217;t have an, watch those movies. So I don&#8217;t know whether they&#8217;ve been exposed, but, but these kids sort of knew, they were like, I was like. I asked, the usual question is I asked some factual questions about it. Like, what do you think is the prevalence in the United States? How many incidents of cow tipping have there been in the last year? And people will say, very few people will say like a firm zero. Um, but in the Zoom class, one of the students, they had their, like, their apparent or a relative in the background, and they were like, no, cow tipping happens.</p><p>Ben: I&#8217;ve seen it. So then I had to, like, in the middle of my class, I have to interview this person to, assess like whether my whole understanding of things is wrong. It wasn&#8217;t a very exciting, I was like, well, did you see it? Like, what, what did they, what did</p><p>Seth: Is the cow tipper in the room with us right now?</p><p>Ben: exactly, they were like, they were like, well, they, they were drunk and they really like ran at the cow and they hit the cow.</p><p>Ben: And I&#8217;m like, then what happens to the cow? And they&#8217;re, I don&#8217;t know, I ran away. So that&#8217;s the usual, that&#8217;s like</p><p>Seth: Are you saying that, the eigen values of the cow&#8217;s response to tipping are less than one? Is that,</p><p>Ben: Exactly, yeah. Is I, values are very important in mechanics. So. But for the other piece of context, en engineers have written papers kind of proving that you can&#8217;t under reasonable assumptions, like, knock over a cow with your shoulder or</p><p>Seth: are you gonna tell us that Santa&#8217;s not real, dude? What is this podcast about? We&#8217;re just killing people&#8217;s joy. Or, anyway, I&#8217;ll let you finish your example.</p><p>Ben: In terms of false beliefs, I think things are bad. I think my, my naive sense, it&#8217;s very hard to know &#8216;cause we don&#8217;t, you have to really study it and scientifically, but we had like a, since my wife and I have have, had a baby, we&#8217;ve interacted with, like, we had a baby nurse live with us for three years and she, she was from a very different community.</p><p>Ben: You know, she&#8217;s like, and I heard things her friends were saying, and beliefs and my, my sense is that. Strange beliefs about matters of fact are very much out there. And, and I, and I feel like TikTok, I think like TikTok propagates them actually in a way that&#8217;s more powerful thanany vector I knew that I personally experienced.</p><p>Ben: Like when I was in high school, for</p><p>Seth: Is that interesting? I mean, is that surprising from a DeGroot perspective? &#8216;cause it seems like in from a DeGroot perspective, you get communities with weird beliefs &#8216;cause they&#8217;re disconnected. But now the statement is they&#8217;re connected and that&#8217;s giving them weird beliefs.</p><p>Ben: I think what the basic DeGroot model is missing is that people talk about things very, that that people&#8217;s propensity to, to. First of all, I don&#8217;t think like these beliefs, like claims of cow tipping or other urban legends or, or wild statements about what Hillary Clinton does recreationally are like, I don&#8217;t think they&#8217;re like deru where we average what people think.</p><p>Ben: You just propagate interesting information. And I think what the DeGroot model is really missing and a lot of models of social learning is that what people share depends a huge amount on whether they think it&#8217;s interesting and like surprising and much less on whether it&#8217;s true. And moreover, people don&#8217;t adjust for that when they hear, right?</p><p>Ben: Like Tyler Cowen might, but most people, they&#8217;re not, they&#8217;re not aware of that bias in the information they&#8217;re hearing. And so they&#8217;re not, adjusting their posteriors. They&#8217;re just kind of accepting, you know? And, and so I, and I think TikTok has made it much more power, much more, much more viral to say something really interesting and get it into a lot of minds.</p><p>Ben: And that&#8217;s more like a yes on or off viral state, not like, do you believe, not like. What, what do you think the interest rate&#8217;s gonna be next, next quarter, but more like, do you think that people really landed on the moon, like a yes or no? Or you do you believe in some crazy conspiracy that&#8217;s like, like more like a virus that takes hold of you and it&#8217;s not a matter of degree of belief.</p><p><strong>Sequential Bayesian Learning and Herding</strong></p><p>Seth: Well, so if people, if people aren&#8217;t good bayesians, another model that you&#8217;ve worked with is called, the, or sorry, I guess a Sequential Bayes. If people, if people aren&#8217;t learning this connected way, maybe they&#8217;re learning in this kind of sequential, sort of herding-y way, which is sometimes called a Sequential Bayes model.</p><p>Seth: Uh, Andre, are you gonna let me move on to this topic? Or you wanna jump in with something?</p><p>Andrey: make a, I wanted to make a very brief observation since we&#8217;re talking about this. I happen to notice a book in the, in the background of, of Seth, actually The Hype Machine, which is</p><p>Seth: My machine with Ana roll. Yes. What&#8217;s, yes, what he says. It&#8217;s, it&#8217;s not true. Things that spread. It&#8217;s, novel and emotionally intense things that spread. So shout out to, a friend of the show. Sinan Aral.</p><p>Seth: All right. So, yeah. All right. So pe, so pe No, that&#8217;s good. No, that&#8217;s good. So people don&#8217;t learn in this connect way.</p><p>Seth: Maybe. Maybe, maybe they just see what the last guy did and try to figure out the state of the world from that. Is that a better model of what you&#8217;re describing, or is it also wrong?</p><p>Ben: I think what I&#8217;m describing some, some, like, having in mind intending to propagate, a little pellet of false information, like people tip cows. I think that&#8217;s just like a virus and that&#8217;s a good model. It&#8217;s also not be irrational. I mean, I think there&#8217;s some rationality to it, but I think the best model of it is like, if it&#8217;s interesting enough, it goes viral and a lot of people believe it, but Seth absolutely, like the models, Bayesian sequential updating where you hear something. I think where that model really shines is in thinking about something like, which, you know. Should I get, should I get flood insurance for my house or which accountant in our, there&#8217;s like three accountants in our industry and which one should I use? I think there, people think very much like what that model posits, which is I could research this, I could get my own signal.</p><p>Ben: I don&#8217;t have any special confidence that I would be particularly good at that. And this other person, I know that what they, that they&#8217;re not probably acting on amazing information either, but it&#8217;s probably still got a little more information content in it than mine. And let me just, so let me just follow and so you end up with a lot of like in economic context that I think are important.</p><p>Ben: I think the, the choices people make about insurance. Like when I talk to people their, who thought their whole lives about do people buy enough fire insurance or flood insurance or whatever, they basically talk about it like a social convention. And so you, you buy some and you don&#8217;t buy other, and you don&#8217;t buy stuff that people around you don&#8217;t buy.</p><p>Ben: Not because you&#8217;ve taken any time to analyze your personal, portfolio problem, but just because you assume other people have it like. That the social signal contains more information than you&#8217;re likely to gather.</p><p>Andrey: There&#8217;s also an interesting aspect of it, like if you follow the herd, then even if it goes wrong, you&#8217;re like, well, who can blame me for, for doing that? Right? But if you go against the herd, like, oh, that idiot didn&#8217;t buy insurance. Like he deserves what he, what he got. Right?</p><p>Seth: You have to get an awful, strong signal.</p><p>Ben: in a business context, right. There was this saying nobody ever got fired for buying IBM because, and that was exactly hurting on IBM, that at the, are you gonna really get blamed for using the same vendor that everybody uses?</p><p>Seth: So, how does, so is, is that great? We all coordinate on doing the right thing, or can that fail somehow? Why, why wouldn&#8217;t that be a good approach to learning?</p><p>Ben: You absolutely get big. I mean, the main was, the main first result about the herding model is that you can get quite dramatic failures of information</p><p>Seth: Oh no.</p><p>Ben: Where? Um. If people did experiment, if people, if we could ask like the first a hundred people to make this decision to ignore the social signal or just deprive them of access to other people&#8217;s past choices, and we made them decide based on their private signal, then we&#8217;d get a hundred hunches aggregated, and that would, and then after that we&#8217;d have a hundred people&#8217;s information, averaging into some vibe about what the sensible thing to do is.</p><p>Ben: But, but the sequential model shows that if you, if, if the first people already are contaminated by having access to previous decision makers, it&#8217;s just rationally they won&#8217;t get this started. So you have a kind of tragedy of the commons where collectively, we could like. Maybe compensate the first movers or just pick some of us to be unlucky and have to make this decision solo. And we would, society would learn a lot that way from, but, but what we in fact do is just, herd and actually online platforms spend a lot of energy thinking about like how to get enough experimentation going on. You know, should Google re Google Maps recommend, shortcut that it doesn&#8217;t think is the best to learn about it, should Yelp send people, try to send people to a restaurant that it doesn&#8217;t think is the best to get more information about it.</p><p><strong>LLMs and Information Aggregation</strong></p><p>Seth: How does LLMs change all this? Alright, so I&#8217;m kind of split &#8216;cause I kind of feel like these two models have different implications for whether it&#8217;s gonna help or hurt with aggregation failure. So help me out with this. It seems like in this sort of sequential Bayesian framework, LLM sort of should hurt our information algorithm, aggregation, right?</p><p>Seth: Because, nobody is in the position of being ignorant. We can always just question the model. The model tells us what the last hundred people did. Uh, we&#8217;re gonna herd harder by virtue of all having, none of us being in that state of ignorance, that state of blissful archipelago ignorance. Do you think that that is a mechanism that&#8217;s potentially at play?</p><p>Andrey: Wait, Seth, can you just clarify something? Why</p><p>Seth: Please,</p><p>Andrey: LLM tell you what the last a hundred people said necessarily? I,</p><p>Seth: it&#8217;s gonna tell me what the last hundred books written about the subject are. Let&#8217;s say.</p><p>Andrey: I mean, we can take that as a premise. I&#8217;m not sure if I&#8217;d buy it, but,</p><p>Seth: I mean, well what are they? They&#8217;re based on, this is what I&#8217;m trying to say is LLMs are based on the things LLMs have read. Andyou might say maybe this is a version of model collapse, right. LLMs are based on the last hundred on some thing of some of the last things. The LLMs read</p><p>Andrey: The last</p><p>Seth: just the last hundred tokens.</p><p>Seth: And then, somebody reads that and then they write a book based on having read the LLM. And now we get herd to whatever our opinion was in 1850.</p><p>Ben: What do you think buying it?</p><p>Andrey: no, I mean, I just, I, I guess it depends on the decision, right? But to, to the extent that models are able to reason and to the extent that your,</p><p>Seth: What if it&#8217;s a pure fashion question? What if it&#8217;s, what if it&#8217;s just black shirts are in versus white shirts are in? Could it, could it lead a stronger herding there?</p><p>Andrey: Well, it would rationally know that you don&#8217;t wanna wear what everyone else is wearing. Right. I mean, I mean, there&#8217;s a, there&#8217;s an element of like, that it can really be, have a lot of context about you, which is different than else.</p><p>Seth: Yeah.</p><p>Andrey: that&#8217;s, that&#8217;s the aspect where I&#8217;m not exactly sure that that&#8217;s how we should model it, but I&#8217;m happy to consider that version of the model.</p><p>Andrey: Sure.</p><p>Ben: Um, yeah, I&#8217;ve never thought, I haven&#8217;t thought about it in a sequential learning setting exactly. But I think there&#8217;s a different, a different dimension which seems related and important, which is like a narrative that I&#8217;ve heard repeatedly and that I think has a lot of truth about what&#8217;s happened to western society and politics is that there used to be, a focal provider of, of focal baseline, of facts, basically</p><p>Seth: Catholic church.</p><p>Ben: well, I would say the six o&#8217;clock news,</p><p>Seth: Six. Okay. All right. I always wanna go. I always wanna go back to Habsburg times. Dude, you can see this is my Habsburg wall.</p><p>Ben: I don&#8217;t know. I, and I think this was probably a unique moment because I&#8217;m not sure, I think that, that the newspapers we should ask like, Gentzkow and Shapiro about, newspapers in 1900, which was I&#8217;m sure a very different, environment with all. But like, there&#8217;s this moment which is now kind of seen, which is, valorized a little bit, that there was the, a national truth and you could, you had to get pastsome regulatory, there was regulatory exclusivity for the major broadcasters and basically nothing too crazy.</p><p>Ben: You could get broadcast too widely that Right. And then we move to this TikTok world where, where it&#8217;s a free for all. And, and it does seem like, that has some, the breakdown of a shared reality seems like an, something that&#8217;s happening to some extent and now coming like. ChatGPT. It&#8217;s, I think it&#8217;s a real empirical question.</p><p>Ben: To what extent in normal people&#8217;s normal lives does that serve as like the six o&#8217;clock news? Again, the coordinating device. Um, if you&#8217;re debating something, my wife Annie, who&#8217;s, who&#8217;s a also a Northwestern professor, had a hilarious story at a dinner she was debating. She went to MIT and she&#8217;s a big MIT snob and always reminds me that Caltech, where I went to for undergrad is way worse and is like way less cool.</p><p>Ben: And so there was, but to my surprise, her dinner can be, I wasn&#8217;t at the seminar dinner, but a guest of ours thought that Caltech was great. So I was like, the kids, it</p><p>Andrey: To.</p><p>Ben: and she was, yeah. And she was like, and he was like, wait, are you telling me that if you ask, you ask 10 people, they&#8217;ll all who, who care about this?</p><p>Ben: They&#8217;ll say that MIT is better. She was like, yeah. So of course they took out ChatGPT and that settled, and she,</p><p>Seth: Pirate, get John Horton on the phone. Tony Stark went to, Tony Stark, went to MIT Dude, that&#8217;s what people know about.</p><p>Ben: So I thought that was, and I think that&#8217;s gonna happen a lot around a lot of dinner tables and kind of, it has an effect. I, I think of it as a shared, I think of it as a powerful shared signal. Um, andI think that really reshapes things, in, in a lot of different ways. Um, that&#8217;s the main way I&#8217;ve been thinking about it.</p><p>Andrey: You know, it&#8217;s, it&#8217;s funny &#8216;cause what I, my very opinionated bias take is that the average quality of the undergrads atCaltech is obviously higher than at MIT in my experience, and I think a lot of people who know would agree.</p><p>Ben: Yeah, I think that&#8217;s, I think she&#8217;s been a little bit per, I think she&#8217;s been a little persuaded over time because my, my, my good friends, like the, the relationships I&#8217;ve kept from undergrad are, um. John Schulman, who was a, who was there, were two of the biggest ones. Or John Schulman, who was a, one of the, was maybe the, is often credited as being, a creator of chat, GPT andAdam D&#8217;Angelo, who&#8217;s, who is of course the co founder where I worked and and is a, is a very big figure in ai and I think that does you, there there&#8217;s a sort, so I think that&#8217;s made a, made an impression actually that there&#8217;s some kind of person that the place was good at incubating</p><p>Andrey: So</p><p>Seth: so</p><p>Andrey: is all listeners. This is actually all a ploy to get John Schulman on justified posters.</p><p>Seth: come on.</p><p>Ben: those two are Caltech alum in case it, it was not.</p><p>Seth: Uh, so, okay, so, so let me, so let me take that argument a step further. So, the way we should, one way to think about LLMs in the social information aggregation function is as being a central node that all of us are connected to. Um, we, you just reminded us that in these DeGroot models, having, influential node in the long run means that influential node gets to, set a little bit of the opinion and it might not just be the average of everyone&#8217;s opinions.</p><p>Seth: Is the concern there, or is the observation there that, whoever ends up controlling the most important three LLMs ends up having a real thumb on their scale in the opinions of society.</p><p>Ben: yeah, exactly. So, it&#8217;s funny when it, when Matt and Jackson and I were working on this in 2007, 2008, were very, the ba the basic first observation is exactly what, what you said, that if one person gets a lot of weight, they&#8217;re gonna, their errors are gonna matter. They&#8217;re gonna contaminate everything.</p><p>Ben: And so they&#8217;re gonna prevent, even if society as a whole has the information collectively to wash out all the error, the fact that this guy talked in a way, first or talked loudly, means that everybody&#8217;s going to be influenced by whatever. That note says, but there is an exception. Or when you try to prove those things mathematically, that&#8217;s not necessarily true because something that can happen is if that note is very good at themselves being an aggregator, and it actually does, it figures out the right information.</p><p>Ben: Um, and rebroadcast, that&#8217;s also one of the most efficient ways of figuring it out. So I think</p><p>Seth: A</p><p>Ben: the</p><p>Seth: post, a reliable pollster.</p><p>Ben: Exactly. And so the selfer, there&#8217;s something irritating about the Selfer, way in which some of these AI companies regard themselves, or it&#8217;s like that they, thinking really earnestly about stewardship of, of, the model&#8217;s preferences or whatever.</p><p>Ben: But I actually think this, that, it, if the model is say left bias, this liberal liberal bias, then that&#8217;s gonna, um. it into a lot of opinions andthat matters. And so they, they should think about it. And I, I do actually admire efforts that they make, to be basically good aggregators, good pollsters.</p><p>Ben: And interestingly, like before we could have pollsters on a few issues that you could distill numerically, but now this is a pollster that kind of up internet text about anything. It&#8217;s like a qualitative pollster, which is a really remarkable kind of device that we couldn&#8217;t have imagined when we were writing those papers.</p><p>Seth: Should we be RLH fing these models so that they have the median social opinion on all social issues?</p><p>Ben: What does that even mean? Right? How do you</p><p>Seth: I, you go to Pew and it says, the median person thinks abortion should be legal at 27 months. Whatever. What? Sorry? 27 months. 27.</p><p>Ben: But even that,</p><p>Seth: 27 weeks. Okay.</p><p>Ben: didn&#8217;t like. The interesting thing is that the LMS are doing their own embeddings of these issues into their, so people will just talk to them and say, and talk about abortion in a way. They&#8217;re doing an averaging but not one that&#8217;s, that&#8217;s, that&#8217;s numerical one that&#8217;s qualitative. And, and I, I kind of like it that way. I, I, I don&#8217;t think people have coherent views on almost any issue of public interest. And so if you try to make it numerical and try to average it that way, that would be like garbage and garbage</p><p>Seth: Right.</p><p>Ben: and.</p><p>Seth: Trying to recreate the mind of the median American voter will make you insane.</p><p>Andrey: I, I really wanna go back now to this personalization aspect of things, right? Um, it, especially with something like Chad, GPT, I don&#8217;t view it as a monolith. There is a model router involved. It has all your previous conversations. And if me and you asked it a question, and this is an interesting, it would be an interesting empirical exercise actually, is like. We might get a very different answer about like, is it, is it, normal to, I guess, I guess it depends on what we&#8217;re asking. It&#8217;s like one of the things like for myself, like, is it, should I wear a hoodie to a business meeting? Right. You know, and it might give me a different answer than you guys.</p><p>Seth: Did play League of Legends during the business meeting.</p><p>Andrey: yes, yes, Uh, but, but if I ask it, what does the average person in society think about this question? We might get the same answer, but I don&#8217;t know, these things are a little unpredictable in this way. Right.</p><p>Ben: Yeah, and there&#8217;s a bunch of</p><p>Andrey: I.</p><p>Ben: papers just suggested by what you just asked, right? If people, because of course the system prompt. If you&#8217;ve done a, if you&#8217;ve now had your custom prompt, all bets are off because you could, you could ask it. Please don&#8217;t tell me. Things that might upset me with this mental illness that I have.</p><p>Ben: And then they, we wouldn&#8217;t get probably accurate answers on, on if it&#8217;s really, then it has. So yeah, people do get, the personalization issue is super interesting. but for now, yeah, I just wanna make the point for the moment that as a focal before the market has matured to the point that there&#8217;s a niche little LLM for everybody, these items are actually new kind of animal in the, they&#8217;re not like Facebook, they&#8217;re not like they&#8217;re, they&#8217;re a new kind of sort of public object that everybody interacts with.</p><p>Ben: Um, and despite the heterogeneity that Andrey said, they, that&#8217;s, that might shift things in a way closer to a, a, a former time.</p><p>Seth: Or will people just all choose, I&#8217;m a lefty going in, so I&#8217;m gonna use lefty, LLM, and you&#8217;re already going in. You&#8217;ll use righty. LLM.</p><p>Ben: Right. But it is, isn&#8217;t it remarkable that gra, I mean, there&#8217;s like a popular Twitter joke, but after trying, after trying to train the wokes, the, sorry, the, the anti wokes, LLM imaginable, it has like, it has like wine mom views, like</p><p>Seth: You can only, you can only, you can only, right wing eyes, the LLM so much.</p><p>Ben: Yeah. Except on the rare, like, it&#8217;ll say, it&#8217;ll occasionally say Hitler is great, but other, other than that, it&#8217;ll like,</p><p>Seth: Only when it&#8217;s role playing.</p><p><strong>Simulating Social Learning with LLMs</strong></p><p>Andrey: has anyone tried to</p><p>Seth: Ooh.</p><p>Andrey: some of these social learning games with LLMs?</p><p>Ben: yeah, that&#8217;s, I, that&#8217;s a great, I I&#8217;ve been trying to learn, keep track of this. I, it&#8217;s been proposed to me by students. Um, and I know that there are people. That. So I was gonna say that when we, &#8216;cause before, before the podcast, we&#8217;d sort of discussed, some topics, and I&#8217;ve been thinking about this one that like, how will it affect social learning?</p><p>Ben: But it made me think, how will it affect studies of social learning? And now you can, you can, implement, you can simulate it, you can, try to forecast how groups of people would behave. And it&#8217;s interesting because people like John Horton have done studies of how good is it as a simulator of a, of an individual. the question of how good is it as a simulator of a community, would be super interesting. I think just intellectually, I&#8217;m sure people are doing it. I&#8217;d love to, if people listening are aware, I would love to like tweet it at me or something.</p><p>Seth: You heard it, folks, dm d dm, Ben, with all of your, simulation ideas</p><p>Andrey: yeah.</p><p>Ben: tweet.</p><p>Andrey: Well, I, I guess theclosest thing that I</p><p>Seth: posted on our Discord I&#8217;ll, we&#8217;re at the, we&#8217;re at the end.</p><p>Andrey: Yeah, is the, is the AI village, know, where the, there are like different ais, different models, and they&#8217;re like co cooperating, slash they&#8217;re given a task to do and they see if you can do the task. And some tasks are like, can you sell a t-shirt online?</p><p>Andrey: Or something like that. And it&#8217;s hilarious how they try to cooperate with each other and all their foibles andso on. Uh, which is kind of not narrowly the, the specific formulation of social learning, obviously, but related,</p><p>Ben: Yeah. Yeah.</p><p><strong>Lessons from Quora and Startup Experience</strong></p><p>Andrey:so one, you, you mentioned, your friend Adam D&#8217;Angelo. I&#8217;m curious what, what you learned, at Quora, that you&#8217;re bringing to your current startup experience, or alternatively what you learned at Quora that you brought to your research.</p><p>Ben: Yeah, that it was such a formative time that I really didn&#8217;t understand at the time, how important it would be in my life. That I think the biggest thing, I never thought I would, I never expected that I would do anything entrepreneurial just because, I think that for one, I didn&#8217;t expect that there would be a technology like AI that would be kind of like, have the exact shape that, that is, has been important for, for me to be able to actually try to do something, at the technological frontier.</p><p>Ben: But at that, but I was, what was remarkable to me is that I</p><p>Seth: Thought you said linear you, I thought you knew that Linear algebra destri describe the world and you&#8217;re the king of eigenvalues. Come on, dude.</p><p>Ben: No, but I guess I never had that deep faith or I thought it was a few steps away that I was upstream in</p><p>Seth: Mm-hmm.</p><p>Ben: the innovation</p><p>Seth: Fair enough.</p><p>Ben: of commercial applications. But I remember, like, it was huge for me that they, that they were, that Adam&#8217;s always been very interested in economics. He just reads, like he reads texts on industrial organization recreationally.</p><p>Ben: And, and I think he had, he always had this respect for economists. Um, that was very, and and so he would, we would just occasionally chat about things often through the lens of economics. And Quora had some specific, he had some economic ideas of for, well, one thing I did was moderation. &#8216;cause I was just a very active user.</p><p>Ben: So I was involved in kind of, some of the housekeeping of the moderation operation, which I actually wasn&#8217;t good at. So I, my, at the time, the interesting thing is I wasn&#8217;t like, I wasn&#8217;t a good community community manager and but, but when, then, when I was in the company. Adam got curious about this idea of credits and actually having an internal currency, and that so that people&#8217;s like, basically so that the scarce resource of some people&#8217;s attention, like, especially on early Quora, a lot of the answers were written by really visible people whose, who were, people were very excited to see them there, but their attention was scarce.</p><p>Ben: So how could you efficiently bid for people&#8217;s attention? You wanna create some kind of token, right? And so I was just like the consultant who, thought about the very basics of the design of that system, like the central banking. How much money do you issue it? How do you, but that was what I did. but what I learned was actually like just getting to watch a startup. And it was right at, when I joined there were about, I think 27 people. And so seeing a startup at that stage, I learned a huge amount about. About running a business andespecially in tech, I think the strongest, people often say that startups are like a magnification of the founder&#8217;s personality. Um, and I think that&#8217;s really true in this case. &#8216;cause,</p><p>Seth: Getting, getting how, how, frustrated it, refined was with some of my notation where it was like, you called this a node. I, it took me a while to figure out what you mean, but I would not call it a node. Uh, your personality really does come through.</p><p>Ben: it&#8217;s funny because, yeah, I&#8217;m very, I&#8217;m very pedantic. I, I&#8217;ve spent, I, I, I feel, yeah. So I&#8217;ve created, and Adam is very, very thoughtful and deliberate and kind of like likes to make decisions with principles and in a thoughtful way and make decisions, like I think a lot of good, good leadership skills, like focus on, focus on one focal goal at a time and and. Propagate that and communicate that. And then, think really thoughtfully about design The core was a very design first company andmaking design decisions, not as an afterthought, but as a core thing. I think there were a lot of those like principles, I think similar to growing up in families, like there&#8217;s just certain values that are embodied in where your environment.</p><p>Ben: And when I was there, like I realized after that I, I&#8217;m a pretty good sponge and I wasn&#8217;t directly involved in any like, decisions having to do with design, but you know, the guy I sat next to at Quora was, Joel Lewenstein, who&#8217;s now the, the head of design at Anthropic. And I can, and like, but I didn&#8217;t, I think what the amazing thing is, it was this like, combination of amazing people and all of them were really thoughtful and really good at what they did.</p><p>Ben: And they talked about startup uping in a very intellectual, thoughtful principles first way. And so that when I, I, when it came time to think about a business, I felt like. That was a natural way to be, and I realized I never would&#8217;ve had the, that kind of, those kinds of vibes, if not for those six or eight months that I spent there.</p><p></p><p>Andrey: Very cool. Um, do you have any thoughts about why more companies don&#8217;t use virtual currencies and have you thought about the use case of virtual currency for internal allocations of GPUs?</p><p>Ben: Great questions? Um, I think virtual</p><p>Seth: You imagine going to Walmart and they tried to pay you in Walmart coin instead of money, people would riot.</p><p>Ben: Yeah. Well, but you could, I mean, internal currencies. I think one of the problems that, I wasn&#8217;t around when Quora eventually decided to get rid of them, but I think one of the problems is that, um. Currencies are focal and they create people, they, they motivate people to do things in a way that they sort of take up too much oxygen in the ecosystem. And so when you&#8217;re designing a social product where you want many kinds of incentives to be in balance, having a currency can actually be harmful to the, it&#8217;s a kind of a sociologist insight, but like, so I think there&#8217;s some of, I think you have to be really, I think for platforms where that are truly transactional and economic currencies are always good.</p><p>Ben: And usually that currency becomes money. &#8216;cause it&#8217;s gonna have an exchange rate with real money</p><p>Seth: Right.</p><p>Ben: Um,</p><p>Seth: Love one price.</p><p>Ben: yeah, but for, but I think for, for. It is, I think it&#8217;s an interesting phenomenon that needs to be thought about more. Why it&#8217;s not, why it&#8217;s really generally not a successful route for social for internal markets. I, I&#8217;m very, I I believe that some of the obstacles to internal markets are just frictions having to do with like, basically contracting frictions. Um, and one thought that I have had for a long time actually discussed with, we had some there. Let me just, I, you guys will edit. Let me just say that again. One thought I&#8217;ve been thinking about for a long time is just as contracting intermediaries. Um, and</p><p>Seth: This is a big theme of the</p><p>Ben: Andrey</p><p>Seth: Coasian Singularity Dude.</p><p>Ben: Yeah. This is Andrey&#8217;s paper.</p><p>Andrey: Yeah. So what, what is your thought about this? Yeah.</p><p>Ben: I&#8217;m very curious, so I&#8217;m very curious for your take on it since you&#8217;ve thought about it much more seriously now, but it just, yeah, I think I feel like. A lot of the details were just like implementation details, that if it became your job to implement it at a company, you would, you would decide that it&#8217;s, you&#8217;d have to really have a high valuation of the marginal allocated efficiency of that currency. And it&#8217;s arguable that it&#8217;ll, it&#8217;ll be, it, I think experiment experimenting with it has just become way more valuable once we reach the LLM, capability of being trustworthy to like, negotiate a contract, which I think honestly is not right now, but yeah.</p><p>Ben: I, I see that as a potential, a big organizational impact. I&#8217;m very curious what you think.</p><p>Andrey: I mean, surely the contracting aspect would be hard. but I also think there&#8217;s a social aspect to it as well, right? You&#8217;re the CEO, you create an internal Coasean internal market for GPU resources, then you suddenly see a team that you don&#8217;t want using the GPUs, using a lot of the GPUs. Now, what do you do</p><p>Seth: The whole point of, yeah, the whole point of having a firm is to have a command DI economy. If you wanted everyone making independent economic decisions, you wouldn&#8217;t have a company right.</p><p>Andrey: but there&#8217;s a sense in which there&#8217;s some optimization that you want your teams to be making, like leaving idle GPUs or they&#8217;re using them very stupidly for some reason, and you don&#8217;t, you want that to be kind of disincentivized and. The way it&#8217;s currently done is through these very imperfect monitoring systems and people asking very nicely, can I have, this resource?</p><p>Andrey: Right? So yeah, I&#8217;m, I&#8217;m curious whether the, the AIs can do a better job here.</p><p>Ben: Yeah, I mean I guess the, you might shortcut you, they&#8217;re also becoming better at being the arbiters of requests. Right? So maybe, maybe rather than, but, but I do think money is, one memory I have of Quora actually is that the engineers, they hadbrilliant young people and I very like. Who were first principles thinkers too.</p><p>Ben: And so people would ask me also, I had to just like justify money to the whole, to like the skeptics in the whole company. And so I gave, gave a lot of thought</p><p>Ben: Yeah, why don&#8217;t we have some more multidimensional expression? Right. And there are good answers to that. It&#8217;s like very helpful that money is very legible.</p><p>Ben: That, but, but I guess we, yeah, for companies, I&#8217;m very much with Seth&#8217;s point that if you really believed in the power of the, of monetary incentives to, to do it, you, you wouldn&#8217;t have a company, but you may find it a useful tool within the command. I mean, even, even the command the North Korea has has currency, right?</p><p>Ben: So like it&#8217;s definitely a tool. And I think with the Pareto frontier has changed, but I don&#8217;t know how</p><p><strong>Closing</strong></p><p>Andrey: Very, very cool. So, we&#8217;re just about out of time. Uh, is there anything either of you want to add to our conversation?</p><p>Seth: Ben, do you have any good eigenvalue jokes for us?</p><p>Ben: oh man, I should have prepared. </p><p>Seth: Alright. We had Ben Golub today who&#8217;s made tremendous strides in automated paper reviewing and still has a lot of progress to be achieved on automated Eigenvalue joke, doing, thanks for tuning into this episode of Justified Posteriors. Please like, share, and subscribe. We now have a hoppin&#8217; Discord community for now by invite only DM us on substack Twitter or LinkedIn for your personalized invite code.</p><p>Seth: And why don&#8217;t you keep your posteriors justified?</p><p>Andrey: Thanks, Ben. </p>]]></content:encoded></item><item><title><![CDATA[The Best Books Seth Read in 2025]]></title><description><![CDATA[How advice about murderous dads explains the difference between the U.S. and China]]></description><link>https://empiricrafting.substack.com/p/the-best-books-seth-read-in-2025</link><guid isPermaLink="false">https://empiricrafting.substack.com/p/the-best-books-seth-read-in-2025</guid><dc:creator><![CDATA[Seth Benzell]]></dc:creator><pubDate>Fri, 26 Dec 2025 17:12:50 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/9f0eb425-7381-4f22-b20f-41465884456e_988x518.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>In 2025, I read about 35 books, slightly below my targeted pace of 40. Here are some superlatives and books I highly recommend:</p><h3><strong>Best Pairing: </strong><em>Breakneck: China&#8217;s Quest to Engineer the Future </em>by Dan Wang and <br><em>Natural Moralities: A Defense of Pluralistic Relativism</em> by David B. Wong</h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Irb2!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd51e095-b5f5-46fb-a9ff-5cd22f7691f9_906x676.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Irb2!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd51e095-b5f5-46fb-a9ff-5cd22f7691f9_906x676.png 424w, https://substackcdn.com/image/fetch/$s_!Irb2!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd51e095-b5f5-46fb-a9ff-5cd22f7691f9_906x676.png 848w, https://substackcdn.com/image/fetch/$s_!Irb2!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd51e095-b5f5-46fb-a9ff-5cd22f7691f9_906x676.png 1272w, https://substackcdn.com/image/fetch/$s_!Irb2!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd51e095-b5f5-46fb-a9ff-5cd22f7691f9_906x676.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Irb2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd51e095-b5f5-46fb-a9ff-5cd22f7691f9_906x676.png" width="906" height="676" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/dd51e095-b5f5-46fb-a9ff-5cd22f7691f9_906x676.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:676,&quot;width&quot;:906,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:641432,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://empiricrafting.substack.com/i/182637326?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd51e095-b5f5-46fb-a9ff-5cd22f7691f9_906x676.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Irb2!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd51e095-b5f5-46fb-a9ff-5cd22f7691f9_906x676.png 424w, https://substackcdn.com/image/fetch/$s_!Irb2!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd51e095-b5f5-46fb-a9ff-5cd22f7691f9_906x676.png 848w, https://substackcdn.com/image/fetch/$s_!Irb2!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd51e095-b5f5-46fb-a9ff-5cd22f7691f9_906x676.png 1272w, https://substackcdn.com/image/fetch/$s_!Irb2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd51e095-b5f5-46fb-a9ff-5cd22f7691f9_906x676.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>How and why do China&#8217;s and the U.S.&#8217;s political cultures differ? This pair of books, each by a leading D. Wang/Wong, comes at the question from two very different directions.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://empiricrafting.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Empiricrafting! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p><em>Breakneck</em> says the divergence is due to China having a leadership/political culture focused on engineering, while the U.S.&#8217; is focused on law and lawyers. Dan Wang argues that the excesses of the one-child policy and lockdowns, and the failure of the US to build infrastructure, can all be understood as downstream of this decision. Dan has good conversations about this take at <a href="https://conversationswithtyler.com/episodes/dan-wang/">Conversations with Tyler</a> and on the <a href="https://www.sinicapodcast.com/p/the-engineering-state-and-the-lawyerly">Sinica podcast</a>.</p><p>Now, don&#8217;t get me wrong, the thesis and the anecdotes used to illustrate <em>Breakneck</em> are excellent. But in both the book and his podcasts, Dan doesn&#8217;t engage with what I see as the spiciest question prompted by this theory. Namely, <em>when and why</em> <em>is it that the two cultures diverged</em>? <br><br>If China is just on the standard Solow growth path, with a US 1950s need for engineering leadership, and will naturally converge to US 2020 levels of lawyerly leadership, this is a <em>VERY </em>different hypothesis than the two countries having fundamentally different moral and political inheritances. If the latter is the case, Tyler&#8217;s objection that (paraphrasing) &#8220;Chinese lawyers might just make autocracy more efficient&#8221; has purchase.</p><p>Isn&#8217;t it plausible that a <a href="https://en.wikipedia.org/wiki/Yu_the_Great">mythic canal king</a>, irrigated rice farming, and a unified empire make a society different than one downstream of Greek philosophy, chivalrized barbarians, and protestantism? <br><br>In David B. Wong&#8217;s &#8220;Natural Moralities: A Defense of Pluralistic Relativism,&#8221; this deep divergence in culture between the US and China is a central theme. D. Wong argues for a form of moral pluralism he calls &#8220;pluralistic relativism&#8221;. Under this view, dramatically different moral systems can be equally moral without descending into anything-goes moral relativism.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a><br><br>IMHO, this is a very attractive move for two reasons: (1) It&#8217;s more plausible than natural law theories of &#8220;absolute morality&#8221;, while still being able to make the obvious point that some social systems are better for human flourishing than others. And (2) It&#8217;s a step towards a vision of how universalizing Westerners can have constructive dialogue with Oriental moral systems -- an essential need in a century that will be defined by East vs. West rivalry.</p><p>This brings me to why <em>Natural Moralities</em> is such a valuable pairing with <em>Breakneck</em>. According to Wong, but in my own words, the divergence between Confucian and Western moral theories -- at a deep level -- is that Confucians are shape rotators while Plato and the Judeo-Christian philosophers are wordcels (&#8220;In the beginning was the word&#8230;&#8221;).</p><h4><br>How What You Should Do When Your Dad Murders Someone Explains the Difference Between the U.S. and China<br></h4><p>To illustrate this, David Wong brings up a story from Mencius, which I&#8217;ll contrast with one from Plato. The question both philosophers are faced with is<strong> &#8220;What should you do if your dad kills someone unjustly?</strong>&#8221; Before I give their answers, maybe think for yourself what you&#8217;d recommend.</p><h4><strong><a href="https://en.wikisource.org/wiki/The_Chinese_Classics/Volume_2/The_Works_of_Mencius/chapter13">Mencius&#8217; Answer</a>: </strong></h4><p><strong>(Note: Gu Sao is Shun&#8217;s dad)</strong><br><br><em>&#26691;&#25033;&#21839;&#26352;&#65306;&#12300;&#33308;&#28858;&#22825;&#23376;&#65292;&#30347;&#38518;&#28858;&#22763;&#65292;&#30653;&#30605;&#27578;&#20154;&#65292;&#21063;&#22914;&#20043;&#20309;&#65311;&#12301;</em></p><p><em>Tao Ying asked, saying, &#8216;Shun being sovereign, and Gao Yao chief minister of justice, if Gu Sou had murdered a man, what would have been done in the case?&#8217;</em></p><p><em>&#23391;&#23376;&#26352;&#65306;&#12300;&#22519;&#20043;&#32780;&#24050;&#30691;&#12290;&#12301;</em></p><p><em>Mencius said, &#8216;Gao Yao would simply have apprehended him.&#8217;</em></p><p><em>&#12300;&#28982;&#21063;&#33308;&#19981;&#31105;&#33287;&#65311;&#12301;</em></p><p><em>&#8216;But would not Shun have forbidden such a thing?&#8217;</em></p><p><em>&#26352;&#65306;&#12300;&#22827;&#33308;&#24801;&#24471;&#32780;&#31105;&#20043;&#65311;&#22827;&#26377;&#25152;&#21463;&#20043;&#20063;&#12290;&#12301;</em></p><p><em>&#8216;Indeed, how could Shun have forbidden it? Gao Yao had received the law from a proper source.&#8217;</em></p><p><em>&#12300;&#28982;&#21063;&#33308;&#22914;&#20043;&#20309;&#65311;&#12301;</em></p><p><em>&#8216;In that case what would Shun have done?&#8217;</em></p><p><em>&#26352;&#65306;&#12300;&#33308;&#35222;&#26820;&#22825;&#19979;&#65292;&#29494;&#26820;&#25949;&#36445;&#20063;&#12290;&#31434;&#36000;&#32780;&#36867;&#65292;&#36981;&#28023;&#28657;&#32780;&#34389;&#65292;&#32066;&#36523;&#35362;&#28982;&#65292;&#27138;&#32780;&#24536;&#22825;&#19979;&#12290;&#12301;</em></p><p><em>&#8216;Shun would have regarded abandoning the kingdom as throwing away a worn-out sandal. He would privately have taken his father on his back, and retired into concealment, living some where along the sea-coast. There he would have been all his life, cheerful and happy, forgetting the kingdom.&#8217;</em></p><p>Here we see a classic shape rotator approach to a moral dilemma: The state needs to enforce justice, but a son needs to protect his father. We get a compromise that hopefully leaves everyone somewhat happy -- Shun should abscond with his father, removing him from being able to do more crimes, but still protecting him. <br><br>We also get advised that, despite what we&#8217;d call a tragic clash of values in the West, Shun should still try to feel good about himself. To your taste, Mencius&#8217; answer is either a nice compromise or a stupidity that fails to satisfy any plausible theory of justice.</p><h4><strong><a href="https://en.wikipedia.org/wiki/Euthyphro">Plato&#8217;s Answer:</a></strong></h4><p><a href="https://en.wikipedia.org/wiki/Euthyphro">In the Socratic dialogue &#8220;Euthyphro&#8221;</a>, Socrates runs into a priest who has decided to turn his murderous dad in to the justice system. <br><br>Unlike Mencius, who tries to split the difference and make everyone happy, Socrates decides to confuse things further. He questions: <br><br><em><strong>Socrates<br></strong>But what is the charge, and what is the suit about?</em></p><p><em><strong>Euthyphro<br></strong>Murder, Socrates.</em></p><p><em><strong>Socrates<br></strong>Heracles! Surely, Euthyphro, most people do not know where the right lies; for I fancy it is not everyone who can rightly do what you are doing, but only one who is already very far advanced in wisdom.</em></p><p><em><strong>Euthyphro<br></strong>Very far, indeed, Socrates, by Zeus.</em></p><p><em><strong>Socrates<br></strong>Is the one who was killed by your father a relative? But of course he was; for you would not bring a charge of murder against him on a stranger&#8217;s account.</em></p><p><em><strong>Euthyphro<br></strong>It is ridiculous, Socrates, that you think it matters whether the man who was killed was a stranger or a relative&#8230;<br><br><strong>Socrates<br></strong>But, in the name of Zeus, Euthyphro, do you think your knowledge about divine laws and holiness and unholiness is so exact that, when the facts are as you say, you are not afraid of doing something unholy yourself in prosecuting your father for murder?</em>&#8221;</p><p>In the rest of the dialogue, Socrates proceeds to shoot down every theory of Euthyphro&#8217;s about the nature of piety and justice.</p><p>The reader is only left with more questions: &#8220;Are the gods just because they behave justly, or is justice simply what the gods command?&#8221; and &#8220;Is piety something different than a commercial relationship with gods?&#8221; In classic wordcel fashion, rather than actually contribute to solving a social dilemma, Socrates critiques and deconstructs  -- and, of course, his dialogue is much, much longer than Mencius&#8217; answer too!</p><p>When framed as wordcel vs. shaperotator culture/morality, I think the Breakneck distinction between Lawyer and Engineer states makes more sense, and is actually a deeper and more interesting thesis. It also puts me closer to Tyler&#8217;s view that just adding more lawyers to China&#8217;s system won&#8217;t actually result in more individual protections and Western-style justice.</p><p>I think I can see the unique strengths of either approach, while still feeling secure in the fact that we each have a system that works well for us: A Western system of critique, individual reason, an openness to the idea of tragic conflicts, and an insistence on conceptual clarity and rights vs. a Confuscian system of practical problem solving at the expense of some of that clarity and Western roadbumps to ill-concieved grand plans. I hope that books like these, which attempt to see the logic in each other&#8217;s systems, can be an important step to peaceful coexistence.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://empiricrafting.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Empiricrafting! Subscribe for free to receive new posts and support our work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h3><strong>Best Non-Fiction: </strong><em>The Allure of Battle: A History of How Wars Have Been Won</em> <em>and Lost</em> by Cathal J. Nolan</h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!i0UO!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F021cadd5-0789-4344-b11b-073022885bb4_266x400.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!i0UO!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F021cadd5-0789-4344-b11b-073022885bb4_266x400.png 424w, https://substackcdn.com/image/fetch/$s_!i0UO!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F021cadd5-0789-4344-b11b-073022885bb4_266x400.png 848w, https://substackcdn.com/image/fetch/$s_!i0UO!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F021cadd5-0789-4344-b11b-073022885bb4_266x400.png 1272w, https://substackcdn.com/image/fetch/$s_!i0UO!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F021cadd5-0789-4344-b11b-073022885bb4_266x400.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!i0UO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F021cadd5-0789-4344-b11b-073022885bb4_266x400.png" width="266" height="400" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/021cadd5-0789-4344-b11b-073022885bb4_266x400.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:400,&quot;width&quot;:266,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!i0UO!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F021cadd5-0789-4344-b11b-073022885bb4_266x400.png 424w, https://substackcdn.com/image/fetch/$s_!i0UO!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F021cadd5-0789-4344-b11b-073022885bb4_266x400.png 848w, https://substackcdn.com/image/fetch/$s_!i0UO!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F021cadd5-0789-4344-b11b-073022885bb4_266x400.png 1272w, https://substackcdn.com/image/fetch/$s_!i0UO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F021cadd5-0789-4344-b11b-073022885bb4_266x400.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>An alluring possibility: What if the real battle is the allies and economic capacity we build up along the way?</p><p>The start of this book is not promising. An overlong introduction of the main theme &#8212; battles and great generals are overrated; attrition and grand strategy underrated &#8212; which bounces between obvious and unsupported claims.</p><p>But what comes next is the greatest single-volume history of warfare I&#8217;ve ever read. A masterful tour from Marathon to the Marne, his discussions of individual wars are better than many dedicated books I&#8217;ve read.</p><p>His coverage of the evolution from pike and shot, to line infantry, to skirmishers is excellent  &#8212; especially because it&#8217;s not presented as a series of innovations by great generals. Rather, the author has a fresh take focused on the interaction of generals&#8217; desire for maneuver warfare with changing fortification and siege technologies, as well as a focus on how quickly these technologies and strategies can diffuse through repeated encounters.</p><p>The author&#8217;s main argument is simple: (1) the relative cumulative economic power of sides is the most important determinant of who wins long wars (2) long wars are expensive also, and therefore (3) Revisionist powers are tempted to plan around short wars because these are the ones that would hypothetically help them. This leads to (4) Revisionist and over-confident powers quickly find themselves in over their heads, leading them to lose long wars.</p><p>Decisive, quick, victorious maneuver warfare is the dream of a Frederick the Great, a Von Moltke, a Yamamoto. The author does a fantastic job of explaining this doctrine &#8212; the lust for a costless victory, ideally a &#8220;cauldron battle&#8221; that would exterminate the enemy army in imitation of Cannae.</p><p>But then the author makes an amazing, obvious, and yet hugely underappreciated point &#8212; why do we idealize the victory at Cannae, when Hannibal&#8217;s strategic failures are what determine the course of the war?</p><p>The author explains why. We idealize the great army geniuses of the past - in part to get adolescents psyched about war, in part to glorify national genius, but worst of all, to justify irrational wars of aggression by revisionist powers. The Japanese in WW2 wanted to revise the international order, but they weren&#8217;t capable (due to internal division in large part &amp; aggressive leaders taking international actions) of aligning themselves on the stronger side of a global conflict (or at limiting the spread of their conflict). Therefore, the only answer was ever more aggressive attacks in the hope of destabilizing, in a series of brilliant campaigns, stronger opponents.</p><p>The argument is basically right. I am convinced. Great book. But it is possible to over-learn this lesson. France&#8217;s side may have eventually won WW2 - perhaps inevitably due to the network of alliances- but their failure to keep up with Wermacht initiative at the beginning of the war made everything so much worse.</p><p>As a child, I fell for the romance of Hannibal. In some ways, the fact that he loses in the end is almost a plus - a heroic standing in the face of the Roman tsunami. But a much better hero is Fabian, the delayer, who, rather than pushing for a quick resolution, showed the patience necessary for Rome&#8217;s advantages to inevitably tell.</p><p>Where does that leave the US today? I conclude that maintaining the alliance system is more important than ever. No country, including China, can challenge the US + EU + India together. We couldn&#8217;t be conquered in decades. Even if these nations cut their militaries to the bone, we could still hold out and win a long war - so long as we remained unified! It also makes me worried for an Israel that, drunk with operational success, may find itself isolated and overextended. <br><br>In sum, Grand Strategy&gt;&gt;&gt;&gt;&gt;Operational Art&gt;=Tactics.</p><h3><strong>Best Sci Fi: </strong><em>The Hydrogen Sonata </em>by Ian M. Banks</h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3ZI_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fabb0308c-9bc4-40d7-83f2-dfddc9650b61_316x475.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3ZI_!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fabb0308c-9bc4-40d7-83f2-dfddc9650b61_316x475.png 424w, https://substackcdn.com/image/fetch/$s_!3ZI_!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fabb0308c-9bc4-40d7-83f2-dfddc9650b61_316x475.png 848w, https://substackcdn.com/image/fetch/$s_!3ZI_!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fabb0308c-9bc4-40d7-83f2-dfddc9650b61_316x475.png 1272w, https://substackcdn.com/image/fetch/$s_!3ZI_!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fabb0308c-9bc4-40d7-83f2-dfddc9650b61_316x475.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3ZI_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fabb0308c-9bc4-40d7-83f2-dfddc9650b61_316x475.png" width="316" height="475" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/abb0308c-9bc4-40d7-83f2-dfddc9650b61_316x475.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:475,&quot;width&quot;:316,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!3ZI_!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fabb0308c-9bc4-40d7-83f2-dfddc9650b61_316x475.png 424w, https://substackcdn.com/image/fetch/$s_!3ZI_!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fabb0308c-9bc4-40d7-83f2-dfddc9650b61_316x475.png 848w, https://substackcdn.com/image/fetch/$s_!3ZI_!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fabb0308c-9bc4-40d7-83f2-dfddc9650b61_316x475.png 1272w, https://substackcdn.com/image/fetch/$s_!3ZI_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fabb0308c-9bc4-40d7-83f2-dfddc9650b61_316x475.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>I read The Culture series as mainly about two things: (1) What would it be like to live in a utopia? And (2) How did (and should have) the US acted internationally during its 1990s unipolar moment. I had been holding off on reading this, the last in the series, wanting to savor one of my favorite series for longer. But I finally gave in.</p><p>I was not disappointed. The book delivers well on both (1) and (2) as well as pointing out interesting connections between them. The core metaphor is the humanoid protagonist&#8217;s dedication to mastering playing an impossibly stupid four-arm-requiring string instrument. No spoilers, but if you&#8217;re interested in either theme, I highly recommend this series. It can be read out of order, and this is possibly my favorite, so feel free to jump in here.</p><h3><strong>Best Play/Opera: </strong><em>Salome </em>by Oscar Wilde</h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!rX6U!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01a1ea2a-2a71-4ffe-89c9-df0c8117bee2_1600x685.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rX6U!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01a1ea2a-2a71-4ffe-89c9-df0c8117bee2_1600x685.png 424w, https://substackcdn.com/image/fetch/$s_!rX6U!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01a1ea2a-2a71-4ffe-89c9-df0c8117bee2_1600x685.png 848w, https://substackcdn.com/image/fetch/$s_!rX6U!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01a1ea2a-2a71-4ffe-89c9-df0c8117bee2_1600x685.png 1272w, https://substackcdn.com/image/fetch/$s_!rX6U!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01a1ea2a-2a71-4ffe-89c9-df0c8117bee2_1600x685.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rX6U!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01a1ea2a-2a71-4ffe-89c9-df0c8117bee2_1600x685.png" width="1456" height="623" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/01a1ea2a-2a71-4ffe-89c9-df0c8117bee2_1600x685.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:623,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!rX6U!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01a1ea2a-2a71-4ffe-89c9-df0c8117bee2_1600x685.png 424w, https://substackcdn.com/image/fetch/$s_!rX6U!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01a1ea2a-2a71-4ffe-89c9-df0c8117bee2_1600x685.png 848w, https://substackcdn.com/image/fetch/$s_!rX6U!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01a1ea2a-2a71-4ffe-89c9-df0c8117bee2_1600x685.png 1272w, https://substackcdn.com/image/fetch/$s_!rX6U!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01a1ea2a-2a71-4ffe-89c9-df0c8117bee2_1600x685.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>I read the play in advance of seeing the excellent Met production. I appreciated how Oscar anticipates the concept of the &#8220;male gaze&#8221; and how sexual abuse perpetuates itself. The play is ambiguous in a way that leaves &#8220;The Dance of the Seven Veils&#8221; and related scenes somewhat sexy. Oscar&#8217;s language in listing the great gifts offered by King Herod is hypnotic.</p><p>Strauss&#8217; music and the Met&#8217;s staging illustrate the play well. The Met production renders explicit how fucked up Salome&#8217;s abuse was, using seven Salomes at various ages, all dressed in school-girl clothes, to make sure you don&#8217;t miss the point, at the expense of sexiness.</p><h3><strong>Wildcard: </strong><em>The Pine Barrens</em> by John McPhee</h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!fIvC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1de40462-a558-41d9-a8ad-281f84c57f3c_253x394.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!fIvC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1de40462-a558-41d9-a8ad-281f84c57f3c_253x394.png 424w, https://substackcdn.com/image/fetch/$s_!fIvC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1de40462-a558-41d9-a8ad-281f84c57f3c_253x394.png 848w, https://substackcdn.com/image/fetch/$s_!fIvC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1de40462-a558-41d9-a8ad-281f84c57f3c_253x394.png 1272w, https://substackcdn.com/image/fetch/$s_!fIvC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1de40462-a558-41d9-a8ad-281f84c57f3c_253x394.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!fIvC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1de40462-a558-41d9-a8ad-281f84c57f3c_253x394.png" width="253" height="394" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1de40462-a558-41d9-a8ad-281f84c57f3c_253x394.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:394,&quot;width&quot;:253,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!fIvC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1de40462-a558-41d9-a8ad-281f84c57f3c_253x394.png 424w, https://substackcdn.com/image/fetch/$s_!fIvC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1de40462-a558-41d9-a8ad-281f84c57f3c_253x394.png 848w, https://substackcdn.com/image/fetch/$s_!fIvC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1de40462-a558-41d9-a8ad-281f84c57f3c_253x394.png 1272w, https://substackcdn.com/image/fetch/$s_!fIvC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1de40462-a558-41d9-a8ad-281f84c57f3c_253x394.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>If you&#8217;re not from New Jersey, you&#8217;ve probably only encountered the Pine Barrens through hearing of mobsters dumping bodies there or perhaps the Jersey Devil cryptid. Even as someone from North Jersey, my understanding did not extend much further than that. But I really enjoyed learning more about it in this tightly written exploration of an anachronistic region nestled discretely between Philly and NYC.  <br><br>The pines have always been sparsely populated, even in indigenous times, because of the sandy soil unsuitable for most agriculture. It has been a haven for Northeasterners who have wanted to get off the grid for centuries: as America&#8217;s first Native American reservation, for escaped slaves, and for Loyalists during and after the Revolutionary War. Today, it is known for its excellent blueberries, which were intentionally selected, cultivated, and spread by Rutgers University biologists.</p><p>The writing is brisk and respectful while not above pointing out some of the funny or absurd parts of piney life. Truly an underappreciated corner of America!</p><h3><strong>Most Laughable Economic Theory Joke Award:</strong><em><strong> </strong>Ecstasy: Understanding the Psychology of Joy</em> by Robert A. Johnson</h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Zsvv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9cc0deab-08e9-4065-bc5c-7132e45cf23a_312x475.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Zsvv!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9cc0deab-08e9-4065-bc5c-7132e45cf23a_312x475.png 424w, https://substackcdn.com/image/fetch/$s_!Zsvv!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9cc0deab-08e9-4065-bc5c-7132e45cf23a_312x475.png 848w, https://substackcdn.com/image/fetch/$s_!Zsvv!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9cc0deab-08e9-4065-bc5c-7132e45cf23a_312x475.png 1272w, https://substackcdn.com/image/fetch/$s_!Zsvv!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9cc0deab-08e9-4065-bc5c-7132e45cf23a_312x475.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Zsvv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9cc0deab-08e9-4065-bc5c-7132e45cf23a_312x475.png" width="312" height="475" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9cc0deab-08e9-4065-bc5c-7132e45cf23a_312x475.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:475,&quot;width&quot;:312,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Zsvv!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9cc0deab-08e9-4065-bc5c-7132e45cf23a_312x475.png 424w, https://substackcdn.com/image/fetch/$s_!Zsvv!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9cc0deab-08e9-4065-bc5c-7132e45cf23a_312x475.png 848w, https://substackcdn.com/image/fetch/$s_!Zsvv!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9cc0deab-08e9-4065-bc5c-7132e45cf23a_312x475.png 1272w, https://substackcdn.com/image/fetch/$s_!Zsvv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9cc0deab-08e9-4065-bc5c-7132e45cf23a_312x475.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>I have to mention this one because I got the best laugh I&#8217;ve had of the year out of it -- but that was unintentional, the book is pretty bad.</p><p>You may have read about the distinction between the Dionysian and the Apollonian introduced by Nietzsche. Like a D&amp;D alignment table, this chaos vs. order axis is orthogonal to conventional morality, but an important aspect of human psychology. I highly recommend reading &#8220;The Birth of Tragedy&#8221; by Nietzsche, or &#8220;Psychological Types&#8221; by Jung, to learn more about this distinction! The idea that we are cut off from our emotive, intuitionistic tools for creating value is a compelling one, but one difficult to balance with our modern virtues of reason and order. It&#8217;s a really good big idea!</p><p>This book is only sometimes about that big idea, and like many in Nietzsche&#8217;s and Jung&#8217;s shoes, it doesn&#8217;t share their talent for connection and subtlety. Instead, in this book, we get something in between DBT and Jungian shadow-self work.</p><p>Some of these ideas are not necessarily bad -- understanding your counter-social impulses and integrating them is great. But some of the ideas advocated are actually pretty bad and scary. The book seems to advocate different pseudo-schizo approaches to emotional healing with the shadow self -- from building a little shrine of your idol, to doing crazy calisthenics your dream-Dionysis tells you to do. <br><br>In a very short book of 97 pages, it&#8217;s clear the author is running out of steam by the end, with two of the last chapters devoted to reciting not particularly exciting dreams he or a client has had.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!fw4J!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2988fc69-98a6-48eb-b149-566a859a8944_640x480.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!fw4J!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2988fc69-98a6-48eb-b149-566a859a8944_640x480.png 424w, https://substackcdn.com/image/fetch/$s_!fw4J!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2988fc69-98a6-48eb-b149-566a859a8944_640x480.png 848w, https://substackcdn.com/image/fetch/$s_!fw4J!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2988fc69-98a6-48eb-b149-566a859a8944_640x480.png 1272w, https://substackcdn.com/image/fetch/$s_!fw4J!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2988fc69-98a6-48eb-b149-566a859a8944_640x480.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!fw4J!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2988fc69-98a6-48eb-b149-566a859a8944_640x480.png" width="640" height="480" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2988fc69-98a6-48eb-b149-566a859a8944_640x480.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:480,&quot;width&quot;:640,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!fw4J!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2988fc69-98a6-48eb-b149-566a859a8944_640x480.png 424w, https://substackcdn.com/image/fetch/$s_!fw4J!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2988fc69-98a6-48eb-b149-566a859a8944_640x480.png 848w, https://substackcdn.com/image/fetch/$s_!fw4J!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2988fc69-98a6-48eb-b149-566a859a8944_640x480.png 1272w, https://substackcdn.com/image/fetch/$s_!fw4J!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2988fc69-98a6-48eb-b149-566a859a8944_640x480.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The book is at its funniest when the author -- who admits to not being very good at book writing or history -- completely makes up a political-economic history of the suppression of Dionysis and his replacement with the debauched Bacchus.<br><br>The peak of this, which I&#8217;ll leave you with, is my new favorite theory of the price level. From page 45, on Dionysus as &#8220;scapegoat&#8221;: <br><br>&#8220;<em>Sheep represent everything of value in our Judeo-Christian World. The sheep, in fact, is the chief determinant of our currency. Every currency in the Western world -- the shilling, the franc, the deutche mark, the lira, the peso, the Austrian thaler (from which we got our dollar) -- was the price of one sheep. For centuries, there was no inflation in the Western world because one of our money pieces was worth a sheep. You could count on that anywhere, anytime.</em>&#8221;</p><p><strong>Literally laughed for a solid 10 minutes</strong>. Unconstrained by reason, the author gave me a moment of joy. And isn&#8217;t that the most Dionysian thing of all?</p><h3><strong>Honorable Mentions:</strong></h3><p><em>Abundance </em>-- Agreed with it too much to find it interesting. But it&#8217;s the book I&#8217;m &#8220;rooting for&#8221; the most this year.</p><p><em>Democracy in America part 1</em> -- Great, but &#8220;The Ancient Regime and the Revolution&#8221; is better, and more unified in its thinking. This is foxy and hard to summarize, but ofc a deserved classic.</p><p><em>The Fundamentals of Heavy Tails</em> -- Great primer on a topic I&#8217;ve launched myself into this year.</p><p><em>Help Wanted</em> -- Read on the recommendation of Jason Furman, a nice little slice of life about minor drama at an upstate NY big-box store and the people who work there. Some good boots-on-the-ground economics about how management, economic incentives, loyalty, and hope play out at a place like this.</p><p><em>Fortune&#8217;s Formula: The Untold Story of the Scientific Betting System That Beat the Casinos and Wall Street</em> -- Good for its discussion of the Kelly criteria and the hilarious fight between gamblers and Samuelson over whether it&#8217;s deeply true. (Spoiler: Obviously, it&#8217;s only utility maximizing from a single specific perspective, but it&#8217;s an awesome and useful heuristic for long-lived institutions.)</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>You might think that this is just Isaiah Berlin-esque Moral Pluralism, but as of this book, D. Wong HATES that term, arguing that Berlin goes full relativist. He argues that Berlin&#8217;s system has no resources for calling e.g. Aztec or Molochian worship systems immoral, while he would argue that only moral systems that plausibly contribute to human flourishing (which he thinks is somewhat universal due to our shared biology). </p><p></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>This is the second book in the last few years where I&#8217;ve run into a weird 1970s UN cybernetics conference giving people disastrous ideas. Here it appears as where China&#8217;s architect of the one-child policy got his inspiration. I have also seen these conferences in  &#8220;Building a Ruin&#8221; about late Soviet economic policymaking, as a source for compromise technocratic ideas that gave Soviet leaders a politically useful (but economically inadequate) third option besides the antiquated command economy  vs. true liberalization.</p><p></p></div></div>]]></content:encoded></item></channel></rss>