<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[arg min]]></title><description><![CDATA[arg min: a blog of minimum value. on the history, foundations, and validity of "optimally" automated decision making.]]></description><link>https://www.argmin.net</link><image><url>https://substackcdn.com/image/fetch/$s_!MpqK!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F5ecc065f-b4b4-488f-9ff9-d842d175475d_256x256.png</url><title>arg min</title><link>https://www.argmin.net</link></image><generator>Substack</generator><lastBuildDate>Tue, 12 May 2026 15:21:31 GMT</lastBuildDate><atom:link href="https://www.argmin.net/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Ben Recht]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[argmin@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[argmin@substack.com]]></itunes:email><itunes:name><![CDATA[Ben Recht]]></itunes:name></itunes:owner><itunes:author><![CDATA[Ben Recht]]></itunes:author><googleplay:owner><![CDATA[argmin@substack.com]]></googleplay:owner><googleplay:email><![CDATA[argmin@substack.com]]></googleplay:email><googleplay:author><![CDATA[Ben Recht]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[The Quantification Trap]]></title><description><![CDATA[A computational paradox of the postmodern condition.]]></description><link>https://www.argmin.net/p/the-quantification-trap</link><guid isPermaLink="false">https://www.argmin.net/p/the-quantification-trap</guid><dc:creator><![CDATA[Ben Recht]]></dc:creator><pubDate>Mon, 11 May 2026 14:05:59 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/3d510b0f-6a1c-4334-96de-5d353b13dfd5_840x600.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!NqLY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F20d7c535-7b1c-4cba-8165-2461a64b3463_1100x220.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!NqLY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F20d7c535-7b1c-4cba-8165-2461a64b3463_1100x220.jpeg 424w, https://substackcdn.com/image/fetch/$s_!NqLY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F20d7c535-7b1c-4cba-8165-2461a64b3463_1100x220.jpeg 848w, https://substackcdn.com/image/fetch/$s_!NqLY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F20d7c535-7b1c-4cba-8165-2461a64b3463_1100x220.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!NqLY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F20d7c535-7b1c-4cba-8165-2461a64b3463_1100x220.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!NqLY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F20d7c535-7b1c-4cba-8165-2461a64b3463_1100x220.jpeg" width="1100" height="220" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/20d7c535-7b1c-4cba-8165-2461a64b3463_1100x220.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:220,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:272940,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.argmin.net/i/197218423?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F20d7c535-7b1c-4cba-8165-2461a64b3463_1100x220.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!NqLY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F20d7c535-7b1c-4cba-8165-2461a64b3463_1100x220.jpeg 424w, https://substackcdn.com/image/fetch/$s_!NqLY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F20d7c535-7b1c-4cba-8165-2461a64b3463_1100x220.jpeg 848w, https://substackcdn.com/image/fetch/$s_!NqLY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F20d7c535-7b1c-4cba-8165-2461a64b3463_1100x220.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!NqLY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F20d7c535-7b1c-4cba-8165-2461a64b3463_1100x220.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p>If we want to make decisions in a complex society, we need a shared language. Experts on the ground must summarize complex situations in their communication with decision makers decoupled from the field. They need to make their experiences <em>legible</em> to those they report to.</p><p>The easiest way to make situations legible is to <em>quantify</em> them. To count things, record figures in tables, compute statistics, and make charts. Quantification sorts complexity into simple bins, simplifying communication both up and down the chain.</p><p>When we speak in such quantified numerical summaries, our statements feel <em>objective</em>. We believe that appropriate quantification isn&#8217;t be subject to the whims and opinions of an individual field worker. By agreeing upon standards, quantified measurements are now scientific facts.</p><p>Once we have objectivity, we have <em>authority</em>. Making decisions based on objective facts is obviously in the best interest of everyone else, and we impose threats of chastisement, ostracism, or violence upon those irrational individuals who disagree.</p><p>And once these numerical summaries that we made out of whole cloth to simplify communication become authoritative, they become <em>real</em>. They become things we should strive to maximize.</p><p>This is the <em>quantification trap</em>.</p><p>The quantification trap is social-scientific canon. You could build this story entirely out of texts written before the year 2000. The role of quantification, measurement, and legibility in statecraft is laid out by James C. Scott&#8217;s <em>Seeing Like a State </em>(1998)<em> </em>and Alain Desrosi&#232;res <em>The Politics of Large Numbers</em> (1993). Theodore Porter&#8217;s <em>Trust In Numbers </em>(1995) highlights the turn to quantification in pursuit of standardization and objectivity. The blind optimization of decontextualized metrics is core to Jean-Fran&#231;ois Lyotard&#8217;s characterization of <em>The Postmodern Condition</em> (1979).</p><p>Twenty-five years into the twenty-first century, I don&#8217;t think you should have to run a Science and Technology Studies sidequest to recognize the quantification trap. It&#8217;s obvious and almost trite when we say it out loud. It&#8217;s trendy to talk about how metrics and benchmarks are bad and to prattle on endlessly about Goodhart&#8217;s, Campbell&#8217;s, or Murphy&#8217;s Laws. And yet, we continue to organize ourselves around statistical summaries. Is the quantification trap an inevitable part of scale?  Is it an inevitable part of efficiency? Is it an inevitable part of the dismal hierarchy of bureaucratic power? The great puzzle of our contemporary condition is why it&#8217;s so hard to escape.</p><p>Part of the puzzle is that making society computable has dramatic benefits paired with every cost. The constant tension in mathematical rationality lies in the interplay between its sweet spots and its limitations. The quantification trap creates an intersubjectivity for collective action. Mathematically rational governance lets systems and hierarchies see, but also makes it easy to maintain their control. It facilitates posing clear questions and objectives, though crowds out nuance and multiplicity. It creates a shared understanding of standardization but removes the discretion of experts. It lets us speak about maximizing the average welfare of populations, but erases individuals.</p><p>If there are such clear trade-offs with quantification, why do we always tend to side with &#8220;the data?&#8221; The acceleration of computation has made the quantification trap exponentially more contagious. As computers became ubiquitous, the quantification process became inevitable and invisible. We don&#8217;t think about how we are tethered to unfathomable computing machines. They&#8217;re just part of who we are now. Our devices measure us all the time, recording time-on-site and click-throughs. Everything has a like button. All of these measurements are churned upon by data scientists hoping to hit their personal promotion metrics, regardless of whether the instrumentation means anything. The quantification trap is built out of an invisible fabric of computation.</p><p>I feel like I say this in the book, but never <em>say</em> it in the <em>Irrational Decision</em>. The book articulates the role of mathematical computation, optimization, and statistics as scaffolding in the elaborate quantification trap. To understand why we optimize what we optimize, it&#8217;s helpful to look at the history of computational methods and language boxing us in. The path from legibility to authority goes straight through computation and computerization. Quantification transforms experience into machine-readable data and a small number of interventions and outcomes. Decisions can only be automated once we throw away the messy, uncomputable parts. We maximize averages because it&#8217;s a convenient way to model uncertainty.</p><p>Now, I am by far not the only person to talk about the quantification trap. I wrote about it today because I felt I needed this placeholder after the last few weeks of talking about my book. However, if you want a reading list from the past 25 years, I can write us an impossibly long bibliography. Even in the past year, crossover books like Healy and Fourcade&#8217;s <em>The Ordinal Society </em>and Nguyen&#8217;s<em> The Score </em>have articulated the same conundrum.</p><p>It&#8217;s good that more people are talking about this. What we count, compute, and optimize is a political decision. Counting flattens complexity, and the choice of what is left is a question of power. The virality of the quantification trap forecloses better futures. We can&#8217;t strive for them if we can&#8217;t see the gilded cage we&#8217;re in.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.argmin.net/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.argmin.net/subscribe?"><span>Subscribe now</span></a></p>]]></content:encoded></item><item><title><![CDATA[The Rationality of the Language Machines]]></title><description><![CDATA[Are LLMs mathematically rational?]]></description><link>https://www.argmin.net/p/the-rationality-of-the-language-machines</link><guid isPermaLink="false">https://www.argmin.net/p/the-rationality-of-the-language-machines</guid><dc:creator><![CDATA[Ben Recht]]></dc:creator><pubDate>Thu, 07 May 2026 14:39:58 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/fcd9bb14-ce1a-4ee6-9786-681be514bc56_840x599.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!gJXE!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ba6f283-1073-4e5e-b4f1-ac0facbafd38_1100x219.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!gJXE!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ba6f283-1073-4e5e-b4f1-ac0facbafd38_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!gJXE!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ba6f283-1073-4e5e-b4f1-ac0facbafd38_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!gJXE!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ba6f283-1073-4e5e-b4f1-ac0facbafd38_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!gJXE!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ba6f283-1073-4e5e-b4f1-ac0facbafd38_1100x219.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!gJXE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ba6f283-1073-4e5e-b4f1-ac0facbafd38_1100x219.jpeg" width="1100" height="219" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5ba6f283-1073-4e5e-b4f1-ac0facbafd38_1100x219.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:219,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:276418,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.argmin.net/i/196786875?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ba6f283-1073-4e5e-b4f1-ac0facbafd38_1100x219.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!gJXE!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ba6f283-1073-4e5e-b4f1-ac0facbafd38_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!gJXE!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ba6f283-1073-4e5e-b4f1-ac0facbafd38_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!gJXE!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ba6f283-1073-4e5e-b4f1-ac0facbafd38_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!gJXE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ba6f283-1073-4e5e-b4f1-ac0facbafd38_1100x219.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p>My least favorite part of writing books is their permanence. Minutes after I spend time with a hardcopy, I start seeing a bunch of things I should have added, removed, or modified. I&#8217;m not talking about typos&#8212;I expect and accept those will be abundant&#8212;but rather ways I&#8217;d have written the book differently if I could do it again. In particular, this last week of <em><a href="https://press.princeton.edu/books/hardcover/9780691272443/the-irrational-decision">Irrational Decision</a></em> events crystallized a few things that I wish I had commented on, but maybe couldn&#8217;t have.</p><p>I&#8217;ll spend the next few blog posts going over the questions raised that I wish I had addressed in the book. I was unhappy with my extemporaneous replies, and I&#8217;m hoping blogging might bring me more satisfactory answers.</p><p>I&#8217;ll start with the trillion-dollar elephant in the room: language machines. In our conversation on Tuesday, Lily Hu asked me how LLMs denote a shift away from the mathematical rationality of <em>The Irrational Decision</em>. For a long time, artificial intelligence systems were built by solving problems differently from how people solved them. For example, computers are better at chess than humans, but they do not approximate the way master chess players play. But LLMs, especially the chatbot interaction model that captured global attention and rocketed the valuations of these companies into the stratosphere, uncannily seem to act and respond like us. This seems like a departure from the cold rationality of computation.</p><p>The chat interface leans on the ambiguity of language to make for a satisfying interactive experience. <a href="https://ebiquity.umbc.edu/paper/html/id/1130/How-to-Make-a-Computer-Appear-Intelligent">Joseph Weizenbaum famously lamented</a> that when his colleagues tried to make a computer appear intelligent, they leaned on psychological parlor tricks. A machine that summarizes in iambic pentameter appears intelligent. A machine that solves a linear program in millions of variables is merely mechanically calculating.</p><p>So what does it mean to lean on language machines for decision making? This is a tricky question because LLMs are unfathomably complex software artifacts. If you use them under the harness of a coding agent, you can get them to optimize all sorts of things. For example, they are incredible at making code faster, outperforming any autotuning tool I&#8217;ve ever tried. You can easily get them to implement a Bayesian rational choice engine for you.</p><p>Certainly, the rationalist weirdos in charge at Anthropic have demanded that Claude be trained to recite back summaries of dogma from the <a href="http://lesswrong.com">lesswrong.com</a> comments section. So LLMs can and will parrot back the tenets of mathematical rationality to you. But natural language is not rational language. Indeed, the language of mathematical rationality is a Bayesian language game, always working to box out the unmeasurable and unquantifiable. It demands language without ambiguity, but of course, language is always ambiguous, fluid, and evolving.</p><p>You can try to squeeze out the ambiguity by prompting the machines to reply in logically structured language, like software or mathematics. Yet, even when you try to get the chat interfaces to give you rational answers, you still find perplexing mistakes that aren&#8217;t even logical, let alone rational. I liked this glaringly obvious one from last week.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!cMoL!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5bb847a-a918-4e0c-b40b-92fc3f570f50_1494x630.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!cMoL!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5bb847a-a918-4e0c-b40b-92fc3f570f50_1494x630.png 424w, https://substackcdn.com/image/fetch/$s_!cMoL!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5bb847a-a918-4e0c-b40b-92fc3f570f50_1494x630.png 848w, https://substackcdn.com/image/fetch/$s_!cMoL!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5bb847a-a918-4e0c-b40b-92fc3f570f50_1494x630.png 1272w, https://substackcdn.com/image/fetch/$s_!cMoL!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5bb847a-a918-4e0c-b40b-92fc3f570f50_1494x630.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!cMoL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5bb847a-a918-4e0c-b40b-92fc3f570f50_1494x630.png" width="1456" height="614" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c5bb847a-a918-4e0c-b40b-92fc3f570f50_1494x630.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:614,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:157962,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.argmin.net/i/196786875?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5bb847a-a918-4e0c-b40b-92fc3f570f50_1494x630.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!cMoL!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5bb847a-a918-4e0c-b40b-92fc3f570f50_1494x630.png 424w, https://substackcdn.com/image/fetch/$s_!cMoL!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5bb847a-a918-4e0c-b40b-92fc3f570f50_1494x630.png 848w, https://substackcdn.com/image/fetch/$s_!cMoL!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5bb847a-a918-4e0c-b40b-92fc3f570f50_1494x630.png 1272w, https://substackcdn.com/image/fetch/$s_!cMoL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5bb847a-a918-4e0c-b40b-92fc3f570f50_1494x630.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>These sorts of errors remain vexingly common, but are becoming increasingly difficult to spot.</p><p>The coding machines I mentioned above, with their ability to execute, check errors, and optimize, feel like they are more rational. However, they force programmers to cosplay as project managers. Software engineers have to prompt in natural language that is kind, inspiring, and encouraging. There are <a href="https://github.com/obra/superpowers">massively popular GitHub repos</a> that help you find the right phrasing to make sure your eager agents have the right artificial mindset so they don&#8217;t make mistakes. This doesn&#8217;t seem very mathematically rational either.</p><p>Moreover, in the broader context of decision making, the vast majority of people do not have LLMs write optimization code to compute the decisions for them. Instead, they treat the chat like a consultant, and get interactive text back, sometimes with charts and figures for quantified comfort. Being advised by these outputs as to how to structure your life or business is closer to a magic eight ball than to a linear program. Writing in imprecise language and having a system trained to respond nicely certainly isn&#8217;t mathematical rationality. It&#8217;s what the early mathematicians tried to excise from decision making. What a weird turn.</p><p>But if we step back from the chat box, it&#8217;s not hard to place LLMs on a linear axis of mathematical rationality. The people who build these things religiously believe in mathematical rationality. They endorse the Bostrom-Russell nonsense that intelligence is attached to singular objective functions, and our robots will kill our dogs to make us coffee. Alignment researchers claim we just have to find the right objective function for post-training, and then we&#8217;ll have perfect human companions to build us a utopia of abundance.</p><p>Similarly, the guts of LLMs are the same computational pillars I describe in the book. LLMs seem like us, but they are still built upon the very unnatural model of statistically summarizing language and code via maximum likelihood estimation. People don&#8217;t do this.</p><p>Language machines run on complex software, hardware, and network infrastructure, but none of it would look alien to a computer engineer from 2006. Just like every computing system of the past two decades, they are tuned by optimizing engagement, maximizing userbase preferences in glorified A/B tests. The advances are charted by the same sort of rational, average-case benchmarking we&#8217;ve been doing in machine learning for decades. We make them better by blindly optimizing without understanding.</p><p>And let us not forget that these products were built upon fraud, theft, and rent extraction. They are enriching a small group of incredibly annoying people who claim a mantle of mathematical rationality and are promising to subjugate the rest of us in unemployment work camps. Though I started this blog lamenting what I didn&#8217;t cover in <em>The Irrational Decision</em>, maybe I got this one right: That we built this wildly irrational technology with mathematical rationality corroborates my argument.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.argmin.net/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.argmin.net/subscribe?"><span>Subscribe now</span></a></p>]]></content:encoded></item><item><title><![CDATA[An Argmin Events Calendar]]></title><description><![CDATA[A quick rundown of four fun events this week.]]></description><link>https://www.argmin.net/p/an-argmin-events-calendar</link><guid isPermaLink="false">https://www.argmin.net/p/an-argmin-events-calendar</guid><dc:creator><![CDATA[Ben Recht]]></dc:creator><pubDate>Mon, 27 Apr 2026 14:02:45 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/12d48498-b47d-45e6-9795-729d3eaf34e3_840x599.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!FmYH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8bfaac6f-96ca-4b52-a371-d1af9cdac7c1_1100x219.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!FmYH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8bfaac6f-96ca-4b52-a371-d1af9cdac7c1_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!FmYH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8bfaac6f-96ca-4b52-a371-d1af9cdac7c1_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!FmYH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8bfaac6f-96ca-4b52-a371-d1af9cdac7c1_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!FmYH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8bfaac6f-96ca-4b52-a371-d1af9cdac7c1_1100x219.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!FmYH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8bfaac6f-96ca-4b52-a371-d1af9cdac7c1_1100x219.jpeg" width="1100" height="219" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8bfaac6f-96ca-4b52-a371-d1af9cdac7c1_1100x219.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:219,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:199115,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.argmin.net/i/195628896?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8bfaac6f-96ca-4b52-a371-d1af9cdac7c1_1100x219.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!FmYH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8bfaac6f-96ca-4b52-a371-d1af9cdac7c1_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!FmYH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8bfaac6f-96ca-4b52-a371-d1af9cdac7c1_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!FmYH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8bfaac6f-96ca-4b52-a371-d1af9cdac7c1_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!FmYH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8bfaac6f-96ca-4b52-a371-d1af9cdac7c1_1100x219.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p>I wanted to give a quick rundown of some upcoming <em><a href="https://press.princeton.edu/books/hardcover/9780691272443/the-irrational-decision">Irrational Decision</a></em> events, as I&#8217;ve managed to stack four of them into a week.</p><p>On Wednesday, April 29, I&#8217;m visiting the University of Wisconsin-Madison to give the <a href="https://silo.wisc.edu/talk/2026-04-29/?wcs_timestamp=1777467050">SILO seminar</a>. It&#8217;s at 12:30 in the Orchard View Room of the Wisconsin Institute for Discovery building. Rob Nowak and I started this multidisciplinary seminar on systems, information, learning, and optimization fifteen years ago, and I&#8217;m super happy that it&#8217;s still going strong. It&#8217;s fitting that I&#8217;m going to be giving a talk about the roots of the SILO subjects, when they were all considered disciplinary, not multidisciplinary. Do come by if you&#8217;re on campus.</p><p>On Friday, May 1, I&#8217;ll be at the University of Chicago, giving a <a href="https://datascience.uchicago.edu/events/benjamin-recht-uc-berkeley-distinguished-speaker-series/">lecture</a> at the Data Science Institute at noon. I always love visiting Chicago and am looking forward to seeing the new data science building, which was a Lutheran seminary when I was an undergraduate there. Given my book&#8217;s framing, that&#8217;s quite a fitting, unintentional allegory.</p><p>On Tuesday, May 5, I&#8217;ll be participating in a conversation on &#8220;<a href="https://secure.givelively.org/event/boston-critic-inc/artificial-reason-a-conversation-on-ai-rationality-and-violence">AI, rationality, and violence</a>&#8221; with Kevin Baker, Sophia Goodfriend, and moderator Lily Hu. This is a free, online event hosted by The Boston Review at 3 PM Eastern Time. We&#8217;ll be discussing the &#8220;nature and meaning of rationality, how new technology is interfacing with old institutions, what popular AI discourses get wrong, and the consequences for politics, war, and social life in general.&#8221; I can&#8217;t wait to hash it out with three super brilliant people.</p><p>And finally, I&#8217;ll also be giving a talk about <em>The Irrational Decision</em> at 4 PM on May 5 at Berkeley. This presentation will be in conversation with Marion Fourcade and hosted by the Social Science Matrix and the BESI Technology Network. I&#8217;m looking forward to discussing the book with my on-campus colleagues there, as a way to celebrate the end of the semester. <a href="https://besi.berkeley.edu/event/the-irrational-decision-how-we-gave-computers-the-power-to-choose-for-us/">Details here.</a></p><p>Since I&#8217;ll be on the road, I expect blogging will be light for the next week. In transit, I&#8217;ll be writing down some thoughts for the summer session that I&#8217;ll probably post next week. Stay tuned!</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.argmin.net/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.argmin.net/subscribe?"><span>Subscribe now</span></a></p>]]></content:encoded></item><item><title><![CDATA[Engineering Architecture: A Syllabus?]]></title><description><![CDATA[Assembling a reading list on the theory of engineering architecture.]]></description><link>https://www.argmin.net/p/engineering-architecture-a-syllabus</link><guid isPermaLink="false">https://www.argmin.net/p/engineering-architecture-a-syllabus</guid><dc:creator><![CDATA[Ben Recht]]></dc:creator><pubDate>Thu, 23 Apr 2026 14:20:54 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/5dbd1c5d-76f1-4c64-8da8-caccba132069_840x600.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Lizn!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F307273a6-b691-470d-a91e-d36e427f7e6c_1100x220.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Lizn!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F307273a6-b691-470d-a91e-d36e427f7e6c_1100x220.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Lizn!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F307273a6-b691-470d-a91e-d36e427f7e6c_1100x220.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Lizn!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F307273a6-b691-470d-a91e-d36e427f7e6c_1100x220.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Lizn!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F307273a6-b691-470d-a91e-d36e427f7e6c_1100x220.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Lizn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F307273a6-b691-470d-a91e-d36e427f7e6c_1100x220.jpeg" width="1100" height="220" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/307273a6-b691-470d-a91e-d36e427f7e6c_1100x220.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:220,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:341799,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.argmin.net/i/195243710?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F307273a6-b691-470d-a91e-d36e427f7e6c_1100x220.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Lizn!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F307273a6-b691-470d-a91e-d36e427f7e6c_1100x220.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Lizn!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F307273a6-b691-470d-a91e-d36e427f7e6c_1100x220.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Lizn!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F307273a6-b691-470d-a91e-d36e427f7e6c_1100x220.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Lizn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F307273a6-b691-470d-a91e-d36e427f7e6c_1100x220.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p>After spending a week trying to figure out what to say in one lecture, I realized I could teach an entire class on architecture. In fact, in hindsight, that&#8217;s what <em>this</em> class should have been! Oh well. I knew this from the outset, but I couldn&#8217;t figure out how to stitch a syllabus together last December. Inevitably, I had to work my way through a full semester of cleaning out the skeletons in my learning-for-control closet before I could figure out what I wanted to dive more into. To be clear, this is a success story and a model of how teaching classes ought to go.</p><p>Given that I&#8217;m already committed to a schedule for next year,<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> a class on architecture will have to wait a bit. But nothing stops me from putting together a syllabus now, right? While preparing this week&#8217;s lecture, I assembled an <a href="https://docs.google.com/document/d/1nMwqRQZXPtpi_MoENbSuJZSkbGZySiBmIpF9mNlMQ44/edit?tab=t.0">unfortunately long reading list for this hypothetical class</a>. Synthesizing this material promises a very interesting story. Let me explain how I&#8217;m thinking about its contents.</p><p>Though architectural theory is figured out on the fly, adapting existing systems to manage newfound complexity, there are repeated patterns that we can extract from our contemporary human-cyber-physical infrastructure. The architecture class would attempt to synthesize the design principles needed for enabling diversity and error handling. The paper I referenced yesterday by Matni, Ames, and Doyle takes a stab at this sort of view, but I want to look beyond robotics. I&#8217;d want to cover as diverse a set of applications as possible while still maintaining some degree of cohesion.</p><p>I&#8217;d probably start with computing systems. You&#8217;ll get different answers about what is needed to build good architectures in your operating systems, programming languages, and networking classes. And maybe you should. I&#8217;ll keep repeating myself: I&#8217;m not convinced that there&#8217;s a &#8220;universal theory&#8221; of architecture. However, that doesn&#8217;t mean we can&#8217;t move up a layer of abstraction and draw the common threads together. What are the shared patterns in hardware, software, and network design? I&#8217;m particularly interested in studying the 75-year development of software from manually mapping bits on registers to the complex high-level languages of today. There are a lot of interesting theories on modularity, abstraction boundaries, and protocol design, and those should be thrown into the mix.</p><p>I&#8217;d also extend downward into the physical layer, adding a &#8220;cyberphysical&#8221; systems view that connects to larger systems like the power grid or transportation network (good references on these two topics are currently missing from the bibliography). I&#8217;d spend time on the history of architectures in robotics and control, where we have settled on a platform, arguably by the discipline-wide adoption of the Robot Operating System. The principles were there in the Apollo project: a separation between low-level control, sensing, navigation, and mid-level feedback, and high-level planning. There have been other proposed architectures, like Brooks&#8217; subsumption architecture, that didn&#8217;t gain much traction beyond the Roomba. There is something inescapable about the standard three-level architecture, and I want to unpack more about this diversity-enabled sweet spot.</p><p>I would like to examine some architectural theories in systems biology, especially those of Gerhart and Kirschner. We&#8217;d have to at least read some parts of <em>The Plausibility of Life</em>, mostly because it&#8217;s really good. I also think that we do learn a lot by reflecting our technology onto biological systems. I&#8217;m sure we&#8217;ll find interesting examples and insights by seeing how others have done this.</p><p>I also want to look at how we engineer architectures for organizing people. The complexity of the corporation and the computer grew symbiotically, and there are clear influences of human organizational behavior on information technology. There&#8217;s clearly a co-evolution of computing architectures with human architectures. Herb Simon, who has had as much influence on management science as computer science, would be a key figure here.</p><p>And since I can&#8217;t pass up an opportunity to dig into more Cold War technological history, we&#8217;d look at some classic theorizing about complex systems. This class would <em>not</em> be an aughties-era complex systems class. But I&#8217;d like to find the point in time before the network science people split off from the cyberneticists. So we&#8217;d go back and read Wiener and Weaver, Ashby and Simon, and look at what they got right and what they missed.</p><p>The <a href="https://docs.google.com/document/d/1nMwqRQZXPtpi_MoENbSuJZSkbGZySiBmIpF9mNlMQ44/edit?tab=t.0">bibliography</a> needs both growth and pruning. But I have plenty of time to get it in order. Help me flesh it out. What topics and references would you add? What other books on architecture, organization, protocols, and design should I throw on there? I&#8217;d love to see your suggestions in the comments.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.argmin.net/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.argmin.net/subscribe?"><span>Subscribe now</span></a></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>I&#8217;ll teach &#8220;Forecasting: WTF?&#8221; in the Fall and probability in the Spring.</p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[Freedom From Choice]]></title><description><![CDATA[Building a theory of the architecture of organizing machines and people.]]></description><link>https://www.argmin.net/p/freedom-from-choice</link><guid isPermaLink="false">https://www.argmin.net/p/freedom-from-choice</guid><dc:creator><![CDATA[Ben Recht]]></dc:creator><pubDate>Wed, 22 Apr 2026 14:10:02 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/430fda00-da92-4003-9b92-d82a20e8d9e6_840x599.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!cnoC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17611ff8-d494-4868-ad8b-b9facfe65951_1100x219.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!cnoC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17611ff8-d494-4868-ad8b-b9facfe65951_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!cnoC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17611ff8-d494-4868-ad8b-b9facfe65951_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!cnoC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17611ff8-d494-4868-ad8b-b9facfe65951_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!cnoC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17611ff8-d494-4868-ad8b-b9facfe65951_1100x219.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!cnoC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17611ff8-d494-4868-ad8b-b9facfe65951_1100x219.jpeg" width="1100" height="219" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/17611ff8-d494-4868-ad8b-b9facfe65951_1100x219.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:219,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:358009,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.argmin.net/i/195037546?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17611ff8-d494-4868-ad8b-b9facfe65951_1100x219.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!cnoC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17611ff8-d494-4868-ad8b-b9facfe65951_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!cnoC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17611ff8-d494-4868-ad8b-b9facfe65951_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!cnoC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17611ff8-d494-4868-ad8b-b9facfe65951_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!cnoC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17611ff8-d494-4868-ad8b-b9facfe65951_1100x219.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><em>This is a live blog of Lecture 10 of my graduate seminar &#8220;Feedback, Learning, and Adaptation.&#8221; A table of contents is <a href="https://www.argmin.net/p/feedback-learning-and-adaptation-48f">here</a>.</em></p><p>Though the &#8220;theory&#8221; of computer science is most associated with algorithms and complexity, by far the most impactful theories all stem from architecture. Computer architecture, software architecture, network architecture. Architectural theory in computer science is seldom packaged in clean theorems, but there are implicit and explicit design principles that recur across dozens of abstraction layers.</p><p>Computing hardware, software, and network design all share key architectural concepts, but our courses don&#8217;t often cleanly connect the architectural dots across the application domains. In computer science, all of these different theories of architecture focus on designing hierarchical systems to support diversity and robustness. They all use similar building blocks, namely abstraction boundaries, layered hierarchies, and protocols for cross-layer communication. These protocols are all constraints that deconstrain.</p><p>In class, I walked through a few examples, though I had to entirely gloss over all the details. The result was <a href="https://people.eecs.berkeley.edu/~brecht/cs294_s26/ArchitectureLecture.pdf">my most Santa Fe Institute slide deck ever</a>, an endless scroll of ugly graphs of networks. I started with the internet, which has the clearest declarative design of all the architectures. Here&#8217;s a glimpse from Berkeley&#8217;s CS 168:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!NqQw!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35f406d3-f6f8-42e0-a34d-3ae804770ad6_2048x832.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!NqQw!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35f406d3-f6f8-42e0-a34d-3ae804770ad6_2048x832.png 424w, https://substackcdn.com/image/fetch/$s_!NqQw!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35f406d3-f6f8-42e0-a34d-3ae804770ad6_2048x832.png 848w, https://substackcdn.com/image/fetch/$s_!NqQw!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35f406d3-f6f8-42e0-a34d-3ae804770ad6_2048x832.png 1272w, https://substackcdn.com/image/fetch/$s_!NqQw!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35f406d3-f6f8-42e0-a34d-3ae804770ad6_2048x832.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!NqQw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35f406d3-f6f8-42e0-a34d-3ae804770ad6_2048x832.png" width="1456" height="592" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/35f406d3-f6f8-42e0-a34d-3ae804770ad6_2048x832.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:592,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!NqQw!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35f406d3-f6f8-42e0-a34d-3ae804770ad6_2048x832.png 424w, https://substackcdn.com/image/fetch/$s_!NqQw!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35f406d3-f6f8-42e0-a34d-3ae804770ad6_2048x832.png 848w, https://substackcdn.com/image/fetch/$s_!NqQw!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35f406d3-f6f8-42e0-a34d-3ae804770ad6_2048x832.png 1272w, https://substackcdn.com/image/fetch/$s_!NqQw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35f406d3-f6f8-42e0-a34d-3ae804770ad6_2048x832.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The internet enables diverse applications to run on diverse networks. It does so by enforcing seven layers of protocols. All of these protocols flow through the &#8220;narrow waist&#8221; of the Internet Protocol (IP), the jewel &#8220;constraint that deconstrains.&#8221; Since every application has to flow through a single protocol, you can have incredibly diverse physical networking below and incredibly diverse applications on top. The protocols fan out above and below IP to support the diverse goals. Notably, the transport layer supports TCP, which lets applications know if their packets arrived, and UDP, which doesn&#8217;t. The internet is designed for robustness by having a strict protocol list, but pushing all of the processing and thinking about those protocols to the edge.</p><p>I also briefly discussed software, operating systems, and hardware architectures in computer science. These systems are physically more localized and have different design constraints. Their main goal is to enable local physical scale so that computers can support fast, general-purpose software. As computers became faster, more complex, and more reliable, their design became more layered and hierarchical. Here&#8217;s an image of the timeline Alberto Sangiovanni-Vincentelli shared with me</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Q2Eo!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F326c0b19-8b0e-4372-92ea-a68d8de3e37d_1396x720.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Q2Eo!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F326c0b19-8b0e-4372-92ea-a68d8de3e37d_1396x720.png 424w, https://substackcdn.com/image/fetch/$s_!Q2Eo!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F326c0b19-8b0e-4372-92ea-a68d8de3e37d_1396x720.png 848w, https://substackcdn.com/image/fetch/$s_!Q2Eo!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F326c0b19-8b0e-4372-92ea-a68d8de3e37d_1396x720.png 1272w, https://substackcdn.com/image/fetch/$s_!Q2Eo!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F326c0b19-8b0e-4372-92ea-a68d8de3e37d_1396x720.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Q2Eo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F326c0b19-8b0e-4372-92ea-a68d8de3e37d_1396x720.png" width="1396" height="720" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/326c0b19-8b0e-4372-92ea-a68d8de3e37d_1396x720.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:720,&quot;width&quot;:1396,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Q2Eo!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F326c0b19-8b0e-4372-92ea-a68d8de3e37d_1396x720.png 424w, https://substackcdn.com/image/fetch/$s_!Q2Eo!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F326c0b19-8b0e-4372-92ea-a68d8de3e37d_1396x720.png 848w, https://substackcdn.com/image/fetch/$s_!Q2Eo!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F326c0b19-8b0e-4372-92ea-a68d8de3e37d_1396x720.png 1272w, https://substackcdn.com/image/fetch/$s_!Q2Eo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F326c0b19-8b0e-4372-92ea-a68d8de3e37d_1396x720.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Rather than trying to design a computer chip from transistors, design cycles accelerate by letting engineers work at higher and higher levels of abstraction. Layered design now comes in to simplify choices. Alberto and Edward Lee like to echo <a href="https://www.youtube.com/watch?v=dVGINIsLnqU">DEVO</a>: &#8220;Freedom from choice is what you want!&#8221; By establishing clean abstraction layers, engineers can innovate at each layer without worrying about what happens above and below.</p><p>Now, this is the point in the lecture where I split with John Doyle. John likes to use layered architectures to understand biology. Yes, you can look at biology and see architecture. Indeed, the constraints that deconstrain terminology were coined by systems biologists. Marc Kirschner and John Gerhart use the notions of constraints and deconstraints to describe how common platforms in biology facilitate agile evolution into diverse phenotypes and species. Because the platform is conserved, this enables rapid evolutionary changes that wouldn&#8217;t be predicted by simple, uniformly random variation.</p><p>However, I always find that people project technology onto biology to organize and understand biological function. In the 1600s, the body was a bunch of clocks. In the 1800s, it was an engine. Now we think of it as a computer. I&#8217;m not saying these projections of technology onto biology aren&#8217;t useful, but I don&#8217;t think that we necessarily learn more about technology from seeking common patterns in biology. Indeed, I&#8217;d rather look at recurring patterns in <em>artificial</em> structures to identify commonalities and general principles.</p><p>So instead of looking to biology, let&#8217;s look to management. Because man, every computer architecture diagram looks like an industrial org chart. This is not accidental. They serve similar functions. Computing and the mega-organization grew symbiotically in the post-war period, and building complex computing infrastructure required complex organizations of people. Some individuals certainly made brilliant, important advances at isolated nodes of these networks. However, the genius of layered architecture is that they admit a diversity of narrow innovations at every layer that locally grows the architectural ruleset without disrupting what everyone else is doing. In organizations, we have specific reporting and evaluation protocols, rules for bonuses and promotions, and schemes for supporting diverse business goals. The organizational architecture serves functions similar to those of computer architecture.</p><p>In &#8220;<a href="https://arxiv.org/abs/2401.15185">Toward a Theory of Control Architecture</a>,&#8221; which I&#8217;ll discuss more in the next post, Nik Matni, Aaron Ames, and John Doyle set the stage with the Apollo project&#8217;s architecture, which bears a striking resemblance to today&#8217;s standard robotic architectures.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!7uVu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff388f01-3faf-4d60-a428-a6e04fa869af_1968x1328.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!7uVu!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff388f01-3faf-4d60-a428-a6e04fa869af_1968x1328.png 424w, https://substackcdn.com/image/fetch/$s_!7uVu!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff388f01-3faf-4d60-a428-a6e04fa869af_1968x1328.png 848w, https://substackcdn.com/image/fetch/$s_!7uVu!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff388f01-3faf-4d60-a428-a6e04fa869af_1968x1328.png 1272w, https://substackcdn.com/image/fetch/$s_!7uVu!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff388f01-3faf-4d60-a428-a6e04fa869af_1968x1328.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!7uVu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff388f01-3faf-4d60-a428-a6e04fa869af_1968x1328.png" width="1456" height="983" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ff388f01-3faf-4d60-a428-a6e04fa869af_1968x1328.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:983,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!7uVu!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff388f01-3faf-4d60-a428-a6e04fa869af_1968x1328.png 424w, https://substackcdn.com/image/fetch/$s_!7uVu!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff388f01-3faf-4d60-a428-a6e04fa869af_1968x1328.png 848w, https://substackcdn.com/image/fetch/$s_!7uVu!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff388f01-3faf-4d60-a428-a6e04fa869af_1968x1328.png 1272w, https://substackcdn.com/image/fetch/$s_!7uVu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff388f01-3faf-4d60-a428-a6e04fa869af_1968x1328.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>You have low-level controllers at one layer, a synthesis of sensors and trajectory optimization in the middle, and a high-level planner at the bottom. Part of this is because the abstraction makes it easier to reason locally about mitigating the complexity of launching people to a cold, barren, airless moon. However, such complexity also required massive teams of people. Here&#8217;s a small part of the organization of the Apollo Spacecraft Project Office.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!VfHt!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb83dc2f5-c828-4043-80ab-30c57a744c1b_746x495.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!VfHt!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb83dc2f5-c828-4043-80ab-30c57a744c1b_746x495.png 424w, https://substackcdn.com/image/fetch/$s_!VfHt!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb83dc2f5-c828-4043-80ab-30c57a744c1b_746x495.png 848w, https://substackcdn.com/image/fetch/$s_!VfHt!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb83dc2f5-c828-4043-80ab-30c57a744c1b_746x495.png 1272w, https://substackcdn.com/image/fetch/$s_!VfHt!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb83dc2f5-c828-4043-80ab-30c57a744c1b_746x495.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!VfHt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb83dc2f5-c828-4043-80ab-30c57a744c1b_746x495.png" width="746" height="495" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b83dc2f5-c828-4043-80ab-30c57a744c1b_746x495.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:495,&quot;width&quot;:746,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!VfHt!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb83dc2f5-c828-4043-80ab-30c57a744c1b_746x495.png 424w, https://substackcdn.com/image/fetch/$s_!VfHt!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb83dc2f5-c828-4043-80ab-30c57a744c1b_746x495.png 848w, https://substackcdn.com/image/fetch/$s_!VfHt!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb83dc2f5-c828-4043-80ab-30c57a744c1b_746x495.png 1272w, https://substackcdn.com/image/fetch/$s_!VfHt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb83dc2f5-c828-4043-80ab-30c57a744c1b_746x495.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>A theory of architecture can&#8217;t neglect a theory of human organization. Both artificial structures work together to create the complex infrastructure underneath our contemporary condition.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.argmin.net/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.argmin.net/subscribe?"><span>Subscribe now</span></a></p>]]></content:encoded></item><item><title><![CDATA[Walk the Marble Malls]]></title><description><![CDATA[Identifying the elements of a theory of engineering architecture.]]></description><link>https://www.argmin.net/p/walk-the-marble-malls</link><guid isPermaLink="false">https://www.argmin.net/p/walk-the-marble-malls</guid><dc:creator><![CDATA[Ben Recht]]></dc:creator><pubDate>Mon, 20 Apr 2026 14:37:24 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/904f8341-66d4-4636-85c7-15fa5d694b20_840x599.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!_kHm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5133ce5-6394-4aac-bf48-07b04d29cc63_1100x219.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!_kHm!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5133ce5-6394-4aac-bf48-07b04d29cc63_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!_kHm!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5133ce5-6394-4aac-bf48-07b04d29cc63_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!_kHm!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5133ce5-6394-4aac-bf48-07b04d29cc63_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!_kHm!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5133ce5-6394-4aac-bf48-07b04d29cc63_1100x219.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!_kHm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5133ce5-6394-4aac-bf48-07b04d29cc63_1100x219.jpeg" width="1100" height="219" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d5133ce5-6394-4aac-bf48-07b04d29cc63_1100x219.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:219,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:252260,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.argmin.net/i/194803568?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5133ce5-6394-4aac-bf48-07b04d29cc63_1100x219.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!_kHm!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5133ce5-6394-4aac-bf48-07b04d29cc63_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!_kHm!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5133ce5-6394-4aac-bf48-07b04d29cc63_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!_kHm!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5133ce5-6394-4aac-bf48-07b04d29cc63_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!_kHm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd5133ce5-6394-4aac-bf48-07b04d29cc63_1100x219.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><em>This is a live blog of Lecture 10 of my graduate seminar &#8220;Feedback, Learning, and Adaptation.&#8221; A table of contents is <a href="https://www.argmin.net/p/feedback-learning-and-adaptation-48f">here</a>.</em></p><p>John Doyle, <a href="https://ieeexplore.ieee.org/document/1101812">he of robust control infamy</a>, has been a close friend and mentor of mine for over two decades. And for the entirety of those two decades, he&#8217;s been yelling about the need for a unified theory of engineering architecture. If you know John, you&#8217;ve likely heard the same rants and seen his psychedelic PowerPoint slides with bowties, hourglasses, and bonobos. He&#8217;ll show you a picture of the internet stack and a picture of the organization of bacterial metabolism, and expect you to see that they are the same thing.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!q9mt!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7f46f414-488b-49ea-abba-19a3cb73d209_1688x946.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!q9mt!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7f46f414-488b-49ea-abba-19a3cb73d209_1688x946.png 424w, https://substackcdn.com/image/fetch/$s_!q9mt!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7f46f414-488b-49ea-abba-19a3cb73d209_1688x946.png 848w, https://substackcdn.com/image/fetch/$s_!q9mt!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7f46f414-488b-49ea-abba-19a3cb73d209_1688x946.png 1272w, https://substackcdn.com/image/fetch/$s_!q9mt!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7f46f414-488b-49ea-abba-19a3cb73d209_1688x946.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!q9mt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7f46f414-488b-49ea-abba-19a3cb73d209_1688x946.png" width="1456" height="816" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7f46f414-488b-49ea-abba-19a3cb73d209_1688x946.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:816,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!q9mt!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7f46f414-488b-49ea-abba-19a3cb73d209_1688x946.png 424w, https://substackcdn.com/image/fetch/$s_!q9mt!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7f46f414-488b-49ea-abba-19a3cb73d209_1688x946.png 848w, https://substackcdn.com/image/fetch/$s_!q9mt!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7f46f414-488b-49ea-abba-19a3cb73d209_1688x946.png 1272w, https://substackcdn.com/image/fetch/$s_!q9mt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7f46f414-488b-49ea-abba-19a3cb73d209_1688x946.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Now, I spent a good chunk of the last week trying to figure out how to pack John&#8217;s grand unified theory into a single lecture. Just like with simulation, I failed. However, I came out much more convinced that John is onto something than I went in. Let me do my best to explain why without sounding like a complex-systems crazy person.</p><p>In this class, and in control research more generally, we keep getting trapped by optimization. It&#8217;s always easiest to frame problems in terms of optimization problems, and then worry about the particulars of the solution method. However, optimal control is far too limited and brittle for any practical application. It is more often than not a <a href="https://www.argmin.net/p/reticulating-splines">simulation steering problem</a>: we build a simulator at some abstraction level, and then shape a policy by designing an appropriate set of costs and constraints. The framework forces us to operate at a particular abstraction layer, which means we end up at a specific point on the <a href="https://www.argmin.net/p/action-impact-tradeoffs">action-impact trade-off curve</a>. <a href="https://www.argmin.net/p/at-least-its-an-ethos">We can&#8217;t design for hidden states, and we&#8217;re sensitive to modeling errors.</a> <a href="https://www.argmin.net/p/you-play-to-win-the-game">It forces us to model unknowns as average-case or worst-case disturbances</a>. Optimization is specific and rigid, whereas control systems need to be diverse and flexible. What&#8217;s the right way to think about diversity and flexibility?</p><p>Fortunately, we have a lot of existence proofs to learn from when trying to answer that question. The world is run on complex, engineered feedback systems with astounding robustness, diversity, and flexibility. We transmit thousands of trillions of bits to each other every second on the internet. We maintain electricity for billions of people. We can have any product we can imagine delivered to our doorstep. We can get from our house to almost any point on earth in a couple of days. We carry around supercomputers in our pockets so we can watch vertical videos whenever we&#8217;re bored. We live in a world of engineering miracles that are more robust than any LQR system. So how in the world do they work?</p><p>John Doyle is not alone in arguing that the answer is <em>architecture</em>, a set of organizing principles for engineering design. Architecture is the rules and protocols for assembling components to enable <em>diversity</em> in system execution. You want systems that can accommodate a diversity of objectives: balancing speed, accuracy, and impact. You&#8217;d like to be able to solve a diversity of tasks. You&#8217;d like to accommodate a diversity of end users. Diversity is a particular kind of robustness. It&#8217;s robustness to intent. And you have to design for it.</p><p>Looking at the last hundred years of engineering, you definitely see repeated patterns in architectural design. First, the need for hierarchy to handle complexity isn&#8217;t surprising. <a href="https://www.jstor.org/stable/985254">Herb Simon was already on this in the sixties</a>. But graph structure and emergulence are not enough.</p><p>Feedback is essential. As we&#8217;ve seen throughout the class, you can take two systems with dramatically different behaviors and put together a system with &#8220;best of both worlds&#8221; through feedback. <a href="https://www.argmin.net/p/links-and-loops">A powerful amplifier and a precise attenuator combine in feedback to make a powerful, precise amplifier.</a> <a href="https://www.argmin.net/p/secrets-of-intelligence-services">A language machine that can recapitulate all software in feedback, using a simple agent harness with iterative exception handling, lets you build and manage complex software packages</a>. Architecture lets you scale this feedback design principle.</p><p>The key to scalable architectures is protocols. Computer science &#8212; the engineering discipline of scaling logical systems &#8212; is obsessed with architectural protocols. To build a complex system like the internet or the modern computer, you build a hierarchy of abstraction boundaries. You design interfaces to talk across these boundaries with clean, well-specified protocols. The protocols let each system operate with a particular set of agreements about what the other will do. When you stack these protocols together, you get ridiculously impressive diversity.  From this design strategy, you can build out the internet, the integrated circuit, the cell, and the control system. The internet serves arbitrary applications on arbitrary physical layers, all funneled through a set of contracts with IP in the middle. We can design a diversity of complex computer chips from standard cells and data flow models. In each of these cases, each layer only speaks to its neighbors in a structured hierarchy.</p><p>Finally, there is a central concept that seems to drive architectural design: constraints that deconstrain. They are restrictions on what we can do at one point of the hierarchy that end up enabling diversity at another. Constraints that deconstrain were proposed by Marc W. Kirschner and John Gerhart as a facilitator of evolution. Systems engineers like Alberto Sangiovanni-Vincentelli and Edward Lee emphasize how they provide a Devo-esque &#8220;Freedom of choice,&#8221; removing the paradox of choice and enabling more efficient design cycles. I&#8217;ll connect this to similar architectural theories of computer scientists and electrical engineers.</p><p>Engineering architecture is far too much to cram into a single lecture, but I&#8217;ll give a brief introduction to the ideas this week on the blog. I&#8217;ve come to the conclusion that there&#8217;s a whole semester&#8217;s class to be taught here. How better to end a class than by describing what the next class looks like?</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.argmin.net/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.argmin.net/subscribe?"><span>Subscribe now</span></a></p>]]></content:encoded></item><item><title><![CDATA[Structured Uncertainties]]></title><description><![CDATA[A brief introduction to the structured singular value and what it teaches us about uncertainty quantification.]]></description><link>https://www.argmin.net/p/structured-uncertainties</link><guid isPermaLink="false">https://www.argmin.net/p/structured-uncertainties</guid><dc:creator><![CDATA[Ben Recht]]></dc:creator><pubDate>Fri, 17 Apr 2026 16:59:40 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/164ac04c-169a-467b-9ed5-386f31760833_840x600.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!6ro-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3239cf74-d901-40f1-a801-8c835de8da52_1100x219.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!6ro-!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3239cf74-d901-40f1-a801-8c835de8da52_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!6ro-!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3239cf74-d901-40f1-a801-8c835de8da52_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!6ro-!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3239cf74-d901-40f1-a801-8c835de8da52_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!6ro-!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3239cf74-d901-40f1-a801-8c835de8da52_1100x219.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!6ro-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3239cf74-d901-40f1-a801-8c835de8da52_1100x219.jpeg" width="1100" height="219" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3239cf74-d901-40f1-a801-8c835de8da52_1100x219.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:219,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:285811,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.argmin.net/i/194537068?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3239cf74-d901-40f1-a801-8c835de8da52_1100x219.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!6ro-!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3239cf74-d901-40f1-a801-8c835de8da52_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!6ro-!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3239cf74-d901-40f1-a801-8c835de8da52_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!6ro-!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3239cf74-d901-40f1-a801-8c835de8da52_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!6ro-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3239cf74-d901-40f1-a801-8c835de8da52_1100x219.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><em>This is a live blog of Lecture 9 of my graduate seminar &#8220;Feedback, Learning, and Adaptation.&#8221; A table of contents is <a href="https://www.argmin.net/p/feedback-learning-and-adaptation-48f">here</a>.</em></p><p>The problem with a fast-paced course is that I keep hitting topics I want to dig into but am forced to move on. Simulation is <em>fascinating</em>, and I need to spend more time with its history and nuance. I guess that just means I&#8217;m going to add it to the syllabus of my next graduate course.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a></p><p>But I wanted to post about one fun thing I learned this week that, while not directly related to simulation, does seem to have some broader lessons. I came to better appreciate <a href="https://authors.library.caltech.edu/records/9de69-h7s46/files/04642148.pdf">John Doyle&#8217;s </a><em><a href="https://authors.library.caltech.edu/records/9de69-h7s46/files/04642148.pdf">structured singular value</a></em>, often called by its Greek name &#8220;&#120583;&#8221; or &#8220;mu.&#8221; The lessons it teaches about interconnection and uncertainty, though perhaps not always computable, are quite general and important.</p><p>Let&#8217;s go back to the <a href="https://www.argmin.net/p/links-and-loops">recurring simple feedback loop</a>.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!RS5r!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ee4a9b2-2a11-4afb-9437-252a65755b72_1976x456.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!RS5r!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ee4a9b2-2a11-4afb-9437-252a65755b72_1976x456.png 424w, https://substackcdn.com/image/fetch/$s_!RS5r!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ee4a9b2-2a11-4afb-9437-252a65755b72_1976x456.png 848w, https://substackcdn.com/image/fetch/$s_!RS5r!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ee4a9b2-2a11-4afb-9437-252a65755b72_1976x456.png 1272w, https://substackcdn.com/image/fetch/$s_!RS5r!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ee4a9b2-2a11-4afb-9437-252a65755b72_1976x456.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!RS5r!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ee4a9b2-2a11-4afb-9437-252a65755b72_1976x456.png" width="542" height="125.07692307692308" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1ee4a9b2-2a11-4afb-9437-252a65755b72_1976x456.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:336,&quot;width&quot;:1456,&quot;resizeWidth&quot;:542,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!RS5r!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ee4a9b2-2a11-4afb-9437-252a65755b72_1976x456.png 424w, https://substackcdn.com/image/fetch/$s_!RS5r!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ee4a9b2-2a11-4afb-9437-252a65755b72_1976x456.png 848w, https://substackcdn.com/image/fetch/$s_!RS5r!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ee4a9b2-2a11-4afb-9437-252a65755b72_1976x456.png 1272w, https://substackcdn.com/image/fetch/$s_!RS5r!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ee4a9b2-2a11-4afb-9437-252a65755b72_1976x456.png 1456w" sizes="100vw"></picture><div></div></div></a></figure></div><p>Here, C is the controller we&#8217;re designing, P is the plant we&#8217;re trying to steer. I&#8217;ve added a new block, &#916;, to represent some uncertain system in our feedback loop. When &#916; equals zero, we have our nominal system model. The structured singular value asks the following question: if I have a design that works without uncertainty, how much margin for error do I have? How large can &#916; be before the system goes unstable?</p><p>A standard problem in linear control models &#916; as a simple scalar. In that case, because of how the equations work out, the uncertainty question is about the gain margin of the linear system. How much can you amplify or attenuate the plant before the closed-loop system goes unstable? Asked another way, how well do you need to know the amplification factor to guarantee stable operation? Or, let&#8217;s say you have a bunch of plants that all have reasonably similar dynamics but different gains. Is your controller good enough for all of them? The uncertainty could model the mass of a flying vehicle or the insulin sensitivity of a person with diabetes. Steady-state control of both of these systems relies on some robustness to uncertainty.</p><p>One of the classic results about the linear quadratic regulator is that its stability is maintained for any &#916; between -&#189; and infinity. That&#8217;s a good gain margin! Many other control design techniques from classical control using Nyquist plots can also guarantee large gain margins for single-input, single-output systems.</p><p>However, the problem becomes a lot trickier when you need to control a plant with many inputs and outputs. Most control systems are networks of interconnected feedback loops, not just simple single-input, single-output systems. In my favorite control system, the espresso machine, you might have a PID controller for water temperature and another for water pressure. You could calibrate these by tuning each PID parameter, one at a time. But obviously, these two loops interact with each other. They also interact with your grind and your tamping.</p><p>An industrial process, a chemical plant, or a robot has a networked control system of far greater complexity. You might be able to write out performance guarantees for each loop in the system, and that might look fine on its face. But if these loops are coupled, your margin calculations might be misleading.</p><p>To see why, we can look at a static example, like we did with the feedback amplifier. Imagine we&#8217;re trying to get a plant to track a constant reference signal. The controller compares the reference signal with the plant&#8217;s output and applies a new input if the difference is large. This signal sets a different set point for each loop. We can compute the steady state of our system by looking at a matrix equation. Indeed, slightly abusing notation, we can think of the steady-state maps C, P, and &#916; as matrices (these are the DC responses of each system). The map from the error signal input of the controller, e, to the output of the uncertainty, y, is a system of equations:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ftQG!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb63a9050-2a50-43a6-9fba-d2e680be5f19_174x26.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ftQG!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb63a9050-2a50-43a6-9fba-d2e680be5f19_174x26.png 424w, https://substackcdn.com/image/fetch/$s_!ftQG!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb63a9050-2a50-43a6-9fba-d2e680be5f19_174x26.png 848w, https://substackcdn.com/image/fetch/$s_!ftQG!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb63a9050-2a50-43a6-9fba-d2e680be5f19_174x26.png 1272w, https://substackcdn.com/image/fetch/$s_!ftQG!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb63a9050-2a50-43a6-9fba-d2e680be5f19_174x26.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ftQG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb63a9050-2a50-43a6-9fba-d2e680be5f19_174x26.png" width="174" height="26" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b63a9050-2a50-43a6-9fba-d2e680be5f19_174x26.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:26,&quot;width&quot;:174,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ftQG!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb63a9050-2a50-43a6-9fba-d2e680be5f19_174x26.png 424w, https://substackcdn.com/image/fetch/$s_!ftQG!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb63a9050-2a50-43a6-9fba-d2e680be5f19_174x26.png 848w, https://substackcdn.com/image/fetch/$s_!ftQG!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb63a9050-2a50-43a6-9fba-d2e680be5f19_174x26.png 1272w, https://substackcdn.com/image/fetch/$s_!ftQG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb63a9050-2a50-43a6-9fba-d2e680be5f19_174x26.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>Using the fact that the error is the difference between the reference and the output, e = r - y, we can compute the steady-state output as a function of the reference signal. It&#8217;s not the prettiest formula, but you can write it out in closed form and stare at it:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!1MXz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe6b2432-ad19-4ffd-8e96-9886896eed84_378x28.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!1MXz!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe6b2432-ad19-4ffd-8e96-9886896eed84_378x28.png 424w, https://substackcdn.com/image/fetch/$s_!1MXz!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe6b2432-ad19-4ffd-8e96-9886896eed84_378x28.png 848w, https://substackcdn.com/image/fetch/$s_!1MXz!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe6b2432-ad19-4ffd-8e96-9886896eed84_378x28.png 1272w, https://substackcdn.com/image/fetch/$s_!1MXz!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe6b2432-ad19-4ffd-8e96-9886896eed84_378x28.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!1MXz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe6b2432-ad19-4ffd-8e96-9886896eed84_378x28.png" width="378" height="28" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/be6b2432-ad19-4ffd-8e96-9886896eed84_378x28.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:28,&quot;width&quot;:378,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!1MXz!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe6b2432-ad19-4ffd-8e96-9886896eed84_378x28.png 424w, https://substackcdn.com/image/fetch/$s_!1MXz!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe6b2432-ad19-4ffd-8e96-9886896eed84_378x28.png 848w, https://substackcdn.com/image/fetch/$s_!1MXz!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe6b2432-ad19-4ffd-8e96-9886896eed84_378x28.png 1272w, https://substackcdn.com/image/fetch/$s_!1MXz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbe6b2432-ad19-4ffd-8e96-9886896eed84_378x28.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>If the open-loop map from the controller input to plant output &#8212; the matrix (I+&#916;) PC &#8212;  is sufficiently large, the output of the plant will be approximately equal to the reference input. However, there&#8217;s a catch. We need to know that matrix never has an eigenvalue of -1 for any instantiation of the uncertainty. If it does, then the inverse in the above matrix expression isn&#8217;t defined, and the expression blows up in unpleasant ways. We&#8217;d say the closed-loop system was unstable.</p><p>Hence, we can capture a notion of multivariate robustness by finding the smallest perturbation that makes that matrix singular. The tricky part is that you get different answers based on what sorts of uncertainties you believe are plausible.</p><p>Consider the classic gain margin question. For simplicity, define the matrix</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!hNJC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9528d648-7979-4d82-a65c-2f6e68eb52a8_212x28.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!hNJC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9528d648-7979-4d82-a65c-2f6e68eb52a8_212x28.png 424w, https://substackcdn.com/image/fetch/$s_!hNJC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9528d648-7979-4d82-a65c-2f6e68eb52a8_212x28.png 848w, https://substackcdn.com/image/fetch/$s_!hNJC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9528d648-7979-4d82-a65c-2f6e68eb52a8_212x28.png 1272w, https://substackcdn.com/image/fetch/$s_!hNJC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9528d648-7979-4d82-a65c-2f6e68eb52a8_212x28.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!hNJC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9528d648-7979-4d82-a65c-2f6e68eb52a8_212x28.png" width="212" height="28" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9528d648-7979-4d82-a65c-2f6e68eb52a8_212x28.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:28,&quot;width&quot;:212,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!hNJC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9528d648-7979-4d82-a65c-2f6e68eb52a8_212x28.png 424w, https://substackcdn.com/image/fetch/$s_!hNJC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9528d648-7979-4d82-a65c-2f6e68eb52a8_212x28.png 848w, https://substackcdn.com/image/fetch/$s_!hNJC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9528d648-7979-4d82-a65c-2f6e68eb52a8_212x28.png 1272w, https://substackcdn.com/image/fetch/$s_!hNJC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9528d648-7979-4d82-a65c-2f6e68eb52a8_212x28.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>When &#916; is a scalar, you are just looking for the smallest number such that</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!jws3!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3628ca6b-14b1-4063-b37b-2bddac95e2af_176x26.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!jws3!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3628ca6b-14b1-4063-b37b-2bddac95e2af_176x26.png 424w, https://substackcdn.com/image/fetch/$s_!jws3!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3628ca6b-14b1-4063-b37b-2bddac95e2af_176x26.png 848w, https://substackcdn.com/image/fetch/$s_!jws3!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3628ca6b-14b1-4063-b37b-2bddac95e2af_176x26.png 1272w, https://substackcdn.com/image/fetch/$s_!jws3!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3628ca6b-14b1-4063-b37b-2bddac95e2af_176x26.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!jws3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3628ca6b-14b1-4063-b37b-2bddac95e2af_176x26.png" width="176" height="26" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3628ca6b-14b1-4063-b37b-2bddac95e2af_176x26.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:26,&quot;width&quot;:176,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!jws3!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3628ca6b-14b1-4063-b37b-2bddac95e2af_176x26.png 424w, https://substackcdn.com/image/fetch/$s_!jws3!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3628ca6b-14b1-4063-b37b-2bddac95e2af_176x26.png 848w, https://substackcdn.com/image/fetch/$s_!jws3!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3628ca6b-14b1-4063-b37b-2bddac95e2af_176x26.png 1272w, https://substackcdn.com/image/fetch/$s_!jws3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3628ca6b-14b1-4063-b37b-2bddac95e2af_176x26.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>This number is precisely equal to the inverse of the magnitude of the largest eigenvalue of T. By contrast, if you think that you can have uncertainty that couples channels of your system together, the size of the uncertainty you can handle is much smaller. Indeed, you can check that if you allow for the uncertainty to be an arbitrary matrix, the norm of the uncertainty has to be smaller than the inverse of the magnitude of the largest <em>singular value</em> of T.</p><p>Singular values are always larger than eigenvalues. Sometimes, they can be <em>much</em> larger. For instance if</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!jAuT!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F081007bc-7688-4808-b5ba-6a7f5217890d_174x60.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!jAuT!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F081007bc-7688-4808-b5ba-6a7f5217890d_174x60.png 424w, https://substackcdn.com/image/fetch/$s_!jAuT!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F081007bc-7688-4808-b5ba-6a7f5217890d_174x60.png 848w, https://substackcdn.com/image/fetch/$s_!jAuT!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F081007bc-7688-4808-b5ba-6a7f5217890d_174x60.png 1272w, https://substackcdn.com/image/fetch/$s_!jAuT!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F081007bc-7688-4808-b5ba-6a7f5217890d_174x60.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!jAuT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F081007bc-7688-4808-b5ba-6a7f5217890d_174x60.png" width="174" height="60" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/081007bc-7688-4808-b5ba-6a7f5217890d_174x60.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:60,&quot;width&quot;:174,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!jAuT!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F081007bc-7688-4808-b5ba-6a7f5217890d_174x60.png 424w, https://substackcdn.com/image/fetch/$s_!jAuT!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F081007bc-7688-4808-b5ba-6a7f5217890d_174x60.png 848w, https://substackcdn.com/image/fetch/$s_!jAuT!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F081007bc-7688-4808-b5ba-6a7f5217890d_174x60.png 1272w, https://substackcdn.com/image/fetch/$s_!jAuT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F081007bc-7688-4808-b5ba-6a7f5217890d_174x60.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>Then the eigenvalues of T are &#189;, and the maximum singular value of T is approximately 250. If the uncertainties were just a multiple of the identity, it would appear very robust to perturbations, handling disturbances with gains up to a magnitude of 2. However, if general matrix uncertainty were allowed, you could only handle disturbances with gains of magnitude at most 0.004.</p><p>The structured singular value lets you figure out what this magnitude is for whatever plausible model of uncertainty you can construct. Maybe only a subset of the loops is coupled. Maybe &#916; has block structure. Each structure gives you a different number in between the spectral radius and the norm of your system&#8217;s complementary sensitivity, T.  The structured singular value generalizes beyond this simple matrix example to general linear systems. It lets you compute bounds even when the uncertain blocks are themselves structured dynamical systems.</p><p>For people designing mission-critical linear feedback systems, you should learn all of the details. For everyone else who is stuck with nonlinear systems, there are still lessons to take away. Nonlinearity seldom makes our lives easier! If a robustness problem presents itself when we look at simple linear instances, we shouldn&#8217;t just hope that it&#8217;s not there on hard nonlinear ones. This is one of the reasons that in our post-math age of YOLO scaling, it&#8217;s useful to learn a little bit of math to be a little bit chastened. Though I suppose if you do it that way, you&#8217;ll never make a dime.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.argmin.net/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.argmin.net/subscribe?"><span>Subscribe now</span></a></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>It is indeed already on there, my friends.</p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[Reticulating Splines]]></title><description><![CDATA[The long legacy of simulation-based control.]]></description><link>https://www.argmin.net/p/reticulating-splines</link><guid isPermaLink="false">https://www.argmin.net/p/reticulating-splines</guid><dc:creator><![CDATA[Ben Recht]]></dc:creator><pubDate>Wed, 15 Apr 2026 14:12:27 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/3a0f2f7b-4eeb-4aee-a118-2bb1bc653aaa_840x600.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!w0J3!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda2d13bb-5295-4769-b27f-74411160d89d_1100x219.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!w0J3!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda2d13bb-5295-4769-b27f-74411160d89d_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!w0J3!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda2d13bb-5295-4769-b27f-74411160d89d_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!w0J3!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda2d13bb-5295-4769-b27f-74411160d89d_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!w0J3!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda2d13bb-5295-4769-b27f-74411160d89d_1100x219.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!w0J3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda2d13bb-5295-4769-b27f-74411160d89d_1100x219.jpeg" width="1100" height="219" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/da2d13bb-5295-4769-b27f-74411160d89d_1100x219.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:219,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:294299,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.argmin.net/i/194299287?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda2d13bb-5295-4769-b27f-74411160d89d_1100x219.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!w0J3!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda2d13bb-5295-4769-b27f-74411160d89d_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!w0J3!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda2d13bb-5295-4769-b27f-74411160d89d_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!w0J3!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda2d13bb-5295-4769-b27f-74411160d89d_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!w0J3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fda2d13bb-5295-4769-b27f-74411160d89d_1100x219.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><em>This is a live blog of Lecture 9 of my graduate seminar &#8220;Feedback, Learning, and Adaptation.&#8221; A table of contents is <a href="https://www.argmin.net/p/feedback-learning-and-adaptation-48f">here</a>.</em></p><p>As I mentioned Monday, one of the big paradigms in modern robotics and control is the &#8220;sim2real&#8221; pipeline. People invest in complex computer simulators to test their robotic policies. The simulators have detailed dynamic and kinematic models of the robot and how it moves in contact with varied terrain and obstacles. The hope is that by burning through infinite GPU credits to troubleshoot every possibility in simulation, they can deploy code to their actual robot and need no troubleshooting once it&#8217;s unleashed in the real world.</p><p>While the young folks like to make this paradigm sound like a novel new research program, all of optimal control rests on the sim2real pipeline. Think about the core problem of optimal control: the linear quadratic regulator. This problem looks for a control sequence that minimizes a quadratic cost subject to the world evolving according to a linear dynamical system. Control theorists banged their heads against this problem for decades, and we are now taught the beautiful dynamic programming derivations that reduce this problem to solving a compact equation. However, we can also solve it using gradient descent. The gradient computation amounts to simulating the system with the current control policy, computing the sensitivity of the cost trajectory to each control decision, and then adding this information up to compute the gradient.</p><p>The lovely thing about gradient descent is that it gives you a solution technique for general optimal control problems with nonquadratic costs or nonlinear dynamics. You evaluate your policy under the current control, run a dynamical system backward in time to compute how sensitive the trajectory was to your control decisions, and then add up the contributions of each time point to get the full gradient. <a href="https://asmedigitalcollection.asme.org/appliedmechanics/article/29/2/247/386190/A-SteepestAscent-Method-for-Solving-Optimum">Arthur Bryson invented this method</a> to compute gradients of general optimal control problems in 1962. Today, we call his algorithm <a href="https://archives.argmin.net/2016/05/18/mates-of-costate/">backpropagation</a>. This simulation-based gradient method provides incremental improvement of policies for any differentiable dynamical model and any differentiable cost function.</p><p>Now, if your simulation isn&#8217;t differentiable, maybe you&#8217;ll use a different sort of policy optimization method to solve your optimal control problem. However, reinforcement learning for robotics is still optimal control. RL for robotics minimizes a designed cost function subject to dynamics. The modern departure is that no one bothers to write down the equations of motion anymore. They just assume the simulator will compute them.</p><p>This belief pushes a lot of work onto the simulator. GPU cycles are sadly neither free nor abundant. It would be nice to minimize the simulation time and cost required to find a good control policy. It would be particularly nice because many people would like to have a simulator on board the actual robot to compute policies with methods like model predictive control. This begs the question of how accurate your simulation needs to be.</p><p>Unfortunately, no one knows. We all think that if you can act quickly enough with enough control authority, then a really simple model should work. But it&#8217;s impossible to quantify &#8220;enough&#8221; in that sentence. You have to try things out because dynamical processes are always surprising.</p><p>While it feels like increasing the fidelity of a simulator to the minute details of physical law always improves performance, this is not remotely the case. In class on Monday, Spencer Schutz presented a <a href="https://ieeexplore.ieee.org/document/7225830">paper on autonomous driving</a> showing a simple, inaccurate kinematic model with a low sampling rate performed just as well as a more accurate dynamic model. Anyone who&#8217;s spent time with dynamic models knows that very high-dimensional complex systems often look simple when you have limited controllability and observability. This is the basis of thermodynamics, where infinitely many bodies colliding collectively produce fairly boring dissipative behavior. Many complex-looking circuits have the input-output behavior of resistors.</p><p>On the other side of the coin, safe execution demands identification of subtle aspects of input-output relationships. You can have two dynamical systems with nearly identical behavior perform completely differently once in a closed loop circuit. You can also have systems with completely different behavior look the same in closed loop. <a href="https://www.argmin.net/p/the-same-but-different">I worked through a few examples of this phenomenon in a blog post a couple of years ago</a>. Your model needs to be perfect in exactly the right places. But it&#8217;s usually impossible to know those places in advance.</p><p>To make matters worse, you can&#8217;t really identify the parameters of a robot in open loop. An expensive robot is always going to be running with its low-level controllers on both for its safety and yours. The actual parameters of closed-loop systems can&#8217;t be identified.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> So you&#8217;re stuck with guesses in your simulator, and you have to hope that your plausible parameters are good enough for your sim2real application.</p><p>The most popular solution to this identification problem is <em>domain adaptation</em>. Since you can only find a range of parameters that describe reality, you build a control policy that works for randomly sampled parameters. By constantly sampling different parameters in each run, you build a policy that performs well on average across all possible parameters.</p><p>Finding controllers that work for the average model isn&#8217;t new. Indeed, this is just a variant of optimal control called <em>dual control</em>, which has seen bursts of interest since the 1960s. Dual control is literally the problem of minimizing an expected control performance over a distribution of parameters.  Like dual control, domain adaptation needs a good prior model for how the environment &#8220;samples&#8221; parameters. But you can also just YOLO and hope that as long as you include all the edge cases, you&#8217;ll never crash. That&#8217;s the machine learning mindset, after all.</p><p>But what does it mean to sample the coefficient of friction of a surface? What&#8217;s the right distribution of coefficients of friction? This is again a tricky question.</p><p>One approach to modeling the distribution of parameters is to add an element of adversarial behavior to the system. We can adapt the simulations to find hard parameter settings and train more on those. We can have the simulator learn to trip up the robot. Rather than minimizing expected cost, we are working to minimize a worst-case cost, where the supremum is over a distribution of parameters or disturbances. The dual control people were really into this sort of minimax robustness in the 60s. But practice in aerospace applications ultimately pushed the community to robust control.</p><p>But people hate robust control because it gives them conservative policies. Computer scientists love to hack and ship. Look how productive they&#8217;ve been! You only need to write a few tests and make sure your simulator passes those. No bugs detected, LGTM! What could go wrong, right?</p><p>Is that last paragraph about coding agents? It might be.</p><p>But regardless, robust control pointed out that unmodeled uncertainties are everywhere, and they can be out there to bite you if you&#8217;re not careful. For its entire history, robust control advocates have been haranguing people about the limits of simulators. They note a couple of significant problems: first, training on a simulator <a href="https://archives.argmin.net/2020/07/14/there-are-none/">often means fitting to quirks of the simulator that don&#8217;t appear in the real world</a>. This is a major danger, even in linear systems. Second, many apparent parametric robustness properties of optimal controllers break down under scrutiny.</p><p>In class, I introduced the <a href="https://www.sciencedirect.com/science/article/abs/pii/000510989390175S">structured singular value</a> to motivate this issue. The structured singular value showed that when you had a system with many inputs and outputs, and you only considered independent perturbations, you&#8217;d convince yourself that a system was stable when it was not remotely stable. Guaranteeing stable behavior required understanding the dependencies between different errors. But how you test stability in simulation is not clear.</p><p>We are thus left considering a strategy beyond sim2real: sim2real2sim2real. Or sim2real2sim2real2sim2real. You deploy the system and find out what didn&#8217;t work in reality. And then you go back to your simulator, add a few thousand lines of code to account for the mistake, and try again. The software state of mind is that we can always patch mistakes. You can have an all-hands, blameless post-mortem and say it won&#8217;t happen again. This drives the old control theorists mad, but it&#8217;s been great working so far, so why change course?</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.argmin.net/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.argmin.net/subscribe?"><span>Subscribe now</span></a></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>In case you haven&#8217;t encountered this before, suppose you are trying to model a closed-loop system x[t+1] = Ax[t]+ Bu[t], u[t]= Kx[t]. Then for an arbitrary matrix E<sub>B</sub>,</p><p>A+BK = (A -E<sub>B</sub>K)+(B+E<sub>B</sub>) K</p><p>Hence, you can only identify a subspace of possible dynamical systems describing your data.</p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[Purposeful Predictions]]></title><description><![CDATA[What is simulation, and what is it good for?]]></description><link>https://www.argmin.net/p/purposeful-predictions</link><guid isPermaLink="false">https://www.argmin.net/p/purposeful-predictions</guid><dc:creator><![CDATA[Ben Recht]]></dc:creator><pubDate>Mon, 13 Apr 2026 14:49:19 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/ad0cb30b-892c-4a0b-845f-e5f97adcdee0_1100x219.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!M4ut!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9eb24b7-3750-4a3c-91ae-dd5ee2cf5467_1100x219.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!M4ut!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9eb24b7-3750-4a3c-91ae-dd5ee2cf5467_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!M4ut!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9eb24b7-3750-4a3c-91ae-dd5ee2cf5467_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!M4ut!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9eb24b7-3750-4a3c-91ae-dd5ee2cf5467_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!M4ut!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9eb24b7-3750-4a3c-91ae-dd5ee2cf5467_1100x219.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!M4ut!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9eb24b7-3750-4a3c-91ae-dd5ee2cf5467_1100x219.jpeg" width="1100" height="219" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e9eb24b7-3750-4a3c-91ae-dd5ee2cf5467_1100x219.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:219,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:272200,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.argmin.net/i/194079361?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9eb24b7-3750-4a3c-91ae-dd5ee2cf5467_1100x219.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!M4ut!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9eb24b7-3750-4a3c-91ae-dd5ee2cf5467_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!M4ut!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9eb24b7-3750-4a3c-91ae-dd5ee2cf5467_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!M4ut!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9eb24b7-3750-4a3c-91ae-dd5ee2cf5467_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!M4ut!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9eb24b7-3750-4a3c-91ae-dd5ee2cf5467_1100x219.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p>Every engineer and scientist knows there is a fundamental difference between a &#8220;simulation&#8221; and a &#8220;prediction,&#8221; but what is the root of that distinction? At the highest level, we contrast simulation against black-box modeling. Simulations are typically thought of as &#8220;transparent boxes&#8221; where we can describe the intent of each part of the model that produces a forecast.</p><p>A roboticist might think of a simulation as a computer system designed to integrate the differential equations that define basic laws of physics. For example, you predict the path the airplane takes based on physical models of lift and drag and how the plane moves under different control settings. Simple simulations based on reduced equations might suffice for some tasks. For others, we might have to rely on computational fluid dynamics to truly capture the behavior we&#8217;re after.</p><p>The transparent box becomes murky when systems are too complex to predict precisely. Many designers accept adding randomness to their simulations, provided they can characterize the statistical models as plausible. The dynamics of coin flipping are too hard to capture precisely, but we&#8217;re usually fine with a random number generator that produces an even number of heads and tails. Noise in measurement devices often reliably has statistics that match those of Gaussian or Poisson random numbers, and such stochastic processes are reasonable stand-ins for the sorts of signals we&#8217;ll encounter in the wild. Maybe you can simulate elections based on random numbers derived from current polling results.</p><p>Where do we draw the line between sampling and simulation? I maintain that <a href="https://www.argmin.net/p/digitally-twinning">LLMs are simulations of language</a>. We train next-token predictors in language models so that their generation matches the statistical properties of the data. Indeed, maximum likelihood selects probability distributions that make past sequences likely in the future. I&#8217;ve received a lot of pushback on this because the samples generated by the transformer are too black-box to count as simulations. This reaction suggests to me that some people want simulations to arise from models with articulable causal explanations.</p><p>The academic literature on simulation is also intentionally vague about the difference between modeling, sampling, and simulation. But this quote from the 1975 textbook <em>Systems Simulation: The Art and Science</em>, by industrial engineer Robert Shannon, highlights a crucial feature of simulation:</p><blockquote><p>&#8220;Simulation is the process of designing a model of a real system and conducting experiments with this model for the purpose either of understanding the behavior of the system or of evaluating various strategies (within the limits imposed by a criterion or set of criteria) for the operation of the system.&#8221;</p></blockquote><p>For Shannon, simulation is <em>purpose-driven</em>. You replace a real system with a model, and then evaluate counterfactuals in the modeled world. A forecast that is not evaluating a counterfactual configuration or strategy is not a simulation. Simulation is anything where we can evaluate counterfactual futures and gain insights from them.</p><p>Simulations can help engineers describe the behavior of complex systems and build theories and hypotheses for why that behavior occurs. Engineers can also use them to predict future behavior of the system if they were to intervene with some new policy or if an external force acted to change some parameters.</p><p>Under this broad tent, <em>optimal control</em> is simulation. Since everyone learns LQR first, we get such clean formulas out that we don&#8217;t think of this as a simulator. We think of this as an analytical technique. But if you instead solve LQR by gradient descent, you&#8217;ll find that you need to simulate to compute a gradient. This is the &#8220;forward pass&#8221; in backpropagation, a method for computing gradients that was initially invented to solve optimal control problems.</p><p>Indeed, the process of solving LQR by gradient descent looks like this: You pick a cost function that seems to match your design specification. You try a particular control policy out. You get a signal back based on its performance under your cost function. You use this signal to modify your control policy to a policy with lower cost and try again. Once you have repeated this enough times so that you don&#8217;t think you can further improve, you deploy the control system trained in simulation.</p><p>AI people have coined a cutesy name for this iterative control design process: &#8220;<a href="https://arxiv.org/abs/2502.08844">sim2real</a>.&#8221; On the one hand, sim2real looks like it&#8217;s doing something far more sophisticated than optimal control. The simulators they use are highly complex, their control policies are neural networks, and their cost functions are a clever pastiche of best past practice. However, robotic sim2real is a short conceptual hop from <a href="https://asmedigitalcollection.asme.org/fluidsengineering/article-abstract/80/8/1820/1132229/Optimal-Synthesis-of-Linear-Sampling-Control?redirectedFrom=fulltext">Kalman&#8217;s papers</a> on basic linearization of chemical plants in the 50s.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> And just as today&#8217;s roboticist wishes Nvidia GPUs were cheaper, Kalman and Koepcke lament how they would be better served by more compute.</p><p>The question then becomes, how good does your simulation need to be for control? In their description of sim2real, <a href="https://arxiv.org/abs/2502.08844">Zakka et al.</a> discuss the demand for the highest-fidelity simulations possible. But what does that even mean for a simulation to be high fidelity? How can you validate the assertion of high fidelity? Components with dramatically different behaviors look the same once they are interconnected in feedback loops. How can we identify what modeling is necessary? Once they are connected in feedback, identifying actual parameters becomes impossible. What is the right way to deal with uncertainty in the simulators? Is &#8220;domain adaptation,&#8221; the hot trend of the last decade where we simulate a lot of different plausible environments, the right way to make progress? Which transparent boxes can be replaced with black boxes? These are some of the questions I&#8217;ll dig into during today&#8217;s lecture. In the next post, I&#8217;ll report on partial answers.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.argmin.net/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.argmin.net/subscribe?"><span>Subscribe now</span></a></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>If you are a roboticist, you should read that paper to see how Kalman should be credited with inventing Iterative LQR.</p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[Calibrated Games]]></title><description><![CDATA[Solving the game of forecasting with accounting strategies.]]></description><link>https://www.argmin.net/p/calibrated-games</link><guid isPermaLink="false">https://www.argmin.net/p/calibrated-games</guid><dc:creator><![CDATA[Ben Recht]]></dc:creator><pubDate>Fri, 10 Apr 2026 14:45:51 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/0bdce3cc-619c-4ca7-a8d1-37fce299abec_840x600.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Hz8m!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa379bf04-1660-496a-a880-511e47f4cc93_1100x219.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Hz8m!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa379bf04-1660-496a-a880-511e47f4cc93_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Hz8m!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa379bf04-1660-496a-a880-511e47f4cc93_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Hz8m!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa379bf04-1660-496a-a880-511e47f4cc93_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Hz8m!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa379bf04-1660-496a-a880-511e47f4cc93_1100x219.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Hz8m!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa379bf04-1660-496a-a880-511e47f4cc93_1100x219.jpeg" width="1100" height="219" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a379bf04-1660-496a-a880-511e47f4cc93_1100x219.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:219,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:107668,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.argmin.net/i/193800934?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa379bf04-1660-496a-a880-511e47f4cc93_1100x219.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Hz8m!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa379bf04-1660-496a-a880-511e47f4cc93_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Hz8m!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa379bf04-1660-496a-a880-511e47f4cc93_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Hz8m!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa379bf04-1660-496a-a880-511e47f4cc93_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Hz8m!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa379bf04-1660-496a-a880-511e47f4cc93_1100x219.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><em>This is a live blog of Lecture 8 of my graduate seminar &#8220;Feedback, Learning, and Adaptation.&#8221; A table of contents is <a href="https://www.argmin.net/p/feedback-learning-and-adaptation-48f">here</a>.</em></p><p>One of the main uses of simulation and forecasting in designed feedback systems is for deciding how to act. If I can map what will happen next, I can choose actions that steer me toward good outcomes. This mindset seems perfectly sensible, and it&#8217;s the backbone of statistical decision theory, tree search in game play, optimal control, and model predictive control. Moreover, people who are good at prediction get clout. You can even win money in markets. It seems like forecasting is a skill and talent, and one that requires deep knowledge of how the world works. And yet, in class on Monday, I discussed how you can make excellent forecasts by simple, strategic accounting.</p><p>To understand why, let&#8217;s examine how we know if forecasts are good. It&#8217;s sort of obvious, but we can only evaluate our predictions of the future once the future has become the past. I can&#8217;t tell how good your forecast is until the forecast event occurs. No matter how much we think about setting up ungamable metrics, forecasters can only be evaluated retrospectively. And this retrospective nature means we can cast forecasting in the game theoretic framework from last week. Let me write out the rules in the format I&#8217;ve been using.</p><p>We have a two-player game with repeated interactions. In every round t,</p><ol><li><p>Information x<sub>t</sub> is revealed to both players.</p></li><li><p>Player One makes the forecast p<sub>t</sub></p></li><li><p>Player Two takes reveals the actual outcome y<sub>t</sub></p></li><li><p>A score s<sub>t</sub> is assigned based on the triple (x<sub>t</sub>,p<sub>t</sub>,y<sub>t</sub>).</p></li></ol><p>Player One is the &#8220;forecaster.&#8221; Their goal is to accumulate as high a score as possible, summed across all rounds. Player Two wants the sum of all of the s<sub>t</sub> to be as low as possible.</p><p>Now, we need to come up with score functions that can&#8217;t be &#8220;gamed,&#8221; and people have thought of many. For example, you might require the forecaster to have a low <em>Brier</em> score.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Rb4U!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8954d8cd-cce1-40cb-8515-ca13c76a873b_226x72.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Rb4U!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8954d8cd-cce1-40cb-8515-ca13c76a873b_226x72.png 424w, https://substackcdn.com/image/fetch/$s_!Rb4U!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8954d8cd-cce1-40cb-8515-ca13c76a873b_226x72.png 848w, https://substackcdn.com/image/fetch/$s_!Rb4U!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8954d8cd-cce1-40cb-8515-ca13c76a873b_226x72.png 1272w, https://substackcdn.com/image/fetch/$s_!Rb4U!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8954d8cd-cce1-40cb-8515-ca13c76a873b_226x72.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Rb4U!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8954d8cd-cce1-40cb-8515-ca13c76a873b_226x72.png" width="226" height="72" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8954d8cd-cce1-40cb-8515-ca13c76a873b_226x72.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:72,&quot;width&quot;:226,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Rb4U!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8954d8cd-cce1-40cb-8515-ca13c76a873b_226x72.png 424w, https://substackcdn.com/image/fetch/$s_!Rb4U!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8954d8cd-cce1-40cb-8515-ca13c76a873b_226x72.png 848w, https://substackcdn.com/image/fetch/$s_!Rb4U!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8954d8cd-cce1-40cb-8515-ca13c76a873b_226x72.png 1272w, https://substackcdn.com/image/fetch/$s_!Rb4U!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8954d8cd-cce1-40cb-8515-ca13c76a873b_226x72.png 1456w" sizes="100vw"></picture><div></div></div></a></figure></div><p>Now, a low Brier score is impossible in the adversarial context. Think about this weird game of bit prediction, where Player One guesses a number 0 or 1 and Player Two responds with the correct answer equal to 0 or 1. Player Two goes second and can always pick the opposite of what Player One says. That seems unfair.</p><p>Last week, I brought up the possibility of judging Player One with <em>regret</em>. Regret would measure the difference between Player One&#8217;s score and the score of a player who knows all of Player Two&#8217;s moves in advance but can only play a single prediction. In math, this quantity is written as</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!5gEg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb5adc2-f7ca-4a7a-a4e5-5da8d4b64644_434x72.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!5gEg!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb5adc2-f7ca-4a7a-a4e5-5da8d4b64644_434x72.png 424w, https://substackcdn.com/image/fetch/$s_!5gEg!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb5adc2-f7ca-4a7a-a4e5-5da8d4b64644_434x72.png 848w, https://substackcdn.com/image/fetch/$s_!5gEg!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb5adc2-f7ca-4a7a-a4e5-5da8d4b64644_434x72.png 1272w, https://substackcdn.com/image/fetch/$s_!5gEg!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb5adc2-f7ca-4a7a-a4e5-5da8d4b64644_434x72.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!5gEg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb5adc2-f7ca-4a7a-a4e5-5da8d4b64644_434x72.png" width="434" height="72" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0bb5adc2-f7ca-4a7a-a4e5-5da8d4b64644_434x72.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:72,&quot;width&quot;:434,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!5gEg!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb5adc2-f7ca-4a7a-a4e5-5da8d4b64644_434x72.png 424w, https://substackcdn.com/image/fetch/$s_!5gEg!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb5adc2-f7ca-4a7a-a4e5-5da8d4b64644_434x72.png 848w, https://substackcdn.com/image/fetch/$s_!5gEg!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb5adc2-f7ca-4a7a-a4e5-5da8d4b64644_434x72.png 1272w, https://substackcdn.com/image/fetch/$s_!5gEg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb5adc2-f7ca-4a7a-a4e5-5da8d4b64644_434x72.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>But good predictions alone may not be what you care about. Certainly, you&#8217;d want the frequencies to match. If you are predicting a sequence of probabilities, the average of those probabilities should match the average of the actual outcomes. If you consistently predict a player makes 90% of their free throws, we should see 90% of free throws made.</p><p>Similarly, other expected values should match. If you are changing your probabilities over time, the variance of the outcomes should still match the variance of your probabilities.</p><p>Maybe you&#8217;d prefer the predictions to be good across stratifications of the data. For example, if you are predicting free throws, maybe you&#8217;d want your forecasts to be accurate for all players individually. There are lots of subtests and subsets I can inspect, and I&#8217;d like to check that you are making good predictions on all of them.</p><p>Perhaps you&#8217;d like a certain degree of <em>calibration</em> from the forecast. In all of Player One&#8217;s forecasts where they say 20%, Player Two should say 1 only 20% of the time. In the forecasts where they say 60%, Player Two should say 1 60% of the time. If in all of the times Player One says there&#8217;s a 90% chance of a 1, only 10% of the times Player Two plays a 1, we&#8217;d think Player One is a pretty bad forecaster. Trying to achieve calibration across all possible probabilistic predictions seems a lot harder than just getting a single frequency correct in a Brier score game.</p><p>Mathematically, however, all of these problems are basically the same. They list a set of &#8220;test functions&#8221;, and Player 1 wants the following to be small for every single test function:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!jvBd!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6fad9fcf-7753-49b3-a2bc-79cbac767efa_312x74.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!jvBd!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6fad9fcf-7753-49b3-a2bc-79cbac767efa_312x74.png 424w, https://substackcdn.com/image/fetch/$s_!jvBd!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6fad9fcf-7753-49b3-a2bc-79cbac767efa_312x74.png 848w, https://substackcdn.com/image/fetch/$s_!jvBd!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6fad9fcf-7753-49b3-a2bc-79cbac767efa_312x74.png 1272w, https://substackcdn.com/image/fetch/$s_!jvBd!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6fad9fcf-7753-49b3-a2bc-79cbac767efa_312x74.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!jvBd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6fad9fcf-7753-49b3-a2bc-79cbac767efa_312x74.png" width="312" height="74" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6fad9fcf-7753-49b3-a2bc-79cbac767efa_312x74.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:74,&quot;width&quot;:312,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!jvBd!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6fad9fcf-7753-49b3-a2bc-79cbac767efa_312x74.png 424w, https://substackcdn.com/image/fetch/$s_!jvBd!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6fad9fcf-7753-49b3-a2bc-79cbac767efa_312x74.png 848w, https://substackcdn.com/image/fetch/$s_!jvBd!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6fad9fcf-7753-49b3-a2bc-79cbac767efa_312x74.png 1272w, https://substackcdn.com/image/fetch/$s_!jvBd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6fad9fcf-7753-49b3-a2bc-79cbac767efa_312x74.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>What are the test functions? If all we care about is getting the frequency that y<sub>t</sub> equals one correct, then the test function is the constant function. For calibration, the test function is equal to one when a forecaster predicts probability x% and 0 otherwise. You&#8217;ll have one test function for each calibration bin. For calibration across strata, there will be a function for each stratum. Even Brier scores amount to calibration. You can get a low Brier Score by calibrating the functions</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!q0Fb!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3227230b-cc3a-4d3c-9bfa-608654ab1c8b_138x26.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!q0Fb!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3227230b-cc3a-4d3c-9bfa-608654ab1c8b_138x26.png 424w, https://substackcdn.com/image/fetch/$s_!q0Fb!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3227230b-cc3a-4d3c-9bfa-608654ab1c8b_138x26.png 848w, https://substackcdn.com/image/fetch/$s_!q0Fb!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3227230b-cc3a-4d3c-9bfa-608654ab1c8b_138x26.png 1272w, https://substackcdn.com/image/fetch/$s_!q0Fb!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3227230b-cc3a-4d3c-9bfa-608654ab1c8b_138x26.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!q0Fb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3227230b-cc3a-4d3c-9bfa-608654ab1c8b_138x26.png" width="138" height="26" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3227230b-cc3a-4d3c-9bfa-608654ab1c8b_138x26.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:26,&quot;width&quot;:138,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!q0Fb!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3227230b-cc3a-4d3c-9bfa-608654ab1c8b_138x26.png 424w, https://substackcdn.com/image/fetch/$s_!q0Fb!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3227230b-cc3a-4d3c-9bfa-608654ab1c8b_138x26.png 848w, https://substackcdn.com/image/fetch/$s_!q0Fb!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3227230b-cc3a-4d3c-9bfa-608654ab1c8b_138x26.png 1272w, https://substackcdn.com/image/fetch/$s_!q0Fb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3227230b-cc3a-4d3c-9bfa-608654ab1c8b_138x26.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>for all values of q.</p><p>The amazing thing is that making calibration errors of the form E<sub>f </sub>small is incredibly mechanical.  Juanky Perdomo and I spell out the general details in Section 3 of the tutorial &#8220;<em><a href="https://arxiv.org/pdf/2506.11848">In Defense of Defensive Forecasting</a></em>.&#8221; More or less, you just have to choose a prediction that makes the future look uncorrelated with the past. And you can always find such a prediction with simple search. Though there are specific details you have to deal with for each case, essentially the same procedure applies to very general sets of calibration functions.</p><p>We found that we could reduce every metric used to evaluate forecasting skill to some form of generalized calibration. There are whole bodies of work on proper scoring rules, conformal prediction, omniprediction, and outcome indistinguishability that reduce to generalized calibration. In the forecasting game, this generalized calibration can be done without specific domain expertise. As long as the evaluation metrics are prescribed in advance, a Defensive Forecaster will do well in fantasy sports, weather prediction, and election forecasting.  It doesn&#8217;t need to know anything about the topic other than the judgment scheme.</p><p>Though Juanky and I wrote up our defense of defensive forecasting almost a year ago, this week was the first time I tried to present it in class. I got a lot of puzzled looks, as if I was playing clever card tricks. That&#8217;s the correct reaction! We are naturally impressed by people who are good at forecasting. We&#8217;re obsessed with predicting the future. Predictions from soothsayers are reassuring even if they&#8217;re consistently wrong.</p><p>And yet, forecasting is often just playing clever tricks for fun and profit. Though Dean Foster and Rahesh Vorha famously showed that percentile calibration amounted to bookkeeping thirty years ago, it turns out that <em>all</em> forms of generalized calibration can be achieved through bookkeeping. Next time someone tries to impress you with their prediction market prowess, remember that cooking the books isn&#8217;t the same as clairvoyance.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.argmin.net/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.argmin.net/subscribe?"><span>Subscribe now</span></a></p>]]></content:encoded></item><item><title><![CDATA[Unreal Is Here]]></title><description><![CDATA[Mapping the territory of simulation and its many purposes.]]></description><link>https://www.argmin.net/p/unreal-is-here-117</link><guid isPermaLink="false">https://www.argmin.net/p/unreal-is-here-117</guid><dc:creator><![CDATA[Ben Recht]]></dc:creator><pubDate>Tue, 07 Apr 2026 14:09:50 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/6187f124-59d0-44bb-ab49-23de806eedf8_840x600.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!UW5-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9c74311c-193d-4eca-a33c-3f9f25cfbb13_1100x219.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!UW5-!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9c74311c-193d-4eca-a33c-3f9f25cfbb13_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!UW5-!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9c74311c-193d-4eca-a33c-3f9f25cfbb13_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!UW5-!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9c74311c-193d-4eca-a33c-3f9f25cfbb13_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!UW5-!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9c74311c-193d-4eca-a33c-3f9f25cfbb13_1100x219.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!UW5-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9c74311c-193d-4eca-a33c-3f9f25cfbb13_1100x219.jpeg" width="1100" height="219" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9c74311c-193d-4eca-a33c-3f9f25cfbb13_1100x219.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:219,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:363167,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.argmin.net/i/193467902?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9c74311c-193d-4eca-a33c-3f9f25cfbb13_1100x219.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!UW5-!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9c74311c-193d-4eca-a33c-3f9f25cfbb13_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!UW5-!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9c74311c-193d-4eca-a33c-3f9f25cfbb13_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!UW5-!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9c74311c-193d-4eca-a33c-3f9f25cfbb13_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!UW5-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9c74311c-193d-4eca-a33c-3f9f25cfbb13_1100x219.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p>Though I&#8217;ve been prefacing my <a href="https://www.argmin.net/p/feedback-learning-and-adaptation-48f">lecture blog posts</a> with italicized disclaimers, I want to single this lecture blog out as being targeted a bit more broadly. Because, in a weird confluence, the topic of this week&#8217;s lecture coincides with the topic of an <a href="https://www.nytimes.com/2026/04/06/opinion/ai-polling.html">op-ed by Leif Weatherby and me that appears this morning in the New York Times</a>: <em>forecasting and simulation</em>.</p><p>We can&#8217;t avoid prediction and simulation in a class about feedback systems. Our theories suggest that better predictions and forecasts lead to better plans of action. Additionally, we try to make sense of complex, interconnected systems by simulating their behavior, and simulations often reveal surprising &#8220;emergent&#8221; behavior of the whole, which wasn&#8217;t evident from the modeled behavior of the parts. We also tend to think that the subcomponents of complex, interconnected systems make sense of their surroundings by predicting what other components around them will do.</p><p>I was a bit slippery in that paragraph about what the difference is between simulation and prediction. That&#8217;s because I&#8217;m still not sure how to draw a boundary between the two concepts. The most common axis is opacity: everyone thinks there is a fundamental difference between a model that is &#8220;easy to describe&#8221; from first principles and one that is purely data-driven. We call the latter &#8220;black box&#8221; to mark our disdain. The &#8220;transparent box&#8221; systems might derive from physical laws, and we write down a set of equations that dictate how each step relates to the next. The black box systems might be derived by curve fitting, where we pick a function of convenience, untethered from causal explanation, to describe how inputs have historically mapped to outputs.</p><p>I&#8217;ll talk more about the opacity slider in later posts this week, but today, I want to ask about the <em>purpose</em> of simulation. That axis is more interesting to me. Simulations can be used in many different ways. You might use a simulation to better understand a system itself. Simulations of mechanical systems can give you a feel for their performance limits. You can use simulations to figure out why something went wrong, deriving causal explanations from plausible mechanisms. And, of course, you can use simulations to predict the future. You can use these simulation forecasts to make a plan of action. Or, in our Draft-Kings-addled culture, you might use them to gamble.</p><p><a href="https://www.nytimes.com/2026/04/06/opinion/ai-polling.html">Leif and I</a> talked about this murky simulation landscape in the world of public opinion polling. Specifically, we wrote about the absurdity of silicon sampling. For those unfamiliar with the term, silicon sampling is when you design a social science survey experiment and give the questions to LLMs rather than people. As absurd as this sounds, people are really pushing to do this. There&#8217;s a billion-dollar startup called Aaru that is based entirely on this silly idea. And one of their fake polls <a href="https://web.archive.org/web/20260323180923/https://www.axios.com/2026/03/19/olivia-walton-heartland-forward-maternal-health">slipped its way into Axios last week</a>, without Mike Allen realizing that the &#8220;poll&#8221; he was reporting on was a computer simulation (embarrassed, <a href="https://www.axios.com/2026/03/19/olivia-walton-heartland-forward-maternal-health">Axios later edited the story to reflect the phoniness</a>).</p><p>But why do silicon samples have so much cachet with pollsters and social scientists? As Leif and I argue in our piece, it&#8217;s because polls already rely heavily on simulation methods. Because of remarkably high nonresponse bias, pollsters lean heavily on statistical modeling to tweak their numbers to align with reality. Polls that use multilevel regression and poststratification are already inputting a lot of simulated reality to &#8220;correct&#8221; their summarization of the data they collected. The number isn&#8217;t &#8220;percentage of yesses in my sample,&#8221; it&#8217;s &#8220;what I think the percentage of yesses is in the population given my sample and my beliefs about the population.&#8221;</p><p>Since polling already relies heavily on simulation, tossing out the expensive part of the process&#8212;you know, asking actual people questions&#8212;feels like a logical conclusion. The Nate Silverization of political coverage turned polling into prediction. In the media, the goal of polls stopped being about understanding what people think and became more about predicting the outcome of elections. If all you need to do is predict, you don&#8217;t really need pristine distillations of understanding. You can take your empirical facts and use them solely to predict outcomes. And if the goal is just prediction, you don&#8217;t need to bother asking people at all. In fact, you want more reliable data than the fickle behavior of people nagged by pollsters at the end of some modern transmission line. If your goal is only prediction, you&#8217;re probably better off not talking to people at all.</p><p>But is the purpose of polling prediction? It depends on who you ask, but I&#8217;d like to think that the answer is no. At pure face value, the topline numbers of an opinion poll are a summary of a survey. They reduce a list of ones and zeros into two numbers: a mean and a variance.</p><p>Now, using a bit more social-scientific reasoning, we might interpret this summarization as a <em>measurement</em> of what a group of people believes. With a rigid methodology, we can consider polling to be quantified opinion.  It&#8217;s a bit odd to think that you can &#8220;objectively&#8221; measure opinion in the first place, but this has been a supposition of social science research for a long time.</p><p>Unfortunately, statistics has incredibly slippery semantics that lead people to conflate summarization with measurement and measurement with prediction. Is the percentage of &#8220;people who answered yes&#8221; a summarization of the data? Is it a measured quantity about the opinion of a broader population? Is it a prediction of how people will vote in November? Yes?</p><p>I&#8217;m interested in this conflation for both political and academic reasons. <a href="https://www.argmin.net/p/going-beyond-the-polls">Leif and I think the polling industry is harmful to the public sphere</a>. But setting those politics aside, I think that being upfront about the purpose of simulations and forecasts helps demystify their outputs. Indeed, this week I&#8217;ll describe how purpose <em>dictates</em> forecasts. Prediction of the future is difficult. But if you tell me how my predictions will be evaluated, prediction of the future is trivial. I&#8217;ll explain more about why in the next post.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.argmin.net/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.argmin.net/subscribe?"><span>Subscribe now</span></a></p>]]></content:encoded></item><item><title><![CDATA[Arbitrary Geometry]]></title><description><![CDATA[Adversarial regret as a proof technique in learning, optimization, and games]]></description><link>https://www.argmin.net/p/arbitrary-geometry</link><guid isPermaLink="false">https://www.argmin.net/p/arbitrary-geometry</guid><dc:creator><![CDATA[Ben Recht]]></dc:creator><pubDate>Fri, 03 Apr 2026 14:23:15 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/53fdbcf4-69f5-45b3-a326-3676085b77ab_840x600.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!z3hF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01fc3886-b97e-4f05-9ddd-5428dab2d1ec_1100x219.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!z3hF!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01fc3886-b97e-4f05-9ddd-5428dab2d1ec_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!z3hF!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01fc3886-b97e-4f05-9ddd-5428dab2d1ec_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!z3hF!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01fc3886-b97e-4f05-9ddd-5428dab2d1ec_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!z3hF!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01fc3886-b97e-4f05-9ddd-5428dab2d1ec_1100x219.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!z3hF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01fc3886-b97e-4f05-9ddd-5428dab2d1ec_1100x219.jpeg" width="1100" height="219" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/01fc3886-b97e-4f05-9ddd-5428dab2d1ec_1100x219.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:219,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:147193,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.argmin.net/i/193073885?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01fc3886-b97e-4f05-9ddd-5428dab2d1ec_1100x219.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!z3hF!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01fc3886-b97e-4f05-9ddd-5428dab2d1ec_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!z3hF!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01fc3886-b97e-4f05-9ddd-5428dab2d1ec_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!z3hF!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01fc3886-b97e-4f05-9ddd-5428dab2d1ec_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!z3hF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F01fc3886-b97e-4f05-9ddd-5428dab2d1ec_1100x219.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><em>This is the third live blog of Lecture 7 of my graduate seminar &#8220;Feedback, Learning, and Adaptation.&#8221; A table of contents is <a href="https://www.argmin.net/p/feedback-learning-and-adaptation-48f">here</a>.</em></p><p>I closed yesterday&#8217;s blog with a cliffhanger, promising to give a few examples of where I think adversarial regret is a useful concept. On its own, I&#8217;m not sure that it <em>is</em> a useful concept. More on that next week! But today, I&#8217;ll show how you can use adversarial regret to bootstrap interesting arguments linking machine learning, game theory, and stochastic optimization.</p><p>Once again, I list the rules of the game. We have a two-player game with repeated interactions. In every round t,</p><ol><li><p>Information x<sub>t</sub> is revealed to both players.</p></li><li><p>Player One takes action u<sub>t</sub></p></li><li><p>Player Two takes action d<sub>t</sub></p></li><li><p>A score r<sub>t</sub> is assigned based on the triple (x<sub>t</sub>,u<sub>t</sub>,d<sub>t</sub>).</p></li></ol><p>Player One is the &#8220;decision maker,&#8221; and their action has to be computable from a few lines of code. Their goal is to accumulate as high a score as possible, summed across all rounds. Player two wants the sum of all of the r<sub>t</sub> to be as low as possible.</p><p>The adversarial regret compares the score of Player One&#8217;s strategy to that of a player who sees the entire sequence of disturbances but must play the same action at every time step. It&#8217;s a weird setup where we are comparing a player who can change their strategy arbitrarily to an omniscient player forced to play the same move every time. While these two notions don&#8217;t seem worth comparing at first blush, there are a few cases in learning theory and game theory where the comparison is mathematically powerful. It turns out that computing these regret bounds is often quite simple, and they follow from elementary derivations. While these bounds themselves might not be useful, they then imply results you <em>actually</em> care about. Let me give my three favorite examples.</p><h3>Online learning and PAC Learning</h3><p>Online learning is the case argmin readers will have already encountered if they followed my machine learning course blogging. I like to teach online learning because adversarial regret bounds imply the standard model of probabilistic machine learning. Adversarial regret highlights how most of the &#8220;generalization bounds&#8221; we derive in machine learning are artifacts of geometry rather than mystical manifestations of mechanical epiphany.</p><p>In the online learning model, the goal is to predict the disturbance from the information. The actions are predictions. At each round, your score is high if your prediction is correct and low if the prediction is incorrect.</p><p>Let&#8217;s change the notation to match the standard verbiage of machine learning. The prediction is a function f, and the &#8220;disturbance&#8221; is a &#8220;label,&#8221; denoted y. Instead of a high score, we want low loss. In the online learning setting, you get to change your prediction function at every time step and compare your losses to a <em>single</em> model that tries to fit the labels after you see them. In equations, this is</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!j3Tw!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70373852-f449-48cd-8c04-da6015e3b9b2_382x72.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!j3Tw!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70373852-f449-48cd-8c04-da6015e3b9b2_382x72.png 424w, https://substackcdn.com/image/fetch/$s_!j3Tw!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70373852-f449-48cd-8c04-da6015e3b9b2_382x72.png 848w, https://substackcdn.com/image/fetch/$s_!j3Tw!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70373852-f449-48cd-8c04-da6015e3b9b2_382x72.png 1272w, https://substackcdn.com/image/fetch/$s_!j3Tw!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70373852-f449-48cd-8c04-da6015e3b9b2_382x72.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!j3Tw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70373852-f449-48cd-8c04-da6015e3b9b2_382x72.png" width="382" height="72" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/70373852-f449-48cd-8c04-da6015e3b9b2_382x72.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:72,&quot;width&quot;:382,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!j3Tw!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70373852-f449-48cd-8c04-da6015e3b9b2_382x72.png 424w, https://substackcdn.com/image/fetch/$s_!j3Tw!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70373852-f449-48cd-8c04-da6015e3b9b2_382x72.png 848w, https://substackcdn.com/image/fetch/$s_!j3Tw!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70373852-f449-48cd-8c04-da6015e3b9b2_382x72.png 1272w, https://substackcdn.com/image/fetch/$s_!j3Tw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70373852-f449-48cd-8c04-da6015e3b9b2_382x72.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>You can now do math and show that this expression is bounded by a sublinear function of the number of rounds. <a href="https://www.argmin.net/p/regretfully-yours">This post works out the details</a>. Now, the resulting deterministic bound is necessarily interesting in and of itself, but the magic happens when you declare the xs and ys to be generated by a stochastic process. If, for example, you assume the information-label pairs are identically distributed, independently samples from some data-generating process, then after making a few assumptions about convexity, the regret bound becomes a generalization bound:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!NoEZ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8492bc0f-685c-4e69-bf62-6e5975872985_350x38.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!NoEZ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8492bc0f-685c-4e69-bf62-6e5975872985_350x38.png 424w, https://substackcdn.com/image/fetch/$s_!NoEZ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8492bc0f-685c-4e69-bf62-6e5975872985_350x38.png 848w, https://substackcdn.com/image/fetch/$s_!NoEZ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8492bc0f-685c-4e69-bf62-6e5975872985_350x38.png 1272w, https://substackcdn.com/image/fetch/$s_!NoEZ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8492bc0f-685c-4e69-bf62-6e5975872985_350x38.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!NoEZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8492bc0f-685c-4e69-bf62-6e5975872985_350x38.png" width="350" height="38" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8492bc0f-685c-4e69-bf62-6e5975872985_350x38.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:38,&quot;width&quot;:350,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!NoEZ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8492bc0f-685c-4e69-bf62-6e5975872985_350x38.png 424w, https://substackcdn.com/image/fetch/$s_!NoEZ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8492bc0f-685c-4e69-bf62-6e5975872985_350x38.png 848w, https://substackcdn.com/image/fetch/$s_!NoEZ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8492bc0f-685c-4e69-bf62-6e5975872985_350x38.png 1272w, https://substackcdn.com/image/fetch/$s_!NoEZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8492bc0f-685c-4e69-bf62-6e5975872985_350x38.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>Here, F<sub>T</sub> denotes the random function that your model returns after seeing T examples. This bound compares the predictive accuracy of your algorithm on a new sample to that of the best function computable given the data-generating distribution. If you have sublinear regret, then this quantity tends to zero as T goes to infinity. This is called a generalization bound, or, if you use probability instead of expected values, a PAC Learning bound.</p><p>The technique of deriving a deterministic regret bound and transforming it into a probabilistic generalization bound by taking expected values is called &#8220;online-to-batch conversion.&#8221; It is one of the favorite tricks of learning theorists.</p><h3>Stochastic Optimization</h3><p>Similar techniques can be applied more generally to stochastic optimization. A clever analysis of the stochastic gradient method takes a similar approach: you can prove that gradient descent has low regret even if Player Two is handing you a different convex function at every time step. If you take expectations of the resulting regret bound and apply Jensen&#8217;s inequality, you derive a bound on the <a href="https://www2.isye.gatech.edu/~nemirovs/SA_Sept25_final.pdf">sample average approximation method for stochastic programming</a>. Though substantially more general, the proof is almost identical to the one in online learning.</p><h3>Repeated Games</h3><p>Still closely related to but slightly more challenging than online convex optimization are repeated zero-sum games. In this setup, each round of the game is itself a zero-sum game. The players battle each other for multiple rounds, and Player One&#8217;s goal is to refine their strategy so they eventually achieve an infinite ELO score. Here, a classic result proves that when both players use algorithms with low adversarial regret, they converge to a Nash equilibrium. You assume that both Player One and Player Two are using algorithms that yield low regret against an arbitrary adversary. The baseline is a player forced to use the same strategy every round. If Player One and Two&#8217;s strategic improvements have sublinear regret, their strategies eventually converge to an equilibrium. This result is the backbone of modern poker bots, which use algorithms like <a href="https://proceedings.neurips.cc/paper/2007/file/08d98638c6fcd194a4b1e6992063e944-Paper.pdf">counterfactual regret minimization</a>. Whether or not you think solving poker is a major contribution to humanity and human knowledge is up to you.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.argmin.net/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.argmin.net/subscribe?"><span>Subscribe now</span></a></p>]]></content:encoded></item><item><title><![CDATA[Should Have Known Better]]></title><description><![CDATA[Motivating the metric of regret in sequential decision making.]]></description><link>https://www.argmin.net/p/should-have-known-better</link><guid isPermaLink="false">https://www.argmin.net/p/should-have-known-better</guid><dc:creator><![CDATA[Ben Recht]]></dc:creator><pubDate>Thu, 02 Apr 2026 14:42:40 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/763c5375-7331-4644-9718-796ec8db9d06_840x599.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!tUss!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faddfc182-041a-4ec6-9913-93b6794c00e6_1100x219.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!tUss!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faddfc182-041a-4ec6-9913-93b6794c00e6_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!tUss!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faddfc182-041a-4ec6-9913-93b6794c00e6_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!tUss!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faddfc182-041a-4ec6-9913-93b6794c00e6_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!tUss!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faddfc182-041a-4ec6-9913-93b6794c00e6_1100x219.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!tUss!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faddfc182-041a-4ec6-9913-93b6794c00e6_1100x219.jpeg" width="1100" height="219" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/addfc182-041a-4ec6-9913-93b6794c00e6_1100x219.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:219,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:224223,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.argmin.net/i/192967846?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faddfc182-041a-4ec6-9913-93b6794c00e6_1100x219.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!tUss!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faddfc182-041a-4ec6-9913-93b6794c00e6_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!tUss!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faddfc182-041a-4ec6-9913-93b6794c00e6_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!tUss!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faddfc182-041a-4ec6-9913-93b6794c00e6_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!tUss!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faddfc182-041a-4ec6-9913-93b6794c00e6_1100x219.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><em>This is the second live blog of Lecture 7 of my graduate seminar &#8220;Feedback, Learning, and Adaptation.&#8221; A table of contents is <a href="https://www.argmin.net/p/feedback-learning-and-adaptation-48f">here</a>.</em></p><p>When talking about sequential decision making and optimal control, I can&#8217;t avoid discussing the mathematical concept of <em>regret</em>. Regret is the preferred theoretical metric for evaluating bandit algorithms, and bandit algorithms are a core method for online decision-making.</p><p>Invariably, every time I try to explain &#8220;regret,&#8221; I get the question, &#8220;Wait, why do we care about that?&#8221; So to have an answer that I can point to in the future, I&#8217;m going to write a few blog posts.</p><p>We can use the setup from the <a href="https://www.argmin.net/p/you-play-to-win-the-game">last blog</a>. We have a two-player game with repeated interactions. In every round t,</p><ol><li><p>Information x<sub>t</sub> is revealed to both players.</p></li><li><p>Player One takes action u<sub>t</sub></p></li><li><p>Player Two takes action d<sub>t</sub></p></li><li><p>A score r<sub>t</sub> is assigned based on the triple (x<sub>t</sub>,u<sub>t</sub>,d<sub>t</sub>).</p></li></ol><p>Player One is the &#8220;decision maker,&#8221; and their action has to be computable from a few lines of code. Their goal is to accumulate as high a score as possible, summed across all rounds. Player two wants the sum of all of the r<sub>t</sub> to be as low as possible.</p><p>You could think of designing the optimal policy as an optimization problem. Given a description of what Player Two can do, you can design a strategy for Player One. If Player Two is totally random, you could perhaps maximize the expected score. If Player Two is deterministic, you could maximize the score assuming Player Two plays the best possible strategy against you. Regret provides a flexible framework for going beyond both of these formulations.</p><p>To define regret, we imagine a counterfactual world in which Player One knows <em>something</em> about Player Two&#8217;s strategy in advance. The regret of some strategy is the difference between the score of a policy built on knowing this secret of Player Two and the score of the strategy that has to learn as it goes. It is called regret because it estimates how much the score could be improved with the benefit of hindsight.</p><p>Sometimes this regret is really large. Consider the following example. In each round, Player Two thinks of a color, Red or Blue, and Player One has to guess which color Player Two is thinking of. Player One gets a 1 if they guess correctly and a 0 otherwise. Player Two agrees to choose their sequence in advance, but only reveal one number to Player One in each round. In the counterfactual world, Player One would know the entire sequence and would receive a perfect score. In reality, Player One can&#8217;t do better than guessing, so they would be hard-pressed to get more than half of the colors correct.</p><p>This is where regret gets confusing. We ask, what if Player One in the counterfactual world has the benefit of hindsight but is <em>constrained</em> in their strategies? In this color guessing game, what if Player One is forced to choose <em>one</em> color in their counterfactual world? They see the entire sequence but can only pick red or blue. In this case, if Player Two chooses an even number of Red and Blues, the omniscient yet restricted Player One can only get half of the answers correct. A real-world strategy of random guessing will fare just as well as this counterfactual strategy with the benefit of hindsight.</p><p>No matter how many times I explain it, <em>I</em> find this setup confusing. Let me write it again: The regret model requires two things: a secret of Player Two and a restricted strategy set of Player One. In the real world, Player One has a flexible strategy set, but is missing information. In the counterfactual world, Player One has a restricted strategy set, but extra knowledge. Regret bounds the difference in scores achieved in these two worlds.</p><p>You might ask why this particular example of color guessing is interesting. I&#8217;m not sure it is, but it&#8217;s the one we&#8217;ll use next week when discussing forecasting. When someone tells you that they have calibrated predictions, they are doing this sort of sleight of hand and comparing against something that you probably don&#8217;t actually care about.</p><p>But let&#8217;s spend some time discussing examples where regret is reasonable. I&#8217;ll start with the canonical example: the stochastic multiarmed bandit. If Player Two is <em>random</em> and <em>stationary</em>, then the best strategy in hindsight makes a lot more sense. In our game of colors, this is the <em>multiarmed bandit problem</em>, the most annoyingly named subject in decision making. In the classic version of this problem, you have two slot machines and want to find the one with the highest payout. Each round, you are allowed to choose one of the machines to play. We model the payout from each machine as an independent draw from a fixed probability distribution. These distributions have different means, and your goal is to devise a strategy that results in the highest expected payout.</p><p>What would the best policy do? No matter how clever you are, you can&#8217;t beat the strategy of only using the machine with the higher mean payout. If you knew the expected payouts in advance and your goal is to maximize the expected payout, you would use only the machine with the highest expected payout. Thus, we can think of the secret held by Player Two to be the mean payouts of each machine.</p><p>If you didn&#8217;t know this secret, what would you do? You&#8217;d probably spend some time with each machine, look at which one is giving you higher returns, and then pick that one forever. This seems like a reasonable strategy.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> But note that this strategy necessarily has nonzero regret, because you necessarily have to try both machines to figure out which one is best.</p><p>Any strategy you devise for the real world has a particular expected regret, which is the difference between the expected payout of playing the best machine and the expected value of your strategy. In the case of our multiarmed bandit, the worst regret is accrued by always playing the suboptimal machine. So the regret would grow linearly with the number of pulls. Bandit algorithms seek strategies for which regret grows <em>sublinearly</em>.</p><p>Outside the casino, variants of the stochastic multiarmed bandit are <a href="https://www.argmin.net/p/how-to-pick-a-sample-size">reasonable models for adaptive experimentation</a>. Suppose you want to select between treatments A and B that maximizes the average benefit to some cohort of subjects. If you can randomly sample individuals from the cohort, there will be regret associated with the number of subjects assigned to the suboptimal treatment in a randomized experiment, and there will be regret associated with the chance your experiment selects the suboptimal treatment. You would like to minimize application of the wrong treatment, but also be pretty sure you are finding the right one. You can compare this to the policy that assigned everyone to the optimal policy in advance.</p><p>Tomorrow I&#8217;ll talk through three other examples where regret feels like the right concept to me. In hindsight, it&#8217;s not always worth the headaches and confusion associated with regret minimization, but there are enough positive examples to make it a concept worth understanding.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.argmin.net/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.argmin.net/subscribe?"><span>Subscribe now</span></a></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>This is more or less the optimal strategy.</p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[You Play to Win the Game]]></title><description><![CDATA[The game theory behind algorithmic decision making]]></description><link>https://www.argmin.net/p/you-play-to-win-the-game</link><guid isPermaLink="false">https://www.argmin.net/p/you-play-to-win-the-game</guid><dc:creator><![CDATA[Ben Recht]]></dc:creator><pubDate>Tue, 31 Mar 2026 14:38:10 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/37937d35-5006-42f5-9d2a-ce015a37435d_1100x219.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3iXX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd412c221-1db5-437c-9ee8-af83d7117df7_1100x219.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3iXX!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd412c221-1db5-437c-9ee8-af83d7117df7_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!3iXX!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd412c221-1db5-437c-9ee8-af83d7117df7_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!3iXX!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd412c221-1db5-437c-9ee8-af83d7117df7_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!3iXX!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd412c221-1db5-437c-9ee8-af83d7117df7_1100x219.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3iXX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd412c221-1db5-437c-9ee8-af83d7117df7_1100x219.jpeg" width="1100" height="219" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d412c221-1db5-437c-9ee8-af83d7117df7_1100x219.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:219,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:365432,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.argmin.net/i/192737499?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd412c221-1db5-437c-9ee8-af83d7117df7_1100x219.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!3iXX!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd412c221-1db5-437c-9ee8-af83d7117df7_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!3iXX!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd412c221-1db5-437c-9ee8-af83d7117df7_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!3iXX!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd412c221-1db5-437c-9ee8-af83d7117df7_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!3iXX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd412c221-1db5-437c-9ee8-af83d7117df7_1100x219.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><em>This is a live blog of Lecture 7 of my graduate seminar &#8220;Feedback, Learning, and Adaptation.&#8221; A table of contents is <a href="https://www.argmin.net/p/feedback-learning-and-adaptation-48f">here</a>.</em></p><p>The Monday after Spring Break is always a weird class with people trickling in from their various excursions. So it&#8217;s an ideal time for a weird lecture. I decided it was time for some game theory.</p><p><a href="https://www.argmin.net/p/induction-and-feedback">The goal of this seminar</a> was to focus on the power of feedback, to understand how to think about complex interconnected systems, and to understand how feedback design allows systems to &#8220;generalize&#8221; and &#8220;behave effectively in unknown future settings&#8221;.</p><p>In classical control, you can argue that feedback is for stabilization, for maintaining fixed points, for rejecting disturbances, or for recovering from failure. We covered some of these ideas in the first part of the course.</p><p>However, there&#8217;s another view of feedback, one that&#8217;s ubiquitous in machine learning and artificial intelligence. It&#8217;s the one that&#8217;s most prevalent in the quantitative social sciences. And, increasingly, based on my interactions with Berkeley graduate students, in robotics. That is the idea of feedback as a way to augment optimization.</p><p>In optimal control, feedback is used for the most narrow-minded reason: it lowers cost. Feedback policies, because they search over a larger space of policies, have lower cost than open-loop policies. That&#8217;s it. Feedback provides more information to the decision-maker, and a decision-maker who uses information will achieve a lower cost than one who doesn&#8217;t.</p><p>The optimal control model of feedback is <em>game theoretic</em> with rules of engagement staged as follows:</p><p>In every round t,</p><ol><li><p>Information x<sub>t</sub> is revealed to both players.</p></li><li><p>Player one takes action u<sub>t</sub></p></li><li><p>Player two takes action d<sub>t</sub></p></li><li><p>A score r<sub>t</sub> is assigned based on the triple (x<sub>t</sub>,u<sub>t</sub>,d<sub>t</sub>).</p></li></ol><p>Player One is the &#8220;decision maker,&#8221; and their action has to be computable from a few lines of code. Their goal is to accumulate as high a score as possible, summed across all rounds. Player two wants the sum of all of the r<sub>t</sub> to be as low as possible.</p><p>Player One&#8217;s action can be computed based on the rules of the game and all of the moves they&#8217;ve seen thus far. This is why if they <em>optimally</em> use the revealed information, they will have no worse cost than if they throw the information out.</p><p>Player Two is &#8220;the adversary.&#8221; Their power dictates how hard the game is for the decision maker. In some formulations, Player Two chooses oblivious random actions. You can make Player One&#8217;s life harder by making Player Two an omniscient god that knows Player One&#8217;s strategy in advance and can compute undecidable functions to topple them.</p><p>If there is a single round and Player Two is random, this game is called decision theory. We&#8217;ve collectively decided that the best strategies against random adversaries are those that maximize the expected value of the score. Don&#8217;t ask me why. If there are two rounds, it&#8217;s stochastic programming. If there are an infinite number of rounds, but there is no relationship between the rounds, it&#8217;s a bandit problem. If there are infinitely many rounds and the information follows a Markov chain, this is stochastic optimal control or reinforcement learning. In this case, when the costs are quadratic, the Markovian dynamics linear, and the adversary normally distributed, this is the linear quadratic regulator problem.</p><p>When Player Two is adversarial, Player One seeks a strategy to maximize their score against the best imaginable opponent. If there is a single round and Player Two is adversarial, this is called game theory or robust optimization. If there are an infinite number of rounds, but there is no relationship between the rounds, it&#8217;s a non-stochastic bandit problem. If there are infinitely many rounds and the information follows a Markov chain, this is robust optimal control. The linear version of this robust control problem is called the H<sub>&#9854;&#65039;</sub> optimal control problem. Phew!</p><p>Now, every single one of these problems requires a slightly different algorithmic solution. That&#8217;s what keeps us in business. For every gradation, the solution details can fill a textbook.  But they are all variations of the same game-theoretic framework. Having been formalized in the late 1940s and honed in the military-industrial boom of the 1950s and 1960s, this game-theoretic model of control and decision-making has been standard since the 1970s.</p><p>I&#8217;m not saying any of this is wrong, per se. I am saying that it is a bit limited as a framework. Part of the motivation for this course was to make better sense of this &#8220;graph&#8221; I made in a blog series a <a href="https://www.argmin.net/p/action-impact-tradeoffs">couple of years ago</a>.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!8iaw!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd9d1c6e-77f4-4b58-992f-d5bcd3e937fa_1348x1036.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!8iaw!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd9d1c6e-77f4-4b58-992f-d5bcd3e937fa_1348x1036.png 424w, https://substackcdn.com/image/fetch/$s_!8iaw!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd9d1c6e-77f4-4b58-992f-d5bcd3e937fa_1348x1036.png 848w, https://substackcdn.com/image/fetch/$s_!8iaw!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd9d1c6e-77f4-4b58-992f-d5bcd3e937fa_1348x1036.png 1272w, https://substackcdn.com/image/fetch/$s_!8iaw!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd9d1c6e-77f4-4b58-992f-d5bcd3e937fa_1348x1036.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!8iaw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd9d1c6e-77f4-4b58-992f-d5bcd3e937fa_1348x1036.png" width="1348" height="1036" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/dd9d1c6e-77f4-4b58-992f-d5bcd3e937fa_1348x1036.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1036,&quot;width&quot;:1348,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!8iaw!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd9d1c6e-77f4-4b58-992f-d5bcd3e937fa_1348x1036.png 424w, https://substackcdn.com/image/fetch/$s_!8iaw!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd9d1c6e-77f4-4b58-992f-d5bcd3e937fa_1348x1036.png 848w, https://substackcdn.com/image/fetch/$s_!8iaw!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd9d1c6e-77f4-4b58-992f-d5bcd3e937fa_1348x1036.png 1272w, https://substackcdn.com/image/fetch/$s_!8iaw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd9d1c6e-77f4-4b58-992f-d5bcd3e937fa_1348x1036.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>I observed that decision-making frameworks were distinguished by two variables: the impact each action had on a system external to the decision-maker (the x-axis) and the frequency with which decisions could be made (the y-axis).<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> Game-theoretic decision making requires first figuring out where on this graph you want to operate. If you have a problem that calls for a specific level of impact and have the authority to act at a specific speed, you can find a particular solution using a proper game-theoretic formulation. How powerful you make Player Two will affect the complexity of your decision system and its conservatism. Since you have no idea what the future holds, your conception of Player Two is a subjective decision, but at least it&#8217;s one you can precisely describe. In this sense, the optimal framework is nice because you can declaratively compute decision policies based on systems modeling.</p><p>But if you have problems that span multiple regions of this space, or ones that lie below that red curve, the optimization framework gets stuck. If you have problems where the costs are ambiguous or variable, it&#8217;s hard to argue in favor of a policy based on models of cumulative reward. If you care about multiple levels of interaction impacts and speed, optimization stops being helpful.</p><p>The problem is that if you want to move into high-impact regimes where your authority is less than you&#8217;d desire, no single system gets you there. At some point, you have to think a bit more broadly about what systems push hard against this red curve. We&#8217;re forced above the red curve because of different limits, some fundamental, some conceptual. Physical law, computational efficiency, and even the ability to model keep us on one side of the curve.</p><p>I&#8217;m not sure this class helped me understand how to move beyond this curve, but it helped me understand a bit better why we&#8217;re stuck with it. The action-impact curve shows that single optimization problems can&#8217;t govern complex systems on their own. How do existing complex systems, be they natural or artificial, get around it? I&#8217;ll reserve the last two lectures of the class for this sort of abstract navel gazing.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.argmin.net/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.argmin.net/subscribe?"><span>Subscribe now</span></a></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>To read more about that plot and what I intend the axes to mean, <a href="https://www.argmin.net/p/predictions-and-actions-redux">read this post from a couple of years ago.</a></p></div></div>]]></content:encoded></item><item><title><![CDATA[The Poetics of Bureaucracy]]></title><description><![CDATA[Language models are a bureaucratic technology]]></description><link>https://www.argmin.net/p/the-poetics-of-bureaucracy</link><guid isPermaLink="false">https://www.argmin.net/p/the-poetics-of-bureaucracy</guid><dc:creator><![CDATA[Ben Recht]]></dc:creator><pubDate>Thu, 26 Mar 2026 14:15:02 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/c95e2c28-4e7f-44db-8321-3b3b80256bc9_840x600.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!U2XI!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa45bcf15-fe51-49e0-a660-3cd91b3d00c4_1100x219.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!U2XI!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa45bcf15-fe51-49e0-a660-3cd91b3d00c4_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!U2XI!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa45bcf15-fe51-49e0-a660-3cd91b3d00c4_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!U2XI!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa45bcf15-fe51-49e0-a660-3cd91b3d00c4_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!U2XI!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa45bcf15-fe51-49e0-a660-3cd91b3d00c4_1100x219.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!U2XI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa45bcf15-fe51-49e0-a660-3cd91b3d00c4_1100x219.jpeg" width="1100" height="219" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a45bcf15-fe51-49e0-a660-3cd91b3d00c4_1100x219.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:219,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:96318,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.argmin.net/i/192209144?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa45bcf15-fe51-49e0-a660-3cd91b3d00c4_1100x219.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!U2XI!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa45bcf15-fe51-49e0-a660-3cd91b3d00c4_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!U2XI!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa45bcf15-fe51-49e0-a660-3cd91b3d00c4_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!U2XI!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa45bcf15-fe51-49e0-a660-3cd91b3d00c4_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!U2XI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa45bcf15-fe51-49e0-a660-3cd91b3d00c4_1100x219.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p>No conference taking a broad view of contemporary culture can escape the bureaucracy sickos (laudatory). Bureaucracy, with the complex social relations it codifies and entails, is one of the most salient aspects of our culture. Bureaucracies box in massively complex bodies of information through standardization, measurement, and policies. Computers are amazing. They are also the physical embodiment of mass bureaucracy. And no computing technology is more bureaucratic than the large language model.</p><p>Several talks at the <a href="https://as.nyu.edu/research-centers/remarque/events/Spring-2026/cultural-ai--an-emerging-field.html">Cultural AI</a> conference threaded together the complexities of language models and bureaucracy. Henry Farrell kicked things off with a characteristically fantastic talk, describing his evolving view of AI as cultural and social technology. He introduced the notion of &#8220;coarse graining,&#8221; a new angle he&#8217;s working on with Cosma Shalizi.</p><p>In physics, coarse graining means &#8220;averaging out&#8221; a lot of complexity to leave you with bulk behavior that describes useful things. Arguably, it&#8217;s how you go from quantum field theory to atomic theory to the ideal gas law.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> There are levels of approximations, and details are lost in the transitions between layers. However, this loss of detail is often worth it because stacking abstractions lets us think simply inside clean layers. Moreover, surfacing coarse graining helps us understand what to look for when one level of description doesn&#8217;t suffice to describe observed phenomena.</p><p>For Farrell, bureaucracies, democracies, and markets are cultural coarse grainings. Bureaucracy establishes relations between parts such that management at one particular location in an organizational web can make decisions without having to understand the fine details at all other locations. It creates a distribution of decision making, simultaneously bound and freed by rules. We can see LLMs as coarse grainings that allow us to access mediated linguistic relationships between end users and the cultural material on which they were trained.</p><p>Good bureaucracy should provide constraints that deconstrain.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a> However, so often bureaucracy, in its taming of complexity, obscures sources of power in cultural relationships and the human agency behind decision making. Lily Chumley and Abbie Jacobs both spoke to different angles of this concealment.</p><p>Through the lens of linguistic anthropology, Chumley described how language models obscure contractual relationships underlying enterprise software. The primary interaction with language models is through the chat box. When we squeeze our demands into prompts and skill files that use the institutional language of management, we are mimicking the casual nature of Irving Goffman&#8217;s &#8220;open-state of talk&#8221; with a computer. The interaction feels personal rather than transactional. However, your interactions with all of the work software are contingent on inscrutable vendor contracts with complex webs of accountabilities, restrictions, and obligations. The employee is left with only a chat interface that has been RLHFed into a <a href="https://joinreboot.org/p/alignment">servile caricature of a 1950s secretary</a>. This erases the heavily surveilled, legally bound, hyper monetized relationships between corporate behemoths.</p><p>Chumley illustrated this through the SAAS web on the academic campus. Though we feel like we&#8217;re working with LLMs like they are other co-workers:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!-_9h!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4c01c2b-5d42-4976-9de0-71f25b93bb6e_1294x675.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!-_9h!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4c01c2b-5d42-4976-9de0-71f25b93bb6e_1294x675.png 424w, https://substackcdn.com/image/fetch/$s_!-_9h!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4c01c2b-5d42-4976-9de0-71f25b93bb6e_1294x675.png 848w, https://substackcdn.com/image/fetch/$s_!-_9h!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4c01c2b-5d42-4976-9de0-71f25b93bb6e_1294x675.png 1272w, https://substackcdn.com/image/fetch/$s_!-_9h!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4c01c2b-5d42-4976-9de0-71f25b93bb6e_1294x675.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!-_9h!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4c01c2b-5d42-4976-9de0-71f25b93bb6e_1294x675.png" width="576" height="300.46367851622875" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c4c01c2b-5d42-4976-9de0-71f25b93bb6e_1294x675.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:675,&quot;width&quot;:1294,&quot;resizeWidth&quot;:576,&quot;bytes&quot;:53089,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!-_9h!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4c01c2b-5d42-4976-9de0-71f25b93bb6e_1294x675.png 424w, https://substackcdn.com/image/fetch/$s_!-_9h!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4c01c2b-5d42-4976-9de0-71f25b93bb6e_1294x675.png 848w, https://substackcdn.com/image/fetch/$s_!-_9h!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4c01c2b-5d42-4976-9de0-71f25b93bb6e_1294x675.png 1272w, https://substackcdn.com/image/fetch/$s_!-_9h!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4c01c2b-5d42-4976-9de0-71f25b93bb6e_1294x675.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Every interaction with an LLM or web interface portal or training is mediated by a complex contract with giant corporations, be they Elsevier (who own Interfolio), Salesforce, SLATE Technolutions, Google, Microsoft, NVIDIA, OpenAI, or Anthropic. It is a move of power away from people to a fabric of capital. Gideon Lewis-Kraus commented that these power shifts from engineering to capital have been symptomatic of post-Cold War America and have had dire consequences, as in the example of Boeing.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!YYcm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2624f182-c7af-48ca-ad87-34e4b18aec90_1668x934.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!YYcm!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2624f182-c7af-48ca-ad87-34e4b18aec90_1668x934.png 424w, https://substackcdn.com/image/fetch/$s_!YYcm!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2624f182-c7af-48ca-ad87-34e4b18aec90_1668x934.png 848w, https://substackcdn.com/image/fetch/$s_!YYcm!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2624f182-c7af-48ca-ad87-34e4b18aec90_1668x934.png 1272w, https://substackcdn.com/image/fetch/$s_!YYcm!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2624f182-c7af-48ca-ad87-34e4b18aec90_1668x934.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!YYcm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2624f182-c7af-48ca-ad87-34e4b18aec90_1668x934.png" width="1456" height="815" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2624f182-c7af-48ca-ad87-34e4b18aec90_1668x934.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:815,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!YYcm!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2624f182-c7af-48ca-ad87-34e4b18aec90_1668x934.png 424w, https://substackcdn.com/image/fetch/$s_!YYcm!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2624f182-c7af-48ca-ad87-34e4b18aec90_1668x934.png 848w, https://substackcdn.com/image/fetch/$s_!YYcm!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2624f182-c7af-48ca-ad87-34e4b18aec90_1668x934.png 1272w, https://substackcdn.com/image/fetch/$s_!YYcm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2624f182-c7af-48ca-ad87-34e4b18aec90_1668x934.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Chumley extended her contractual analysis to the bureaucratic war machine that <a href="https://www.theguardian.com/news/2026/mar/26/ai-got-the-blame-for-the-iran-school-bombing-the-truth-is-far-more-worrying">Kevin Baker has been so eloquently writing about</a>. Big Tech owns AI, so this poses complex risks to the financial order as these companies are too big to fail. And yet, Big Tech is <em>really</em> small compared to the state. The relationships between the tech companies and the government established through military contracting are <em>geopolitical.</em> This means that even if we had a functioning Congress,<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a> the regulation of military AI would be ensnared in transnational agreements. Not only is the use of AI in warfare a smokescreen to avoid talking about the people who control decisions of violence, but it further entangles geopolitics in a big contractual mess.</p><p>From the perspective of measurement theory, Abbie Jacobs discussed how the language of governance, when coarse-grained into AI, creates new meaning. Jacobs argued that operationalizing language always in the context of governance requires conceptualizing how to measure those concepts. And this measurement and quantization are often not talked about by those doing the coding. We see this sort of talk about computing systems all the time. Words like &#8220;high-quality,&#8221; &#8220;relevant,&#8221; &#8220;toxic,&#8221; &#8220;harmful,&#8221; &#8220;age-appropriate,&#8221; &#8220;safe,&#8221; &#8220;responsible,&#8221; &#8220;fair,&#8221; &#8220;<em>intelligence</em>&#8221; are turned into rigid measurements by communities of coders, researchers, and policymakers. This operationalization through bureaucratic technology creates a new kind of coarse graining in which words gain meaning through their institutionalization. Arguments at this operationalized level themselves become exclusionary. Jacobs leans on measurement theory from the quantitative social sciences, arguing that &#8220;Measurement is the (usually hidden, implicit, diffuse) process  through which these concepts are instantiated and made real.&#8221;</p><p>Measurement itself is governance. I associate this assertion with <a href="https://press.princeton.edu/books/paperback/9780691208411/trust-in-numbers?srsltid=AfmBOorYNrrIVYtK3_05SQQiHAIrOQjFYdVK3qvVJEZHrXRwWq_T13tT">Theodore Porter</a>, though he&#8217;d probably credit Horkheimer and Adorno&#8217;s <em><a href="https://dn710607.ca.archive.org/0/items/pdfy-TJ7HxrAly-MtUP4B/Dialectic%20of%20Enlightenment%20-%20Theodor%20W.%20Adorno%2C%20Max%20Horkheimer.pdf">Dialectic of Enlightenment</a></em>. Jacobs argues that we have to bring such measurement to the surface of social technology before we go about asking our coding agents to coarse-grain it. If we can uncover the measurement process itself, then these hidden webs of governance perhaps become more legible to all of us caught in the middle. By fighting about operationalization, you are implicitly fighting about values. You are fighting about how the state sees you.</p><p>This will be my last dispatch on the Cultural AI conference for now. I don&#8217;t think I fully did justice to the speakers&#8217; arguments or to the discussion at the conference, but the talks will be available on YouTube soon.</p><p>I&#8217;ll close with a few thoughts about &#8220;conferences&#8221; more generally. We use the same word to describe an academic gathering of ten people as fifty thousand, but those meetings couldn&#8217;t be more different. The one thing I wish we were better at was marking the proceedings of these small workshops in some non-empheral state. There is value in simply getting people in a room and then seeing influential intellectual artifacts manifest in later work. Some conversations are better when everyone knows there will be no permanent record. <a href="https://www.argmin.net/p/too-much-information">Not every conversation needs to become an Overleaf</a>. Still, capturing something about the moment has value, too. I guess <a href="https://realizable.substack.com/p/artificial-intelligence-interactive">Max</a> and I are blogging a bit, and that&#8217;s not nothing. There will be YouTube videos, as I have mentioned. But I&#8217;ve been thinking a lot about what it would mean to organize, archive, and coarse grain these small moments of intellectual discourse. To be continued.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.argmin.net/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.argmin.net/subscribe?"><span>Subscribe now</span></a></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Real heads know that jumping between these abstraction levels is far less cut and dried than the physicists want us to believe.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Feel free to share examples of good bureaucracy in the comments.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>LOL.</p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[Information Transit Got the Wrong Man]]></title><description><![CDATA[The mechanized tradition of peer review and the absurdity of bureaucratic conference review.]]></description><link>https://www.argmin.net/p/information-transit-got-the-wrong</link><guid isPermaLink="false">https://www.argmin.net/p/information-transit-got-the-wrong</guid><dc:creator><![CDATA[Ben Recht]]></dc:creator><pubDate>Tue, 24 Mar 2026 14:36:41 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/a1678b69-bcf5-4386-aef6-f71e756dee75_840x600.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!fPs6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd1da810-06f5-4baa-a690-5e0fbd01a0e5_1100x220.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!fPs6!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd1da810-06f5-4baa-a690-5e0fbd01a0e5_1100x220.jpeg 424w, https://substackcdn.com/image/fetch/$s_!fPs6!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd1da810-06f5-4baa-a690-5e0fbd01a0e5_1100x220.jpeg 848w, https://substackcdn.com/image/fetch/$s_!fPs6!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd1da810-06f5-4baa-a690-5e0fbd01a0e5_1100x220.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!fPs6!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd1da810-06f5-4baa-a690-5e0fbd01a0e5_1100x220.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!fPs6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd1da810-06f5-4baa-a690-5e0fbd01a0e5_1100x220.jpeg" width="1100" height="220" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/dd1da810-06f5-4baa-a690-5e0fbd01a0e5_1100x220.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:220,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:96576,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.argmin.net/i/191987037?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd1da810-06f5-4baa-a690-5e0fbd01a0e5_1100x220.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!fPs6!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd1da810-06f5-4baa-a690-5e0fbd01a0e5_1100x220.jpeg 424w, https://substackcdn.com/image/fetch/$s_!fPs6!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd1da810-06f5-4baa-a690-5e0fbd01a0e5_1100x220.jpeg 848w, https://substackcdn.com/image/fetch/$s_!fPs6!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd1da810-06f5-4baa-a690-5e0fbd01a0e5_1100x220.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!fPs6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd1da810-06f5-4baa-a690-5e0fbd01a0e5_1100x220.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p>In case you hadn&#8217;t heard, people are using LLMs to create their peer reviews. I know, you&#8217;re as shocked as I am. To their credit, the program committee of the International Conference on Machine Learning (ICML) has been doing things to address the problem. Their attempts reveal the systematic problems here that are unfixable without a dramatic teardown.</p><p>Let&#8217;s recap the situation, though even typing it out makes me feel like a character in a Terry Gilliam movie.</p><p>In November 2025, <a href="https://icml.cc/Conferences/2026/Intro-LLM-Policy">the PC sent a series of surveys to past ICML reviewers</a> to gauge their sentiment about LLMs in reviewing. They assumed that this list must also include a bunch of authors because they mandated a reciprocal reviewing policy for the 2025 conference, under which<a href="https://icml.cc/Conferences/2025/CallForPapers"> all submissions must have had at least one author who agreed to serve as a reviewer</a>. In their final survey, they proposed the following policies for LLM reviewing at future conferences, and asked for preferences:</p><ul><li><p><strong>Policy A (Conservative)</strong>: Use of LLMs for reviewing is strictly prohibited.</p></li><li><p><strong>Policy B (Permissive)</strong>: Reviewers may input the submission text into privacy-compliant LLMs. However, the assessment of the paper and the writing of the review must not be delegated to LLMs.</p></li></ul><p>A random sample of 500 reviewers received the survey. 74 (15%) answered it. And the survey says!</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!V-0O!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F002f9e12-956d-490b-840f-eb17b66a62d7_1552x540.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!V-0O!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F002f9e12-956d-490b-840f-eb17b66a62d7_1552x540.png 424w, https://substackcdn.com/image/fetch/$s_!V-0O!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F002f9e12-956d-490b-840f-eb17b66a62d7_1552x540.png 848w, https://substackcdn.com/image/fetch/$s_!V-0O!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F002f9e12-956d-490b-840f-eb17b66a62d7_1552x540.png 1272w, https://substackcdn.com/image/fetch/$s_!V-0O!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F002f9e12-956d-490b-840f-eb17b66a62d7_1552x540.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!V-0O!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F002f9e12-956d-490b-840f-eb17b66a62d7_1552x540.png" width="1456" height="507" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/002f9e12-956d-490b-840f-eb17b66a62d7_1552x540.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:507,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!V-0O!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F002f9e12-956d-490b-840f-eb17b66a62d7_1552x540.png 424w, https://substackcdn.com/image/fetch/$s_!V-0O!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F002f9e12-956d-490b-840f-eb17b66a62d7_1552x540.png 848w, https://substackcdn.com/image/fetch/$s_!V-0O!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F002f9e12-956d-490b-840f-eb17b66a62d7_1552x540.png 1272w, https://substackcdn.com/image/fetch/$s_!V-0O!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F002f9e12-956d-490b-840f-eb17b66a62d7_1552x540.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>This plot uses default matplotlib colors, so it must be science. I&#8217;m not sure what this measures, but whatever, surveys are the democratic way to make policy, amirite? Since the results weren&#8217;t equivocal, they decided to make both policy options available in 2026. That&#8217;s democracy.</p><p>In the multistage reviewer enrollment process for the 2026 conference, they gave reviewers the option of adhering to either Policy A or Policy B or saying they didn&#8217;t care. The people who didn&#8217;t care were then <em>assigned</em> to one of the two policies in one of the many absurdly long emails they send you when you participate in these conferences.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a></p><p>Now here&#8217;s where it gets weird. The program committee decided to run a sting operation, watermarking the PDFs to trap people who uploaded them directly into LLMs. You can read about their ornate sting operation <a href="https://blog.icml.cc/2026/03/18/on-violations-of-llm-review-policies/">here</a>. The details are boring. What&#8217;s interesting is their response. If they found that someone assigned to Policy A used LLMs in reviewing, then they rejected all papers of which that person was an author.  They had 800 reviews that they flagged as violating policy A. They checked every one of these by hand to avoid false positives. Every. Single. One. Because that&#8217;s a good use of human manual labor. They ultimately rejected 500 papers associated with these naughty reviewers.</p><p>Everyone is patting themselves on the back about this, and tut-tutting those horrible people who dared violate the sacred random assignment of policy, and thanking the committee for their cleverness and transparency. But come on, this whole thing is absurd. The PC argues:</p><blockquote><p>&#8220;&#8203;&#8203;We hope that by taking strong action against violations of agreed-upon policy we will remind the community that as our field changes rapidly the thing we must protect most actively is our trust in each other. If we cannot adapt our systems in a setting based in trust, we will find that they soon become outdated and meaningless. &#8220;</p></blockquote><p>I&#8217;m sorry, but it&#8217;s already meaningless! ICML received over 33000 submissions. A random subset of 20-25% of these will be approved as &#8220;papers acceptable to go on one&#8217;s CV.&#8221; The process will churn forward. Everyone who attends the conference knows this process is impossibly bad, but the only proposed solutions make the paper-generation process more onerous for humans. This naturally leads people to offload work to LLMs. Next year, people will use watermark detection before they put the LLM into ChatGPT. The wheels of progress will continue rolling.</p><p>It&#8217;s unfortunate that the natural bureaucratic editorial mode is to assume everyone is cheating and to go on witch hunts to claim progress. The board wrote:</p><blockquote><p>&#8220;Conferences must adapt, creating rules and policies to handle the new normal, and taking disciplinary action against those who break the rules and violate the trust that we all place in the review process. &#8220;</p></blockquote><p>Psychology took this sort of approach in the 2010s. Though we all got to revel in the high-profile fraud schadenfreude, the field did not come out better for it.</p><p>Rejecting 800 of 33000 papers because of possibly inappropriate LLM use, when your LLM use policy is based on the most bizarre, arbitrary decision-making built upon a semblance of objective quantitative social science, is <em>farce</em>. At this point, the AI reviewing process can be nothing but farce. As Kevin Baker succinctly put it in his <a href="https://artificialbureaucracy.substack.com/p/context-widows">authoritatively inflectional essay on AI for science</a>:</p><div class="pullquote"><p>Systems can persist in dysfunction indefinitely, and absurdity is not self-correcting.</p></div><p>One nice thing about LLMs is that they show us which parts of our systems of intellect are mechanical traditions. LLMs are a good way to stress-test our systems for organizing experience and expertise. We&#8217;ll need to be more creative about what we want to do moving forward.</p><p>Moving forward requires us to talk more about the <em>point</em> of peer review. Yes, the AI conferences are the most absurd manifestation of this problem, but don&#8217;t think that your community is insulated from rampant LLM reviewing. At the <a href="https://as.nyu.edu/research-centers/remarque/events/Spring-2026/cultural-ai--an-emerging-field.html">Cultural AI</a> conference, Mel Andrews showed us dozens of headlines across academia <em>advocating</em> for LLM review. Arguing that LLM review was <em>better</em> than human review. There are economists launching startups to do this as a service.</p><p>Andrews argued that the arguments in favor of LLM reviewing consistently conflated institutional and epistemic concerns. The institutional concerns are well known to us. Reviewing is an enormous burden of unpaid labor that further enriches rent-seeking publication houses, and reviewership is unfairly distributed across academia. The epistemic concerns worry that peer review doesn&#8217;t properly weed out invalid papers. At least in the sciences, peer review is supposedly meta-epistemic, judging the validity of papers that aim to get at scientific knowledge, understanding, and explanation. Many studies have found the current state of peer review unfit for this task.</p><p>Advocates for LLM peer review argue it solves both problems. Andrews took a hard line, claiming that it can&#8217;t solve the epistemic problems. Andrews&#8217; boldest claim is that the relationship of the text generated by LLMs to semantic content and truth is always accidental or incidental. Hence, the mechanical aspects of peer review can only increase confabulation and error. Following tradition means not having to think, but peer review&#8217;s epistemic function demands thinking.</p><p>I don&#8217;t fully endorse Mel&#8217;s argument, but it&#8217;s a position worth airing and engaging with. By focusing solely on <em>process and rules,</em> tweaks to peer review make it more mechanical. Mechanization only makes LLMs better suited for the job. If epistemic cultivation of expertise and experience demands something beyond tradition, then more complex systems of checks and balances only stifle it.</p><p>Program committees in computer science used to be small groups of people who met in person to discuss every paper that would be presented at a conference. They are now ministries of truth that haruspicate the statistics of poorly designed surveys and build ornate policies for the masses. Program committees have become bureaucrats of the state, and they are forced to see like it. The bureaucratization of academic work product threatens its very epistemic nature. Perhaps the fix has to arise from spontaneous order.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.argmin.net/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.argmin.net/subscribe?"><span>Subscribe now</span></a></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p><a href="https://docs.google.com/document/d/1YfnxOOFvGxIaHjP-gRZCAU3jbtYWyE9hCtZqEC61Wws/edit?usp=sharing">Here&#8217;s an email I received this morning</a> asking me to be an area chair for one of the other big conferences, NeuRIPS. My AI detector flagged this email as &#8220;highly likely AI generated.&#8221;</p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[Cosma Shalizi Is Aware of All Internet Traditions]]></title><description><![CDATA[Shalizi&#8217;s frame of artificial intelligence as mechanized tradition]]></description><link>https://www.argmin.net/p/cosma-shalizi-is-aware-of-all-internet</link><guid isPermaLink="false">https://www.argmin.net/p/cosma-shalizi-is-aware-of-all-internet</guid><dc:creator><![CDATA[Ben Recht]]></dc:creator><pubDate>Fri, 20 Mar 2026 14:07:28 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/319ad87c-1705-408f-ad56-0722342a9a48_840x600.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!NuOl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc21f543c-68f2-4f03-a273-e56d5c33873f_1100x219.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!NuOl!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc21f543c-68f2-4f03-a273-e56d5c33873f_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!NuOl!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc21f543c-68f2-4f03-a273-e56d5c33873f_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!NuOl!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc21f543c-68f2-4f03-a273-e56d5c33873f_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!NuOl!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc21f543c-68f2-4f03-a273-e56d5c33873f_1100x219.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!NuOl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc21f543c-68f2-4f03-a273-e56d5c33873f_1100x219.jpeg" width="1100" height="219" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c21f543c-68f2-4f03-a273-e56d5c33873f_1100x219.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:219,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:155246,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.argmin.net/i/191584284?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc21f543c-68f2-4f03-a273-e56d5c33873f_1100x219.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!NuOl!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc21f543c-68f2-4f03-a273-e56d5c33873f_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!NuOl!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc21f543c-68f2-4f03-a273-e56d5c33873f_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!NuOl!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc21f543c-68f2-4f03-a273-e56d5c33873f_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!NuOl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc21f543c-68f2-4f03-a273-e56d5c33873f_1100x219.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p>I&#8217;ve been wanting to write a summary of the <a href="https://as.nyu.edu/research-centers/remarque/events/Spring-2026/cultural-ai--an-emerging-field.html">Cultural AI</a> conference I attended at NYU last week, but I&#8217;ve been struggling to succinctly capture my thoughts. That&#8217;s indicative of the depth and complexity of how AI meets culture, and the different perspectives and disciplines might not lend themselves to a tidy summary.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> As I often do when trying to wrap my head around complex things, I will stop worrying and just blog through it.</p><p>The talk that serves as my hub in the complex network of cultural AI is Cosma Shalizi&#8217;s  &#8220;Aware of All Internet Traditions: Large Language Models as Information Retrieval and Synthesis.&#8221; That language models simultaneously retrieve information and synthesize new content isn&#8217;t controversial. Nor is the fact that this synthesis is <em>formulaic</em>. The current synthesis is next-token prediction trained on all written information, whose output is warped by some selective post-training. By design, language models mechanistically reproduce the recurring regularities in their training data. That training data consists of all the text files on the internet and what is easily available in printed books. Hence, the regularities are the tropes, stereotypes, templates, conventions, and genres of language and code.</p><p>The formulaic generation of discourse <em>looks</em> like discourse in ways we could never have imagined. But with hindsight, we shouldn&#8217;t be surprised. Human culture is very formulaic! There are long-standing formulas for oral tradition, for generating small talk, or for generating scientific papers. As Cosma put it, in the single sentence that summarizes the entire Cultural AI conference:<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a></p><div class="pullquote"><p>Following a tradition means not having to think for oneself.</p></div><p>Not having to think is often a good thing! Tradition lets us externalize certain processes so we can focus on other tasks. Formalities strengthen cultural connections. Traditions in communication help us understand each other better and come to consensus faster.</p><p>Indeed, our vast externalized cultural intelligence is the jewel of human tradition. Cosma cites Jacques Barzun&#8217;s conception of the <em><a href="https://archive.org/details/houseofintellect0000unse">House of Intellect</a></em>: intellect is the communal form of society&#8217;s intelligence. &#8220;[I]t is intelligence stored up and made into habits of discipline, signs and symbols of meaning, chains of reasoning and spurs to emotion &#8212; a shorthand and a wireless by which the mind can skip connectives, recognize ability, and communicate truth.&#8221; According to Barzun, intellect lets society share and externalize knowledge. It belongs to society, not any individual. It connects individual intelligences. It lives after any single intelligence dies.</p><p>GenAI is the mechanization of this intellect. It is the mechanization of all of our traditions.</p><p>With James Evans, Henry Farrell, and Alison Gopnik, Cosma has been preaching the gospel that AI is a cultural technology for several years. He&#8217;s gone through several iterations of what that means and what it implies, but <em>mechanized tradition</em> is the characterization that resonates most with me. Mechanized tradition of Barzun&#8217;s artificial intellect is a far better description of GenAI technology than &#8220;artificial intelligence.&#8221; This frame helps us get away from the silly C-suite sci-fi navel-gazing about the personalities inside the data centers. <a href="https://www.newyorker.com/magazine/2026/02/16/what-is-claude-anthropic-doesnt-know-either">Claude is not a person</a>. It is a mechanized intellect. A <a href="https://www.argmin.net/p/lore-laundering-machines">Lore Laundering Machine</a>. The frame of mechanized tradition helps me build a social metascience of our LLM condition.</p><p>Let me give you a fun example.</p><p>In the same session as Cosma, Wouter Haverals gave a rhizomatic inspection of the tradition of literary style. What is style anyway? We love to ask LLMs to write in new styles. It&#8217;s funny to have it generate poetry. One of my most common queries is how to rewrite emails to sound less angry.</p><p>But humans are also great at mimicking style. It can be a fun, creative game to do the sort of rewriting we now task AI with. And our audience can all tell when something hits or misses the mark when we <a href="https://www.instagram.com/p/DWAK6XjEfv5/">very ape a particular tradition</a>.</p><p>Wouter introduced Raymond Queneau&#8217;s <em>Exercices de Style</em>, a book consisting of 99 rewritings of the same story in different styles. The main story is simple enough. Here&#8217;s Barbara Wright&#8217;s 1958 translation:</p><blockquote><p>In the S bus, in the rush hour. A chap of about 26, felt hat with a cord instead of a ribbon, neck too long, as if someone&#8217;s been having a tug-of-war with it. People getting off. The chap in question gets annoyed with one of the men standing next to him. He accuses him of jostling him every time anyone goes past. A snivelling tone which is meant to be aggressive. When he sees a vacant seat he throws himself on to it.</p><p>Two hours later, I meet him in the Cour de Rome, in front of the gare Saint-Lazare. He&#8217;s with a friend who&#8217;s saying: You ought to get an extra button put on your overcoat.&#8221; He shows him where (at the lapels) and why.</p></blockquote><p>Queneau rewrites this story in the past and the present. In reported speech. In the passive voice. In haiku.</p><blockquote><p><em>Summer S long neck</em></p><p><em>plait hat toes abuse retreat</em></p><p><em>station button friend</em></p></blockquote><p>Obviously, you can feed an LLM Queneau&#8217;s original story and prompt it to write in each of the prescribed styles. Can LLM capture the style? How could you know that LLM did a good job?</p><p>The only way to answer such questions is to lean on the tradition of <a href="https://www.unpopularfront.news/p/against-vulgar-positivism">vulgar positivism</a>. In a delightfully recursive metanarrative,<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a> Wouter and his co-author Meredith Martin ran a survey experiment. On the platform <a href="https://www.prolific.com/">Prolific</a>, they asked &#8220;real people&#8221; a series of questions about Wright&#8217;s translations of Queneau&#8217;s original stories and AI-generated versions. They ran several variants. In one, mimicking the style of Kevin Roose&#8217;s <a href="https://www.nytimes.com/interactive/2026/03/09/business/ai-writing-quiz.html">mechanistically obnoxious New York Times quiz</a> last week, they didn&#8217;t tell the participants how the two stories were generated and simply asked which better captured the style. In the second variant, they asked which captured the style better when the participants knew whether the story was Queneau&#8217;s or AI&#8217;s. In the third experiment, they asked for preferences with the true labels switched.</p><p>What happened next was entirely predictable. Without attribution of authorship, the &#8220;people&#8221; on Prolific slightly preferred the AI version, choosing the AI 55% of the time. There were 186 participants and 930 pairwise judgments, so statistical tradition would spew out a confidence interval somewhere between 3 and 7 percentage points wide, depending on the pedantry of Reviewer 2. Make of that what you will. On the other hand, with the correct labels, &#8220;people&#8221; only chose the AI 48% of the time. Most hilariously, when the labels were swapped, &#8220;people&#8221; chose what they <em>thought</em> was human 62% of the time.</p><p>To situate these numbers within our broader house of intellectual tradition, Haverals and Martin adopted a recently instituted social-scientific tradition: <a href="https://arxiv.org/abs/2301.07543">silicon sampling</a>. They ran a survey experiment where the participants were LLMs. When prompted with the same survey, LLMs chose AI-writing 50% of the time without labels. But with the correct labels, the machines judged Queneau superior 70% of the time. And with the swapped labels, AI chose what was presented as Queneau 64% of the time.  As the title of Wouter and Meredith&#8217;s paper says, &#8220;<a href="https://arxiv.org/abs/2510.08831">Everyone prefers human writers, even AI</a>.&#8221;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!zQMX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60eaa06e-9cd5-4f0d-8c00-ba7017bed6cf_828x810.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!zQMX!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60eaa06e-9cd5-4f0d-8c00-ba7017bed6cf_828x810.png 424w, https://substackcdn.com/image/fetch/$s_!zQMX!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60eaa06e-9cd5-4f0d-8c00-ba7017bed6cf_828x810.png 848w, https://substackcdn.com/image/fetch/$s_!zQMX!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60eaa06e-9cd5-4f0d-8c00-ba7017bed6cf_828x810.png 1272w, https://substackcdn.com/image/fetch/$s_!zQMX!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60eaa06e-9cd5-4f0d-8c00-ba7017bed6cf_828x810.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!zQMX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60eaa06e-9cd5-4f0d-8c00-ba7017bed6cf_828x810.png" width="472" height="461.7391304347826" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/60eaa06e-9cd5-4f0d-8c00-ba7017bed6cf_828x810.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:810,&quot;width&quot;:828,&quot;resizeWidth&quot;:472,&quot;bytes&quot;:60737,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.argmin.net/i/191584284?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60eaa06e-9cd5-4f0d-8c00-ba7017bed6cf_828x810.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!zQMX!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60eaa06e-9cd5-4f0d-8c00-ba7017bed6cf_828x810.png 424w, https://substackcdn.com/image/fetch/$s_!zQMX!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60eaa06e-9cd5-4f0d-8c00-ba7017bed6cf_828x810.png 848w, https://substackcdn.com/image/fetch/$s_!zQMX!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60eaa06e-9cd5-4f0d-8c00-ba7017bed6cf_828x810.png 1272w, https://substackcdn.com/image/fetch/$s_!zQMX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60eaa06e-9cd5-4f0d-8c00-ba7017bed6cf_828x810.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>There&#8217;s nothing surprising in these survey results, and that shouldn&#8217;t be surprising. Survey experiments are a woefully limited way to understand the social condition. They are completely mechanical. Of course, this sort of impoverished social science can be done by mechanical literary analysis. Silicon-sampled survey experiments enable us to mechanically generate stories from illusory correlations. These stories are interpreted traditionally as either informative or absurd, depending on the academic tradition in which you were raised. The recursion continues indefinitely. There are so many patterns and regularities in human behavior, and by simulating common text strings, we get text conforming to these regularities. To rephrase <a href="https://fitelson.org/confirmation/goodman_1955.pdf">Nelson Goodman</a>, regularities are where you find them, and in human tradition, you find them everywhere.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.argmin.net/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.argmin.net/subscribe?"><span>Subscribe now</span></a></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>That said, Maxim Raginsky gave a fun synthesis talk on assemblage, feedback, and cybernetics at the end of the conference. I hope he writes up his expletive-laden thoughts on <a href="https://realizable.substack.com/">The Art of Realizable</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>I wrote Cosma asking whether that quote was a Shalizi-ism or if I was misattributing it. He replied, &#8220;It&#8217;s not a conscious quotation on my part, but wouldn&#8217;t it be better if it was?&#8221;</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>This blogpost is all recursive metanarrative.</p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[Small World Models]]></title><description><![CDATA[How much do you need to know about a system to control it?]]></description><link>https://www.argmin.net/p/small-world-models</link><guid isPermaLink="false">https://www.argmin.net/p/small-world-models</guid><dc:creator><![CDATA[Ben Recht]]></dc:creator><pubDate>Mon, 16 Mar 2026 14:19:53 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/9af04f0f-f1eb-402a-b2f6-63662fe70906_840x600.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!G6cV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1bb046a0-93d2-4973-92f1-917c35a8eead_1100x219.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!G6cV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1bb046a0-93d2-4973-92f1-917c35a8eead_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!G6cV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1bb046a0-93d2-4973-92f1-917c35a8eead_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!G6cV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1bb046a0-93d2-4973-92f1-917c35a8eead_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!G6cV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1bb046a0-93d2-4973-92f1-917c35a8eead_1100x219.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!G6cV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1bb046a0-93d2-4973-92f1-917c35a8eead_1100x219.jpeg" width="1100" height="219" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1bb046a0-93d2-4973-92f1-917c35a8eead_1100x219.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:219,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:276176,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.argmin.net/i/191133103?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1bb046a0-93d2-4973-92f1-917c35a8eead_1100x219.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!G6cV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1bb046a0-93d2-4973-92f1-917c35a8eead_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!G6cV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1bb046a0-93d2-4973-92f1-917c35a8eead_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!G6cV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1bb046a0-93d2-4973-92f1-917c35a8eead_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!G6cV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1bb046a0-93d2-4973-92f1-917c35a8eead_1100x219.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><em>This is a live blog of Lecture 6 of my graduate seminar &#8220;Feedback, Learning, and Adaptation.&#8221; A table of contents is <a href="https://www.argmin.net/p/feedback-learning-and-adaptation-48f">here</a>.</em></p><p>The backbone of control engineering is the assumption of a reasonably reliable, reasonably simple dynamical system with inputs and outputs. We have to believe that the behavior of the thing we want to steer is consistent enough so that whatever we design in the lab will work on the road.</p><p>Now, what exactly do I need to know about this reliably consistent dynamical system to get it to do what I want to do? You want a model of the system that is rich enough to describe everything you might ever see, but small enough so that you can computationally derive control policies and, perhaps, performance guarantees. Simpler system descriptions yield simpler state-estimation and control-design algorithms, both online <em>and</em> offline. What&#8217;s the right balance between modeling precision and simplicity? This is the question of <em>system identification.</em></p><p>System identification is the natural place where machine learning and statistics meet control engineering. You need to either <em>estimate</em> parameters of models you believe are true or build <em>predictions</em> of how the system will respond to a string of inputs. What sorts of statistical infrastructure you need varies on your control engineering task</p><p>Sometimes you really believe that for all intents and purposes, the system behavior is captured by simple differential equations. An object falling in space without friction will obey Newton&#8217;s laws. To identify this system, you just need to measure the object&#8217;s mass. Easy peasy.</p><p>For more complicated mechanical systems, like quadrotors or simple, slow-moving wheeled vehicles, you still can get away with relatively simple modeling where the geometry of the problem gives you a nice differential equation with a few parameters determined by the build of your drone. In these cases, where you don&#8217;t need particularly high performance, you need only break out the ruler to scale to guestimate all of the parameters.</p><p>Sometimes you can&#8217;t determine the parameters from simple measurements, as environmental conditions dictate their values. For example, coefficients of friction might depend on temperature and the particulars of the flooring. Now you can find the parameters by repeatedly testing your system and running a nonlinear regression to minimize the input-output error.</p><p>As your problem gets even more complicated, maybe you don&#8217;t want to bother building a sophisticated simulator and would be perfectly fine with a &#8220;black-box&#8221; prediction of outputs from inputs. We&#8217;ve developed a zoo of methods to do this sort of prediction. The simplest are the &#8220;ARMAX&#8221; models, which predict outputs as a linear combination of the past inputs and outputs. You can fit these using least squares. If you want to be fancy, you can even compute nice &#8220;state-space&#8221; models from these linear ARMAX models, using a family of methods that are called subspace identification. This will yield smaller models and simplify your control synthesis problem.</p><p>On the other hand, you can go in a completely different direction and make your time-series predictor <em>nonlinear</em>. You can use a neural network to predict the next output from your history. If you want to get extra fancy, throw a transformer at the problem. I&#8217;m sure this will work great and build the best simulator without knowing anything about the problem at all.</p><p>So what&#8217;s the right level of modeling granularity for your problem? I don&#8217;t have a clear answer. In optimal control, the better your estimate, the better your performance. But maybe you care about the minimal amount of information you need to control something. How much is it?</p><p>You might think none. We&#8217;ve seen in class already that two systems that look completely different in open loop look the same in closed loop. Feedback can correct modeling errors. The simplest example is</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!IOTn!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2129833d-3ff9-439f-96b9-67999f00c0ad_292x62.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!IOTn!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2129833d-3ff9-439f-96b9-67999f00c0ad_292x62.png 424w, https://substackcdn.com/image/fetch/$s_!IOTn!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2129833d-3ff9-439f-96b9-67999f00c0ad_292x62.png 848w, https://substackcdn.com/image/fetch/$s_!IOTn!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2129833d-3ff9-439f-96b9-67999f00c0ad_292x62.png 1272w, https://substackcdn.com/image/fetch/$s_!IOTn!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2129833d-3ff9-439f-96b9-67999f00c0ad_292x62.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!IOTn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2129833d-3ff9-439f-96b9-67999f00c0ad_292x62.png" width="292" height="62" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2129833d-3ff9-439f-96b9-67999f00c0ad_292x62.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:62,&quot;width&quot;:292,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:16320,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.argmin.net/i/191133103?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2129833d-3ff9-439f-96b9-67999f00c0ad_292x62.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!IOTn!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2129833d-3ff9-439f-96b9-67999f00c0ad_292x62.png 424w, https://substackcdn.com/image/fetch/$s_!IOTn!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2129833d-3ff9-439f-96b9-67999f00c0ad_292x62.png 848w, https://substackcdn.com/image/fetch/$s_!IOTn!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2129833d-3ff9-439f-96b9-67999f00c0ad_292x62.png 1272w, https://substackcdn.com/image/fetch/$s_!IOTn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2129833d-3ff9-439f-96b9-67999f00c0ad_292x62.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>Input u[0]=1, and the &#8220;x&#8221; variable goes to infinity, but the &#8220;z&#8221; variable goes to zero. However, under the negative state feedback rule &#8220;u[t]=-x[t]&#8221;, the systems are identical</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Ygtt!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa16d99b7-db1f-4007-ab82-f05b8f6e173d_202x62.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Ygtt!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa16d99b7-db1f-4007-ab82-f05b8f6e173d_202x62.png 424w, https://substackcdn.com/image/fetch/$s_!Ygtt!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa16d99b7-db1f-4007-ab82-f05b8f6e173d_202x62.png 848w, https://substackcdn.com/image/fetch/$s_!Ygtt!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa16d99b7-db1f-4007-ab82-f05b8f6e173d_202x62.png 1272w, https://substackcdn.com/image/fetch/$s_!Ygtt!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa16d99b7-db1f-4007-ab82-f05b8f6e173d_202x62.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Ygtt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa16d99b7-db1f-4007-ab82-f05b8f6e173d_202x62.png" width="202" height="62" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a16d99b7-db1f-4007-ab82-f05b8f6e173d_202x62.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:62,&quot;width&quot;:202,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:13926,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.argmin.net/i/191133103?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa16d99b7-db1f-4007-ab82-f05b8f6e173d_202x62.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Ygtt!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa16d99b7-db1f-4007-ab82-f05b8f6e173d_202x62.png 424w, https://substackcdn.com/image/fetch/$s_!Ygtt!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa16d99b7-db1f-4007-ab82-f05b8f6e173d_202x62.png 848w, https://substackcdn.com/image/fetch/$s_!Ygtt!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa16d99b7-db1f-4007-ab82-f05b8f6e173d_202x62.png 1272w, https://substackcdn.com/image/fetch/$s_!Ygtt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa16d99b7-db1f-4007-ab82-f05b8f6e173d_202x62.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>which both quickly converge to zero.</p><p>Negative feedback is powerful and can drive solid performance in the face of huge model uncertainties. If you simply care about robust tracking or homeostatic behavior, perhaps you can get away with the most minimal system identification. Unfortunately, it&#8217;s not quite that easy. You can have two systems that look the same in open loop but have completely different closed-loop behavior. Karl Astrom has a relatively simple example that I described in <a href="https://www.argmin.net/p/the-same-but-different">an earlier post</a>. There, one system has a filter between the controller and the state that slightly attenuates the frequencies needed to stabilize the system.</p><p>Now the question is whether Astrom&#8217;s pathological counterexample&#8212;where two systems look similar in open loop simulation but are catastrophically different under feedback&#8212;is indicative of widespread problems. Probably not. I&#8217;m not convinced that you have to learn sophisticated robust control for most small-scale robotics demos. (Sorry, John, though complex aerospace systems are certainly another story). I think the takeaway from Astrom&#8217;s examples is that your model should represent the sorts of disturbances and signals you should see out in the world. And it should be cognizant of the fact that you are going to use the model in a closed-loop, so you have to understand whether there are delays and noise between the actuation signal and the actuation action.</p><p>Of course, this makes sense to any graduate student who has worked on a real robot.  Every robotics grad student I&#8217;ve spoken to has told me that investing the time in system identification makes the robotic performance infinitely better. Sometimes we have to sit with our dynamical systems for a long time before we know what we need to control them. Understanding what it means for our models to be good enough is the tricky part.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.argmin.net/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.argmin.net/subscribe?"><span>Subscribe now</span></a></p>]]></content:encoded></item><item><title><![CDATA[Benchmarking Culture]]></title><description><![CDATA[A summary of my presentation at the Cultural AI Conference]]></description><link>https://www.argmin.net/p/benchmarking-culture</link><guid isPermaLink="false">https://www.argmin.net/p/benchmarking-culture</guid><dc:creator><![CDATA[Ben Recht]]></dc:creator><pubDate>Tue, 10 Mar 2026 13:58:32 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/d6c4b6b1-e44a-4336-801a-9c5d9b533402_840x600.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!cEfi!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbaa53d1a-ce73-4be5-8d7c-b959ab29f63d_1100x219.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!cEfi!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbaa53d1a-ce73-4be5-8d7c-b959ab29f63d_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!cEfi!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbaa53d1a-ce73-4be5-8d7c-b959ab29f63d_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!cEfi!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbaa53d1a-ce73-4be5-8d7c-b959ab29f63d_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!cEfi!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbaa53d1a-ce73-4be5-8d7c-b959ab29f63d_1100x219.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!cEfi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbaa53d1a-ce73-4be5-8d7c-b959ab29f63d_1100x219.jpeg" width="1100" height="219" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/baa53d1a-ce73-4be5-8d7c-b959ab29f63d_1100x219.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:219,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:133380,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.argmin.net/i/190496945?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbaa53d1a-ce73-4be5-8d7c-b959ab29f63d_1100x219.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!cEfi!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbaa53d1a-ce73-4be5-8d7c-b959ab29f63d_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!cEfi!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbaa53d1a-ce73-4be5-8d7c-b959ab29f63d_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!cEfi!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbaa53d1a-ce73-4be5-8d7c-b959ab29f63d_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!cEfi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbaa53d1a-ce73-4be5-8d7c-b959ab29f63d_1100x219.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p>What&#8217;s been clear so far about this conference on <a href="https://as.nyu.edu/research-centers/remarque/events/Spring-2026/cultural-ai--an-emerging-field.html">Cultural AI</a> is the organizers were interested in a broadly construed definition of AI and Culture. That works for me, as my talk ended up being about two ways of construing the culture of benchmarking. Here&#8217;s a summarized version of what I said.</p><p>I&#8217;ve been a machine learning researcher for nearly 25 years now (yikes), and I opened with a slide describing machine learning that I originally made in 2003.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!sgwe!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7bf1cb92-053f-463d-b61e-d1002728c609_1078x642.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!sgwe!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7bf1cb92-053f-463d-b61e-d1002728c609_1078x642.png 424w, https://substackcdn.com/image/fetch/$s_!sgwe!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7bf1cb92-053f-463d-b61e-d1002728c609_1078x642.png 848w, https://substackcdn.com/image/fetch/$s_!sgwe!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7bf1cb92-053f-463d-b61e-d1002728c609_1078x642.png 1272w, https://substackcdn.com/image/fetch/$s_!sgwe!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7bf1cb92-053f-463d-b61e-d1002728c609_1078x642.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!sgwe!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7bf1cb92-053f-463d-b61e-d1002728c609_1078x642.png" width="450" height="267.99628942486083" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7bf1cb92-053f-463d-b61e-d1002728c609_1078x642.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:642,&quot;width&quot;:1078,&quot;resizeWidth&quot;:450,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!sgwe!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7bf1cb92-053f-463d-b61e-d1002728c609_1078x642.png 424w, https://substackcdn.com/image/fetch/$s_!sgwe!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7bf1cb92-053f-463d-b61e-d1002728c609_1078x642.png 848w, https://substackcdn.com/image/fetch/$s_!sgwe!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7bf1cb92-053f-463d-b61e-d1002728c609_1078x642.png 1272w, https://substackcdn.com/image/fetch/$s_!sgwe!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7bf1cb92-053f-463d-b61e-d1002728c609_1078x642.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>It still works. Machine learning is prediction from examples, and that&#8217;s it. You have some blob of stuff that you call X&#8217;s. You have some blob of stuff you call Y&#8217;s, and you build computer programs to predict the Y&#8217;s from the X&#8217;s. The key thing that makes machine learning algorithms different than other kinds of predictions is that you deliberately try to bake in as few assumptions as possible, other than the fact that you have examples.</p><p>I find the online discourse castigating those who say &#8220;LLMs are just next token predictors&#8221; beyond annoying. They <em>are</em> just next token predictors. And that&#8217;s <em>fascinating</em>.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a></p><p>The fascinating part comes in convincing yourself that your function works on new examples. How do you do that? Anybody who has read David Hume knows you can&#8217;t do it with formal proof. We convince ourselves through a particular system of evaluation. And then we built an entire engineering discipline on top of this.</p><p>Now, what is evaluation? <a href="https://www.argmin.net/p/machine-learning-evaluation-631">In our 2025 course</a>, Deb Raji and I <a href="https://www.argmin.net/p/machine-learning-evaluation">adapted </a>Peter Rossi&#8217;s definition, which he developed for social scientific program evaluation.</p><div class="pullquote"><p>Evaluation is measuring the difference between articulated expectations of a system and its actual performance.</p></div><p>This definition seems reasonable enough, but in a world obsessed with quantification, this sets into motion an inevitable bureaucratic collapse. If you want to make your evaluation legible and fair to all stakeholders, you must make it quantitative. If you want to handle a diversity of contexts, you must evaluate on multiple instantiations and report the average behavior. Quantification has to become statistical. And once you declare your expectations and metrics, everything becomes optimization.  Evaluation inevitably becomes statistical prediction.</p><p>This bureaucratic loop swallows up not just social scientific program evaluation but engineering evaluation more broadly. If you are calculating mean-square errors, you&#8217;re shoehorning your evaluation into statistical prediction. Everyone loves to lean on the artifice of clean statistical facts. Once you have set this stage, machine learning is <a href="https://arxiv.org/abs/2509.04546">practically optimal by definition</a>.</p><p>Machine learning as a discipline has no foundation beyond evaluation. This is a descriptive, not normative statement. The most successful machine learning papers work like this: I say that Y is predictable from X by Method M, and you should be impressed. I then make billions of dollars in a startup. Maybe I have to tell a story about how Method M relates to the brain or mean-field approximations in statistical physics. Fantastic stories don&#8217;t seem to hurt.</p><p>Now, here&#8217;s an invalid AI paper, which a lot of critics like to write: &#8220;Y is not predictable from X.&#8221; <a href="https://www.argmin.net/p/the-unpredictability-conundrum">It is impossible to refute this claim</a>. You can&#8217;t even refute it for simple methods, because what&#8217;s gonna happen is some high school kid is gonna go and change the rules slightly and prove you wrong. Then he will dunk on you on Twitter, gleefully writing &#8220;skill issue.&#8221;</p><p>The logical reconstruction makes the logical positivists roll over in their graves. The field is fueled by pure induction. We progress research programs by demonstration alone. And the way we convince others that our demos are cool is by sharing data and code.</p><p>Core to machine learning is the culture of data sets. I&#8217;m not sure if some poor soul is still trying to update <a href="https://en.wikipedia.org/wiki/List_of_datasets_for_machine-learning_research">this wikipedia page</a>, but the field thrives on shared data with common tasks. The data sets give you an easy path to impress your colleagues. You can argue about the novelty of your method M, which achieves high accuracy on a dataset that others agree is challenging.</p><p>People have turned datasets into literal competitions, starting with the <a href="https://www.argmin.net/p/prediction-games">Netflix Prize</a>, moving to the <a href="https://www.image-net.org/challenges/LSVRC/">ImageNet Large Scale Visual Recognition Challenge (ILSVRC)</a>, and ending with a <a href="https://www.kaggle.com/">company that hosts hundreds of competitions</a>. Not to get all monocausal on you, but the ImageNet Challenge is why we use neural networks today instead of other methods. More fascinating is that we declared protein folding solved because Alphafold did really well on a <a href="https://predictioncenter.org/casp13/">machine learning competition</a>. Competitive testing on benchmarks can produce Nobel Prizes.</p><p>You might be put off by how everything in machine learning becomes an optimization competition. But let&#8217;s applaud machine learning for its brutal, ingenuous honesty about how researchers are driven by ruthless competition. If you want to read more on how this works in practice, read Dave Donoho&#8217;s construction of <a href="https://hdsr.mitpress.mit.edu/pub/g9mau4m0/release/2">frictionless reproducibility</a>, my <a href="https://hdsr.mitpress.mit.edu/pub/8dqgwqiu">concurrence and analysis of the mechanism and costs</a>, the history Moritz Hardt and I lay out in <em><a href="https://mlstory.org/">Patterns, Predictions, and Actions</a></em>, or how it fits into the bigger story of <em><a href="https://press.princeton.edu/books/hardcover/9780691272443/the-irrational-decision">The Irrational Decision</a>.</em><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a></p><p>Perhaps the weirdest transition of the last decade was a move from dataset evaluation to &#8220;generalist&#8221; evaluation. Tracking the GPT series of papers is instructive. <a href="https://cdn.openai.com/better-language-models/language_models_are_unsupervised_multitask_learners.pdf">GPT2</a> declared language models to be general-purpose predictors, but OpenAI made their case with rather standard evaluations and metrics. <a href="https://arxiv.org/abs/2005.14165">GPT3</a> moved on to harder to pin down evaluations in &#8220;natural language inference,&#8221; but there were still tables with numbers and scores. With <a href="https://arxiv.org/abs/2303.08774">GPT4</a>, we didn&#8217;t even get a paper. We got a press release formatted like a paper, which is fascinating in and of itself.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a> That press release bragged about the LLM&#8217;s answers on standardized tests.</p><p>Of course, this got people excited, resulting in breathless press coverage and declarations of the end of education and white-collar work. Hypecycles are part of culture, too. Part of the goal of predicting Y from X is impressing people, and the results were very impressive. Based on the reaction, you can&#8217;t say that GPT4 didn&#8217;t surpass people&#8217;s expectations.</p><p>Now, if you live by the evaluation, you die by the evaluation. Notably, Facebook nuked their AI division after flopping on GPT4-style evaluation. Not only did its userbase think the model sucked, but they were caught cheating on their evaluations, too. In a flailing attempt to recover, Facebook went out and spent $14 billion to buy some random AI talent willing to report to King Z, and they&#8217;ve thrown orders of magnitude more at their subsequent AI investments. And what did this buy them? Literally <a href="https://techcrunch.com/2025/09/25/meta-launches-vibes-a-short-form-video-feed-of-ai-slop/">Vibes</a>.</p><p>We&#8217;re in this fascinating world now where research artifacts are consumer products, and the evaluation is necessarily cultural. Nonprofits funded by the same dirty money that funds AI companies might argue that they can measure the power of coding agents with objective statistical evaluations of yore. But agents are evaluated by coders&#8217; experiences and managers&#8217; fever dreams. I <a href="https://www.argmin.net/p/evaluation-or-valuation">wrote this a year ago</a>, and it remains true today: Generative AI lives in the weird liminal space between productivity software and science fiction revolution. The future of generative AI will be evaluated with our wallets. No leaderboard will help us do that.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.argmin.net/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.argmin.net/subscribe?"><span>Subscribe now</span></a></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>It should go without saying that we interface with via most corporate APIs is <a href="https://www.argmin.net/p/secrets-of-intelligence-services">much more than LLMs now</a>. That&#8217;s a topic for another day.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Out today! w00t.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>Culture!</p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[Minimal Updates]]></title><description><![CDATA[A few argmin announcements from the road]]></description><link>https://www.argmin.net/p/minimal-updates</link><guid isPermaLink="false">https://www.argmin.net/p/minimal-updates</guid><dc:creator><![CDATA[Ben Recht]]></dc:creator><pubDate>Mon, 09 Mar 2026 13:59:34 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/cb5fe71d-0ecf-4063-a340-e3a51390f646_840x599.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!dcVa!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6a75ae8-af41-4c7d-bf13-5c3c6d42be2f_1100x219.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!dcVa!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6a75ae8-af41-4c7d-bf13-5c3c6d42be2f_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!dcVa!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6a75ae8-af41-4c7d-bf13-5c3c6d42be2f_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!dcVa!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6a75ae8-af41-4c7d-bf13-5c3c6d42be2f_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!dcVa!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6a75ae8-af41-4c7d-bf13-5c3c6d42be2f_1100x219.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!dcVa!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6a75ae8-af41-4c7d-bf13-5c3c6d42be2f_1100x219.jpeg" width="1100" height="219" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f6a75ae8-af41-4c7d-bf13-5c3c6d42be2f_1100x219.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:219,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:357448,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.argmin.net/i/190378368?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6a75ae8-af41-4c7d-bf13-5c3c6d42be2f_1100x219.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!dcVa!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6a75ae8-af41-4c7d-bf13-5c3c6d42be2f_1100x219.jpeg 424w, https://substackcdn.com/image/fetch/$s_!dcVa!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6a75ae8-af41-4c7d-bf13-5c3c6d42be2f_1100x219.jpeg 848w, https://substackcdn.com/image/fetch/$s_!dcVa!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6a75ae8-af41-4c7d-bf13-5c3c6d42be2f_1100x219.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!dcVa!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6a75ae8-af41-4c7d-bf13-5c3c6d42be2f_1100x219.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p>This week, I&#8217;m attending a workshop at NYU organized by Tyler Shoemaker and Leif Weatherby called &#8220;<a href="https://as.nyu.edu/research-centers/remarque/events/Spring-2026/cultural-ai--an-emerging-field.html">Cultural AI: An Emerging Field</a>.&#8221; What is Cultural AI? I&#8217;m not sure yet, but this workshop is going to find out. At this point in my career, the invite list is far more important than the planned topic, and this list is top notch. My brief talk will be about the culture of benchmarking in machine learning and how weird it is now that we&#8217;re trying to benchmark culture. I&#8217;ll let you know what the group synthesizes later this week.</p><p>In other exciting news, <em><a href="https://press.princeton.edu/books/hardcover/9780691272443/the-irrational-decision">The Irrational Decision</a></em> comes out tomorrow! Get it wherever you buy books. You can even get it in one of those atavistic bindings of printed pages wrapped in a pretty dust cover. I hear they look good in the background of your podcast.</p><p>I should say something longer about the book, but I want to see how release day moves me. So until tomorrow, <a href="https://www.argmin.net/p/announcing-the-irrational-decision">here&#8217;s what I wrote about it last fall</a>. <em>Technology Review </em><a href="https://www.technologyreview.com/2026/02/18/1132579/robots-predict-future-book-review/">wrote a nice review</a> that companioned it with two other great books, <em><a href="https://press.uchicago.edu/ucp/books/book/chicago/M/bo255887145.html">The Means of Prediction: How AI Really Works (and Who Benefits)</a></em> by Max Kasy and <em><a href="https://www.penguinrandomhouse.com/books/759692/prophecy-by-carissa-veliz/">Prophecy: Prediction, Power, and the Fight for the Future, from Ancient Oracles to AI</a></em> by Carissa V&#233;liz. I need to read both of these. In the interdisciplinary journal <em>Critical AI, </em>Jathan Sadowski further <a href="https://criticalai.org/2025/12/10/sneak-preview-review-benjamin-rechts-the-irrational-decision/">contextualized the book within the broader sphere of science studies</a>.</p><p>Finally, for my friends and readers in the Bay Area, the folks at <a href="https://www.bloombergbeta.com/">Bloomberg Beta</a> and <a href="https://joinreboot.org/">Reboot</a> are sponsoring a launch event for <em>The Irrational Decision </em>on March 17 in San Francisco. If you&#8217;re in the area and interested, you can find more information and register <a href="https://luma.com/xijgl2st">here</a>. Space is limited, but it would be great to meet you in person.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.argmin.net/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.argmin.net/subscribe?"><span>Subscribe now</span></a></p>]]></content:encoded></item></channel></rss>