<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[Deep Tech for Non Tech]]></title><description><![CDATA[Learning in public, one post at a time. Curious about many things, obsessed with deep tech, physical products. If you leave with one new idea, it worked!]]></description><link>https://clairechoi616.substack.com</link><image><url>https://substackcdn.com/image/fetch/$s_!aRQL!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc7b67785-ecfc-4439-8f25-11d53020d72c_1280x1280.png</url><title>Deep Tech for Non Tech</title><link>https://clairechoi616.substack.com</link></image><generator>Substack</generator><lastBuildDate>Sat, 23 May 2026 08:39:48 GMT</lastBuildDate><atom:link href="https://clairechoi616.substack.com/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Claire]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[clairechoi616@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[clairechoi616@substack.com]]></itunes:email><itunes:name><![CDATA[Deep Tech for the Non Tech]]></itunes:name></itunes:owner><itunes:author><![CDATA[Deep Tech for the Non Tech]]></itunes:author><googleplay:owner><![CDATA[clairechoi616@substack.com]]></googleplay:owner><googleplay:email><![CDATA[clairechoi616@substack.com]]></googleplay:email><googleplay:author><![CDATA[Deep Tech for the Non Tech]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[Deep-Tech Decoded: #11 Robot Training 101, Part 3: How Robots Learn What Happens Next with World Models]]></title><description><![CDATA[A robot can move its gripper toward a cup and still not understand the cup.]]></description><link>https://clairechoi616.substack.com/p/deep-tech-decoded-11-robot-training</link><guid isPermaLink="false">https://clairechoi616.substack.com/p/deep-tech-decoded-11-robot-training</guid><dc:creator><![CDATA[Deep Tech for the Non Tech]]></dc:creator><pubDate>Mon, 18 May 2026 14:30:56 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!-6Fx!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad317c63-2c0b-4894-872b-7b54980e2226_1672x941.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>A robot can move its gripper toward a cup and still not understand the cup.</p><p>The motion may be correct. But the consequence is still unknown.</p><p>The cup might lift cleanly. It might slide away. It might tip over. It might spill. It might be heavier than it looks. It might be stuck to a coaster. The robot may touch the cup, but the important question comes after contact:</p><blockquote><p>What happens next?</p></blockquote><p>That is the gap between motion and intelligence.</p><p>A robot does not just need to know what to do. It needs to know what its action will do to the world.</p><p>That is where world models come in.</p><p>The difference between a robot that follows a motion and a robot that behaves intelligently is prediction. A world model is the layer that lets a robot ask:</p><blockquote><p>What happens next if I do this?</p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!V-dt!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0646badc-8135-40c3-ad7f-be30bace57a1_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!V-dt!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0646badc-8135-40c3-ad7f-be30bace57a1_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!V-dt!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0646badc-8135-40c3-ad7f-be30bace57a1_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!V-dt!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0646badc-8135-40c3-ad7f-be30bace57a1_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!V-dt!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0646badc-8135-40c3-ad7f-be30bace57a1_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!V-dt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0646badc-8135-40c3-ad7f-be30bace57a1_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0646badc-8135-40c3-ad7f-be30bace57a1_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1274200,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/196683458?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0646badc-8135-40c3-ad7f-be30bace57a1_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!V-dt!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0646badc-8135-40c3-ad7f-be30bace57a1_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!V-dt!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0646badc-8135-40c3-ad7f-be30bace57a1_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!V-dt!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0646badc-8135-40c3-ad7f-be30bace57a1_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!V-dt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0646badc-8135-40c3-ad7f-be30bace57a1_1672x941.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div><hr></div><h2>From policy and data to prediction</h2><p>In Part 1, we talked about <strong>policies</strong>: how robots choose actions. A policy maps what the robot observes to what it should do next. If the robot sees a cup, the policy may tell it to move the gripper toward the handle.</p><p>In Part 2, we talked about <strong>data</strong>: what robots need to learn from. Robot data is not just video. It is interaction data: what the robot saw, what it did, what it felt, and what happened next.</p><p>Now we move to the next layer: <strong>prediction</strong>. If the policy says, &#8220;move this way,&#8221; the world model asks, &#8220;what will happen if I move this way?&#8221;</p><p>A policy chooses. A world model predicts.</p><div><hr></div><h2>A short history: from mental models to game dreams to physical prediction</h2><p>World models are not a brand-new idea.</p><p>The deeper root is the old idea that intelligent systems use internal models of reality to reason before acting. Kenneth Craik is often credited with formalizing this idea in 1943: the mind can construct &#8220;small-scale models&#8221; of reality to anticipate events before they occur. </p><p>The modern AI version became famous through David Ha and J&#252;rgen Schmidhuber&#8217;s 2018 paper <strong>World Models</strong>. Their agent learned a compressed representation of game-like environments such as CarRacing and VizDoom, then used that learned model for control. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!FOCc!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa3148b65-f4d3-4792-ad7f-f6a959b2c004_1448x1086.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!FOCc!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa3148b65-f4d3-4792-ad7f-f6a959b2c004_1448x1086.png 424w, https://substackcdn.com/image/fetch/$s_!FOCc!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa3148b65-f4d3-4792-ad7f-f6a959b2c004_1448x1086.png 848w, https://substackcdn.com/image/fetch/$s_!FOCc!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa3148b65-f4d3-4792-ad7f-f6a959b2c004_1448x1086.png 1272w, https://substackcdn.com/image/fetch/$s_!FOCc!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa3148b65-f4d3-4792-ad7f-f6a959b2c004_1448x1086.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!FOCc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa3148b65-f4d3-4792-ad7f-f6a959b2c004_1448x1086.png" width="1448" height="1086" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a3148b65-f4d3-4792-ad7f-f6a959b2c004_1448x1086.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1086,&quot;width&quot;:1448,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1171595,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/196683458?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa3148b65-f4d3-4792-ad7f-f6a959b2c004_1448x1086.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!FOCc!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa3148b65-f4d3-4792-ad7f-f6a959b2c004_1448x1086.png 424w, https://substackcdn.com/image/fetch/$s_!FOCc!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa3148b65-f4d3-4792-ad7f-f6a959b2c004_1448x1086.png 848w, https://substackcdn.com/image/fetch/$s_!FOCc!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa3148b65-f4d3-4792-ad7f-f6a959b2c004_1448x1086.png 1272w, https://substackcdn.com/image/fetch/$s_!FOCc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa3148b65-f4d3-4792-ad7f-f6a959b2c004_1448x1086.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: World Models (2018), David Ha &amp; J&#252;rgen Schmidhuber</figcaption></figure></div><p>That game setting matters. Games are controlled worlds. The rules are hidden, but stable. If an agent can learn the dynamics of the game, it can plan inside its learned version of that world.</p><p>Robotics makes the idea harder and more useful. A physical robot cannot cheaply crash, drop, fall, or break things millions of times. <strong>DayDreamer</strong> applied Dreamer-style world models directly to real robots, including a quadruped, robot arms, and a wheeled robot, to learn from physical interaction with less dependence on simulators. </p><p>Today, the term is also used more broadly. NVIDIA describes <strong>Cosmos</strong> as a world foundation model platform for physical AI and robotics workflows. Google DeepMind describes <strong>Genie 3</strong> as a general-purpose world model for generating interactive environments. Wayve describes <strong>GAIA-2</strong> as a controllable video-generative world model for autonomous driving. </p><p>So the arc is not just &#8220;games to robots.&#8221; It is:</p><blockquote><p>mental models &#8594; game dreams &#8594; physical prediction &#8594; generated worlds</p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!-6Fx!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad317c63-2c0b-4894-872b-7b54980e2226_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!-6Fx!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad317c63-2c0b-4894-872b-7b54980e2226_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!-6Fx!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad317c63-2c0b-4894-872b-7b54980e2226_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!-6Fx!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad317c63-2c0b-4894-872b-7b54980e2226_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!-6Fx!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad317c63-2c0b-4894-872b-7b54980e2226_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!-6Fx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad317c63-2c0b-4894-872b-7b54980e2226_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ad317c63-2c0b-4894-872b-7b54980e2226_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1390255,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/196683458?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad317c63-2c0b-4894-872b-7b54980e2226_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!-6Fx!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad317c63-2c0b-4894-872b-7b54980e2226_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!-6Fx!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad317c63-2c0b-4894-872b-7b54980e2226_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!-6Fx!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad317c63-2c0b-4894-872b-7b54980e2226_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!-6Fx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad317c63-2c0b-4894-872b-7b54980e2226_1672x941.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div><hr></div><h2>What is a world model?</h2><p>A world model is the robot&#8217;s internal predictor of what might happen next.</p><p>It does not need to perfectly simulate the universe. It does not need to model every molecule, reflection, or microscopic contact point.</p><p>It only needs to be useful.</p><p>If the robot pushes a cup, the world model predicts whether the cup may slide, tip, or stay still.</p><p>If the robot reaches around a chair, it predicts whether the arm may collide.</p><p>If the robot places a plate near the table edge, it predicts whether the placement is stable.</p><p>A world model is not the world. It is a useful guess about how the world changes.</p><p>This is why world models are so appealing in robotics. They give the robot a way to rehearse possible futures before acting in the physical world.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!P5Mh!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80109f1d-c761-4b6d-9ac3-f2bfd3858393_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!P5Mh!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80109f1d-c761-4b6d-9ac3-f2bfd3858393_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!P5Mh!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80109f1d-c761-4b6d-9ac3-f2bfd3858393_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!P5Mh!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80109f1d-c761-4b6d-9ac3-f2bfd3858393_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!P5Mh!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80109f1d-c761-4b6d-9ac3-f2bfd3858393_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!P5Mh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80109f1d-c761-4b6d-9ac3-f2bfd3858393_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/80109f1d-c761-4b6d-9ac3-f2bfd3858393_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1124054,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/196683458?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80109f1d-c761-4b6d-9ac3-f2bfd3858393_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!P5Mh!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80109f1d-c761-4b6d-9ac3-f2bfd3858393_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!P5Mh!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80109f1d-c761-4b6d-9ac3-f2bfd3858393_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!P5Mh!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80109f1d-c761-4b6d-9ac3-f2bfd3858393_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!P5Mh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80109f1d-c761-4b6d-9ac3-f2bfd3858393_1672x941.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div><hr></div><h2>Policy vs. world model</h2><p>The easiest way to understand a world model is to contrast it with a policy.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!dkn4!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d52c8c0-d086-4ef8-9fc8-e59d33363240_1512x475.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!dkn4!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d52c8c0-d086-4ef8-9fc8-e59d33363240_1512x475.png 424w, https://substackcdn.com/image/fetch/$s_!dkn4!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d52c8c0-d086-4ef8-9fc8-e59d33363240_1512x475.png 848w, https://substackcdn.com/image/fetch/$s_!dkn4!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d52c8c0-d086-4ef8-9fc8-e59d33363240_1512x475.png 1272w, https://substackcdn.com/image/fetch/$s_!dkn4!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d52c8c0-d086-4ef8-9fc8-e59d33363240_1512x475.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!dkn4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d52c8c0-d086-4ef8-9fc8-e59d33363240_1512x475.png" width="1512" height="475" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9d52c8c0-d086-4ef8-9fc8-e59d33363240_1512x475.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:475,&quot;width&quot;:1512,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:771661,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/196683458?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe6cf845f-7c41-4e84-9fb8-193f1ec67297_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!dkn4!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d52c8c0-d086-4ef8-9fc8-e59d33363240_1512x475.png 424w, https://substackcdn.com/image/fetch/$s_!dkn4!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d52c8c0-d086-4ef8-9fc8-e59d33363240_1512x475.png 848w, https://substackcdn.com/image/fetch/$s_!dkn4!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d52c8c0-d086-4ef8-9fc8-e59d33363240_1512x475.png 1272w, https://substackcdn.com/image/fetch/$s_!dkn4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d52c8c0-d086-4ef8-9fc8-e59d33363240_1512x475.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>A policy might output: pull the drawer.</p><p>A world model predicts: the drawer may open, jam, resist, or collide with the object in front of it.</p><p>A policy is the action chooser. A world model is the consequence predictor.</p><p>The best systems may use both. The policy proposes actions. The world model helps evaluate what those actions might cause. Then the robot can choose a safer, more useful, or more reliable action.</p><p>This is robot imagination - not imagination in the poetic sense, but in the practical sense of testing futures before touching the world.</p><div><hr></div><h2>Why prediction is harder in robotics</h2><p>Prediction is hard in every domain. In robotics, it is physically hard.</p><p>The world pushes back.</p><p>A cup behaves differently depending on its weight, surface friction, grip angle, and whether it is full.</p><p>A towel can fold, bunch, stretch, wrinkle, or slip.</p><p>A cable can bend, tangle, snag, or resist being pulled.</p><p>A humanoid footstep can stabilize the body or make it fall.</p><p>A person can unexpectedly move into the robot&#8217;s path.</p><p>This is different from predicting the next word in a sentence. The robot&#8217;s action changes the world, and the changed world becomes the next problem.</p><p>Small errors compound. A slightly bad grasp changes the object pose. The changed object pose makes the next action harder. The next action creates a new state the robot may not have seen before.</p><p>Physical intelligence is consequence-aware intelligence.</p><p>The robot does not just need to identify a cup. It needs to predict the future of the cup under action.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!fkvQ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F637b1ff6-70b8-497b-9553-c3dcb5d57790_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!fkvQ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F637b1ff6-70b8-497b-9553-c3dcb5d57790_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!fkvQ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F637b1ff6-70b8-497b-9553-c3dcb5d57790_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!fkvQ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F637b1ff6-70b8-497b-9553-c3dcb5d57790_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!fkvQ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F637b1ff6-70b8-497b-9553-c3dcb5d57790_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!fkvQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F637b1ff6-70b8-497b-9553-c3dcb5d57790_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/637b1ff6-70b8-497b-9553-c3dcb5d57790_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1270050,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/196683458?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F637b1ff6-70b8-497b-9553-c3dcb5d57790_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!fkvQ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F637b1ff6-70b8-497b-9553-c3dcb5d57790_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!fkvQ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F637b1ff6-70b8-497b-9553-c3dcb5d57790_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!fkvQ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F637b1ff6-70b8-497b-9553-c3dcb5d57790_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!fkvQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F637b1ff6-70b8-497b-9553-c3dcb5d57790_1672x941.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div><hr></div><h2>A practical map of world models</h2><p>I don&#8217;t think there is one universally accepted taxonomy of world models. Researchers and companies categorize them by architecture, modality, training objective, or use case.</p><p>But as an ex consultant I always have the urge to structure things, so I tried to make a practical taxonomy here - not a canonical industry standard, but a reader-friendly map to separate ideas that often get collapsed into the same phrase.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!zMNO!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc5cf462-c2d4-4b4c-9c8d-bb8aa3707f9d_1586x626.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!zMNO!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc5cf462-c2d4-4b4c-9c8d-bb8aa3707f9d_1586x626.png 424w, https://substackcdn.com/image/fetch/$s_!zMNO!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc5cf462-c2d4-4b4c-9c8d-bb8aa3707f9d_1586x626.png 848w, https://substackcdn.com/image/fetch/$s_!zMNO!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc5cf462-c2d4-4b4c-9c8d-bb8aa3707f9d_1586x626.png 1272w, https://substackcdn.com/image/fetch/$s_!zMNO!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc5cf462-c2d4-4b4c-9c8d-bb8aa3707f9d_1586x626.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!zMNO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc5cf462-c2d4-4b4c-9c8d-bb8aa3707f9d_1586x626.png" width="1586" height="626" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bc5cf462-c2d4-4b4c-9c8d-bb8aa3707f9d_1586x626.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:626,&quot;width&quot;:1586,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1292189,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/196683458?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d425038-7a7e-43e3-b6fe-32d29906ff5d_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!zMNO!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc5cf462-c2d4-4b4c-9c8d-bb8aa3707f9d_1586x626.png 424w, https://substackcdn.com/image/fetch/$s_!zMNO!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc5cf462-c2d4-4b4c-9c8d-bb8aa3707f9d_1586x626.png 848w, https://substackcdn.com/image/fetch/$s_!zMNO!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc5cf462-c2d4-4b4c-9c8d-bb8aa3707f9d_1586x626.png 1272w, https://substackcdn.com/image/fetch/$s_!zMNO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc5cf462-c2d4-4b4c-9c8d-bb8aa3707f9d_1586x626.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The important point is not the taxonomy itself.</p><p>The important point is that &#8220;world model&#8221; is less a single architecture than a job description:</p><blockquote><p>Predict how the world changes.</p></blockquote><p>Some world models predict in a compact hidden space. Some predict videos. Some represent objects more explicitly. Some generate environments.</p><p>They are different ways of answering the same question: what might happen next?</p><div><hr></div><h2>What must a robot world model learn?</h2><p>A robot world model needs to learn more than appearance.</p><p>It needs to learn the ingredients of physical consequence.</p><p><strong>Objects: </strong>What things are, where they are, and how they can move<br><strong>Physics: </strong>Sliding, falling, rolling, balance, resistance<br><strong>Contact: </strong>What happens when the robot touches, pushes, grasps, or pulls<br><strong>Time: </strong>How small actions compound into future states<br><strong>Agents: </strong>How humans, pets, or other robots may move nearby<br><strong>Uncertainty: </strong>When the robot should be unsure and act cautiously</p><p>Contact is especially hard.</p><p>Before contact, vision can do a lot. The robot can see the cup, estimate its position, and plan a path.</p><p>After contact, the world becomes less predictable. The cup may slip. The gripper may press too hard. The surface may be sticky. The object may deform. A tiny difference in angle can change the outcome.</p><p>The world is not just a scene. It is a set of possible futures.</p><div><hr></div><h2>From experience to prediction</h2><p>This is where world models connect back to robot data.</p><p>Article 2 was about the ingredients: the data stack. What signal was captured? Which body produced it? What task was attempted? How was the data collected? How much time did it cover?</p><p>A world model uses that data differently.</p><p>It does not just store what happened. It learns patterns between action and consequence.</p><p>A static image can teach what a drawer looks like.</p><p>A robot episode can teach: reach toward handle, touch handle, pull, drawer opens.</p><p>A failure trace can teach: pull from the wrong angle, drawer jams.</p><p>A deployment log can teach: the same action sometimes succeeds and sometimes fails depending on object weight, friction, prior state, lighting, or human interruption.</p><p>Robot data is what happened. A world model is what the robot thinks might happen next.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!fKXb!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F048f57b9-4eb0-4d41-9224-d2d436237f79_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!fKXb!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F048f57b9-4eb0-4d41-9224-d2d436237f79_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!fKXb!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F048f57b9-4eb0-4d41-9224-d2d436237f79_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!fKXb!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F048f57b9-4eb0-4d41-9224-d2d436237f79_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!fKXb!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F048f57b9-4eb0-4d41-9224-d2d436237f79_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!fKXb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F048f57b9-4eb0-4d41-9224-d2d436237f79_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/048f57b9-4eb0-4d41-9224-d2d436237f79_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1286052,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/196683458?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F048f57b9-4eb0-4d41-9224-d2d436237f79_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!fKXb!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F048f57b9-4eb0-4d41-9224-d2d436237f79_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!fKXb!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F048f57b9-4eb0-4d41-9224-d2d436237f79_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!fKXb!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F048f57b9-4eb0-4d41-9224-d2d436237f79_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!fKXb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F048f57b9-4eb0-4d41-9224-d2d436237f79_1672x941.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div><hr></div><h2>A few robot-relevant world models worth knowing</h2><p>The robotics industry is not using one single &#8220;world model.&#8221; The field is still early, and different teams use the phrase in different ways.</p><p>A more useful way to read the market is to look at <strong>where world models are showing up in robot learning workflows</strong>. I&#8217;ll call out some world models that I feel like are being actively used / discussed.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Pw3h!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F039af8de-3117-4eab-9371-805b01b05c0b_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Pw3h!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F039af8de-3117-4eab-9371-805b01b05c0b_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!Pw3h!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F039af8de-3117-4eab-9371-805b01b05c0b_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!Pw3h!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F039af8de-3117-4eab-9371-805b01b05c0b_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!Pw3h!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F039af8de-3117-4eab-9371-805b01b05c0b_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Pw3h!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F039af8de-3117-4eab-9371-805b01b05c0b_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/039af8de-3117-4eab-9371-805b01b05c0b_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1245718,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/196683458?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F039af8de-3117-4eab-9371-805b01b05c0b_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Pw3h!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F039af8de-3117-4eab-9371-805b01b05c0b_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!Pw3h!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F039af8de-3117-4eab-9371-805b01b05c0b_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!Pw3h!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F039af8de-3117-4eab-9371-805b01b05c0b_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!Pw3h!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F039af8de-3117-4eab-9371-805b01b05c0b_1672x941.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>DayDreamer</strong> is one of the cleanest robotics examples. It applied Dreamer-style world models directly to physical robots, including robot arms, a quadruped, and a wheeled robot. The important idea is not that every robot company uses DayDreamer specifically. It is that a robot can learn a predictive model from real-world interaction, then use that model to improve behavior with fewer physical trials. </p><p><strong>NVIDIA Cosmos</strong> represents the &#8220;world foundation model&#8221; direction for physical AI. NVIDIA positions Cosmos as a platform for building customized world models for robotics and autonomous vehicles, including predictive video worlds, synthetic data, edge cases, and robot-centric simulation. This is less about one robot policy and more about infrastructure: giving robotics teams a way to generate, simulate, and reason about physical scenarios before deploying in the real world. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!glm4!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7241a58e-fabf-4aff-a821-b0de0c6f0721_2596x1002.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!glm4!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7241a58e-fabf-4aff-a821-b0de0c6f0721_2596x1002.png 424w, https://substackcdn.com/image/fetch/$s_!glm4!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7241a58e-fabf-4aff-a821-b0de0c6f0721_2596x1002.png 848w, https://substackcdn.com/image/fetch/$s_!glm4!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7241a58e-fabf-4aff-a821-b0de0c6f0721_2596x1002.png 1272w, https://substackcdn.com/image/fetch/$s_!glm4!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7241a58e-fabf-4aff-a821-b0de0c6f0721_2596x1002.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!glm4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7241a58e-fabf-4aff-a821-b0de0c6f0721_2596x1002.png" width="1456" height="562" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7241a58e-fabf-4aff-a821-b0de0c6f0721_2596x1002.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:562,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1748733,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/196683458?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7241a58e-fabf-4aff-a821-b0de0c6f0721_2596x1002.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!glm4!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7241a58e-fabf-4aff-a821-b0de0c6f0721_2596x1002.png 424w, https://substackcdn.com/image/fetch/$s_!glm4!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7241a58e-fabf-4aff-a821-b0de0c6f0721_2596x1002.png 848w, https://substackcdn.com/image/fetch/$s_!glm4!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7241a58e-fabf-4aff-a821-b0de0c6f0721_2596x1002.png 1272w, https://substackcdn.com/image/fetch/$s_!glm4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7241a58e-fabf-4aff-a821-b0de0c6f0721_2596x1002.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: NVIDIA</figcaption></figure></div><p><strong>Ctrl-World</strong> is a newer research example (2026) that gets closer to the problem robot companies actually care about: evaluating and improving generalist robot policies without running endless real-world rollouts. It is a controllable, multi-view world model for robot manipulation, trained on DROID trajectories, and designed to let policies roll out in &#8220;imagination space.&#8221; The paper reports that it can rank policy performance without real-world rollouts and improve policy success through synthetic successful trajectories. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Yo-3!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F67b87ddd-4082-4678-853f-856b0ac78da1_2054x624.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Yo-3!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F67b87ddd-4082-4678-853f-856b0ac78da1_2054x624.png 424w, https://substackcdn.com/image/fetch/$s_!Yo-3!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F67b87ddd-4082-4678-853f-856b0ac78da1_2054x624.png 848w, https://substackcdn.com/image/fetch/$s_!Yo-3!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F67b87ddd-4082-4678-853f-856b0ac78da1_2054x624.png 1272w, https://substackcdn.com/image/fetch/$s_!Yo-3!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F67b87ddd-4082-4678-853f-856b0ac78da1_2054x624.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Yo-3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F67b87ddd-4082-4678-853f-856b0ac78da1_2054x624.png" width="1456" height="442" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/67b87ddd-4082-4678-853f-856b0ac78da1_2054x624.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:442,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:782938,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/196683458?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F67b87ddd-4082-4678-853f-856b0ac78da1_2054x624.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Yo-3!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F67b87ddd-4082-4678-853f-856b0ac78da1_2054x624.png 424w, https://substackcdn.com/image/fetch/$s_!Yo-3!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F67b87ddd-4082-4678-853f-856b0ac78da1_2054x624.png 848w, https://substackcdn.com/image/fetch/$s_!Yo-3!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F67b87ddd-4082-4678-853f-856b0ac78da1_2054x624.png 1272w, https://substackcdn.com/image/fetch/$s_!Yo-3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F67b87ddd-4082-4678-853f-856b0ac78da1_2054x624.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: Ctrl-World: A Controllable Generative World Model for Robot Manipulation, Yanjiang Guo, Lucy Xiaoyang Shi, Jianyu Chen, Chelsea Finn</figcaption></figure></div><p><strong>Wayve GAIA-2</strong> is not a household-robot model, but it is useful as an autonomous-driving example of where world models are commercially heading. Wayve describes GAIA-2 as a controllable video-generative world model for driving, using video, text, and action inputs to generate possible driving futures. Driving is a more constrained robotics domain than general-purpose manipulation, but the underlying logic is similar: predict how the world may evolve under actions. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!oF9X!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62e69737-c73e-4fc4-8328-42185d5ed9dc_2352x1276.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!oF9X!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62e69737-c73e-4fc4-8328-42185d5ed9dc_2352x1276.png 424w, https://substackcdn.com/image/fetch/$s_!oF9X!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62e69737-c73e-4fc4-8328-42185d5ed9dc_2352x1276.png 848w, https://substackcdn.com/image/fetch/$s_!oF9X!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62e69737-c73e-4fc4-8328-42185d5ed9dc_2352x1276.png 1272w, https://substackcdn.com/image/fetch/$s_!oF9X!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62e69737-c73e-4fc4-8328-42185d5ed9dc_2352x1276.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!oF9X!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62e69737-c73e-4fc4-8328-42185d5ed9dc_2352x1276.png" width="1456" height="790" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/62e69737-c73e-4fc4-8328-42185d5ed9dc_2352x1276.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:790,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:810571,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/196683458?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62e69737-c73e-4fc4-8328-42185d5ed9dc_2352x1276.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!oF9X!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62e69737-c73e-4fc4-8328-42185d5ed9dc_2352x1276.png 424w, https://substackcdn.com/image/fetch/$s_!oF9X!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62e69737-c73e-4fc4-8328-42185d5ed9dc_2352x1276.png 848w, https://substackcdn.com/image/fetch/$s_!oF9X!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62e69737-c73e-4fc4-8328-42185d5ed9dc_2352x1276.png 1272w, https://substackcdn.com/image/fetch/$s_!oF9X!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62e69737-c73e-4fc4-8328-42185d5ed9dc_2352x1276.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: GAIA-2: A Controllable Multi-View Generative World Model for Autonomous Driving, Wayve</figcaption></figure></div><p>The takeaway is simple: in robotics, &#8220;world model&#8221; is becoming less of a single model category and more of a <strong>prediction layer</strong> across the training stack.</p><p>Some world models help robots practice in imagination.<br>Some help generate synthetic scenarios.<br>Some help evaluate policies before physical rollout.<br>Some are merging into broader robot foundation models.</p><p>Not every important robot model is a world model. But many frontier systems are circling the same problem:</p><blockquote><p>How do we connect physical understanding, prediction, and action?</p></blockquote><div><hr></div><h2>Why world models could change the robotics race</h2><p>World models matter because real-world trial and error is expensive.</p><p>A robot that can predict consequences can test more options before acting.</p><p>It can imagine several grasps before touching the object.</p><p>It can predict whether a package will slide before pushing it.</p><p>It can estimate whether a humanoid step is stable before shifting weight.</p><p>It can notice that opening a cabinet might collide with something nearby.</p><p>This can improve planning, safety, recovery, sample efficiency, and generalization.</p><p>But world models are not magic.</p><p>A bad world model can be worse than no world model if the robot trusts it too much. If the model predicts that a glass will stay stable but the glass tips, the robot still fails. If the model underestimates human movement, safety can break. If the model was trained on clean lab data, it may not predict messy homes or factories.</p><p>The goal is not to simulate the whole universe.</p><p>The goal is to predict enough of the next few seconds to act better.</p><p>For robotics companies, that could become a major advantage. The best systems will not only collect embodied data. They will turn that data into better predictions, better evaluations, safer actions, and faster learning loops.</p><p>Better prediction can turn the same experience into better behavior.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!rVwn!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dcec4a3-ff35-4b0e-90ef-aefa7c81e4ba_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rVwn!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dcec4a3-ff35-4b0e-90ef-aefa7c81e4ba_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!rVwn!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dcec4a3-ff35-4b0e-90ef-aefa7c81e4ba_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!rVwn!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dcec4a3-ff35-4b0e-90ef-aefa7c81e4ba_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!rVwn!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dcec4a3-ff35-4b0e-90ef-aefa7c81e4ba_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rVwn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dcec4a3-ff35-4b0e-90ef-aefa7c81e4ba_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5dcec4a3-ff35-4b0e-90ef-aefa7c81e4ba_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1269907,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/196683458?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dcec4a3-ff35-4b0e-90ef-aefa7c81e4ba_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!rVwn!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dcec4a3-ff35-4b0e-90ef-aefa7c81e4ba_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!rVwn!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dcec4a3-ff35-4b0e-90ef-aefa7c81e4ba_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!rVwn!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dcec4a3-ff35-4b0e-90ef-aefa7c81e4ba_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!rVwn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dcec4a3-ff35-4b0e-90ef-aefa7c81e4ba_1672x941.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div><hr></div><h2>Consequence-aware robots</h2><p>A robot does not need perfect physics for every object in a kitchen.</p><p>It needs enough prediction to avoid breaking the glass, spilling the drink, crushing the fruit, tangling the cable, or trapping itself in a bad state.</p><p>That is why world models matter.</p><p>They are not just another technical module. They are a way to make robot intelligence less reactive and more consequence-aware.</p><p>A policy tells the robot what to do.</p><p>A world model helps it understand what the world might do back.</p><p>The robot that wins may not be the one that moves fastest.</p><p>It may be the one that best understands what its movement will cause.</p>]]></content:encoded></item><item><title><![CDATA[Deep-Tech Decoded: #10 Robot Training 101 Part 2: The Data Stack Robots Need Before They Can Learn]]></title><description><![CDATA[Continuing on from my last article on robot training.]]></description><link>https://clairechoi616.substack.com/p/deep-tech-decoded-10-robot-training</link><guid isPermaLink="false">https://clairechoi616.substack.com/p/deep-tech-decoded-10-robot-training</guid><dc:creator><![CDATA[Deep Tech for the Non Tech]]></dc:creator><pubDate>Mon, 11 May 2026 14:31:10 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!c7ID!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc1e1c92-e055-4104-8fff-f08aa1d63a9b_1672x941.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Continuing on from my last article on robot training.</p><p>This time, I&#8217;ll talk about what must come before training - the Data.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!c7ID!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc1e1c92-e055-4104-8fff-f08aa1d63a9b_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!c7ID!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc1e1c92-e055-4104-8fff-f08aa1d63a9b_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!c7ID!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc1e1c92-e055-4104-8fff-f08aa1d63a9b_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!c7ID!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc1e1c92-e055-4104-8fff-f08aa1d63a9b_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!c7ID!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc1e1c92-e055-4104-8fff-f08aa1d63a9b_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!c7ID!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc1e1c92-e055-4104-8fff-f08aa1d63a9b_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fc1e1c92-e055-4104-8fff-f08aa1d63a9b_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1135047,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/195940197?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc1e1c92-e055-4104-8fff-f08aa1d63a9b_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!c7ID!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc1e1c92-e055-4104-8fff-f08aa1d63a9b_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!c7ID!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc1e1c92-e055-4104-8fff-f08aa1d63a9b_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!c7ID!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc1e1c92-e055-4104-8fff-f08aa1d63a9b_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!c7ID!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc1e1c92-e055-4104-8fff-f08aa1d63a9b_1672x941.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>A robot-training dataset can be <strong>tactile</strong>, <strong>egocentric</strong>, <strong>long-horizon</strong>, <strong>human-generated</strong>, and <strong>kitchen-task data</strong> at the same time.</p><p>That sentence is why robot data gets confusing.</p><p>When people talk about the &#8220;types of data&#8221; robots need, the conversation often becomes a flat list: vision data, tactile data, teleoperation data, simulation data, egocentric video, humanoid data, long-horizon data.</p><p>But those are not the same kind of category.</p><p>&#8220;Tactile&#8221; is a <strong>modality</strong>.<br>&#8220;Egocentric&#8221; is a <strong>viewpoint or collection method</strong>.<br>&#8220;Humanoid&#8221; is an <strong>embodiment</strong>.<br>&#8220;Kitchen task&#8221; is a <strong>task domain</strong>.<br>&#8220;Long-horizon&#8221; is a <strong>temporal structure</strong>.</p><p>A single dataset can be all of them at once.</p><p>That is the first shift: <strong>robot data is not a pile. It is a stack.</strong></p><p>And the robotics data problem is not just a scale problem. It is a composition problem.</p><p>Robots do not simply need more data. They need the right mix of embodied experience: what was sensed, whose body produced it, what task was being performed, how the data was collected, and how much time the episode covered.</p><p>Text models needed the internet. Robots need something harder: a structured record of bodies acting in the world.</p><div><hr></div><h2>Robot data is interaction data</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!rQwg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51ee67e8-a6b0-44a9-ba3a-b371d9cd3ea2_1491x1055.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rQwg!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51ee67e8-a6b0-44a9-ba3a-b371d9cd3ea2_1491x1055.png 424w, https://substackcdn.com/image/fetch/$s_!rQwg!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51ee67e8-a6b0-44a9-ba3a-b371d9cd3ea2_1491x1055.png 848w, https://substackcdn.com/image/fetch/$s_!rQwg!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51ee67e8-a6b0-44a9-ba3a-b371d9cd3ea2_1491x1055.png 1272w, https://substackcdn.com/image/fetch/$s_!rQwg!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51ee67e8-a6b0-44a9-ba3a-b371d9cd3ea2_1491x1055.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rQwg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51ee67e8-a6b0-44a9-ba3a-b371d9cd3ea2_1491x1055.png" width="1456" height="1030" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/51ee67e8-a6b0-44a9-ba3a-b371d9cd3ea2_1491x1055.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1030,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1251150,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/195940197?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51ee67e8-a6b0-44a9-ba3a-b371d9cd3ea2_1491x1055.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!rQwg!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51ee67e8-a6b0-44a9-ba3a-b371d9cd3ea2_1491x1055.png 424w, https://substackcdn.com/image/fetch/$s_!rQwg!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51ee67e8-a6b0-44a9-ba3a-b371d9cd3ea2_1491x1055.png 848w, https://substackcdn.com/image/fetch/$s_!rQwg!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51ee67e8-a6b0-44a9-ba3a-b371d9cd3ea2_1491x1055.png 1272w, https://substackcdn.com/image/fetch/$s_!rQwg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51ee67e8-a6b0-44a9-ba3a-b371d9cd3ea2_1491x1055.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Internet data mostly records what humans wrote, said, saw, or uploaded.</p><p>Robot data records something different: an embodied system acting in the world and observing what happened next.</p><p>An image of a cup can teach a model what a cup looks like. A video of a person picking up a cup can teach intent and procedure. But a robot episode teaches more: where the gripper moved, when it touched the cup, how much force it applied, whether the cup slipped, how the robot corrected, and whether the task succeeded.</p><p>That is the core distinction:</p><blockquote><p>Robot data is not just observation data. It is interaction data.</p></blockquote><p>A robot does not only need to recognize a drawer. It needs to know how to reach the handle, how much resistance to expect, what to do if the drawer sticks, and how to stop if a human hand enters the workspace.</p><p>This is why robotics cannot simply copy the data playbook of language or vision models. The physical world does not just sit there waiting to be classified. It pushes back.</p><div><hr></div><h2>The five-axis data stack</h2><p>A clean way to categorize robot-training data is to ask five questions.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!yCOH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1374fa7-97a4-4aa5-a8ad-cb9e8b7da265_1514x650.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!yCOH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1374fa7-97a4-4aa5-a8ad-cb9e8b7da265_1514x650.png 424w, https://substackcdn.com/image/fetch/$s_!yCOH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1374fa7-97a4-4aa5-a8ad-cb9e8b7da265_1514x650.png 848w, https://substackcdn.com/image/fetch/$s_!yCOH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1374fa7-97a4-4aa5-a8ad-cb9e8b7da265_1514x650.png 1272w, https://substackcdn.com/image/fetch/$s_!yCOH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1374fa7-97a4-4aa5-a8ad-cb9e8b7da265_1514x650.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!yCOH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1374fa7-97a4-4aa5-a8ad-cb9e8b7da265_1514x650.png" width="1514" height="650" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a1374fa7-97a4-4aa5-a8ad-cb9e8b7da265_1514x650.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:650,&quot;width&quot;:1514,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1519428,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/195940197?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feabe83ec-5c54-4928-9e48-03d699b8d0c7_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!yCOH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1374fa7-97a4-4aa5-a8ad-cb9e8b7da265_1514x650.png 424w, https://substackcdn.com/image/fetch/$s_!yCOH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1374fa7-97a4-4aa5-a8ad-cb9e8b7da265_1514x650.png 848w, https://substackcdn.com/image/fetch/$s_!yCOH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1374fa7-97a4-4aa5-a8ad-cb9e8b7da265_1514x650.png 1272w, https://substackcdn.com/image/fetch/$s_!yCOH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1374fa7-97a4-4aa5-a8ad-cb9e8b7da265_1514x650.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>These are not five separate buckets of data. They are five labels that can describe the same episode.</p><p>For example:</p><blockquote><p>Egocentric RGB video plus hand-pose data, generated by a human body, during a kitchen task, collected through wearable capture, over a long-horizon sequence.</p></blockquote><p>Or:</p><blockquote><p>RGB, depth, proprioception, and action data, generated by a robot arm, during tabletop manipulation, collected through teleoperation, over a short task episode.</p></blockquote><p>This is why a MECE taxonomy matters. Without it, &#8220;tactile data&#8221; and &#8220;long-horizon data&#8221; sound like competing options. They are not. One describes the signal. The other describes the time scale.</p><p>The stack makes the conversation cleaner.</p><div><hr></div><h2>1. Modality: what signal is captured?</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!5IWV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ecb2ad0-7d7b-4eb7-b74c-361e2d79e7f7_1491x1055.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!5IWV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ecb2ad0-7d7b-4eb7-b74c-361e2d79e7f7_1491x1055.png 424w, https://substackcdn.com/image/fetch/$s_!5IWV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ecb2ad0-7d7b-4eb7-b74c-361e2d79e7f7_1491x1055.png 848w, https://substackcdn.com/image/fetch/$s_!5IWV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ecb2ad0-7d7b-4eb7-b74c-361e2d79e7f7_1491x1055.png 1272w, https://substackcdn.com/image/fetch/$s_!5IWV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ecb2ad0-7d7b-4eb7-b74c-361e2d79e7f7_1491x1055.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!5IWV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ecb2ad0-7d7b-4eb7-b74c-361e2d79e7f7_1491x1055.png" width="1456" height="1030" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6ecb2ad0-7d7b-4eb7-b74c-361e2d79e7f7_1491x1055.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1030,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1458594,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/195940197?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ecb2ad0-7d7b-4eb7-b74c-361e2d79e7f7_1491x1055.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!5IWV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ecb2ad0-7d7b-4eb7-b74c-361e2d79e7f7_1491x1055.png 424w, https://substackcdn.com/image/fetch/$s_!5IWV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ecb2ad0-7d7b-4eb7-b74c-361e2d79e7f7_1491x1055.png 848w, https://substackcdn.com/image/fetch/$s_!5IWV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ecb2ad0-7d7b-4eb7-b74c-361e2d79e7f7_1491x1055.png 1272w, https://substackcdn.com/image/fetch/$s_!5IWV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6ecb2ad0-7d7b-4eb7-b74c-361e2d79e7f7_1491x1055.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>A modality is a channel of experience.</p><p>For robots, common modalities include RGB video, depth, proprioception, force, torque, tactile pressure, language, and action commands.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!_eGo!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff3d5dc41-a1a5-4720-9a39-ab742c19eef9_1392x753.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!_eGo!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff3d5dc41-a1a5-4720-9a39-ab742c19eef9_1392x753.png 424w, https://substackcdn.com/image/fetch/$s_!_eGo!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff3d5dc41-a1a5-4720-9a39-ab742c19eef9_1392x753.png 848w, https://substackcdn.com/image/fetch/$s_!_eGo!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff3d5dc41-a1a5-4720-9a39-ab742c19eef9_1392x753.png 1272w, https://substackcdn.com/image/fetch/$s_!_eGo!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff3d5dc41-a1a5-4720-9a39-ab742c19eef9_1392x753.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!_eGo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff3d5dc41-a1a5-4720-9a39-ab742c19eef9_1392x753.png" width="1392" height="753" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f3d5dc41-a1a5-4720-9a39-ab742c19eef9_1392x753.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:753,&quot;width&quot;:1392,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1246882,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/195940197?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5fe57bd-e338-41fd-8149-fe1a786a3e59_1491x1055.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!_eGo!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff3d5dc41-a1a5-4720-9a39-ab742c19eef9_1392x753.png 424w, https://substackcdn.com/image/fetch/$s_!_eGo!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff3d5dc41-a1a5-4720-9a39-ab742c19eef9_1392x753.png 848w, https://substackcdn.com/image/fetch/$s_!_eGo!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff3d5dc41-a1a5-4720-9a39-ab742c19eef9_1392x753.png 1272w, https://substackcdn.com/image/fetch/$s_!_eGo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff3d5dc41-a1a5-4720-9a39-ab742c19eef9_1392x753.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Each modality teaches something different.</p><p>Vision tells the robot what the world looks like. Depth helps it understand shape and distance. Proprioception tells the robot where its own body is. Force and torque reveal resistance. Tactile data captures pressure, slip, texture, and contact. Language tells the robot the goal. Action data records what movement was actually taken.</p><p>The key line is simple:</p><blockquote><p>Vision tells the robot what the world looks like. Touch and force tell it what happens when the world pushes back.</p></blockquote><p>This matters most in contact-rich tasks.</p><p>A camera may see that a tomato is red and round. It may not know whether the tomato is firm, slippery, bruised, or about to burst under pressure. A camera may see a cable. It may not know how the cable bends, tangles, or resists being pulled.</p><p>The more a task depends on contact, the less sufficient vision alone becomes.</p><p>This is why tactile and force data are so strategically interesting. They add a missing layer of physical feedback. They help robots move from seeing objects to handling them.</p><div><hr></div><h2>2. Embodiment: whose body generated the data?</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!bkSY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febebe7ce-843d-4629-bb83-d82a6c8cd774_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!bkSY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febebe7ce-843d-4629-bb83-d82a6c8cd774_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!bkSY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febebe7ce-843d-4629-bb83-d82a6c8cd774_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!bkSY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febebe7ce-843d-4629-bb83-d82a6c8cd774_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!bkSY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febebe7ce-843d-4629-bb83-d82a6c8cd774_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!bkSY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febebe7ce-843d-4629-bb83-d82a6c8cd774_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ebebe7ce-843d-4629-bb83-d82a6c8cd774_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1263619,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/195940197?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febebe7ce-843d-4629-bb83-d82a6c8cd774_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!bkSY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febebe7ce-843d-4629-bb83-d82a6c8cd774_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!bkSY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febebe7ce-843d-4629-bb83-d82a6c8cd774_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!bkSY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febebe7ce-843d-4629-bb83-d82a6c8cd774_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!bkSY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Febebe7ce-843d-4629-bb83-d82a6c8cd774_1672x941.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: Generated by ChatGPT</figcaption></figure></div><p>In robotics, the body is part of the dataset.</p><p>A human hand, a two-finger gripper, a robot arm, a quadruped, a bimanual robot, and a humanoid do not experience the same world. They have different joints, sensors, reach, balance, strength, and action spaces.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!QdOe!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F060f8ef7-434c-4eb4-b2c1-7a3b8fc28d0f_1376x767.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!QdOe!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F060f8ef7-434c-4eb4-b2c1-7a3b8fc28d0f_1376x767.png 424w, https://substackcdn.com/image/fetch/$s_!QdOe!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F060f8ef7-434c-4eb4-b2c1-7a3b8fc28d0f_1376x767.png 848w, https://substackcdn.com/image/fetch/$s_!QdOe!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F060f8ef7-434c-4eb4-b2c1-7a3b8fc28d0f_1376x767.png 1272w, https://substackcdn.com/image/fetch/$s_!QdOe!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F060f8ef7-434c-4eb4-b2c1-7a3b8fc28d0f_1376x767.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!QdOe!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F060f8ef7-434c-4eb4-b2c1-7a3b8fc28d0f_1376x767.png" width="1376" height="767" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/060f8ef7-434c-4eb4-b2c1-7a3b8fc28d0f_1376x767.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:767,&quot;width&quot;:1376,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1365805,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/195940197?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe4f82a1b-dd6a-497b-a7e0-bfddacbc6e6d_1491x1055.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!QdOe!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F060f8ef7-434c-4eb4-b2c1-7a3b8fc28d0f_1376x767.png 424w, https://substackcdn.com/image/fetch/$s_!QdOe!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F060f8ef7-434c-4eb4-b2c1-7a3b8fc28d0f_1376x767.png 848w, https://substackcdn.com/image/fetch/$s_!QdOe!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F060f8ef7-434c-4eb4-b2c1-7a3b8fc28d0f_1376x767.png 1272w, https://substackcdn.com/image/fetch/$s_!QdOe!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F060f8ef7-434c-4eb4-b2c1-7a3b8fc28d0f_1376x767.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>This is why human video is useful but incomplete.</p><p>A video of a person folding laundry can teach procedure: pick up the towel, find the corners, fold, align, stack. But the human hand has soft skin, many degrees of freedom, and rich tactile sensing. A robot gripper may not have those capabilities. The task is the same. The body is not.</p><p>This is also why cross-embodiment robot data matters. Open X-Embodiment contains more than one million real robot trajectories across 22 robot embodiments, from single robot arms to bimanual robots and quadrupeds. Its value is not only scale; it is the attempt to pool experience across different bodies. (<a href="https://robotics-transformer-x.github.io/?utm_source=chatgpt.com">Robotics Transformer X</a>)</p><p>The lesson is subtle but important:</p><blockquote><p>A dataset does not just describe a task. It describes a task performed by a particular body.</p></blockquote><p>A humanoid learning from robot-arm data may gain useful priors. But it still has to translate that experience into a different body, with different constraints.</p><div><hr></div><h2>3. Task domain: what behavior is being learned?</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!OLgZ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5b11c69-a720-49b6-b11e-be277c90d2f1_1491x1055.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!OLgZ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5b11c69-a720-49b6-b11e-be277c90d2f1_1491x1055.png 424w, https://substackcdn.com/image/fetch/$s_!OLgZ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5b11c69-a720-49b6-b11e-be277c90d2f1_1491x1055.png 848w, https://substackcdn.com/image/fetch/$s_!OLgZ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5b11c69-a720-49b6-b11e-be277c90d2f1_1491x1055.png 1272w, https://substackcdn.com/image/fetch/$s_!OLgZ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5b11c69-a720-49b6-b11e-be277c90d2f1_1491x1055.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!OLgZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5b11c69-a720-49b6-b11e-be277c90d2f1_1491x1055.png" width="1456" height="1030" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e5b11c69-a720-49b6-b11e-be277c90d2f1_1491x1055.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1030,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1559627,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/195940197?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5b11c69-a720-49b6-b11e-be277c90d2f1_1491x1055.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!OLgZ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5b11c69-a720-49b6-b11e-be277c90d2f1_1491x1055.png 424w, https://substackcdn.com/image/fetch/$s_!OLgZ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5b11c69-a720-49b6-b11e-be277c90d2f1_1491x1055.png 848w, https://substackcdn.com/image/fetch/$s_!OLgZ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5b11c69-a720-49b6-b11e-be277c90d2f1_1491x1055.png 1272w, https://substackcdn.com/image/fetch/$s_!OLgZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5b11c69-a720-49b6-b11e-be277c90d2f1_1491x1055.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Not all robot data teaches equally useful behavior.</p><p>A million examples of pick-and-place do not automatically teach a robot to clean a kitchen.</p><p>Task domain asks: what kind of behavior is inside the data?</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!kEvv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f320228-8cc1-4739-8655-b1c309660fad_1381x747.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!kEvv!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f320228-8cc1-4739-8655-b1c309660fad_1381x747.png 424w, https://substackcdn.com/image/fetch/$s_!kEvv!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f320228-8cc1-4739-8655-b1c309660fad_1381x747.png 848w, https://substackcdn.com/image/fetch/$s_!kEvv!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f320228-8cc1-4739-8655-b1c309660fad_1381x747.png 1272w, https://substackcdn.com/image/fetch/$s_!kEvv!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f320228-8cc1-4739-8655-b1c309660fad_1381x747.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!kEvv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f320228-8cc1-4739-8655-b1c309660fad_1381x747.png" width="1381" height="747" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3f320228-8cc1-4739-8655-b1c309660fad_1381x747.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:747,&quot;width&quot;:1381,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1288633,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/195940197?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F410a7426-3e3b-4e0c-b262-43bc2cac2a58_1491x1055.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!kEvv!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f320228-8cc1-4739-8655-b1c309660fad_1381x747.png 424w, https://substackcdn.com/image/fetch/$s_!kEvv!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f320228-8cc1-4739-8655-b1c309660fad_1381x747.png 848w, https://substackcdn.com/image/fetch/$s_!kEvv!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f320228-8cc1-4739-8655-b1c309660fad_1381x747.png 1272w, https://substackcdn.com/image/fetch/$s_!kEvv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f320228-8cc1-4739-8655-b1c309660fad_1381x747.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The value of robot data depends on what behavior it teaches.</p><p>This is why task diversity has become so important. DROID, a large in-the-wild robot manipulation dataset, contains 76,000 demonstration trajectories, about 350 hours of interaction data, collected across 564 scenes and 86 tasks. The point is not only to collect more robot data, but to collect it across more varied environments and tasks so policies generalize better. (<a href="https://droid-dataset.github.io/?utm_source=chatgpt.com">DROID Dataset</a>)</p><p>For PMs and investors, the question should not be: how many episodes does the company have?</p><p>The better question is:</p><blockquote><p>What behaviors do those episodes actually cover?</p></blockquote><p>A robot company with 500,000 narrow lab demos may be less interesting than one with fewer but richer episodes across real-world variation, failure, and recovery.</p><div><hr></div><h2>4. Collection method: how was the data produced?</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!0qQV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc54f8018-8919-4df9-840d-fc1edbd87ff8_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!0qQV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc54f8018-8919-4df9-840d-fc1edbd87ff8_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!0qQV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc54f8018-8919-4df9-840d-fc1edbd87ff8_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!0qQV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc54f8018-8919-4df9-840d-fc1edbd87ff8_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!0qQV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc54f8018-8919-4df9-840d-fc1edbd87ff8_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!0qQV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc54f8018-8919-4df9-840d-fc1edbd87ff8_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c54f8018-8919-4df9-840d-fc1edbd87ff8_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1337676,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/195940197?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc54f8018-8919-4df9-840d-fc1edbd87ff8_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!0qQV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc54f8018-8919-4df9-840d-fc1edbd87ff8_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!0qQV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc54f8018-8919-4df9-840d-fc1edbd87ff8_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!0qQV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc54f8018-8919-4df9-840d-fc1edbd87ff8_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!0qQV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc54f8018-8919-4df9-840d-fc1edbd87ff8_1672x941.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The way data is collected determines the learning signal.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!S5k4!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ae92c92-b1a5-474d-8d5d-28a5ff7a4c19_1371x749.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!S5k4!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ae92c92-b1a5-474d-8d5d-28a5ff7a4c19_1371x749.png 424w, https://substackcdn.com/image/fetch/$s_!S5k4!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ae92c92-b1a5-474d-8d5d-28a5ff7a4c19_1371x749.png 848w, https://substackcdn.com/image/fetch/$s_!S5k4!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ae92c92-b1a5-474d-8d5d-28a5ff7a4c19_1371x749.png 1272w, https://substackcdn.com/image/fetch/$s_!S5k4!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ae92c92-b1a5-474d-8d5d-28a5ff7a4c19_1371x749.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!S5k4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ae92c92-b1a5-474d-8d5d-28a5ff7a4c19_1371x749.png" width="1371" height="749" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8ae92c92-b1a5-474d-8d5d-28a5ff7a4c19_1371x749.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:749,&quot;width&quot;:1371,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1365193,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/195940197?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F324f93a7-b70d-4ff5-bc10-54372b5efcf4_1491x1055.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!S5k4!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ae92c92-b1a5-474d-8d5d-28a5ff7a4c19_1371x749.png 424w, https://substackcdn.com/image/fetch/$s_!S5k4!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ae92c92-b1a5-474d-8d5d-28a5ff7a4c19_1371x749.png 848w, https://substackcdn.com/image/fetch/$s_!S5k4!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ae92c92-b1a5-474d-8d5d-28a5ff7a4c19_1371x749.png 1272w, https://substackcdn.com/image/fetch/$s_!S5k4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ae92c92-b1a5-474d-8d5d-28a5ff7a4c19_1371x749.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><p>Teleoperation gives robot-specific demonstrations. A human controls the robot, and the system records observations and actions. This is valuable because the data is already in the robot&#8217;s body and action space. But it can be expensive and slow.</p><p>Human video gives scale. The world has far more videos of humans doing tasks than robots doing tasks. Egocentric video is especially useful because it captures activity from a first-person perspective. Ego4D includes 3,670 hours of first-person video from 923 participants across 74 worldwide locations, making it a useful reference point for large-scale egocentric data. </p><p>But human video has a translation problem. It can show intent and procedure, but it does not directly provide robot motor commands.</p><p>Simulation gives cheap practice. Robots can fall, fail, and retry thousands of times without breaking hardware. It is useful for rare events, dangerous scenarios, and scalable variation. But simulation has a reality gap: what works in a physics engine may not work perfectly in the real world.</p><p>Synthetic data sits somewhere nearby. NVIDIA describes Isaac GR00T tools as generating large synthetic trajectory datasets from a small number of human demonstrations, using GR00T-Mimic and Cosmos as part of the data-generation workflow.</p><p>Deployment logs may be the most commercially valuable source. They capture the messiness of the real world: lighting changes, object variation, human interruptions, edge cases, and failure modes that no lab fully anticipated.</p><p>The collection method is not a logistics detail. It determines what the robot can learn.</p><div><hr></div><h2>5. Temporal structure: how much time does the data cover?</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!QX0Z!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6ed4d9d-4ddb-468a-83ac-94c55574b544_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!QX0Z!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6ed4d9d-4ddb-468a-83ac-94c55574b544_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!QX0Z!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6ed4d9d-4ddb-468a-83ac-94c55574b544_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!QX0Z!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6ed4d9d-4ddb-468a-83ac-94c55574b544_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!QX0Z!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6ed4d9d-4ddb-468a-83ac-94c55574b544_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!QX0Z!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6ed4d9d-4ddb-468a-83ac-94c55574b544_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d6ed4d9d-4ddb-468a-83ac-94c55574b544_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1253376,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/195940197?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6ed4d9d-4ddb-468a-83ac-94c55574b544_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!QX0Z!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6ed4d9d-4ddb-468a-83ac-94c55574b544_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!QX0Z!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6ed4d9d-4ddb-468a-83ac-94c55574b544_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!QX0Z!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6ed4d9d-4ddb-468a-83ac-94c55574b544_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!QX0Z!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd6ed4d9d-4ddb-468a-83ac-94c55574b544_1672x941.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><p>Robot data also differs by time scale.</p><p>Short data can teach skills. Long data teaches work.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!hXZP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34ff7394-16a5-4146-8a61-abc38d40b4f4_1424x729.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!hXZP!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34ff7394-16a5-4146-8a61-abc38d40b4f4_1424x729.png 424w, https://substackcdn.com/image/fetch/$s_!hXZP!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34ff7394-16a5-4146-8a61-abc38d40b4f4_1424x729.png 848w, https://substackcdn.com/image/fetch/$s_!hXZP!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34ff7394-16a5-4146-8a61-abc38d40b4f4_1424x729.png 1272w, https://substackcdn.com/image/fetch/$s_!hXZP!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34ff7394-16a5-4146-8a61-abc38d40b4f4_1424x729.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!hXZP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34ff7394-16a5-4146-8a61-abc38d40b4f4_1424x729.png" width="1424" height="729" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/34ff7394-16a5-4146-8a61-abc38d40b4f4_1424x729.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:729,&quot;width&quot;:1424,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1200932,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/195940197?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22ad765f-0517-420f-a619-d6d644ac374c_1491x1055.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!hXZP!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34ff7394-16a5-4146-8a61-abc38d40b4f4_1424x729.png 424w, https://substackcdn.com/image/fetch/$s_!hXZP!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34ff7394-16a5-4146-8a61-abc38d40b4f4_1424x729.png 848w, https://substackcdn.com/image/fetch/$s_!hXZP!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34ff7394-16a5-4146-8a61-abc38d40b4f4_1424x729.png 1272w, https://substackcdn.com/image/fetch/$s_!hXZP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34ff7394-16a5-4146-8a61-abc38d40b4f4_1424x729.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><p>A robot may learn to grasp a cup from short episodes. But clearing a table requires more: find all objects, decide what belongs where, sequence actions, avoid collisions, recover from mistakes, and know when the task is complete.</p><p>That is a different data problem.</p><p>The longer the horizon, the more fragile the behavior becomes. Every additional step creates another chance for error. A robot that succeeds at step one may still fail at step five.</p><p>This is why long-horizon data is so valuable. It teaches continuity.</p><p>The robot does not just need to perform isolated motions. It needs to carry intention across time.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!NuuT!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcb2f4153-901b-44a9-bf38-dc3baa150cb4_4284x3486.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!NuuT!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcb2f4153-901b-44a9-bf38-dc3baa150cb4_4284x3486.jpeg 424w, https://substackcdn.com/image/fetch/$s_!NuuT!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcb2f4153-901b-44a9-bf38-dc3baa150cb4_4284x3486.jpeg 848w, https://substackcdn.com/image/fetch/$s_!NuuT!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcb2f4153-901b-44a9-bf38-dc3baa150cb4_4284x3486.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!NuuT!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcb2f4153-901b-44a9-bf38-dc3baa150cb4_4284x3486.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!NuuT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcb2f4153-901b-44a9-bf38-dc3baa150cb4_4284x3486.jpeg" width="4284" height="3486" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/cb2f4153-901b-44a9-bf38-dc3baa150cb4_4284x3486.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:3486,&quot;width&quot;:4284,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1719550,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/195940197?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed1e6fdb-5945-456b-9e60-2a1f9333815c_4284x5712.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!NuuT!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcb2f4153-901b-44a9-bf38-dc3baa150cb4_4284x3486.jpeg 424w, https://substackcdn.com/image/fetch/$s_!NuuT!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcb2f4153-901b-44a9-bf38-dc3baa150cb4_4284x3486.jpeg 848w, https://substackcdn.com/image/fetch/$s_!NuuT!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcb2f4153-901b-44a9-bf38-dc3baa150cb4_4284x3486.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!NuuT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcb2f4153-901b-44a9-bf38-dc3baa150cb4_4284x3486.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">The importance of long horizon videos is undeniable - Me and my team of former colleagues from Opengraph Labs won 1st place at AGI House robot hackathon on this topic!</figcaption></figure></div><div><hr></div><h2>The missing data: failure and recovery</h2><p>The cleanest robot demos usually hide the most important training signal.</p><p>Failure.</p><p>A cup slips.<br>A drawer sticks.<br>A cable tangles.<br>A towel folds unevenly.<br>A person walks into the workspace.<br>The robot bumps the object and has to replan.</p><p>These moments are not just errors. They are data.</p><p>Perfect demonstrations teach the path. Failure data teaches resilience.</p><p>This matters because real robots will not live inside perfect trajectories. They will live inside distribution shifts: new homes, new warehouses, new lighting, new objects, new humans, new interruptions.</p><p>A robot trained only on clean demonstrations may perform beautifully until the world deviates. Then it may freeze, amplify the mistake, or need human intervention.</p><p>Robustness lives in the recovery trace.</p><p>That means valuable robot datasets should not only ask: did the task succeed?</p><p>They should ask: what went wrong, when did it go wrong, what did the robot sense, what correction was attempted, and did recovery work?</p><div><hr></div><h2>Why deployment data becomes a moat</h2><p>This is where the technical discussion becomes strategic.</p><p>The most valuable robot data may come after deployment.</p><p>A lab can design tasks. A simulator can generate variation. A teleoperator can demonstrate skills. But the real world produces edge cases that are hard to imagine in advance.</p><p>A warehouse robot encounters unusual packages, damaged boxes, reflective surfaces, blocked aisles, and humans moving unpredictably.</p><p>A household robot encounters pets, children, clutter, low lighting, spilled liquids, furniture variation, and vague instructions.</p><p>An industrial robot encounters part variation, tool wear, alignment drift, safety interruptions, and unexpected resistance.</p><p>These are not side cases. They are the path to reliability.</p><p>That is why the real moat may not be the initial dataset. It may be the loop that keeps producing better data.</p><p>Deploy robots.<br>Capture real episodes.<br>Identify failures.<br>Retrain the policy.<br>Evaluate safety.<br>Redeploy.<br>Repeat.</p><p>The companies that own this loop can learn from the physical world faster than companies limited to lab demos or public datasets.</p><p>This does not mean every company with deployed robots automatically wins. The data has to be captured cleanly, labeled usefully, filtered intelligently, and connected back into training and evaluation.</p><p>But the strategic principle holds:</p><blockquote><p>The real data moat is not just the dataset. It is the system that keeps turning real-world experience into better behavior.</p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!rnCN!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F568bcc64-9fec-4b29-8a77-33ddd0d69179_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rnCN!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F568bcc64-9fec-4b29-8a77-33ddd0d69179_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!rnCN!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F568bcc64-9fec-4b29-8a77-33ddd0d69179_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!rnCN!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F568bcc64-9fec-4b29-8a77-33ddd0d69179_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!rnCN!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F568bcc64-9fec-4b29-8a77-33ddd0d69179_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rnCN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F568bcc64-9fec-4b29-8a77-33ddd0d69179_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/568bcc64-9fec-4b29-8a77-33ddd0d69179_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1406064,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/195940197?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F568bcc64-9fec-4b29-8a77-33ddd0d69179_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!rnCN!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F568bcc64-9fec-4b29-8a77-33ddd0d69179_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!rnCN!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F568bcc64-9fec-4b29-8a77-33ddd0d69179_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!rnCN!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F568bcc64-9fec-4b29-8a77-33ddd0d69179_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!rnCN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F568bcc64-9fec-4b29-8a77-33ddd0d69179_1672x941.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><div><hr></div><h2>The real robotics data race</h2><p>The robotics race is often framed as a hardware race.</p><p>Who has the best humanoid?<br>Who has the strongest gripper?<br>Who has the most elegant demo?<br>Who has the cheapest robot arm?</p><p>Hardware matters. Models matter. But neither is enough.</p><p>The deeper race is about learning infrastructure: the ability to collect, structure, evaluate, and reuse embodied experience.</p><p>That is why robot data should not be treated as a flat list of data types. &#8220;Vision,&#8221; &#8220;tactile,&#8221; &#8220;egocentric,&#8221; &#8220;simulation,&#8221; and &#8220;long-horizon&#8221; are useful words, but they answer different questions.</p><p>A better frame is the stack:</p><p>What signal was captured?<br>Whose body produced it?<br>What task was being learned?<br>How was the data collected?<br>How much time did it cover?</p><p>Once you see the stack, the bottleneck becomes clearer.</p><p>Robots do not need data in the abstract. They need the right composition of experience: enough sensory richness to perceive and feel, enough embodiment diversity to generalize across bodies, enough task coverage to become useful, enough real-world grounding to avoid brittle behavior, and enough temporal depth to complete real jobs.</p><p>Software intelligence learned from the internet. Physical intelligence will learn from the world - but only if that world is captured as more than video.</p><p>It must be captured as embodied experience: sensed, acted, felt, failed, recovered, and repeated.</p><p>Robot data is not a pile.</p><p>It is a stack.</p>]]></content:encoded></item><item><title><![CDATA[Deep-Tech Decoded: #9 Robot Training 101 Part 1: How Machines Learn to Move]]></title><description><![CDATA[Over the past few weeks, a surprising number of classmates have asked me for a &#8220;robot training 101.&#8221; Maybe that&#8217;s the unofficial sign that a frontier-tech field is heating up: first the researchers get excited, then the founders, then MBAs start asking how it works &#128521;]]></description><link>https://clairechoi616.substack.com/p/deep-tech-decoded-9-robot-training</link><guid isPermaLink="false">https://clairechoi616.substack.com/p/deep-tech-decoded-9-robot-training</guid><dc:creator><![CDATA[Deep Tech for the Non Tech]]></dc:creator><pubDate>Mon, 04 May 2026 14:31:25 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!6Aot!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45920322-1481-4f19-bef3-2442cacfdbda_1672x941.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Over the past few weeks, a surprising number of classmates have asked me for a &#8220;robot training 101.&#8221; Maybe that&#8217;s the unofficial sign that a frontier-tech field is heating up: first the researchers get excited, then the founders, then MBAs start asking how it works &#128521;</p><p>What I also noticed, though, was that explaining robot training out loud is a different muscle from understanding it or writing about it. Most of my conversations so far have been with Stanford lab researchers, where the discussion can quickly get technical. But translating the same ideas for smart non-engineers forced me to ask a harder question: what is the simplest version of this field that is still accurate?</p><p>Since I love talking about this space anyway, I thought I&#8217;d write the version I wish existed: friendly enough for a smart non-engineer, but precise enough to make the core ideas stick.</p><div><hr></div><p>A chatbot can be wrong and remain safely inside the screen.</p><p>A robot cannot.</p><p>When a language model makes a mistake, it produces a bad sentence. When a robot makes a mistake, it may drop a glass, pinch a finger, scrape a countertop, block a hallway, or push too hard against an object it does not understand.</p><p>That is why robot learning is not simply &#8220;AI with arms and legs.&#8221; It is a different category of intelligence. The model is not only predicting information. It is choosing physical action.</p><p>The core idea is simple:</p><blockquote><p>Robot training is the process of turning experience into a policy &#8212; a model that maps what the robot observes to what it should do next.</p></blockquote><p>That sentence contains most of the field.</p><p>A robot needs to see the world. It needs to understand the task. It needs to know where its own body is. It needs to decide how to move. It needs to sense contact. It needs to recover when something slips. And it needs to do all of this safely, in a world that is messy, changing, and full of edge cases.</p><p>The first mental model is this:</p><blockquote><p>Robots are not trained to know things. They are trained to do things.</p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!zYtW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbd04036-b528-41fd-8794-fab10d1b6820_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!zYtW!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbd04036-b528-41fd-8794-fab10d1b6820_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!zYtW!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbd04036-b528-41fd-8794-fab10d1b6820_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!zYtW!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbd04036-b528-41fd-8794-fab10d1b6820_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!zYtW!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbd04036-b528-41fd-8794-fab10d1b6820_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!zYtW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbd04036-b528-41fd-8794-fab10d1b6820_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/dbd04036-b528-41fd-8794-fab10d1b6820_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1134401,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/195715261?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbd04036-b528-41fd-8794-fab10d1b6820_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!zYtW!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbd04036-b528-41fd-8794-fab10d1b6820_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!zYtW!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbd04036-b528-41fd-8794-fab10d1b6820_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!zYtW!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbd04036-b528-41fd-8794-fab10d1b6820_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!zYtW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbd04036-b528-41fd-8794-fab10d1b6820_1672x941.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: Generated by ChatGPT</figcaption></figure></div><div><hr></div><h2>From prediction to movement</h2><p>Modern AI is often explained through prediction. A language model predicts the next token. An image model predicts visual patterns. A recommendation model predicts what a user may click, watch, or buy.</p><p>A robot also predicts. But its prediction becomes movement. Instead of predicting the next word, a robot policy predicts the next physical action.</p><p>That action could be tiny: move the gripper two centimeters left, rotate the wrist, close the fingers slightly, slow down, stop.</p><p>Or it could be higher-level: pick up the cup, open the drawer, walk across the room, hand the object to a person.</p><p><strong>Useful robots need both. They need high-level understanding of the goal and low-level control of the body.</strong></p><p>This is why a simple instruction like &#8220;put the mug in the sink&#8221; is not simple at all.</p><p>A human hears that sentence and fills in thousands of invisible assumptions: find the mug, approach it, choose a grasp point, avoid the plate, lift with enough force, navigate to the sink, place it gently, release.</p><p>A robot has to learn those assumptions from data.</p><div><hr></div><h2>The policy is the robot&#8217;s decision engine</h2><p>In robot learning, the most important word is <strong>policy</strong>.</p><p>A policy is the model that decides what action to take based on the current situation.</p><p>In its simplest form:</p><blockquote><p>observation &#8594; action</p></blockquote><p>The observation is what the robot knows right now. It may include camera images, depth, joint positions, gripper state, force readings, tactile signals, and a language instruction.</p><p>The action is what the robot does next. It may be a motor command, a gripper command, a movement trajectory, or a short sequence of planned motions.</p><p>So if an LLM turns text context into the next word, a robot policy turns physical context into the next move.</p><p>That is the bridge from AI to robotics.</p><p>Older robots were often programmed with explicit rules. Engineers defined the motion path, the object location, the grip pattern, and the safe operating zone. This works well in structured environments, like factories, where objects are predictable and tasks repeat.</p><p>But the real world is not structured.</p><p>Homes are cluttered. Warehouses change. Restaurants are chaotic. People move unpredictably. Lighting shifts. Objects deform. A towel is not a cup. A ripe tomato is not a metal bolt.</p><p>Modern robot training tries to replace brittle hand-coded behavior with learned policies that can adapt.</p><p>The goal is not magic. The goal is to stop writing every rule by hand.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!fvFI!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44489adf-5c4a-43ad-ac79-7d11e695f339_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!fvFI!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44489adf-5c4a-43ad-ac79-7d11e695f339_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!fvFI!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44489adf-5c4a-43ad-ac79-7d11e695f339_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!fvFI!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44489adf-5c4a-43ad-ac79-7d11e695f339_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!fvFI!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44489adf-5c4a-43ad-ac79-7d11e695f339_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!fvFI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44489adf-5c4a-43ad-ac79-7d11e695f339_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/44489adf-5c4a-43ad-ac79-7d11e695f339_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:921950,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/195715261?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44489adf-5c4a-43ad-ac79-7d11e695f339_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!fvFI!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44489adf-5c4a-43ad-ac79-7d11e695f339_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!fvFI!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44489adf-5c4a-43ad-ac79-7d11e695f339_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!fvFI!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44489adf-5c4a-43ad-ac79-7d11e695f339_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!fvFI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44489adf-5c4a-43ad-ac79-7d11e695f339_1672x941.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: ChatGPT Generated</figcaption></figure></div><div><hr></div><h2>What counts as robot experience?</h2><p>For a robot, experience is not just video.</p><p>It is a record of what the robot saw, what it felt, what it did, and what happened next.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!hgpC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b31d38d-6d92-4779-a4fc-81b567abf1cf_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!hgpC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b31d38d-6d92-4779-a4fc-81b567abf1cf_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!hgpC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b31d38d-6d92-4779-a4fc-81b567abf1cf_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!hgpC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b31d38d-6d92-4779-a4fc-81b567abf1cf_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!hgpC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b31d38d-6d92-4779-a4fc-81b567abf1cf_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!hgpC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b31d38d-6d92-4779-a4fc-81b567abf1cf_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3b31d38d-6d92-4779-a4fc-81b567abf1cf_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1136903,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/195715261?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b31d38d-6d92-4779-a4fc-81b567abf1cf_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!hgpC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b31d38d-6d92-4779-a4fc-81b567abf1cf_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!hgpC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b31d38d-6d92-4779-a4fc-81b567abf1cf_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!hgpC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b31d38d-6d92-4779-a4fc-81b567abf1cf_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!hgpC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b31d38d-6d92-4779-a4fc-81b567abf1cf_1672x941.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: ChatGPT Generated</figcaption></figure></div><p></p><p>A common unit is a <strong>trajectory</strong>: a time-ordered sequence of observations and actions.</p><p>Imagine a robot learning to open a drawer. One trajectory might include the camera view of the drawer, the robot&#8217;s joint positions, the gripper moving toward the handle, the force signal when contact happens, the pull command, the drawer opening, and the final success label.</p><p>An <strong>episode</strong> is one complete attempt at a task.</p><p>A <strong>demonstration</strong> is usually a successful episode shown by a human, often through teleoperation. The human controls the robot. The robot records the observations and actions. The model learns to imitate the pattern.</p><p>This is why robotics data is so different from internet data.</p><p>A robot does not just need examples of objects. It needs examples of interaction.</p><p>It needs to know what happens when it touches the world.</p><div><hr></div><h2>The four ways robots learn</h2><p>Most robot-learning methods can be understood through four buckets: imitation learning, reinforcement learning, diffusion policies, and vision-language-action models.</p><p>Each teaches the robot something different. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!6Aot!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45920322-1481-4f19-bef3-2442cacfdbda_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!6Aot!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45920322-1481-4f19-bef3-2442cacfdbda_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!6Aot!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45920322-1481-4f19-bef3-2442cacfdbda_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!6Aot!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45920322-1481-4f19-bef3-2442cacfdbda_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!6Aot!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45920322-1481-4f19-bef3-2442cacfdbda_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!6Aot!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45920322-1481-4f19-bef3-2442cacfdbda_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/45920322-1481-4f19-bef3-2442cacfdbda_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1078825,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/195715261?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45920322-1481-4f19-bef3-2442cacfdbda_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!6Aot!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45920322-1481-4f19-bef3-2442cacfdbda_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!6Aot!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45920322-1481-4f19-bef3-2442cacfdbda_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!6Aot!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45920322-1481-4f19-bef3-2442cacfdbda_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!6Aot!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45920322-1481-4f19-bef3-2442cacfdbda_1672x941.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: Generated by ChatGPT</figcaption></figure></div><p>My personal belief is that no robot can be trained meaningfully via using only one type of learning - it&#8217;s better to have a mix. Let&#8217;s dive into what each type of learning is, to understand the pros and cons and how they could complement each other.</p><div><hr></div><h2>1. Imitation learning: copy the expert</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!7fSe!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F474a717a-9c37-4a4b-afc0-215b1357e016_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!7fSe!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F474a717a-9c37-4a4b-afc0-215b1357e016_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!7fSe!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F474a717a-9c37-4a4b-afc0-215b1357e016_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!7fSe!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F474a717a-9c37-4a4b-afc0-215b1357e016_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!7fSe!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F474a717a-9c37-4a4b-afc0-215b1357e016_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!7fSe!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F474a717a-9c37-4a4b-afc0-215b1357e016_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/474a717a-9c37-4a4b-afc0-215b1357e016_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1411569,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/195715261?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F474a717a-9c37-4a4b-afc0-215b1357e016_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!7fSe!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F474a717a-9c37-4a4b-afc0-215b1357e016_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!7fSe!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F474a717a-9c37-4a4b-afc0-215b1357e016_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!7fSe!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F474a717a-9c37-4a4b-afc0-215b1357e016_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!7fSe!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F474a717a-9c37-4a4b-afc0-215b1357e016_1672x941.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Imitation learning is the apprenticeship model of robotics.</p><p>A human demonstrates a task, and the robot learns to copy the pattern.</p><p>For example, a person might teleoperate a robot arm to pick up a sponge, wipe a table, and place the sponge back down. The system records the images, robot positions, gripper commands, and outcome. After many examples, the policy learns what actions tend to follow what situations.</p><p>This is practical because humans already know how to do useful tasks. Instead of asking a robot to discover everything from scratch, we show it what good behavior looks like.</p><p>The simplest version is <strong>behavior cloning</strong>: when the observation looks like this, take this action.</p><p>But imitation has a weakness. It teaches the robot what success looks like. It does not always teach the robot what to do after failure.</p><p>A human demo may show the perfect way to pick up a cup. But what if the robot nudges the cup? What if the cup rotates? What if the gripper closes too early? What if the object slips?</p><p>Small errors push the robot into situations it may never have seen in training.</p><p>That is why perfect demonstrations are not enough. Robots also need corrections, failures, and recovery examples.</p><p>A robot trained only on clean demos may look impressive in a video and fragile in the real world.</p><ul><li><p><strong>Companies / labs to know:</strong> AgiBot, Tesla Optimus, Mobile ALOHA / Stanford-DeepMind research. AgiBot has been reported to combine teleoperation with reinforcement learning for manufacturing tasks, Tesla has shifted Optimus training toward human video collection, and Mobile ALOHA is a well-known imitation-learning reference point for bimanual mobile manipulation. </p></li></ul><div><hr></div><h2>2. Reinforcement learning: practice with consequences</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!VhEQ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb681d1c7-513e-4cb2-8028-02e858fc2e15_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!VhEQ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb681d1c7-513e-4cb2-8028-02e858fc2e15_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!VhEQ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb681d1c7-513e-4cb2-8028-02e858fc2e15_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!VhEQ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb681d1c7-513e-4cb2-8028-02e858fc2e15_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!VhEQ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb681d1c7-513e-4cb2-8028-02e858fc2e15_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!VhEQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb681d1c7-513e-4cb2-8028-02e858fc2e15_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b681d1c7-513e-4cb2-8028-02e858fc2e15_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1276092,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/195715261?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb681d1c7-513e-4cb2-8028-02e858fc2e15_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!VhEQ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb681d1c7-513e-4cb2-8028-02e858fc2e15_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!VhEQ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb681d1c7-513e-4cb2-8028-02e858fc2e15_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!VhEQ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb681d1c7-513e-4cb2-8028-02e858fc2e15_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!VhEQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb681d1c7-513e-4cb2-8028-02e858fc2e15_1672x941.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: Generated by ChatGPT</figcaption></figure></div><p>Reinforcement learning trains a robot through trial and error.</p><p>The robot tries actions. It receives rewards for good outcomes and penalties for bad ones. Over time, it learns which behaviors lead to success.</p><p>This is powerful because the robot can discover strategies that humans did not explicitly demonstrate. It can improve through practice.</p><p>But physical practice is expensive.</p><p>If a simulated robot falls 10,000 times, nothing breaks. If a real humanoid falls 10,000 times, the lab has a problem.</p><p>That is why reinforcement learning is often used in simulation, in controlled settings, or as a fine-tuning layer after imitation learning. It is especially useful for skills like locomotion, where balance, recovery, and adaptation matter.</p><p>A simple way to think about it:</p><blockquote><p>Imitation learning gives the robot a first draft. Reinforcement learning helps it practice.</p></blockquote><p>But practice in robotics has a cost. Robots move slowly. Hardware wears down. Unsafe behavior is unacceptable. Real-world data is precious because it is expensive to collect.</p><p>This is one reason robotics has not scaled like language models. The internet already had text. It did not already have billions of clean robot trajectories.</p><ul><li><p><strong>Companies / labs to know:</strong> Boston Dynamics, Google DeepMind, Skild AI. Boston Dynamics has publicly described integrating reinforcement learning into Spot&#8217;s locomotion, while Skild AI describes simulation-heavy training for robots across diverse bodies and conditions. </p></li></ul><div><hr></div><h2>3. Diffusion policy: generate the motion, not just the next move</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!9zFP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb1df171-9ec3-4e25-8f53-e528bf061b1a_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!9zFP!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb1df171-9ec3-4e25-8f53-e528bf061b1a_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!9zFP!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb1df171-9ec3-4e25-8f53-e528bf061b1a_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!9zFP!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb1df171-9ec3-4e25-8f53-e528bf061b1a_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!9zFP!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb1df171-9ec3-4e25-8f53-e528bf061b1a_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!9zFP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb1df171-9ec3-4e25-8f53-e528bf061b1a_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/eb1df171-9ec3-4e25-8f53-e528bf061b1a_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:993320,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/195715261?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb1df171-9ec3-4e25-8f53-e528bf061b1a_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!9zFP!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb1df171-9ec3-4e25-8f53-e528bf061b1a_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!9zFP!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb1df171-9ec3-4e25-8f53-e528bf061b1a_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!9zFP!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb1df171-9ec3-4e25-8f53-e528bf061b1a_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!9zFP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb1df171-9ec3-4e25-8f53-e528bf061b1a_1672x941.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: Generated by ChatGPT</figcaption></figure></div><p>Diffusion policy is one of the more important recent ideas in robot learning.</p><p>The intuition comes from image generation.</p><p>An image diffusion model starts with noise and gradually turns it into a coherent image. A diffusion policy does something similar, but for action.</p><p>Instead of generating a picture, it generates a sequence of robot movements.</p><p>This matters because many physical tasks have more than one correct solution. A robot can pick up a cup from the left, from the right, or from above. It can fold a cloth in different ways. It can move around an obstacle through several valid paths.</p><p>Some models struggle when the same situation has many possible next actions. They average the options and produce a bad middle path.</p><p>A diffusion policy can represent multiple plausible futures. It can generate a smooth action sequence that fits the scene.</p><p>The key idea:</p><blockquote><p>A diffusion policy does not just choose the next tiny move. It sketches the next few seconds of motion.</p></blockquote><p>That is useful for manipulation, where timing and coordination matter.</p><p>Picking up a soft object is not one action. It is a sequence: approach, align, touch, adjust, close, lift, stabilize.</p><p>The quality of the motion matters as much as the goal.</p><ul><li><p><strong>Companies / labs to know:</strong> Toyota Research Institute, Columbia / Diffusion Policy research, Physical Intelligence. TRI explicitly announced a generative AI approach based on Diffusion Policy to teach robots dexterous skills; the original Diffusion Policy research framed robot behavior as conditional denoising over action sequences; Physical Intelligence&#8217;s &#960;0 uses a related generative policy direction for generalist robot control. </p></li></ul><div><hr></div><h2>4. Vision-language-action models: connect meaning to movement</h2><p>The next frontier is the vision-language-action model, often called a VLA.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Hi_G!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F558bea7a-6131-46aa-b9b5-870f87cccb39_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Hi_G!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F558bea7a-6131-46aa-b9b5-870f87cccb39_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!Hi_G!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F558bea7a-6131-46aa-b9b5-870f87cccb39_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!Hi_G!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F558bea7a-6131-46aa-b9b5-870f87cccb39_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!Hi_G!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F558bea7a-6131-46aa-b9b5-870f87cccb39_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Hi_G!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F558bea7a-6131-46aa-b9b5-870f87cccb39_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/558bea7a-6131-46aa-b9b5-870f87cccb39_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1021524,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/195715261?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F558bea7a-6131-46aa-b9b5-870f87cccb39_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Hi_G!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F558bea7a-6131-46aa-b9b5-870f87cccb39_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!Hi_G!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F558bea7a-6131-46aa-b9b5-870f87cccb39_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!Hi_G!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F558bea7a-6131-46aa-b9b5-870f87cccb39_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!Hi_G!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F558bea7a-6131-46aa-b9b5-870f87cccb39_1672x941.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: Generated by ChatGPT</figcaption></figure></div><p>A VLA connects three things:</p><p>Vision: what the robot sees.<br>Language: what the human asks.<br>Action: what the robot does.</p><p>This is where robotics begins to look more like foundation-model AI.</p><p>A traditional robot might be trained for one narrow task: pick up the red block. A VLA-style robot aims to understand broader instructions, such as &#8220;move the empty cup next to the coffee machine&#8221; or &#8220;put the toy back where it belongs.&#8221;</p><p>That requires more than motor control. It requires meaning.</p><p>The robot must know what a cup is, what &#8220;empty&#8221; means, what &#8220;next to&#8221; means, and what a coffee machine looks like. Then it must translate that understanding into motion.</p><p>This is why large AI models matter for robotics. Internet-scale image and language training can give robots useful world knowledge.</p><p>But world knowledge is not the same as physical skill.</p><p>A model may understand the word &#8220;drawer.&#8221; That does not mean it knows how hard to pull one.</p><p>Robots need both semantic intelligence and embodied control.</p><p>That is the central challenge of physical AI: connecting meaning to movement.</p><ul><li><p><strong>Companies / labs to know:</strong> Physical Intelligence, Google DeepMind, Figure AI, Covariant. Physical Intelligence&#8217;s &#960;0 is described as a general-purpose robot foundation model trained to follow text instructions; Figure&#8217;s Helix is positioned as a humanoid VLA model; Covariant&#8217;s RFM-1 combines internet data with real-world robot interaction data for warehouse robotics. </p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!dtsT!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e24c99f-2523-4335-8c7c-26db425a490c_2500x1426.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!dtsT!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e24c99f-2523-4335-8c7c-26db425a490c_2500x1426.png 424w, https://substackcdn.com/image/fetch/$s_!dtsT!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e24c99f-2523-4335-8c7c-26db425a490c_2500x1426.png 848w, https://substackcdn.com/image/fetch/$s_!dtsT!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e24c99f-2523-4335-8c7c-26db425a490c_2500x1426.png 1272w, https://substackcdn.com/image/fetch/$s_!dtsT!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e24c99f-2523-4335-8c7c-26db425a490c_2500x1426.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!dtsT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e24c99f-2523-4335-8c7c-26db425a490c_2500x1426.png" width="1456" height="831" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1e24c99f-2523-4335-8c7c-26db425a490c_2500x1426.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:831,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1422219,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/195715261?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e24c99f-2523-4335-8c7c-26db425a490c_2500x1426.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!dtsT!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e24c99f-2523-4335-8c7c-26db425a490c_2500x1426.png 424w, https://substackcdn.com/image/fetch/$s_!dtsT!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e24c99f-2523-4335-8c7c-26db425a490c_2500x1426.png 848w, https://substackcdn.com/image/fetch/$s_!dtsT!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e24c99f-2523-4335-8c7c-26db425a490c_2500x1426.png 1272w, https://substackcdn.com/image/fetch/$s_!dtsT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1e24c99f-2523-4335-8c7c-26db425a490c_2500x1426.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: Physical Intelligence</figcaption></figure></div><div><hr></div><h2>Why robot learning is hard</h2><p>Robotics is hard because the world pushes back.</p><p>In software, the environment is mostly symbolic. In robotics, the environment has friction, weight, texture, lighting, occlusion, gravity, and humans.</p><p>A robot can see a shirt, but fabric deforms. It can see a tomato, but pressure matters. It can see a door handle, but the hinge may be stiff. It can see a person, but that person may suddenly move.</p><p>Four problems show up again and again.</p><p>First, contact is difficult. Many useful tasks require touching, pushing, pulling, sliding, twisting, or grasping. Once contact happens, vision alone may not be enough. The robot may need force or tactile feedback.</p><p>Second, small errors compound. If a gripper is slightly misaligned, the object may slip. If the object slips, the next observation changes. If the model has never seen that state, it may make things worse.</p><p>Third, generalization is hard. A policy trained in one lab may fail in another room. A robot that picks up one mug may fail on a transparent glass, a heavy ceramic cup, or a mug with an unusual handle.</p><p>Fourth, long-horizon tasks are fragile. A robot may succeed at grasping but fail at the fifth step of cleaning a table. The more steps a task has, the more chances there are to drift.</p><p>This is why robot demos can be misleading.</p><p>A single successful run proves the robot succeeded once, under those conditions.</p><p>The real question is different:</p><blockquote><p>How often does it succeed, across how much variation, with how little human help, and how safely?</p></blockquote><div><hr></div><h2>Evaluation is not just accuracy</h2><p>In most AI products, evaluation can happen on test sets, user studies, or online metrics.</p><p>Robotics needs those too. But it also needs physical proof.</p><p>A useful robot policy is usually tested in layers.</p><p>Offline evaluation asks whether the model predicts reasonable actions on held-out data.</p><p>Simulation asks whether it works across many synthetic variations.</p><p>Lab testing asks whether it works on the real robot in a controlled setting.</p><p>Pilot deployment asks whether it works in a real environment with limited scope.</p><p>Full deployment asks whether it remains reliable over time.</p><p>The important metrics are physical operating metrics:</p><p>Task success rate.<br>Time to completion.<br>Collision rate.<br>Human intervention rate.<br>Recovery rate.<br>Performance on new objects.<br>Performance in new environments.<br>Safety under ambiguous instructions.</p><p>For PMs and investors, the key point is:</p><blockquote><p>A robot is not valuable because it can do a task once. It is valuable when it can do the task repeatedly, safely, and economically.</p></blockquote><p>That is why robotics progress can look slower than software AI progress.</p><p>The demo bar is low. The deployment bar is high.</p><div><hr></div><h2>Safety is part of training</h2><p>Safety in robotics is not a final checklist before launch.</p><p>It has to be part of the training loop.</p><p>A robot must learn how to act, but also when not to act.</p><p>If a human hand enters the workspace, the robot should slow down or stop. If the instruction is unsafe, it should refuse. If confidence is low, it should ask for help. If force exceeds a threshold, it should back off.</p><p>Safety has many layers.</p><p>Mechanical safety limits force, speed, torque, and range of motion.</p><p>Control safety prevents unstable movements.</p><p>Perception safety detects people, obstacles, fragile objects, and unexpected changes.</p><p>Policy safety prevents the learned model from selecting dangerous actions.</p><p>Semantic safety helps the robot understand that some instructions should not be followed.</p><p>Human override allows a person to stop, correct, or take over.</p><p>This is another reason data matters. Safe robots need more than successful demonstrations. They need near misses, human interventions, failed grasps, ambiguous scenes, unsafe commands, and recovery behavior.</p><p>In robotics, failure data is not a footnote.</p><p>It is training material.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!WTWV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7aed5140-9f81-47b5-9a4d-e627ac35e034_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!WTWV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7aed5140-9f81-47b5-9a4d-e627ac35e034_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!WTWV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7aed5140-9f81-47b5-9a4d-e627ac35e034_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!WTWV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7aed5140-9f81-47b5-9a4d-e627ac35e034_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!WTWV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7aed5140-9f81-47b5-9a4d-e627ac35e034_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!WTWV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7aed5140-9f81-47b5-9a4d-e627ac35e034_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7aed5140-9f81-47b5-9a4d-e627ac35e034_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1025837,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/195715261?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7aed5140-9f81-47b5-9a4d-e627ac35e034_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!WTWV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7aed5140-9f81-47b5-9a4d-e627ac35e034_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!WTWV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7aed5140-9f81-47b5-9a4d-e627ac35e034_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!WTWV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7aed5140-9f81-47b5-9a4d-e627ac35e034_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!WTWV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7aed5140-9f81-47b5-9a4d-e627ac35e034_1672x941.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div><hr></div><h2>The training loop is the product</h2><p>The most important robotics companies will not just build better machines.</p><p>They will build better learning loops.</p><p>The loop looks like this:</p><p>Define the task.<br>Collect demonstrations.<br>Train a policy.<br>Test it in simulation and the lab.<br>Deploy in a limited environment.<br>Capture successes, failures, interventions, and edge cases.<br>Improve the policy.<br>Repeat.</p><p>This loop is the real asset.</p><p>A company with robots in the field can collect data that a research lab cannot. A company with better teleoperation can generate demonstrations faster. A company with better simulation can test rare scenarios more cheaply. A company with better evaluation can improve without breaking old skills. A company with stronger safety infrastructure can deploy earlier and learn faster.</p><p>That is how robot data becomes a moat.</p><p>But not all data is equal.</p><p>Ten thousand near-identical pick-and-place demos may be less valuable than a smaller set of diverse episodes across objects, environments, failures, and recovery cases.</p><p>The better question is not:</p><blockquote><p>How much data does this company have?</p></blockquote><p>It is:</p><blockquote><p>Does this company have a loop that makes the robot measurably more useful over time?</p></blockquote><p>That is the commercial heart of robot training.</p><div><hr></div><h2>The simple version</h2><p>Robot training can sound intimidating because the field has so many terms: policies, trajectories, imitation learning, reinforcement learning, diffusion, VLAs, sim-to-real, embodiment, teleoperation.</p><p>But the basic loop is simple.</p><p>A robot observes the world.<br>A policy chooses an action.<br>The robot moves.<br>The world changes.<br>The result becomes new data.<br>The policy improves.</p><p>That is robot training.</p><p>The hard part is that the loop happens in the physical world, where mistakes are costly and every environment is slightly different.</p><p>This is why physical AI will not be won by the best demo alone. It will be won by the best learning system: the one that can turn messy embodied experience into reliable behavior.</p><p>The next robotics race will depend on better models, yes. But the deeper advantage will come from better data, better evaluation, better safety, and better feedback loops.</p><p>Software intelligence learned to speak by absorbing the internet.</p><p>Physical intelligence will learn differently.</p><p>It will learn by seeing, touching, failing, recovering, and trying again - until movement becomes reliable enough to leave the lab.</p>]]></content:encoded></item><item><title><![CDATA[Strategy Decoded: #5 Apple Doesn’t Need the Best AI Model. It Needs the Front Door.]]></title><description><![CDATA[TL;DR Apple may look late to AI, but it may be playing a different game.]]></description><link>https://clairechoi616.substack.com/p/strategy-decoded-5-apple-doesnt-need</link><guid isPermaLink="false">https://clairechoi616.substack.com/p/strategy-decoded-5-apple-doesnt-need</guid><dc:creator><![CDATA[Deep Tech for the Non Tech]]></dc:creator><pubDate>Mon, 27 Apr 2026 22:22:29 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!gJQU!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50c6b57b-546c-48ac-a5d9-3100e03209a3_1920x1080.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><strong>TL;DR </strong></p><ul><li><p>Apple may look late to AI, but it may be playing a different game.</p></li><li><p>Model companies are racing to build the smartest AI; Apple wants to own the doorway.</p></li><li><p>Tim Cook built the fortress, but not the next iPhone.</p></li><li><p>John Ternus signals Apple&#8217;s bet on AI as a device-native, ambient experience.</p></li></ul><p></p><div><hr></div><p>Everyone is asking the wrong question about Apple.</p><p>The question is usually: <strong>Can Apple catch up in AI?</strong></p><p>It is easy to see why. OpenAI has ChatGPT. Google has Gemini. Anthropic has Claude. NVIDIA has become the symbol of the AI boom. Meanwhile, Apple&#8217;s AI rollout has felt cautious, delayed, and strangely quiet for a company that once made the future feel obvious.</p><p>But Apple has rarely won by being first to a technology.</p><p><strong>Apple wins when it turns technology into a habit.</strong></p><p>So the better question is not whether Apple can build the smartest AI model. The better question is:</p><p><strong>Can Apple become the place where everyday people meet AI?</strong></p><p>That distinction matters. One question is about raw intelligence. The other is about distribution, trust, hardware, payments, apps, and daily behavior.</p><p>Apple may be late to the AI model race. But it may still be early to the AI doorway.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!PuN_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F66dcd378-365c-4942-a1a1-359175fc7529_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!PuN_!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F66dcd378-365c-4942-a1a1-359175fc7529_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!PuN_!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F66dcd378-365c-4942-a1a1-359175fc7529_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!PuN_!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F66dcd378-365c-4942-a1a1-359175fc7529_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!PuN_!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F66dcd378-365c-4942-a1a1-359175fc7529_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!PuN_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F66dcd378-365c-4942-a1a1-359175fc7529_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/66dcd378-365c-4942-a1a1-359175fc7529_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1454317,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/195684288?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F66dcd378-365c-4942-a1a1-359175fc7529_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!PuN_!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F66dcd378-365c-4942-a1a1-359175fc7529_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!PuN_!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F66dcd378-365c-4942-a1a1-359175fc7529_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!PuN_!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F66dcd378-365c-4942-a1a1-359175fc7529_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!PuN_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F66dcd378-365c-4942-a1a1-359175fc7529_1672x941.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: Generated by ChatGPT</figcaption></figure></div><p></p><div><hr></div><h2>Apple is weak where AI is loud, but strong where AI becomes daily</h2><p>The visible AI race is happening in chat windows.</p><p>That is where Apple looks weakest. Siri, once an early symbol of consumer AI, now feels behind. ChatGPT can write, reason, code, summarize, and explain. Gemini is being pushed across Google&#8217;s products. Claude has become a favorite among knowledge workers.</p><p>Siri still often feels like a voice command tool from another era.</p><p>That is the obvious story.</p><p>The less obvious story is that most people do not choose AI by comparing benchmark scores. They choose what is already in front of them.</p><p>They do not wake up wondering which foundation model has the strongest reasoning performance this week.</p><p>They wonder: <strong>Can my phone help me get this done?</strong></p><p>That is Apple&#8217;s opening.</p><p>Apple still controls some of the most valuable consumer surfaces in technology: the iPhone in your hand, the Mac on your desk, the AirPods in your ears, the Watch on your wrist, the App Store where apps are discovered, Apple Pay where transactions happen, and iOS where permissions are granted.</p><p>OpenAI, Google, and Anthropic may own powerful AI brains. But Apple owns many of the places those brains need to live.</p><p>A simple analogy: the model companies are building better chefs. Apple owns the hotel - the lobby, the rooms, the concierge desk, and the payment system.</p><p>The chef matters. But the guest relationship often belongs to the hotel.</p><div><hr></div><h2>Apple&#8217;s real strategy: own the switchboard</h2><p>Apple&#8217;s AI strategy is probably not to out-OpenAI OpenAI.</p><p>It is to make Siri, iOS, and Apple devices the routing layer for AI.</p><p>Imagine asking Siri to plan dinner with a friend. That sounds simple. But it requires multiple small jobs: read the message, check your calendar, understand location, compare restaurants, maybe book a table, and draft a reply.</p><p>No single &#8220;AI answer&#8221; is enough. The system needs to move across apps, data, permissions, and actions.</p><p>That is where Apple wants to sit.</p><p>The user says: &#8220;Plan dinner with Sarah next Thursday.&#8221;</p><p>Behind the scenes, Apple decides what should handle each part: an Apple model, Gemini, ChatGPT, Claude, Maps, Calendar, Messages, or a restaurant app.</p><p>The user does not care which model did the work. The user only cares that it worked.</p><p>That is the <em><strong>switchboard strategy</strong></em>. Apple does not need to own every AI brain. It needs to decide which brain gets called, when, and inside what experience.</p><p>This is why Apple&#8217;s partnerships with external AI models should not be read only as weakness. They may also be Apple&#8217;s classic &#8220;buy the ingredient, own the experience&#8221; move.</p><p>Apple did not manufacture every component in the iPhone. It controlled the product. Apple did not invent every technology inside the Mac. It controlled the experience.</p><p>AI may follow the same logic. <strong>Apple wants models to become suppliers. Apple wants to remain the interface.</strong></p><div><hr></div><h2>Why this could work: Apple owns the stack</h2><p>Apple&#8217;s advantage is not one thing. It is the stack.</p><p>A stack simply means the layers that make a product work. In Apple&#8217;s case, that stack includes chips, devices, operating systems, apps, services, payments, identity, and retail distribution.</p><p>Most companies own one or two layers.</p><p><strong>Apple owns many.</strong></p><p>That matters more in AI than it did in the app era.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ChrK!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F575a252f-8ba4-4f08-8d04-990cf621c397_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ChrK!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F575a252f-8ba4-4f08-8d04-990cf621c397_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!ChrK!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F575a252f-8ba4-4f08-8d04-990cf621c397_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!ChrK!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F575a252f-8ba4-4f08-8d04-990cf621c397_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!ChrK!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F575a252f-8ba4-4f08-8d04-990cf621c397_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ChrK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F575a252f-8ba4-4f08-8d04-990cf621c397_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/575a252f-8ba4-4f08-8d04-990cf621c397_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1253142,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/195684288?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F575a252f-8ba4-4f08-8d04-990cf621c397_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ChrK!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F575a252f-8ba4-4f08-8d04-990cf621c397_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!ChrK!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F575a252f-8ba4-4f08-8d04-990cf621c397_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!ChrK!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F575a252f-8ba4-4f08-8d04-990cf621c397_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!ChrK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F575a252f-8ba4-4f08-8d04-990cf621c397_1672x941.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: Generated by ChatGPT</figcaption></figure></div><p></p><p>AI is not just another app category. The more useful AI becomes, the more access it needs. It needs your messages to summarize conversations. Your calendar to schedule things. Your photos to find memories. Your files to retrieve documents. Your location to make recommendations. Your payment tools to complete transactions.</p><p>That is powerful. It is also sensitive.</p><p>A personal AI agent is only useful if it can touch your life. But the more it touches your life, the more trust matters.</p><p>This is where Apple has a real advantage. For years, Apple has trained users to see it as the privacy-first consumer technology company. That positioning becomes more valuable when AI shifts from answering questions to taking actions.</p><p>Because in the agent era, trust is not branding.</p><p>Trust is infrastructure.</p><p><strong>The second layer is hardware.</strong></p><p>AI sounds like software, but it is deeply physical. It needs chips, memory, batteries, sensors, microphones, cameras, and thermal management. If AI runs on your phone, it must feel instant, private, and efficient enough not to destroy battery life.</p><p>This is why Apple Silicon matters.</p><p>Apple&#8217;s chip strategy is not just a performance story. It gives Apple control over how intelligence runs across its devices. Some tasks can happen on-device. Some can go to Apple&#8217;s private cloud. Some can go to external models. The user sees one experience. Apple orchestrates the work behind the curtain.</p><p>Think of Apple Silicon as the nervous system.</p><p>The AI may be the brain, but the nervous system decides how quickly, privately, and smoothly signals move through the body.</p><div><hr></div><h2>Tim Cook built the fortress. He did not build the next iPhone.</h2><p>This is where the CEO transition becomes strategically important.</p><p>Apple announced that Tim Cook will become executive chairman and John Ternus, Apple&#8217;s senior vice president of Hardware Engineering, will become CEO effective September 1, 2026. Apple described the transition as the result of a long-term succession process approved unanimously by its board. </p><p>Cook&#8217;s record is extraordinary.</p><p>He turned Apple into one of the most valuable companies in the world. He built a world-class supply chain. He expanded services. He scaled Apple Watch and AirPods. He made Apple less dependent on one heroic product launch.</p><p>But one critique stayed with him: Cook did not create the next iPhone.</p><p>Apple Car was canceled after years of effort. Vision Pro is technically impressive, but it has not become a mass-market platform. Its problem is not only price. It is clarity. Most consumers still do not have an instant answer to: <strong>Why do I need this every day?</strong></p><p>That is the difference between a beautiful technology and a killer product.</p><p>The iPhone did not require a long explanation. It collapsed the phone, iPod, browser, camera, and app platform into one object people immediately understood.</p><p>Vision Pro did not.</p><p>So Cook&#8217;s legacy has a paradox. He may be one of the most successful operators in corporate history, yet Apple enters the AI era with an unanswered product question:</p><p><strong>What is the next personal computer?</strong></p><p>Apple II made computing personal at home.<br>The Mac made it graphical.<br>The iPhone made it mobile.<br>The next version may be agentic.</p><p>An agent is not just software that answers. It is software that acts.</p><p>Not &#8220;here is the answer.&#8221;</p><p>More like: &#8220;I handled it.&#8221;</p><p>That is the product Apple is now chasing.</p><div><hr></div><h2>Why a hardware CEO makes sense in an AI era</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!gJQU!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50c6b57b-546c-48ac-a5d9-3100e03209a3_1920x1080.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!gJQU!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50c6b57b-546c-48ac-a5d9-3100e03209a3_1920x1080.jpeg 424w, https://substackcdn.com/image/fetch/$s_!gJQU!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50c6b57b-546c-48ac-a5d9-3100e03209a3_1920x1080.jpeg 848w, https://substackcdn.com/image/fetch/$s_!gJQU!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50c6b57b-546c-48ac-a5d9-3100e03209a3_1920x1080.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!gJQU!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50c6b57b-546c-48ac-a5d9-3100e03209a3_1920x1080.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!gJQU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50c6b57b-546c-48ac-a5d9-3100e03209a3_1920x1080.jpeg" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/50c6b57b-546c-48ac-a5d9-3100e03209a3_1920x1080.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!gJQU!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50c6b57b-546c-48ac-a5d9-3100e03209a3_1920x1080.jpeg 424w, https://substackcdn.com/image/fetch/$s_!gJQU!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50c6b57b-546c-48ac-a5d9-3100e03209a3_1920x1080.jpeg 848w, https://substackcdn.com/image/fetch/$s_!gJQU!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50c6b57b-546c-48ac-a5d9-3100e03209a3_1920x1080.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!gJQU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50c6b57b-546c-48ac-a5d9-3100e03209a3_1920x1080.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: Apple</figcaption></figure></div><p>At first, choosing a hardware leader to run Apple in the AI era sounds backward.</p><p>Shouldn&#8217;t the next CEO be an AI researcher? A cloud executive? A software platform leader?</p><p>Maybe - if Apple were Google or OpenAI.</p><p>But Apple&#8217;s deepest skill has never been raw invention. It is translation.</p><p>Apple turns complex technology into objects normal people understand.</p><p>That makes John Ternus&#8217;s background more meaningful. He is not just a &#8220;hardware person.&#8221; He represents a bet that the next AI platform will not live only in a chatbot. It will live in devices.</p><p>Maybe smart glasses let AI see what you see.<br>Maybe AirPods become an always-available assistant.<br>Maybe Apple Watch becomes the health and context layer.<br>Maybe the home gets a new AI device.<br>Maybe the iPhone itself becomes less like an app launcher and more like an agent hub.</p><p>The common thread is simple:</p><p><strong>AI will not stay inside a text box. It will move into the devices around us.</strong></p><p>If that is true, a hardware-centric CEO is not a retreat into Apple&#8217;s past.</p><p>It is a bet on where AI goes next.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!xezV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3417ac5-c58c-4e07-b0e7-c005b79918e7_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!xezV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3417ac5-c58c-4e07-b0e7-c005b79918e7_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!xezV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3417ac5-c58c-4e07-b0e7-c005b79918e7_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!xezV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3417ac5-c58c-4e07-b0e7-c005b79918e7_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!xezV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3417ac5-c58c-4e07-b0e7-c005b79918e7_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!xezV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3417ac5-c58c-4e07-b0e7-c005b79918e7_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e3417ac5-c58c-4e07-b0e7-c005b79918e7_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1519611,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/195684288?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3417ac5-c58c-4e07-b0e7-c005b79918e7_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!xezV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3417ac5-c58c-4e07-b0e7-c005b79918e7_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!xezV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3417ac5-c58c-4e07-b0e7-c005b79918e7_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!xezV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3417ac5-c58c-4e07-b0e7-c005b79918e7_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!xezV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3417ac5-c58c-4e07-b0e7-c005b79918e7_1672x941.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: Generated by ChatGPT</figcaption></figure></div><p></p><div><hr></div><h2>The market implication: model companies become ingredients</h2><p>If Apple succeeds, the AI market may not evolve the way people expect.</p><p>Right now, model companies are the stars. They compete on intelligence, speed, cost, and brand.</p><p>But consumer technology often rewards the company that owns the interface.</p><p>Most people do not know which payment processor powers a checkout page. They know the store.</p><p>Most people do not know which supplier made a phone component. They know the device.</p><p>Most people may not know which model answered a question if the answer appears naturally inside Siri, Messages, Mail, Photos, or Calendar.</p><p>That is the opportunity.</p><p>Apple can let model companies fight the expensive intelligence war while it controls the consumer relationship.</p><p>This does not mean models are unimportant. Bad AI will break the experience. But if multiple models become good enough for many consumer tasks, distribution becomes more powerful.</p><p>That is when Apple becomes dangerous.</p><p>Model companies need users. Apple has users.<br>Model companies need trusted surfaces. Apple has trusted surfaces.<br>Model companies need monetization. Apple has payments.<br>Model companies need context. Apple has the device ecosystem where context lives.</p><p>This is why Apple can look late and still matter. &#127822;&#128170;</p><div><hr></div><h2>What to watch next</h2><p>The first signal is Siri.</p><p>Do not watch whether Siri gets a prettier animation. Watch whether it can complete multi-step tasks across apps. Can it find the right email, summarize the context, make a decision, and take action with permission?</p><p>The second signal is model choice.</p><p>If Apple lets users access multiple AI models through Siri or system settings, Apple becomes the switchboard. Model competition becomes Apple leverage.</p><p>The third signal is on-device AI.</p><p>Listen for Apple talking about privacy, local inference, neural engines, Apple Silicon, and private cloud execution. That language means Apple is not just adding AI features. It is building AI into the architecture of the product.</p><p>The fourth signal is new hardware.</p><p>Smart glasses, AI-enabled AirPods, home robots, foldables, or new ambient devices would suggest Apple believes the next interface is not a chat window.</p><p>The fifth signal is developer access.</p><p>If developers can plug their apps into Siri and Apple Intelligence in a useful way, Apple&#8217;s AI strategy becomes a platform strategy.</p><p>That is when the story gets much bigger.</p><div><hr></div><h2>The door still has to work</h2><p>There is one major caveat.</p><p>Apple cannot win the AI doorway if the doorway is broken.</p><p>If Siri remains unreliable, users will go directly to ChatGPT, Gemini, or Claude. If Apple moves too slowly, habits may form elsewhere. If regulation weakens Apple&#8217;s control over defaults and App Store economics, its gatekeeper power shrinks. If new hardware repeats the Vision Pro problem - technically beautiful, but not obviously necessary - the strategy stalls.</p><p>Apple also has a cultural challenge.</p><p>AI products improve through messy iteration. Apple prefers polish. The agent era may reward companies willing to ship imperfect tools, learn quickly, and tolerate public mistakes.</p><p>Apple has to move faster without breaking the trust that makes its strategy possible.</p><p>That is hard.</p><p>But Apple has one advantage few companies have: patience backed by distribution.</p><p>Tim Cook built the fortress.</p><p>John Ternus now has to answer the question Cook never fully answered: what is the next product that makes Apple feel inevitable again?</p><p>The answer may not look like another iPhone.</p><p>It may look like the iPhone, AirPods, Watch, Mac, and home quietly becoming one personal agent.</p><p>Not a chatbot you visit.</p><p>A layer of intelligence that follows you.</p><p>Apple does not need to build the smartest brain in AI.</p><p>It needs to make sure that when AI enters everyday life, it enters through Apple&#8217;s door.</p>]]></content:encoded></item><item><title><![CDATA[Strategy Decoded: #4 Why Elon Musk Isn’t Really Building a Chip Factory]]></title><description><![CDATA[TL;DR: The New Infrastructure of Ambition Beyond the Chip]]></description><link>https://clairechoi616.substack.com/p/strategy-decoded-4-why-elon-musk</link><guid isPermaLink="false">https://clairechoi616.substack.com/p/strategy-decoded-4-why-elon-musk</guid><dc:creator><![CDATA[Deep Tech for the Non Tech]]></dc:creator><pubDate>Wed, 25 Mar 2026 14:03:22 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!Ew0a!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a55006-fd40-4ae5-b0d8-cd44db9db10b_2752x1536.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>TL;DR: The New Infrastructure of Ambition Beyond the Chip</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Ew0a!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a55006-fd40-4ae5-b0d8-cd44db9db10b_2752x1536.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Ew0a!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a55006-fd40-4ae5-b0d8-cd44db9db10b_2752x1536.png 424w, https://substackcdn.com/image/fetch/$s_!Ew0a!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a55006-fd40-4ae5-b0d8-cd44db9db10b_2752x1536.png 848w, https://substackcdn.com/image/fetch/$s_!Ew0a!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a55006-fd40-4ae5-b0d8-cd44db9db10b_2752x1536.png 1272w, https://substackcdn.com/image/fetch/$s_!Ew0a!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a55006-fd40-4ae5-b0d8-cd44db9db10b_2752x1536.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Ew0a!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a55006-fd40-4ae5-b0d8-cd44db9db10b_2752x1536.png" width="1456" height="813" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b8a55006-fd40-4ae5-b0d8-cd44db9db10b_2752x1536.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:813,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:6801669,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/192051693?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a55006-fd40-4ae5-b0d8-cd44db9db10b_2752x1536.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Ew0a!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a55006-fd40-4ae5-b0d8-cd44db9db10b_2752x1536.png 424w, https://substackcdn.com/image/fetch/$s_!Ew0a!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a55006-fd40-4ae5-b0d8-cd44db9db10b_2752x1536.png 848w, https://substackcdn.com/image/fetch/$s_!Ew0a!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a55006-fd40-4ae5-b0d8-cd44db9db10b_2752x1536.png 1272w, https://substackcdn.com/image/fetch/$s_!Ew0a!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a55006-fd40-4ae5-b0d8-cd44db9db10b_2752x1536.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: Generated by NotebookLLM</figcaption></figure></div><p></p><p>During my consulting years, I worked closely with traditional semiconductor companies - IDMs, foundries, system players. The industry was built on specialization. Design here. Manufacturing there. Integration somewhere else.</p><p>And for a long time, I assumed that was permanent. The capital intensity was too high. The technical barriers were too deep. No single player could realistically control the entire value chain. So I used to think this kind of integration was unrealistic. </p><p>But this is exactly what makes this moment so striking to me. Why is the same person suddenly talking about chips, datacenters, and space&#8230; in one strategy?</p><p>Over the past year, Elon Musk has been floating an idea called <em>Terafab</em> - a proposed ~$20&#8211;25 billion semiconductor facility, with ambitions ranging from ~100,000 wafers per month initially to far larger long-term capacity, and targeting advanced AI chips.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!yBm6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9cef889c-7360-46cf-8a03-3c9e625d2e41_1170x737.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!yBm6!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9cef889c-7360-46cf-8a03-3c9e625d2e41_1170x737.jpeg 424w, https://substackcdn.com/image/fetch/$s_!yBm6!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9cef889c-7360-46cf-8a03-3c9e625d2e41_1170x737.jpeg 848w, https://substackcdn.com/image/fetch/$s_!yBm6!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9cef889c-7360-46cf-8a03-3c9e625d2e41_1170x737.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!yBm6!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9cef889c-7360-46cf-8a03-3c9e625d2e41_1170x737.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!yBm6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9cef889c-7360-46cf-8a03-3c9e625d2e41_1170x737.jpeg" width="1170" height="737" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9cef889c-7360-46cf-8a03-3c9e625d2e41_1170x737.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:737,&quot;width&quot;:1170,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:174163,&quot;alt&quot;:&quot;Austin will have an advanced technology fab. TERAFAB location is TBD. Elon  confirmed it is far too massive for Giga Texas and would dwarf everything  there combined. Multiple sites are being evaluated,&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Austin will have an advanced technology fab. TERAFAB location is TBD. Elon  confirmed it is far too massive for Giga Texas and would dwarf everything  there combined. Multiple sites are being evaluated," title="Austin will have an advanced technology fab. TERAFAB location is TBD. Elon  confirmed it is far too massive for Giga Texas and would dwarf everything  there combined. Multiple sites are being evaluated," srcset="https://substackcdn.com/image/fetch/$s_!yBm6!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9cef889c-7360-46cf-8a03-3c9e625d2e41_1170x737.jpeg 424w, https://substackcdn.com/image/fetch/$s_!yBm6!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9cef889c-7360-46cf-8a03-3c9e625d2e41_1170x737.jpeg 848w, https://substackcdn.com/image/fetch/$s_!yBm6!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9cef889c-7360-46cf-8a03-3c9e625d2e41_1170x737.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!yBm6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9cef889c-7360-46cf-8a03-3c9e625d2e41_1170x737.jpeg 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>But this is not what it looks like. This is not really about replacing TSMC.</p><p>To understand what&#8217;s really happening, we need to zoom out - from chips, to the entire system that produces AI.</p><div><hr></div><h2>1. Decode the Basics: Build the Mental Model</h2><p>For decades, the semiconductor world was modular. Designers designed. Foundries manufactured. Cloud providers deployed.</p><p>That worked - until AI scaled fast enough to break the seams between them. Today, AI is constrained by four things at once: chip supply, power, cooling, and system integration. And increasingly, the hardest problems sit at the boundaries. Not inside the chip - but between chips and memory, between racks, between systems.</p><p>This is where Terafab is better understood not as a traditional fab, but as a step toward a more integrated compute system.</p><div><hr></div><h2>2. What Is Actually Happening</h2><p>Tesla&#8217;s trajectory is moving from designing chips toward <strong>selectively controlling parts of the supply chain that matter most</strong>. Not necessarily to replace existing foundries - but to reduce dependency where it becomes strategically risky.</p><p>At the same time, the <strong>definition of performance is changing</strong>. Modern AI systems depend heavily on how tightly logic chips are connected to memory (like high-bandwidth memory) and how efficiently they are packaged together. In many cases, advanced packaging (not the chip itself) is becoming the bottleneck. That&#8217;s why <strong>integration is becoming more important</strong>.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!HENQ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb94a8f0-fde0-4db3-930c-fa985bc9a032_2890x1390.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!HENQ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb94a8f0-fde0-4db3-930c-fa985bc9a032_2890x1390.jpeg 424w, https://substackcdn.com/image/fetch/$s_!HENQ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb94a8f0-fde0-4db3-930c-fa985bc9a032_2890x1390.jpeg 848w, https://substackcdn.com/image/fetch/$s_!HENQ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb94a8f0-fde0-4db3-930c-fa985bc9a032_2890x1390.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!HENQ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb94a8f0-fde0-4db3-930c-fa985bc9a032_2890x1390.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!HENQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb94a8f0-fde0-4db3-930c-fa985bc9a032_2890x1390.jpeg" width="1456" height="700" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bb94a8f0-fde0-4db3-930c-fa985bc9a032_2890x1390.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:700,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!HENQ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb94a8f0-fde0-4db3-930c-fa985bc9a032_2890x1390.jpeg 424w, https://substackcdn.com/image/fetch/$s_!HENQ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb94a8f0-fde0-4db3-930c-fa985bc9a032_2890x1390.jpeg 848w, https://substackcdn.com/image/fetch/$s_!HENQ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb94a8f0-fde0-4db3-930c-fa985bc9a032_2890x1390.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!HENQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb94a8f0-fde0-4db3-930c-fa985bc9a032_2890x1390.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Then comes the <strong>next constraint: infrastructure</strong>. Datacenters are no longer just server farms - they are power-constrained, heat-constrained systems. And this is where the conversation expands again - into <em><strong>space</strong></em>. Not as an immediate solution, but as a long-term exploration of <strong>where compute could live if Earth-based constraints become binding</strong>.</p><p>Finally, all of this connects into a broader pattern. Tesla, xAI, SpaceX, and Terafab are best understood not as separate bets - but as <strong>different layers of a tightly coupled system.</strong></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Gq8u!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15c18033-81e3-4ac9-b4aa-bb5482641bce_2766x1630.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Gq8u!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15c18033-81e3-4ac9-b4aa-bb5482641bce_2766x1630.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Gq8u!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15c18033-81e3-4ac9-b4aa-bb5482641bce_2766x1630.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Gq8u!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15c18033-81e3-4ac9-b4aa-bb5482641bce_2766x1630.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Gq8u!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15c18033-81e3-4ac9-b4aa-bb5482641bce_2766x1630.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Gq8u!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15c18033-81e3-4ac9-b4aa-bb5482641bce_2766x1630.jpeg" width="1456" height="858" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/15c18033-81e3-4ac9-b4aa-bb5482641bce_2766x1630.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:858,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Gq8u!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15c18033-81e3-4ac9-b4aa-bb5482641bce_2766x1630.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Gq8u!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15c18033-81e3-4ac9-b4aa-bb5482641bce_2766x1630.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Gq8u!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15c18033-81e3-4ac9-b4aa-bb5482641bce_2766x1630.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Gq8u!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15c18033-81e3-4ac9-b4aa-bb5482641bce_2766x1630.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Elon&#8217;s dream kingdom requires A LOT</figcaption></figure></div><div><hr></div><h2>3. Insights per Deep Tech</h2><h3>3.1 Semiconductors</h3><p><strong>The most valuable part of a chip is no longer the chip.</strong></p><p>For a long time, semiconductor value was easy to locate. It sat inside the chip - better architecture, smaller nodes, higher performance.</p><p>But in modern AI systems, that&#8217;s no longer where the bottleneck lives. The constraint is increasingly in how chips are <em>combined</em> - especially how compute chips interact with memory.</p><p>Take high-bandwidth memory (HBM). It&#8217;s not just an add-on - it fundamentally determines how much of a chip&#8217;s theoretical performance you can actually use. If memory can&#8217;t keep up, the chip spends most of its time idle.</p><p>What&#8217;s interesting is where this shifts value. Not just to chip designers&#8212;but to whoever controls:</p><ul><li><p>memory integration</p></li><li><p>advanced packaging</p></li><li><p>system-level layout</p></li></ul><p>In other words, the competitive edge is moving from <em>designing the fastest chip</em> to <em>designing the most efficient system around it</em>.</p><p>And that&#8217;s a very different game.</p><div><hr></div><h3>3.2 Terafab + Datacenter</h3><p><strong>The scarcest resource in AI may not be chips - it may be </strong><em><strong>coordinated capacity</strong></em><strong>.</strong></p><p>When people talk about scaling AI, they often talk about more GPUs.</p><p>But the harder problem isn&#8217;t adding more hardware&#8212;it&#8217;s making everything scale together. You don&#8217;t just need chips. You need chips that arrive on time, are packaged correctly, can be deployed into systems, powered reliably, cooled efficiently, and networked together without bottlenecks. </p><p>Each step is manageable on its own. What&#8217;s hard is synchronizing all of them.</p><p>This is where the idea behind something like Terafab becomes interesting - not as a &#8220;better factory,&#8221; but as a way to reduce coordination friction across the entire pipeline. Because at scale, delays and inefficiencies don&#8217;t add&#8212;they multiply.</p><p>And the companies that win may not be the ones with the best individual components, but the ones that can align the entire system most tightly.</p><div><hr></div><h3>3.3 Spacetech </h3><p><strong>Compute is becoming a location-sensitive decision for the first time.</strong></p><p>Historically, computing had a kind of abstraction. It didn&#8217;t really matter where your servers were, as long as they were connected.</p><p>But that abstraction is starting to break. As AI infrastructure scales, physical constraints&#8212;power grids, cooling environments, land availability - start to matter more. Which means location starts to matter more. Space is one extreme version of this idea. </p><p>But even before that, we&#8217;re already seeing it on Earth:</p><ul><li><p>datacenters moving closer to cheap energy</p></li><li><p>clusters being built around specific geographies</p></li><li><p>infrastructure decisions shaped by physics, not just latency</p></li></ul><p>So the real shift isn&#8217;t &#8220;we&#8217;re moving to space.&#8221; It&#8217;s that compute is no longer location-agnostic.</p><p>And once location matters, infrastructure strategy becomes much more complex - and much more strategic.</p><div><hr></div><h3>3.4 Overall Industry Structure</h3><p><strong>Control is shifting from layers to interfaces.</strong></p><p>For decades, the semiconductor industry was organized in layers. Design &#8594; manufacturing &#8594; systems &#8594; deployment. Each layer created value independently.</p><p>But what&#8217;s emerging now is something different. The most strategic control points are no longer the layers themselves&#8212;but the <em>interfaces between them</em>.</p><ul><li><p>Between chip and memory</p></li><li><p>Between hardware and datacenter</p></li><li><p>Between infrastructure and deployment</p></li></ul><p>That&#8217;s where performance is won or lost. And that&#8217;s where dependency becomes risky.</p><p>As I mentioned earlier in this article, this is what makes the current shift feel so unusual - especially from the perspective of someone who has worked inside this industry.</p><p>I used to think full integration was unrealistic. And it still might be.</p><p>But what we&#8217;re starting to see isn&#8217;t full integration - it&#8217;s something more targeted.</p><p>Companies pulling control over specific interfaces where:</p><ul><li><p>bottlenecks are forming</p></li><li><p>dependencies are highest</p></li><li><p>differentiation is hardest to outsource</p></li></ul><p>And that&#8217;s what makes this moment feel both impressive and slightly unsettling. Because if control moves to those interfaces, then the balance of power in the industry doesn&#8217;t just shift slightly. It reshapes entirely.</p><div><hr></div><p><strong>Mini Section:</strong> &#128640; <strong>How to become a SpaceX shareholder &#128184;</strong></p><blockquote><p>&#9642;&#65039; There&#8217;s basically no clean way for retail investors to buy SpaceX. It&#8217;s still private. Secondary markets exist, but they&#8217;re opaque and dominated by institutions</p><p>&#9642;&#65039; So the market starts creating <em>proxies</em>. A closed-end fund holding private AI/defense names recently traded at <strong>~1,550% premium to NAV </strong>&#8594; investors paying <strong>15&#215;+</strong> just for indirect exposure</p><p>&#9642;&#65039; In another case, a fund with <strong>$19 NAV traded at ~$315 </strong>&#8594; effectively paying $16 to get $1 of SpaceX exposure</p><p>&#9642;&#65039; More workarounds emerging:</p><ul><li><p>funds tracking private unicorn indices without owning shares</p></li><li><p>structured deals mimicking exposure via swaps</p></li><li><p>companies like EchoStar rising <strong>300%+</strong>, driven more by SpaceX linkage than fundamentals</p></li></ul><p>&#9642;&#65039; This isn&#8217;t just hype - it&#8217;s a structural signal.<br>&#8594; the most important AI/infrastructure companies are staying private longer<br>&#8594; access itself is becoming scarce</p><p>&#9642;&#65039; The same pattern we see in compute is now showing up in capital markets<br>scarcity &#8594; bottlenecks &#8594; premiums</p><p>&#9642;&#65039; Not just who can build the future &#8594; but who can <em>own</em> it</p></blockquote><div><hr></div><h2>4. What&#8217;s Real vs What&#8217;s Still Speculative</h2><p>There are real signals here. </p><p>AI is pushing against physical limits - chips, power, cooling. Companies are starting to think in full-stack terms. Integration is becoming strategic.</p><p>But there&#8217;s also a lot that remains uncertain.</p><p>Building a $20&#8211;25 billion fab is one thing. Building one that competes at 2nm, scaling from 100,000 wafers per month to 1 million, is something else entirely. And orbital datacenters, while conceptually compelling, are still far from proven at scale.</p><p>So the architecture of the idea may be real. The execution is still a very open question.</p><div><hr></div><h2>5. What This Means</h2><p>This isn&#8217;t really about Terafab. It&#8217;s about how the AI race is being redefined.</p><p>We used to think the winners would be those with the best models. Now it&#8217;s starting to look like the winners may be those who control the infrastructure beneath them.</p><p>Not just chips. But power. Cooling. Manufacturing. Even geography.</p><p>The center of gravity is shifting - from software to systems.</p><div><hr></div><h2>6. What Changed My Mind</h2><p>A year ago, I would have been much more certain about how this industry works. But now I have so many questions.</p><ul><li><p>Can a new entrant realistically build and operate an advanced-node fab at scale?<br><s>(Given what I&#8217;ve seen Elon has done before&#8230; I&#8217;m more open to it than I would have been three years ago.)</s></p></li><li><p>Will selective vertical integration actually create durable advantage - or will the complexity eventually outweigh the benefits?</p></li><li><p>And is space-based compute a real future - or a compelling narrative layered on top of more grounded constraints?</p></li></ul><p>But stepping back, I think the more important realization is this. I used to think the structure of this industry was fixed. That specialization was not just efficient - but inevitable.</p><p>Now I&#8217;m not so sure. Because what&#8217;s changing isn&#8217;t just technology. It&#8217;s where the bottlenecks are. And when bottlenecks shift, the structure of an industry tends to follow.</p><p>So maybe this isn&#8217;t really about Terafab or even about Elon Musk. Maybe it&#8217;s about a deeper transition: from a world where intelligence was built on modular software stacks&#8230; to one where it depends on tightly integrated physical systems.</p><p>And if that&#8217;s true, then the question isn&#8217;t just: Who builds the best models?</p><p>It&#8217;s: <em><strong>Who controls the system that makes those models possible.</strong></em></p>]]></content:encoded></item><item><title><![CDATA[Event De-coded: #2 CES 2026 - AI Wears Robot]]></title><description><![CDATA[A non-engineer robot enthusiast's walk through the first CES where robots talked less and worked more]]></description><link>https://clairechoi616.substack.com/p/event-de-coded-ces-2026-the-day-ai</link><guid isPermaLink="false">https://clairechoi616.substack.com/p/event-de-coded-ces-2026-the-day-ai</guid><dc:creator><![CDATA[Deep Tech for the Non Tech]]></dc:creator><pubDate>Mon, 19 Jan 2026 17:00:47 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!KW2z!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc6f1c3ec-eae1-4e49-ada4-e80a0526ff09_2989x3815.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h2>Why CES Was on My Bucket List </h2><p>I didn&#8217;t grow up thinking CES was some mythical temple of technology.</p><p>During my McKinsey projects in deep tech, CES was simply one of many reference points - along with academic papers, startup demos, and customer interviews&#8212;when we tried to understand <em>where a technology actually stood</em>. But every January I would watch the announcements and feel that mix of curiosity and distance: frontier ideas appearing on a stage I had never seen in person.</p><p>So CES slowly became less of a benchmark and more of a quiet wish:<br><strong>one day, go and see it myself.</strong></p><p>When CES 2026 announced the theme <strong>&#8220;Innovators Show Up,&#8221;</strong> I finally decided to stop watching from a laptop screen. I bought the ticket, sent that slightly uncomfortable email to my professor, and flew to Las Vegas expecting a parade of futuristic concepts.</p><p>What I found was different.<br>Less spectacle than I imagined - and much more practicality.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!KW2z!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc6f1c3ec-eae1-4e49-ada4-e80a0526ff09_2989x3815.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!KW2z!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc6f1c3ec-eae1-4e49-ada4-e80a0526ff09_2989x3815.jpeg 424w, https://substackcdn.com/image/fetch/$s_!KW2z!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc6f1c3ec-eae1-4e49-ada4-e80a0526ff09_2989x3815.jpeg 848w, https://substackcdn.com/image/fetch/$s_!KW2z!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc6f1c3ec-eae1-4e49-ada4-e80a0526ff09_2989x3815.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!KW2z!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc6f1c3ec-eae1-4e49-ada4-e80a0526ff09_2989x3815.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!KW2z!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc6f1c3ec-eae1-4e49-ada4-e80a0526ff09_2989x3815.jpeg" width="728" height="929.1803278688525" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c6f1c3ec-eae1-4e49-ada4-e80a0526ff09_2989x3815.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:3815,&quot;width&quot;:2989,&quot;resizeWidth&quot;:728,&quot;bytes&quot;:2311507,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/184888842?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff4db8fcc-4dcc-46db-86fc-51dca642de1e_2989x5313.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!KW2z!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc6f1c3ec-eae1-4e49-ada4-e80a0526ff09_2989x3815.jpeg 424w, https://substackcdn.com/image/fetch/$s_!KW2z!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc6f1c3ec-eae1-4e49-ada4-e80a0526ff09_2989x3815.jpeg 848w, https://substackcdn.com/image/fetch/$s_!KW2z!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc6f1c3ec-eae1-4e49-ada4-e80a0526ff09_2989x3815.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!KW2z!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc6f1c3ec-eae1-4e49-ada4-e80a0526ff09_2989x3815.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><div><hr></div><h2>&#127482;&#127480; Silicon Valley - Where AI Starts Doing Chores</h2><h3>Dexterity: Function Before Charm</h3><p>The first demo that held me was Dexterity&#8217;s warehouse system - two large arms on a mobile base stacking irregular boxes into a truck. No face, no attempt to look human, just mechanical confidence.</p><p>The operator explained:</p><blockquote><p>&#8220;Two workers for 30 minutes &#8594; one robot for 10.&#8221;</p></blockquote><p>As someone raised on beautifully designed Korean electronics, the deliberate ugliness felt strange. But the logic was hard to argue with: this robot existed to <strong>earn its salary</strong>, not to win design awards.</p><h3>Matternet: Drones That Already Have Customers</h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!O9vb!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0104a86-4c78-45f4-89f0-f9b7f34f998f_700x365.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!O9vb!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0104a86-4c78-45f4-89f0-f9b7f34f998f_700x365.png 424w, https://substackcdn.com/image/fetch/$s_!O9vb!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0104a86-4c78-45f4-89f0-f9b7f34f998f_700x365.png 848w, https://substackcdn.com/image/fetch/$s_!O9vb!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0104a86-4c78-45f4-89f0-f9b7f34f998f_700x365.png 1272w, https://substackcdn.com/image/fetch/$s_!O9vb!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0104a86-4c78-45f4-89f0-f9b7f34f998f_700x365.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!O9vb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0104a86-4c78-45f4-89f0-f9b7f34f998f_700x365.png" width="700" height="365" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a0104a86-4c78-45f4-89f0-f9b7f34f998f_700x365.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:365,&quot;width&quot;:700,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;A person standing in front of a drone\n\nAI-generated content may be incorrect.&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="A person standing in front of a drone

AI-generated content may be incorrect." title="A person standing in front of a drone

AI-generated content may be incorrect." srcset="https://substackcdn.com/image/fetch/$s_!O9vb!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0104a86-4c78-45f4-89f0-f9b7f34f998f_700x365.png 424w, https://substackcdn.com/image/fetch/$s_!O9vb!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0104a86-4c78-45f4-89f0-f9b7f34f998f_700x365.png 848w, https://substackcdn.com/image/fetch/$s_!O9vb!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0104a86-4c78-45f4-89f0-f9b7f34f998f_700x365.png 1272w, https://substackcdn.com/image/fetch/$s_!O9vb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0104a86-4c78-45f4-89f0-f9b7f34f998f_700x365.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: Matternet</figcaption></figure></div><p>Matternet&#8217;s booth added another layer of realism. They weren&#8217;t talking about hypothetical pilots; they were describing <strong>active routes</strong> in California - medical samples between hospitals, grocery deliveries in suburban neighborhoods, contracts with municipalities.</p><p>The business model sounded more like a logistics company than an aviation startup: per-delivery pricing, service-level agreements, fleet maintenance schedules. It was refreshing to hear a robot company speak the language of <strong>operations rather than imagination.</strong></p><h3>Psyonic: When Price Becomes the Innovation</h3><p>At Psyonic I was reminded that commercialization can be as simple - and as difficult -as lowering cost. Their <strong>Ability Hand</strong> used to cost tens of thousands of dollars; through new manufacturing and AI-based control it has dropped to roughly <strong>$10&#8211;20k</strong>, opening insurance coverage and everyday adoption.</p><p>Watching someone hold a cup and feel pressure through vibration was a quiet but powerful example of value creation - <strong>needs first, technology second.</strong></p><div><hr></div><h3>NVIDIA - The Invisible Spine of Physical AI</h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!t6sf!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a7f2a53-4485-4b97-ad4a-eeffc99b689c_1921x1082.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!t6sf!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a7f2a53-4485-4b97-ad4a-eeffc99b689c_1921x1082.jpeg 424w, https://substackcdn.com/image/fetch/$s_!t6sf!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a7f2a53-4485-4b97-ad4a-eeffc99b689c_1921x1082.jpeg 848w, https://substackcdn.com/image/fetch/$s_!t6sf!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a7f2a53-4485-4b97-ad4a-eeffc99b689c_1921x1082.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!t6sf!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a7f2a53-4485-4b97-ad4a-eeffc99b689c_1921x1082.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!t6sf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a7f2a53-4485-4b97-ad4a-eeffc99b689c_1921x1082.jpeg" width="1456" height="820" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5a7f2a53-4485-4b97-ad4a-eeffc99b689c_1921x1082.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:820,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;NVIDIA Rubin Platform, Open Models, Autonomous Driving: NVIDIA Presents  Blueprint for the Future at CES | NVIDIA Blog&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="NVIDIA Rubin Platform, Open Models, Autonomous Driving: NVIDIA Presents  Blueprint for the Future at CES | NVIDIA Blog" title="NVIDIA Rubin Platform, Open Models, Autonomous Driving: NVIDIA Presents  Blueprint for the Future at CES | NVIDIA Blog" srcset="https://substackcdn.com/image/fetch/$s_!t6sf!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a7f2a53-4485-4b97-ad4a-eeffc99b689c_1921x1082.jpeg 424w, https://substackcdn.com/image/fetch/$s_!t6sf!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a7f2a53-4485-4b97-ad4a-eeffc99b689c_1921x1082.jpeg 848w, https://substackcdn.com/image/fetch/$s_!t6sf!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a7f2a53-4485-4b97-ad4a-eeffc99b689c_1921x1082.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!t6sf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a7f2a53-4485-4b97-ad4a-eeffc99b689c_1921x1082.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: NVIDIA</figcaption></figure></div><p>Among all these individual machines, NVIDIA felt like the city planner.</p><p>Their story wasn&#8217;t one robot but the <strong>stack</strong> that makes robots possible: chips, simulation, reasoning models, and deployment tools. Jensen Huang framed this as a &#8220;Physical AI moment,&#8221; and on the floor I began to understand why.</p><p>The new <strong>Rubin</strong> platform focused on cheaper, faster real-time inference - crucial when a robot must react instantly rather than wait for the cloud. The <strong>Alpamayo</strong> model family aimed at <em>reasoning about actions</em>, not only recognizing objects. Robots, they argued, need judgment more than sharper eyes.</p><p>What interested me most were tools like <strong>Cosmos</strong> and <strong>OSMO</strong> for synthetic data and training loops. Real robot data is expensive; NVIDIA is trying to industrialize learning itself. I saw these systems powering demos from Boston Dynamics, Caterpillar, LG, Franka, and NEURA - an entire ecosystem orbiting one middleware layer.</p><p>It struck me that NVIDIA isn&#8217;t selling a chip.<br>They&#8217;re selling the <strong>operating system for a profession of robots.</strong></p><div><hr></div><h2>&#128200; From Demonstrations to Business Models</h2><p>Not every booth had this clarity. Many companies still looked like they were rehearsing for a future without customers.</p><p>And that&#8217;s fine - CES has always been part dream, part spreadsheet.</p><p>What felt new was how many conversations revolved around <strong>unit economics</strong> rather than hero videos: interventions per hour, utilization, service contracts. Physical AI was beginning to sound like <strong>operations</strong>, not science fiction.</p><h3>Why Now?</h3><p>Three trends aligned:</p><ol><li><p>Multimodal models connecting vision and language</p></li><li><p>Hardware costs pushed down by smartphone supply chains</p></li><li><p>Labor gaps in logistics, care, and delivery</p></li></ol><p>Together they turned robots into <strong>financially legible workers.</strong></p><div><hr></div><h2>&#127472;&#127479; Korea - More Strategic Than I Expected</h2><h3>LG: Turning the Home Into Robot Infrastructure</h3><p>LG&#8217;s <strong>CLOi-D</strong> booth was packed, and not just because the robot looked friendly. What stood out was the <strong>system thinking</strong> behind it.</p><p>Most home-robot demos try to squeeze a machine into today&#8217;s messy kitchens. LG flipped the premise:<br><strong>redesign the environment first.</strong></p><p>Their demo apartment showed:</p><ul><li><p>fridges with standardized handle geometry</p></li><li><p>cabinets positioned for robotic reach</p></li><li><p>appliances exposing structured &#8220;task APIs&#8221;</p></li></ul><p>The robot was careful and sometimes slow, but the idea was larger: CLOi-D as an orchestrator across appliances LG already dominates. From a strategy lens this felt very Korean; use existing market power, control the environment layer, and let the robot improve gradually on top.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!MX3s!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2d370b19-9439-4550-b0ff-657d7cca1733_1440x839.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!MX3s!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2d370b19-9439-4550-b0ff-657d7cca1733_1440x839.jpeg 424w, https://substackcdn.com/image/fetch/$s_!MX3s!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2d370b19-9439-4550-b0ff-657d7cca1733_1440x839.jpeg 848w, https://substackcdn.com/image/fetch/$s_!MX3s!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2d370b19-9439-4550-b0ff-657d7cca1733_1440x839.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!MX3s!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2d370b19-9439-4550-b0ff-657d7cca1733_1440x839.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!MX3s!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2d370b19-9439-4550-b0ff-657d7cca1733_1440x839.jpeg" width="1440" height="839" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2d370b19-9439-4550-b0ff-657d7cca1733_1440x839.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:839,&quot;width&quot;:1440,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;LG Electronics Presents LG ClOiD Home Robot To Demonstrate &#8220;Zero Labor  Home&#8221; at CES 2026&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="LG Electronics Presents LG ClOiD Home Robot To Demonstrate &#8220;Zero Labor  Home&#8221; at CES 2026" title="LG Electronics Presents LG ClOiD Home Robot To Demonstrate &#8220;Zero Labor  Home&#8221; at CES 2026" srcset="https://substackcdn.com/image/fetch/$s_!MX3s!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2d370b19-9439-4550-b0ff-657d7cca1733_1440x839.jpeg 424w, https://substackcdn.com/image/fetch/$s_!MX3s!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2d370b19-9439-4550-b0ff-657d7cca1733_1440x839.jpeg 848w, https://substackcdn.com/image/fetch/$s_!MX3s!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2d370b19-9439-4550-b0ff-657d7cca1733_1440x839.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!MX3s!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2d370b19-9439-4550-b0ff-657d7cca1733_1440x839.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><a href="https://www.youtube.com/watch?v=GvLbSQ0Qelo">www.youtube.com &#8250; watch</a></p><p>I did notice limits: manipulation relied on known object locations and recovery was manual. Yet as a first step toward a &#8220;Zero Labor Home,&#8221; it felt grounded rather than theatrical.</p><h3>Hyundai Atlas: Legs as Enterprise Software</h3><p>I entered the Hyundai/Boston Dynamics area skeptical. Wheels seemed more rational than legs.</p><p>Atlas complicated that view. The demo focused on <strong>workflows</strong>:</p><ul><li><p>balancing asymmetric loads</p></li><li><p>stepping over uneven floors</p></li><li><p>two-handed tool alignment</p></li></ul><p>Watching it move, I realized the obvious: <strong>factories and ships are built for human proportions.</strong> Rebuilding all of that for wheeled robots might cost more than teaching robots to use legs.</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;616d973a-bb68-4fba-80a2-1ef695fd6b4f&quot;,&quot;duration&quot;:null}"></div><p></p><p>Hyundai spoke about Georgia plant pilots and RaaS contracts - language closer to industrial automation than spectacle. Atlas began to feel less like a stunt and more like an enterprise roadmap.</p><h3>A Korea Pattern I Didn&#8217;t Expect</h3><p>Seeing LG and Hyundai together changed my perspective:</p><ul><li><p>LG &#8594; control the <strong>domestic environment</strong></p></li><li><p>Hyundai &#8594; control the <strong>industrial environment</strong></p></li><li><p>startups &#8594; supply specialized brains and components</p></li></ul><p>Instead of chasing one heroic humanoid, Korea seemed to be assembling an <strong>ecosystem play</strong> across appliances, cars, batteries, and telecom.</p><div><hr></div><h2>&#127472;&#127479; Korean Startups - The Conversations I Remember</h2><h3>WIRobotics: From Wearables to Humanoids</h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!xAzp!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf4df033-83e7-44db-8bbc-cf03311dc12f_2820x3279.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!xAzp!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf4df033-83e7-44db-8bbc-cf03311dc12f_2820x3279.jpeg 424w, https://substackcdn.com/image/fetch/$s_!xAzp!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf4df033-83e7-44db-8bbc-cf03311dc12f_2820x3279.jpeg 848w, https://substackcdn.com/image/fetch/$s_!xAzp!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf4df033-83e7-44db-8bbc-cf03311dc12f_2820x3279.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!xAzp!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf4df033-83e7-44db-8bbc-cf03311dc12f_2820x3279.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!xAzp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf4df033-83e7-44db-8bbc-cf03311dc12f_2820x3279.jpeg" width="2820" height="3279" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/df4df033-83e7-44db-8bbc-cf03311dc12f_2820x3279.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:3279,&quot;width&quot;:2820,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1508904,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/184888842?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F513a44f4-a33d-48f8-912f-a4cc4ed71667.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!xAzp!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf4df033-83e7-44db-8bbc-cf03311dc12f_2820x3279.jpeg 424w, https://substackcdn.com/image/fetch/$s_!xAzp!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf4df033-83e7-44db-8bbc-cf03311dc12f_2820x3279.jpeg 848w, https://substackcdn.com/image/fetch/$s_!xAzp!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf4df033-83e7-44db-8bbc-cf03311dc12f_2820x3279.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!xAzp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf4df033-83e7-44db-8bbc-cf03311dc12f_2820x3279.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>I knew WIRobotics from their <strong>WIM S</strong> walking assist and appreciated how they started with real mobility needs. At CES they introduced <strong>ALLEX</strong>, a humanoid concept. The professor and CEO Yongjae Kim at the booth explained how their <strong>hand design</strong> emphasized compliance and human-like manipulation, and mentioned collaboration with <strong>Realworld Labs</strong> in Korea.</p><p>The speed from assistive devices to humanoids surprised me. It felt like a natural evolution: help humans walk &#8594; learn about bodies &#8594; build bodies.</p><h3>Tommoro Robotics: Brains Before Muscles</h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!SnUR!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8ac8c89-7698-4ae4-9e8a-4d89555b05b1_1064x1611.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!SnUR!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8ac8c89-7698-4ae4-9e8a-4d89555b05b1_1064x1611.jpeg 424w, https://substackcdn.com/image/fetch/$s_!SnUR!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8ac8c89-7698-4ae4-9e8a-4d89555b05b1_1064x1611.jpeg 848w, https://substackcdn.com/image/fetch/$s_!SnUR!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8ac8c89-7698-4ae4-9e8a-4d89555b05b1_1064x1611.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!SnUR!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8ac8c89-7698-4ae4-9e8a-4d89555b05b1_1064x1611.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!SnUR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8ac8c89-7698-4ae4-9e8a-4d89555b05b1_1064x1611.jpeg" width="1064" height="1611" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b8ac8c89-7698-4ae4-9e8a-4d89555b05b1_1064x1611.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1611,&quot;width&quot;:1064,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:301185,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/184888842?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8ac8c89-7698-4ae4-9e8a-4d89555b05b1_1064x1611.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!SnUR!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8ac8c89-7698-4ae4-9e8a-4d89555b05b1_1064x1611.jpeg 424w, https://substackcdn.com/image/fetch/$s_!SnUR!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8ac8c89-7698-4ae4-9e8a-4d89555b05b1_1064x1611.jpeg 848w, https://substackcdn.com/image/fetch/$s_!SnUR!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8ac8c89-7698-4ae4-9e8a-4d89555b05b1_1064x1611.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!SnUR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8ac8c89-7698-4ae4-9e8a-4d89555b05b1_1064x1611.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><p>I had known <strong>Tommoro Robotics</strong> mainly from their work on the <strong>Robot Foundation Model (RFM)</strong> and a few logistics pilots in Korea. Seeing them inside the <strong>HUMANOID M.AX Alliance</strong> booth at CES reframed them from an interesting research-driven startup to a company chasing very concrete problems.</p><p>The demo was refreshingly practical: the humanoid <strong>RB-Y1</strong> picked a specific item from mixed objects and transferred it onto a rail system after a feeder robot delivered a box. No acrobatics - just a task that looked like something a warehouse would actually pay for. The team shared that similar workflows are already being tested with a major Korean logistics company and a cosmetics manufacturer.</p><p>What I liked was the philosophy: <strong>&#8220;One Brain, a Thousand Bodies.&#8221;</strong> Tommoro isn&#8217;t trying to sell a humanoid as a marvel, but automation powered by a model that can be reused across different forms. The environment was still structured, but the direction - software-first, task-first - felt realistic.</p><div><hr></div><h2>&#127464;&#127475; China - Forget human-wave tactics; this is robot-wave tactics</h2><p>I believe the Chinese robotics presence deserves its own piece on pricing and iteration speed, so I&#8217;ll cover that separately soon!</p><div><hr></div><h2>What CES Adjusted in My Head</h2><h3>1) Form Factor Humility</h3><p>Wheels often beat legs. Economics beat ego. The most convincing systems fit the task rather than imitated the human shape.</p><h3>2) Robots Are Platforms, Not Products</h3><p>A robot is closer to cloud software than to a toaster: data pipelines, updates, ops teams, service contracts. Hardware is only the beginning.</p><h3>3) Korea&#8217;s Real Opportunity</h3><p>Korea designs the environments robots will enter - apartments, appliances, cars, factories. That ecosystem may matter more than a single breakthrough model.</p><div><hr></div><h2>On the Flight Back</h2><p>One demo showed a robot folding a towel. It was slow and dropped the fabric once. Nothing cinematic.</p><p>Yet on the flight home I kept thinking about it. AI had stepped out from behind glass and tried to help, imperfectly but seriously.</p><p>As a Korean student balancing Silicon Valley lectures and Seoul news feeds, I felt less like a spectator and more like someone watching the first day of a new profession - one that might hire both machines and people like me.</p><p>CES 2026 wasn&#8217;t about robots becoming human.<br>It was about AI finally becoming useful.</p>]]></content:encoded></item><item><title><![CDATA[Strategy Decoded: #3 Google’s Second Act on AI]]></title><description><![CDATA[How Full-Stack AI Is Forcing OpenAI into Code Red]]></description><link>https://clairechoi616.substack.com/p/strategy-decoded-3-googles-second</link><guid isPermaLink="false">https://clairechoi616.substack.com/p/strategy-decoded-3-googles-second</guid><dc:creator><![CDATA[Deep Tech for the Non Tech]]></dc:creator><pubDate>Mon, 15 Dec 2025 06:02:08 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!zZIe!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c879bfe-12fe-4b09-8901-734577032cff_2752x1536.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>The past few weeks have been chaos in my personal life (first finals at Stanford!) but also in the AI world - and it have felt strangely familiar.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!HlBU!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5143a303-a5ec-42f5-a265-cda765025b34_900x413.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!HlBU!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5143a303-a5ec-42f5-a265-cda765025b34_900x413.png 424w, https://substackcdn.com/image/fetch/$s_!HlBU!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5143a303-a5ec-42f5-a265-cda765025b34_900x413.png 848w, https://substackcdn.com/image/fetch/$s_!HlBU!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5143a303-a5ec-42f5-a265-cda765025b34_900x413.png 1272w, https://substackcdn.com/image/fetch/$s_!HlBU!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5143a303-a5ec-42f5-a265-cda765025b34_900x413.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!HlBU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5143a303-a5ec-42f5-a265-cda765025b34_900x413.png" width="900" height="413" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5143a303-a5ec-42f5-a265-cda765025b34_900x413.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:413,&quot;width&quot;:900,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!HlBU!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5143a303-a5ec-42f5-a265-cda765025b34_900x413.png 424w, https://substackcdn.com/image/fetch/$s_!HlBU!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5143a303-a5ec-42f5-a265-cda765025b34_900x413.png 848w, https://substackcdn.com/image/fetch/$s_!HlBU!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5143a303-a5ec-42f5-a265-cda765025b34_900x413.png 1272w, https://substackcdn.com/image/fetch/$s_!HlBU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5143a303-a5ec-42f5-a265-cda765025b34_900x413.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Google released <strong>Gemini 3</strong>. Benchmark leaderboards reshuffled. Developers quietly changed their defaults. Enterprise conversations stalled mid-decision. And then - almost on cue - reports surfaced that <strong>OpenAI had declared an internal &#8220;Code Red.&#8221;</strong> <em>(update: and released GPT 5.2 few days ago!)</em> </p><p>If that phrase rings a bell, it should. In early 2023, <em>Google</em> was the one sounding Code Red after ChatGPT blindsided the company and triggered what was widely seen as an existential crisis.</p><p>Two years later, the alert has changed hands.</p><p>At first glance, this looks like another episode in the AI model arms race: better reasoning, better coding, better benchmarks. But that framing misses what&#8217;s actually happening. What we&#8217;re witnessing isn&#8217;t a model war&#8212;it&#8217;s a <strong>systems war</strong>. And Google&#8217;s resurgence explains why OpenAI is suddenly under pressure.</p><p>To understand OpenAI&#8217;s Code Red, you have to rewind&#8212;not just to Gemini 3, but to Google&#8217;s lowest point.</p><div><hr></div><h2>2023: when Google lost the narrative</h2><p>I remember the <strong>Bard launch</strong> vividly.</p><p>At the time, I was working on a McKinsey project that touched imaging sensors and edge AI. ChatGPT had just exploded, and when Google announced Bard, I was genuinely excited. <em>Finally</em>, Google - the company that invented the transformer - was stepping back into the spotlight.</p><p>I started testing Bard almost immediately, feeding it questions about CMOS sensor architectures and ISP pipelines. Within minutes, the excitement faded.</p><p>At one point, Bard confidently explained that a smartphone image sensor &#8220;temporarily stores photons in on-chip DRAM before uploading them to the cloud for neural enhancement.&#8221; Another answer suggested that rolling-shutter distortion occurred because &#8220;pixel rows fail to synchronize on a distributed ledger.&#8221;</p><p>I laughed. Then I stopped testing.</p><p>This wasn&#8217;t just hallucination - it was <em>un-Google-like</em>. For a company that trained generations of ML researchers, Bard felt rushed, fragile, and oddly shallow. The now-infamous demo error that wiped roughly 7% off Google&#8217;s market cap only reinforced what many of us felt: <strong>Google had the research, but not the product system.</strong></p><p>And that distinction mattered.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3mLp!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6053b1b-1805-4e68-95c0-7e44c1ef53ea_1690x1316.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3mLp!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6053b1b-1805-4e68-95c0-7e44c1ef53ea_1690x1316.png 424w, https://substackcdn.com/image/fetch/$s_!3mLp!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6053b1b-1805-4e68-95c0-7e44c1ef53ea_1690x1316.png 848w, https://substackcdn.com/image/fetch/$s_!3mLp!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6053b1b-1805-4e68-95c0-7e44c1ef53ea_1690x1316.png 1272w, https://substackcdn.com/image/fetch/$s_!3mLp!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6053b1b-1805-4e68-95c0-7e44c1ef53ea_1690x1316.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3mLp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6053b1b-1805-4e68-95c0-7e44c1ef53ea_1690x1316.png" width="1456" height="1134" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a6053b1b-1805-4e68-95c0-7e44c1ef53ea_1690x1316.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1134,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:225619,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/181549963?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6053b1b-1805-4e68-95c0-7e44c1ef53ea_1690x1316.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!3mLp!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6053b1b-1805-4e68-95c0-7e44c1ef53ea_1690x1316.png 424w, https://substackcdn.com/image/fetch/$s_!3mLp!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6053b1b-1805-4e68-95c0-7e44c1ef53ea_1690x1316.png 848w, https://substackcdn.com/image/fetch/$s_!3mLp!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6053b1b-1805-4e68-95c0-7e44c1ef53ea_1690x1316.png 1272w, https://substackcdn.com/image/fetch/$s_!3mLp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6053b1b-1805-4e68-95c0-7e44c1ef53ea_1690x1316.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Google&#8217;s stock price plunged after Bard made an error in a demo in 2023.</figcaption></figure></div><div><hr></div><h2>Google didn&#8217;t lack tech - it lacked activated tech</h2><p>It&#8217;s tempting to say Google &#8220;had the tech but not the product muscle.&#8221; That&#8217;s directionally right, but incomplete.</p><p>Google didn&#8217;t lack intelligence. It lacked <strong>activated, unified, product-grade intelligence</strong>.</p><p>Before ChatGPT, Google had:</p><ul><li><p>World-class model architectures</p></li><li><p>Massive training infrastructure</p></li><li><p>Deep multimodal data</p></li><li><p>Multiple internal LLMs competing for ownership</p></li></ul><p>What it did <em>not</em> have was:</p><ul><li><p>A single, externally accountable AI product</p></li><li><p>A production-ready inference + safety + UX stack</p></li><li><p>Organizational permission to ship something imperfect</p></li><li><p>Clear default surfaces for distribution</p></li></ul><p>OpenAI didn&#8217;t out-invent Google. It <strong>out-assembled</strong> it.</p><p>ChatGPT wasn&#8217;t smarter than Google&#8217;s internal models in 2022 - but it existed, shipped, and learned from users. Bard, by contrast, was an integration Google wasn&#8217;t yet ready to own end-to-end.</p><p>That&#8217;s why Bard felt brittle. It wasn&#8217;t a failure of research. It was a failure of system readiness.</p><div><hr></div><h2>The reset: five structural bets</h2><p>Google&#8217;s recovery wasn&#8217;t a single clever fix. It was a coordinated reset across the company.</p><p>(1) First, <strong>Google Brain and DeepMind were merged</strong>. For nearly a decade, two elite AI orgs operated in parallel - brilliant, but duplicative and competitive. The merger wasn&#8217;t about org charts; it was about collapsing internal friction and forcing focus.</p><p>(2) Second, <strong>12,000 layoffs</strong>. Historically taboo at Google, the cuts created space to reallocate talent aggressively toward AI. Crisis gave permission to move.</p><p>(3) Third, <strong>founder gravity returned</strong>. Sergey Brin re-engaged deeply in Gemini&#8217;s development. In large tech companies, founders don&#8217;t just advise - they compress indecision. Their presence changes risk tolerance.</p><p>(4) Fourth, <strong>Gemini became the single AI brand</strong>. One name for the model, the consumer assistant, and the enterprise offering. This reduced cognitive load and concentrated demand.</p><p>(5) Finally - and most importantly - Google doubled down on <strong>full-stack AI</strong>:<br>models, custom silicon (TPUs), data centers, cloud, consumer products, and distribution via Android, Chrome, Search, YouTube, and hardware partnerships.</p><p>Gemini wasn&#8217;t built to win a leaderboard. It was built to power an ecosystem.</p><div><hr></div><h2>Why Gemini 3 changed the conversation</h2><p>When <strong>Gemini 3</strong> landed, the reaction was different.</p><p>On LMArena, it debuted at the top, surpassing OpenAI&#8217;s latest models in reasoning and coding. But more telling was the qualitative feedback. Developers described Gemini as &#8220;thinking before answering.&#8221; Slower, yes - but more reliable on multi-step tasks.</p><p>This wasn&#8217;t accidental. Gemini 3 leaned into <strong>explicit internal planning modes</strong>, trading latency for correctness. For enterprise use cases - coding, math, research - that tradeoff matters.</p><p>Two structural advantages amplified this leap.</p><p>First, <strong>multimodal data</strong>. Google owns YouTube. No competitor comes close. Gemini&#8217;s strength in image and video understanding isn&#8217;t magic - it&#8217;s structural advantage.</p><p>Second, <strong>silicon-model co-design</strong>. Gemini 3 was trained entirely on <strong>TPUs</strong>, not NVIDIA GPUs. This allowed Google to optimize architectures around its own hardware constraints, improving performance per watt and inference economics.</p><p>That combination - better reasoning, multimodality, and cheaper scaling - shifted perception fast.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!zZIe!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c879bfe-12fe-4b09-8901-734577032cff_2752x1536.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!zZIe!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c879bfe-12fe-4b09-8901-734577032cff_2752x1536.png 424w, https://substackcdn.com/image/fetch/$s_!zZIe!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c879bfe-12fe-4b09-8901-734577032cff_2752x1536.png 848w, https://substackcdn.com/image/fetch/$s_!zZIe!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c879bfe-12fe-4b09-8901-734577032cff_2752x1536.png 1272w, https://substackcdn.com/image/fetch/$s_!zZIe!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c879bfe-12fe-4b09-8901-734577032cff_2752x1536.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!zZIe!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c879bfe-12fe-4b09-8901-734577032cff_2752x1536.png" width="1456" height="813" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6c879bfe-12fe-4b09-8901-734577032cff_2752x1536.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:813,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:6526037,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/181549963?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c879bfe-12fe-4b09-8901-734577032cff_2752x1536.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!zZIe!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c879bfe-12fe-4b09-8901-734577032cff_2752x1536.png 424w, https://substackcdn.com/image/fetch/$s_!zZIe!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c879bfe-12fe-4b09-8901-734577032cff_2752x1536.png 848w, https://substackcdn.com/image/fetch/$s_!zZIe!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c879bfe-12fe-4b09-8901-734577032cff_2752x1536.png 1272w, https://substackcdn.com/image/fetch/$s_!zZIe!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6c879bfe-12fe-4b09-8901-734577032cff_2752x1536.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">NotebookLM created an inforgraphic for me!</figcaption></figure></div><div><hr></div><h2>When product wins, everything unlocks</h2><p>It&#8217;s easy to over-index on org changes. But none of them would have mattered if <strong>Gemini hadn&#8217;t crossed a real quality threshold</strong>.</p><p><strong>The flywheel only restarted when people wanted to use Gemini.</strong></p><p>Once that happened, everything unlocked.</p><p>TPUs - long dismissed as internal science projects - suddenly looked credible to external customers. Companies like Meta and Anthropic began seriously evaluating TPU-based training and inference. Google Cloud&#8217;s AI story tightened: Gemini wasn&#8217;t just <em>on</em> GCP; it was <em>of</em> GCP.</p><p>Consumer products followed. Gemini&#8217;s quality made it safe to embed deeply into Search, Workspace, and Android. That mattered enormously. Distribution only works when the product doesn&#8217;t embarrass you.</p><p>This is the subtle but critical point: <strong>Google&#8217;s distribution advantage only becomes an advantage after product quality clears a bar</strong>. Until then, scale just amplifies failure - as Bard painfully demonstrated.</p><p>Once Gemini crossed that bar, the logic flipped. Distribution accelerated learning. Learning improved the model. Improved models justified more TPU investment. TPU investment lowered marginal costs. Lower costs made bundling feasible. Bundling drove consumer usage. Consumer usage justified hyperscale capex.</p><p>Product quality wasn&#8217;t just one variable. It was the <strong>unlock condition</strong>.</p><div><hr></div><h2>From the GPU era to the ASIC era</h2><p>Underneath all of this is also a deeper structural shift: we&#8217;re moving from a <strong>GPU-dominated AI era to an ASIC-dominated one</strong>.</p><p>NVIDIA GPUs were the perfect accelerator for AI&#8217;s explosive phase - flexible, powerful, and available. OpenAI&#8217;s rise is inseparable from that ecosystem.</p><p>But GPUs are general-purpose. As workloads stabilize and inference volume explodes, general-purpose becomes inefficient.</p><p>Google understood this early. TPUs were never about beating NVIDIA on peak performance. They were about <strong>control</strong>: predictable supply, tighter software integration, and better performance per watt at hyperscale.</p><p>Gemini 3 training entirely on TPUs is not a footnote - it&#8217;s a signal. Google believes its custom silicon is now good enough to anchor frontier models, not just internal workloads.</p><p>If that belief holds, the implications are profound.</p><p>Cost curves bend. Dependence on external suppliers weakens. Competitive advantage shifts back toward companies that own the full stack.</p><p>This mirrors earlier transitions: AWS&#8217;s Graviton didn&#8217;t kill Intel overnight, but it permanently changed the server CPU market. Hyperscalers don&#8217;t need to replace GPUs everywhere - just <em>enough</em>.</p><p>For AI, custom silicon is how intelligence becomes <strong>cheap, reliable, and ubiquitous</strong>.</p><div><hr></div><h2>Why OpenAI is in Code Red</h2><p>Seen through this lens, OpenAI&#8217;s Code Red isn&#8217;t about ego or benchmarks. It&#8217;s about math.</p><p>AI is commoditizing faster than expected. Open-source pressure from China is real. Performance gaps are narrowing.</p><p>At the same time, <strong>costs are exploding</strong>. Even optimistic scenarios suggest OpenAI needs hundreds of billions in additional capital to sustain growth. Planned data-center investments alone stretch toward the trillion-dollar range.</p><p>Worse, OpenAI lacks <strong>consumer amortization</strong>.</p><p>ChatGPT is an app users choose. <strong>Gemini is increasingly an assistant users inherit</strong> - preloaded on Android, embedded in Search and Workspace, and soon hardware like smart glasses <s>(I&#8217;m so excited about these glasses and will write a separate article on it!!!!)</s></p><p>This is why Anthropic looks calm. Anthropic chose a B2B-first path with tighter scope and lower free-user load. OpenAI chose scale - and now has to feed it.</p><p>Code Red isn&#8217;t about Gemini being &#8220;smarter.&#8221;<br>It&#8217;s about Google activating a system OpenAI can&#8217;t replicate quickly.</p><div><hr></div><h2>From model wars to systems wars</h2><p>The real competition now isn&#8217;t GPT vs Gemini vs Claude.</p><p>It&#8217;s <strong>full-stack vs partial-stack AI</strong>.</p><p>Google controls training hardware, inference economics, consumer distribution, enterprise cloud, and personal context. OpenAI controls a beloved interface and world-class research velocity - but not the full loop.</p><p>Once AI becomes infrastructure-heavy, systems win over features. We&#8217;ve seen this before in compute, mobile, and cloud.</p><p>AI is entering that phase now.</p><div><hr></div><h2>Closing thought</h2><p>Two years ago, Google looked slow and complacent. Today, it looks methodical - and dangerous.</p><p>OpenAI&#8217;s Code Red is understandable. But it&#8217;s also revealing. It signals that the game has shifted from sprinting ahead on models to <strong>building systems that compound</strong>.</p><p>Google didn&#8217;t win by inventing intelligence first.<br>It&#8217;s winning by turning intelligence into infrastructure.</p><p>And that&#8217;s the real lesson of this moment.</p>]]></content:encoded></item><item><title><![CDATA[Deep-Tech Decoded: #8: From brain to business: robots as a Physical API]]></title><description><![CDATA[As robot brains mature, the real shift isn&#8217;t one cool humanoid demo, but treating physical work itself as something you can call, compose, and pay for like an API.]]></description><link>https://clairechoi616.substack.com/p/deep-tech-decoded-8-from-brain-to</link><guid isPermaLink="false">https://clairechoi616.substack.com/p/deep-tech-decoded-8-from-brain-to</guid><dc:creator><![CDATA[Deep Tech for the Non Tech]]></dc:creator><pubDate>Fri, 21 Nov 2025 19:30:42 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!A5To!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F620862f0-4079-46d2-b115-359f1db7b902_1536x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h2>From robot training to robot products</h2><p>In the last piece, we stayed inside the robot&#8217;s head. We looked at how teleoperation, massive simulation, and Vision&#8211;Language&#8211;Action (VLA) models are slowly giving robots something they never had: an <em>internet of actions</em> to learn from, instead of the web-scale internet of text that powered large language models. That was the &#8220;how robots actually learn&#8221; story - the data flywheel and the emerging robot brain.</p><p>This article picks up right where that one left off. Once you <em>do</em> have that brain and that training pipeline, a new question appears:</p><blockquote><p>How should the world <strong>use</strong> it?</p></blockquote><p>For decades, programming a robot has felt less like asking for help and more like negotiating with a very stubborn machine.</p><p>To get a simple task done - say, clearing a table - you don&#8217;t just say &#8220;clear the table.&#8221; You specify which arm to use, how fast to move each joint, where to grip each object, how much force is safe before something slips or breaks, how to avoid collisions with chairs and people, and what to do when the world doesn&#8217;t look exactly like the CAD model. Every variation in layout, lighting, or furniture creates a new pile of edge cases.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!YWU9!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F456f9f1f-6b68-4a76-9e6f-15b13ae21e4e_1536x1024.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!YWU9!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F456f9f1f-6b68-4a76-9e6f-15b13ae21e4e_1536x1024.png 424w, https://substackcdn.com/image/fetch/$s_!YWU9!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F456f9f1f-6b68-4a76-9e6f-15b13ae21e4e_1536x1024.png 848w, https://substackcdn.com/image/fetch/$s_!YWU9!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F456f9f1f-6b68-4a76-9e6f-15b13ae21e4e_1536x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!YWU9!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F456f9f1f-6b68-4a76-9e6f-15b13ae21e4e_1536x1024.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!YWU9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F456f9f1f-6b68-4a76-9e6f-15b13ae21e4e_1536x1024.png" width="1456" height="971" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/456f9f1f-6b68-4a76-9e6f-15b13ae21e4e_1536x1024.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:971,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2571180,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/178937257?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F456f9f1f-6b68-4a76-9e6f-15b13ae21e4e_1536x1024.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!YWU9!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F456f9f1f-6b68-4a76-9e6f-15b13ae21e4e_1536x1024.png 424w, https://substackcdn.com/image/fetch/$s_!YWU9!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F456f9f1f-6b68-4a76-9e6f-15b13ae21e4e_1536x1024.png 848w, https://substackcdn.com/image/fetch/$s_!YWU9!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F456f9f1f-6b68-4a76-9e6f-15b13ae21e4e_1536x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!YWU9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F456f9f1f-6b68-4a76-9e6f-15b13ae21e4e_1536x1024.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><p>This is why most robots have lived in tightly controlled bubbles: factory lines with fixed fixtures, fenced-off workcells, carefully scripted motions. When everything is bolted down and nothing changes, you can afford to handcraft the &#8220;how.&#8221; As soon as you step into an open-ended caf&#233;, warehouse, or hospital, that approach collapses under its own complexity.</p><p>Software went through a similar phase. Early on, apps had to know far too much about the systems around them: different operating systems, bespoke protocols, one-off integrations. The web didn&#8217;t take off because we made every interaction <em>more detailed</em>; it took off because we hid detail behind <strong>clean interfaces</strong>. You stopped worrying about exactly how someone else&#8217;s server worked and started trusting that a simple call would get you what you asked for.</p><p>The same kind of abstraction is starting to appear in robotics.</p><p>In #7, we stayed with the training stack - teleop, sim, VLA, data flywheel. In #8, we zoom out one layer and treat that stack as a <strong>service</strong>. Instead of thinking about robots one by one, we think about <strong>capabilities</strong> - &#8220;clear this aisle,&#8221; &#8220;scan these shelves,&#8221; &#8220;reset this rack&#8221; - that can be invoked without micromanaging every joint.</p><p>That&#8217;s the idea behind a <strong>Physical API</strong>.</p><div><hr></div><h2>1. What we mean by a Physical API</h2><p>In software, an API hides complexity. You express <em>what</em> you want done; the service decides <em>how</em> to do it.</p><p>A <strong>Physical API</strong> is the analogous idea for physical work:</p><ul><li><p>You express a goal in a structured way:<br>&#8220;clear this table,&#8221; &#8220;restock this aisle from this inventory,&#8221; &#8220;assemble this rack to this spec.&#8221;</p></li><li><p>You attach constraints and preferences:<br>timing (&#8220;finish before opening&#8221;), safety rules (&#8220;never cross this line&#8221;), priority (&#8220;this row first&#8221;), quality thresholds (&#8220;no visible crumbs / defect rate &lt; X%&#8221;).</p></li><li><p>The system decides which robots to use, which skills to invoke, and which trajectories to execute. It also monitors the task, surfaces metrics (&#8220;success rate,&#8221; &#8220;average cycle time&#8221;), and recovers from small errors when it can.</p></li></ul><p>You&#8217;re no longer thinking, &#8220;How do I program this specific arm?&#8221; You&#8217;re thinking, &#8220;How do I <strong>call</strong> the &#8216;clear table&#8217; capability in this space?&#8221;</p><p>Under the hood, the Physical API layer has to:</p><ul><li><p>translate goals into <strong>skills</strong> (pre-trained behaviors) that exist in the robot brain,</p></li><li><p>ground those skills in a concrete environment using maps and perception,</p></li><li><p>schedule and route work across multiple robots,</p></li><li><p>and keep operators in the loop when something ambiguous happens.</p></li></ul><p>VLA models and world models provide much of the intelligence; the robots are the actuators behind that interface. The operator interacts with the <em>API surface</em>, not the joint-space details.</p><div><hr></div><h2>2. Smarter AI &#8594; simpler, cheaper hardware</h2><p>Historically, industrial robots had to be extremely precise and predictable. The &#8220;intelligence&#8221; lived in:</p><ul><li><p>humans who designed fixtures and jigs,</p></li><li><p>engineers who hand-tuned trajectories and safety envelopes,</p></li><li><p>and offline planning tools.</p></li></ul><p>The robot itself was a very accurate player piano.</p><p>As perception and control models improve, you can tolerate <strong>less perfect, more affordable hardware</strong>:</p><ul><li><p>Slight mechanical slack can be compensated by feedback control.</p></li><li><p>Variations in part placement can be handled by vision rather than hard stops.</p></li><li><p>Small misalignments can be corrected on the fly instead of causing a fault.</p></li></ul><p>We&#8217;ve already seen a broad cost movement:</p><ul><li><p>from six-figure research platforms,</p></li><li><p>to five-figure industrial arms in many factories,</p></li><li><p>to simpler systems and components in the mid- to low-five figures (with some elements dipping into the high four figures), depending on spec and volume.</p></li></ul><p>Exact numbers vary by vendor and configuration, but the direction is clear: if most of your &#8220;intelligence&#8221; lives in shared models and software, the <strong>marginal cost</strong> of adding another robot endpoint to your Physical API can keep dropping over time.</p><p>That doesn&#8217;t mean hardware becomes trivial. Reliability, safety, maintainability, and supply chain still matter. But it means you can start to think about robots less as bespoke capex projects and more as <strong>programmable terminals</strong> that sit behind the same logical interface.</p><div><hr></div><h2>3. Forward looking into the &#8220;Skill Economy&#8221;</h2><p>Once you have:</p><ul><li><p>a reasonably standardized brain (a VLA-style model trained on your &#8220;internet of actions&#8221;),</p></li><li><p>a growing base of shared experience across tasks and environments,</p></li><li><p>and fleets of compatible robot bodies,</p></li></ul><p>it&#8217;s natural to imagine a <strong>skill economy</strong> forming.</p><p>A &#8220;skill&#8221; here is more than a macro. It could encapsulate:</p><ul><li><p>how to make a specific drink across a family of caf&#233;s (different machines, same end result),</p></li><li><p>how to execute an inspection routine in different data centers (different rack layouts, same checklist),</p></li><li><p>how to prep a hospital room according to protocol (different room shapes, same infection-control requirements),</p></li><li><p>or how to assemble a particular product in a light manufacturing cell.</p></li></ul><p>Each skill embeds:</p><ul><li><p><strong>tacit expert judgment</strong>: what &#8220;clean enough,&#8221; &#8220;tight enough,&#8221; or &#8220;properly aligned&#8221; actually looks like in that domain,</p></li><li><p><strong>safety checks and fallbacks</strong>: when to stop and ask a human, when to log an anomaly, when a hazard threshold is crossed,</p></li><li><p><strong>language hooks</strong>: natural-language controls like &#8220;do it more gently,&#8221; &#8220;skip dusting the top shelves at night,&#8221; &#8220;prioritize this side first.&#8221;</p></li></ul><p>In practice, skills might be:</p><ul><li><p>authored initially via teleop + sim + offline tuning,</p></li><li><p>packaged as reusable modules inside the robot software stack,</p></li><li><p>parameterized for different customers or facilities (&#8220;this SKU list,&#8221; &#8220;these aisle lengths&#8221;).</p></li></ul><p>We&#8217;re not at a mature skill marketplace yet. But if the ecosystem keeps maturing, you can imagine:</p><ul><li><p>manufacturers shipping skills tuned to their own equipment,</p></li><li><p>service companies encoding their SOPs as skills that work across many sites,</p></li><li><p>third-party developers specializing in niche skills (e.g., &#8220;museum artifact handling&#8221;),</p></li><li><p>and operators selecting and configuring skills the way they adopt SaaS integrations today.</p></li></ul><p>At that point, a Physical API starts to feel like a <strong>real platform</strong>, not just a metaphor.</p><div><hr></div><h2>4. From buying robots to buying &#8220;tasks as a service&#8221; (my imagination)</h2><p>Today, bringing robots into an operation often looks like:</p><ul><li><p>buying or leasing expensive hardware,</p></li><li><p>hiring or contracting a robotics team,</p></li><li><p>doing months of integration and safety validation,</p></li><li><p>then living with whatever capabilities you managed to hard-wire in.</p></li></ul><p>It feels a lot like buying and running your own servers in the pre-cloud era.</p><p><strong>A mature Physical API</strong> opens the door to different business models. This part is speculative, but directionally plausible:</p><ul><li><p>Instead of buying robots outright, customers <strong>subscribe</strong> to a set of physical capabilities:<br>&#8220;nightly aisle scans,&#8221; &#8220;per-shift pallet moves,&#8221; &#8220;rooms cleaned per day.&#8221;</p></li><li><p>They pay based on outcomes or usage - the work done - not just the list price of hardware.</p></li><li><p>Providers handle hardware selection, maintenance, and skill updates behind the scenes, much like cloud providers upgrade their infrastructure.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!A5To!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F620862f0-4079-46d2-b115-359f1db7b902_1536x1024.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!A5To!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F620862f0-4079-46d2-b115-359f1db7b902_1536x1024.png 424w, https://substackcdn.com/image/fetch/$s_!A5To!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F620862f0-4079-46d2-b115-359f1db7b902_1536x1024.png 848w, https://substackcdn.com/image/fetch/$s_!A5To!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F620862f0-4079-46d2-b115-359f1db7b902_1536x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!A5To!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F620862f0-4079-46d2-b115-359f1db7b902_1536x1024.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!A5To!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F620862f0-4079-46d2-b115-359f1db7b902_1536x1024.png" width="1456" height="971" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/620862f0-4079-46d2-b115-359f1db7b902_1536x1024.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:971,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2377376,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/178923367?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F620862f0-4079-46d2-b115-359f1db7b902_1536x1024.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!A5To!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F620862f0-4079-46d2-b115-359f1db7b902_1536x1024.png 424w, https://substackcdn.com/image/fetch/$s_!A5To!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F620862f0-4079-46d2-b115-359f1db7b902_1536x1024.png 848w, https://substackcdn.com/image/fetch/$s_!A5To!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F620862f0-4079-46d2-b115-359f1db7b902_1536x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!A5To!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F620862f0-4079-46d2-b115-359f1db7b902_1536x1024.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Under the hood, there are still technicians, spare parts, replacement mops, and blocked drains. But from the customer&#8217;s perspective, they&#8217;re buying <strong>completed tasks</strong>, not machines.</p><p>Early hints of this model already exist in some cleaning, delivery, and warehouse-automation contracts: customers are sold uptime and throughput, not just units of hardware. The Physical API concept pushes that logic further, into a world where a single abstraction layer coordinates many types of robots and skills across many sites.</p><div><hr></div><h2>5. CosmicBrain AI: one example of where this is going</h2><p><a href="http://cosmicbrainai.com">CosmicBrain AI</a> is one of several startups that are explicitly trying to build towards this Physical API world.</p><p>Publicly, they describe themselves as a <strong>&#8220;cursor for robots&#8221;</strong>: an all-in-one, no-code/low-code environment for creating, testing, and deploying robot skills.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!EqSk!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4be64d3-a75d-40d5-b9da-cbb723b9f6c8_2536x1258.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!EqSk!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4be64d3-a75d-40d5-b9da-cbb723b9f6c8_2536x1258.png 424w, https://substackcdn.com/image/fetch/$s_!EqSk!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4be64d3-a75d-40d5-b9da-cbb723b9f6c8_2536x1258.png 848w, https://substackcdn.com/image/fetch/$s_!EqSk!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4be64d3-a75d-40d5-b9da-cbb723b9f6c8_2536x1258.png 1272w, https://substackcdn.com/image/fetch/$s_!EqSk!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4be64d3-a75d-40d5-b9da-cbb723b9f6c8_2536x1258.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!EqSk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4be64d3-a75d-40d5-b9da-cbb723b9f6c8_2536x1258.png" width="1456" height="722" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b4be64d3-a75d-40d5-b9da-cbb723b9f6c8_2536x1258.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:722,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:297844,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/178937257?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4be64d3-a75d-40d5-b9da-cbb723b9f6c8_2536x1258.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!EqSk!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4be64d3-a75d-40d5-b9da-cbb723b9f6c8_2536x1258.png 424w, https://substackcdn.com/image/fetch/$s_!EqSk!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4be64d3-a75d-40d5-b9da-cbb723b9f6c8_2536x1258.png 848w, https://substackcdn.com/image/fetch/$s_!EqSk!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4be64d3-a75d-40d5-b9da-cbb723b9f6c8_2536x1258.png 1272w, https://substackcdn.com/image/fetch/$s_!EqSk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4be64d3-a75d-40d5-b9da-cbb723b9f6c8_2536x1258.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: Cosmicbrain AI</figcaption></figure></div><p>A few things stand out from their materials and interviews:</p><ul><li><p><strong>Skill-creation IDE.</strong> They offer a visual programming environment and drag-and-drop interface so non-expert users can define robot behaviors and task sequences without deep robotics coding. Think of it as a design studio for skills rather than for single robots.</p></li><li><p><strong>Simulation-first workflow.</strong> Their platform integrates simulation tools so that skills can be tested and validated before touching real hardware, leveraging physics engines and world models to shrink the gap between sim and real.</p></li><li><p><strong>Skills as APIs, not data sales.</strong> In one IBM interview, the founder sketches the ambition as an &#8220;app store for robots,&#8221; but with a twist: instead of selling data, they aim to sell <strong>skill sets exposed as APIs</strong>, so that customers keep control of their own data while tapping into shared capabilities.</p></li><li><p><strong>Shared foundation, reused intelligence.</strong> In public posts they emphasize that robots shouldn&#8217;t have to relearn everything from scratch for each deployment; a shared foundation and reusable skills should carry over across customers and fleets.</p></li></ul><p>CosmicBrain is still early, and the market is young. But they&#8217;re a useful concrete example: not just building another robot, but trying to build <strong>the tools and marketplace layer</strong> that makes a Physical API usable - and commercial.</p><p>They won&#8217;t be the only ones. If the Physical API thesis is right, we should expect:</p><ul><li><p>IDEs and toolchains for skill authoring,</p></li><li><p>marketplaces for distributing and monetizing those skills,</p></li><li><p>and orchestration platforms that let enterprises plug multiple hardware vendors into a single abstraction layer.</p></li></ul><p>CosmicBrain is one of the first teams whose branding and product explicitly point in that direction.</p><div><hr></div><h2>6. Macro: robots and AI&#8217;s capex wall</h2><p>Zooming out, this has implications beyond any one startup.</p><p>Most forward-looking AI roadmaps over the next decade involve <strong>massive capital expenditure</strong>:</p><ul><li><p>more and larger data centers,</p></li><li><p>chip fabs and advanced packaging facilities,</p></li><li><p>power generation and transmission,</p></li><li><p>solar and wind build-out,</p></li><li><p>logistics networks to keep all of this supplied.</p></li></ul><p>Even if you can finance all that infrastructure, you still hit a constraint: <strong>human labor</strong> to build, deploy, inspect, and maintain it.</p><p>If robots with useful autonomy reach reliable operation in key niches - warehouses, plants, solar fields, data centers - within the next decade, they could become a <strong>significant enabler</strong> for scaling AI and clean-energy infrastructure. Not the only lever, but a meaningful one.</p><p>In that world:</p><ul><li><p>The <strong>internet of actions</strong> is the data and training fabric that makes robots competent.</p></li><li><p>The <strong>Physical API</strong> is how we expose that competence to the rest of the economy.</p></li><li><p>Skill platforms and marketplaces are how we distribute it.</p></li></ul><p>The abstraction layer we&#8217;ve been talking about stops being a thought experiment and becomes the way you <strong>program the physical side of the AI boom.</strong></p><div><hr></div><h2>7. Conclusion: from learning to leverage</h2><p>If you read #7 and #8 together, there&#8217;s a simple story hiding under the jargon.</p><p>Article #7 was about <strong>learning</strong>:<br>how robots get their experience when there is no web to scrape; how teleop, simulation, and VLA models slowly add up to an internet of actions; how a data flywheel can, if we&#8217;re lucky and careful, push robots toward more robust autonomy.</p><p>Article #8 is about <strong>leverage</strong>:<br>once you have that brain and that experience, how do you turn it into something the world can actually <em>use</em>? How do you hide complexity behind a clean Physical API, package expertise into skills, and shift from buying metal to buying completed work?</p><p>The physical Turing test - &#8220;clean the house and make dinner, and I can&#8217;t tell who did it&#8221; - is still aspirational. The &#8220;single-digit years&#8221; timeline for broad autonomy is very much a bet, not a guarantee. But the trajectory is clearer than it was a few years ago:</p><ul><li><p>Data, simulation, and VLAs are giving robots a learning substrate that looks more and more like an internet of actions.</p></li><li><p>Platforms like CosmicBrain AI hint at how that substrate might be turned into programmable skills and services.</p></li><li><p>And the Physical API framing gives PMs, founders, and investors a way to reason about robots not just as machines, but as <strong>infrastructure</strong> - a new layer in the tech stack where value, and power, will concentrate.</p></li></ul><p>For builders and investors, the most interesting question is no longer simply:</p><blockquote><p>&#8220;Will we have autonomous robots?&#8221;</p></blockquote><p>A sharper version is:</p><blockquote><p>&#8220;As physical work becomes increasingly programmable, <strong>who will own the data, the skills, and the distribution - and what will you build on top of that?</strong>&#8221;</p></blockquote><p>That is the shared through-line of #7 and #8 - and it&#8217;s the real strategic stake in the emerging internet of actions and the Physical APIs that will sit on top of it.</p>]]></content:encoded></item><item><title><![CDATA[Deep-Tech Decoded: #7: The Internet of Actions: How Robots Actually Learn]]></title><description><![CDATA[Robots never got a web to learn from - so we&#8217;re hacking together an internet of actions with teleop, simulation, and VLA brains]]></description><link>https://clairechoi616.substack.com/p/deep-tech-decoded-7-the-internet</link><guid isPermaLink="false">https://clairechoi616.substack.com/p/deep-tech-decoded-7-the-internet</guid><dc:creator><![CDATA[Deep Tech for the Non Tech]]></dc:creator><pubDate>Sat, 15 Nov 2025 01:01:27 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!YwOq!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe92bf137-dd2b-4f8a-9db6-a87716b5f2b5_1536x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h2>TL;DR </h2><ul><li><p>Large language models grew up with a <strong>web-scale internet of text and images</strong>. Robots don&#8217;t have an equivalent &#8220;internet of actions&#8221; for physical behavior. Existing robot datasets are tiny and narrow in comparison.</p></li><li><p>For many labs pushing <strong>general-purpose robots</strong>, a big chunk of real-world experience still comes from <strong>teleoperation</strong>. It&#8217;s powerful but slow, expensive, and fundamentally capped by human time.</p></li><li><p><strong>Simulation</strong> is becoming the &#8220;clean energy&#8221; for robot learning: thousands of parallel physics worlds plus generative tools that spin up endless virtual kitchens, warehouses, and offices.</p></li><li><p>If teleop, simulation, and VLA models <strong>do</strong> click into a self-reinforcing <strong>data flywheel</strong>, robots can climb a ladder from scripted demos &#8594; narrow autonomy &#8594; something closer to a &#8220;physical Turing test.&#8221;</p></li></ul><div><hr></div><h2>LLMs had the internet. Robots don&#8217;t.</h2><p>Large language models had an unfair advantage: they were born into a world where billions of people had already poured text, images, code, and conversations onto the web for decades.</p><p>Turning that mess into training data was non-trivial, but the raw material was there. The internet acted as a fossil record of human cognition.</p><p>Robots don&#8217;t have the same luck.</p><p>There&#8217;s no public, web-scale dataset of &#8220;how to unload a delivery truck,&#8221; &#8220;how to reset a breaker in a data center,&#8221; or &#8220;how to clean up after a hackathon and cook a candlelit dinner.&#8221; We have videos, but not the thing a robot actually needs: continuous joint angles, forces, contact events, and fine-grained corrections.</p><p>That&#8217;s why you hear people in embodied AI talk in terms of a <strong>&#8220;physical Turing test&#8221;</strong>:</p><blockquote><p>Walk into a shared house on Sunday night after a chaotic weekend. Someone has cleaned the place and made dinner. You shouldn&#8217;t be able to tell if it was a human or a robot.</p><p><em>&#8212; Jim Fan, Director &amp; Distinguished Research Scientist at NVIDIA</em></p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!YwOq!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe92bf137-dd2b-4f8a-9db6-a87716b5f2b5_1536x1024.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!YwOq!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe92bf137-dd2b-4f8a-9db6-a87716b5f2b5_1536x1024.png 424w, https://substackcdn.com/image/fetch/$s_!YwOq!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe92bf137-dd2b-4f8a-9db6-a87716b5f2b5_1536x1024.png 848w, https://substackcdn.com/image/fetch/$s_!YwOq!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe92bf137-dd2b-4f8a-9db6-a87716b5f2b5_1536x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!YwOq!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe92bf137-dd2b-4f8a-9db6-a87716b5f2b5_1536x1024.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!YwOq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe92bf137-dd2b-4f8a-9db6-a87716b5f2b5_1536x1024.png" width="1456" height="971" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e92bf137-dd2b-4f8a-9db6-a87716b5f2b5_1536x1024.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:971,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2278378,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/178923367?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe92bf137-dd2b-4f8a-9db6-a87716b5f2b5_1536x1024.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!YwOq!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe92bf137-dd2b-4f8a-9db6-a87716b5f2b5_1536x1024.png 424w, https://substackcdn.com/image/fetch/$s_!YwOq!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe92bf137-dd2b-4f8a-9db6-a87716b5f2b5_1536x1024.png 848w, https://substackcdn.com/image/fetch/$s_!YwOq!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe92bf137-dd2b-4f8a-9db6-a87716b5f2b5_1536x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!YwOq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe92bf137-dd2b-4f8a-9db6-a87716b5f2b5_1536x1024.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>We are nowhere near that benchmark today. Current humanoids still fumble cereal and cables. But the interesting story is not about one viral demo; it&#8217;s about how the field is slowly constructing an <strong>&#8220;internet of actions&#8221;</strong> and how, if that succeeds, it naturally leads toward something like a <strong>Physical API</strong> for the real world.</p><div><hr></div><h2>Where robots are actually stuck: data, not (just) hardware</h2><p>In top-tier labs and companies working on general-purpose robots, you rarely see a shortage of hardware:</p><ul><li><p>robot arms, mobile bases, prototype humanoids;</p></li><li><p>multiple cameras and depth sensors;</p></li><li><p>racks of GPUs in the back.</p></li></ul><p>One major bottleneck is <strong>experience</strong> - especially diverse, high-quality experience.</p><h3>1. Teleoperation as &#8220;human fuel&#8221;</h3><p>For many cutting-edge embodied AI projects, a large share of ground-truth experience still comes from <strong>teleoperation</strong>:</p><ul><li><p>A human wears a VR headset or uses special controllers.</p></li><li><p>They see from the robot&#8217;s point of view.</p></li><li><p>They manually drive the robot&#8217;s joints through tasks: take bread out of a toaster, pick up a cup, open a cabinet, pour honey without spilling.</p></li></ul><p>Every one of these sessions produces a rich trajectory: joint angles, velocities, contacts, successes, failures. That&#8217;s precious supervision.</p><p>But teleop has hard ceilings:</p><ul><li><p>It&#8217;s tiring and attention-heavy.</p></li><li><p>It doesn&#8217;t scale linearly with money the way scraping websites does.</p></li><li><p>In theory, you could teleop one robot 24 hours a day; in practice, you quickly run into human limits.</p></li></ul><p>And every new environment - a different kitchen, warehouse, or hospital - often needs <strong>fresh data</strong>, because the small details (layout, lighting, fixtures) change everything.</p><p>Relative to the web for LLMs, robot learning is still running on very expensive calories.</p><h3>2. Why &#8220;more demos&#8221; isn&#8217;t enough</h3><p>You could imagine just doing <strong>more</strong> teleop, but that doesn&#8217;t fully solve the problem.</p><p>A genuinely useful robot doesn&#8217;t just need:</p><ul><li><p>the &#8220;average&#8221; demo under ideal conditions.</p></li></ul><p>It also needs:</p><ul><li><p>robustness to odd cases (slippery plates, torn packaging, weird clutter),</p></li><li><p>the ability to work in slightly different layouts,</p></li><li><p>graceful recovery from small mistakes,</p></li><li><p>and efficiency and safety across all of that.</p></li></ul><p>That&#8217;s not just <em>more</em> of the same data; it&#8217;s <strong>different kinds</strong> of data:</p><ul><li><p>more diversity in environments,</p></li><li><p>more strange edge cases,</p></li><li><p>and more repeated practice under systematic variation.</p></li></ul><p>So if there&#8217;s no internet of actions to scrape, and human teleop doesn&#8217;t scale without limit, where does the rest of the experience come from?</p><div><hr></div><h2>Simulation as clean energy for robot experience</h2><p>One major answer is <strong>simulation.</strong> </p><p>Early robot simulators looked like basic game engines with physics turned on. Modern ones look more like experience factories: thousands of parallel environments on GPUs, each with randomized mass, friction, lighting, and clutter, all feeding data into policies that never get tired.</p><h3>The simulation ladder: Sim 1.0 &#8594; 1.5 &#8594; 2.0</h3><p>You can think of the field as climbing a simple ladder along one axis: <strong>speed vs. diversity</strong>.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!STrS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06835885-544b-4403-9742-a8442455724f_2234x1064.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!STrS!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06835885-544b-4403-9742-a8442455724f_2234x1064.png 424w, https://substackcdn.com/image/fetch/$s_!STrS!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06835885-544b-4403-9742-a8442455724f_2234x1064.png 848w, https://substackcdn.com/image/fetch/$s_!STrS!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06835885-544b-4403-9742-a8442455724f_2234x1064.png 1272w, https://substackcdn.com/image/fetch/$s_!STrS!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06835885-544b-4403-9742-a8442455724f_2234x1064.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!STrS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06835885-544b-4403-9742-a8442455724f_2234x1064.png" width="1456" height="693" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/06835885-544b-4403-9742-a8442455724f_2234x1064.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:693,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:576115,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/178923367?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06835885-544b-4403-9742-a8442455724f_2234x1064.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!STrS!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06835885-544b-4403-9742-a8442455724f_2234x1064.png 424w, https://substackcdn.com/image/fetch/$s_!STrS!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06835885-544b-4403-9742-a8442455724f_2234x1064.png 848w, https://substackcdn.com/image/fetch/$s_!STrS!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06835885-544b-4403-9742-a8442455724f_2234x1064.png 1272w, https://substackcdn.com/image/fetch/$s_!STrS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06835885-544b-4403-9742-a8442455724f_2234x1064.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: Jim Fan, Director &amp; Distinguished Research Scientist at NVIDIA</figcaption></figure></div><ul><li><p><strong>Sim 1.0 &#8211; Digital Twin:</strong> very fast, high-fidelity physics copies of a specific robot and workspace, but environments are mostly hand-built and limited in variety.</p></li><li><p><strong>Sim 1.5 &#8211; Digital Cousin:</strong> still physics-based, but scenes and motions are generated and randomized by models, so you trade some speed for much richer diversity.</p></li><li><p><strong>Sim 2.0 &#8211; Digital Nomad:</strong> robots train inside neural &#8220;world models&#8221; or video models that can imagine countless futures, slower today but with virtually unlimited diversity as compute and data scale.</p></li></ul><p>Most industrial and academic systems today live somewhere between <strong>1.0 and 1.5</strong>. The interesting question for humanoids and Physical AI is how quickly we can move more of the learning signal into <strong>2.0</strong>.</p><blockquote><h3>From fossil fuel &#9981;&#65039; to clean energy &#127807;</h3><p>A helpful analogy:</p><ul><li><p><strong>Teleoperation</strong> is fossil fuel: potent but human-limited. You can get high-quality data, but only as fast as humans can drive robots.</p></li><li><p><strong>Simulation</strong> is more like clean base-load power: once you&#8217;ve built the plant (the physics engine + world model), you can run it continuously.</p></li></ul><p>With Sim 1.0&#8211;1.5, you can already run <strong>years of practice in hours of GPU time</strong> and expose robots to failures you&#8217;d never tolerate in a live facility. With Sim 2.0, the &#8220;plant&#8221; becomes a <strong>neural world</strong> that can keep inventing new, plausible situations.</p><p>The long-term aim is to flip the ratio:</p><ul><li><p>Use simulation &#8211; especially Sim 2.0&#8211;style neural worlds &#8211; for the <strong>bulk of pre-training and diversity</strong>.</p></li><li><p>Use real-world data and teleop to <strong>anchor, correct, and fine-tune</strong>.</p></li></ul><p>That&#8217;s how the field is trying to manufacture an &#8220;internet of actions&#8221; where none existed: a shared, ever-growing reservoir of experience that future robots can draw from before they ever touch the real world.</p></blockquote><h3>1. Sim 1.0&#8211;1.5: fast but capped</h3><p>In <strong>Sim 1.0</strong>, the pattern is:</p><ul><li><p>Build a <strong>digital twin</strong> of your robot and workspace.</p></li><li><p>Use <strong>domain randomization</strong> &#8211; deliberately randomize gravity, friction, object weights, textures, camera noise.</p></li><li><p>Train policies on huge amounts of synthetic experience.</p></li></ul><p>This has already given us:</p><ul><li><p>Quadrupeds that learn to walk, jump, and recover from pushes in sim, then transfer to real hardware.</p></li><li><p>Manipulators that learn to spin objects, re-grasp tools, or open doors after seeing many stochastic variants.</p></li></ul><p><strong>Sim 1.5</strong> pushes diversity further by letting generative tools help build the world:</p><ul><li><p>Language describes tasks and layouts: &#8220;a cluttered sink with dishes, a drying rack, a sponge, and a bottle of soap.&#8221;</p></li><li><p>3D generators populate scenes with varied objects and materials.</p></li><li><p>Image models synthesize realistic textures and lighting.</p></li></ul><p>The robot model and core physics stay grounded; almost everything else is procedurally created. Instead of hand-authoring a few canonical scenes, you can train across thousands of plausible kitchens, offices, and warehouses that nobody manually modeled.</p><p>But even with 1.5, you are still fundamentally <strong>authoring environments</strong> and <strong>pushing data into the simulator</strong>. Diversity expands, but not infinitely.</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;30e02aff-d84f-4168-89aa-25e6287cbacb&quot;,&quot;duration&quot;:null}"></div><p><em>Source: NVIDIA</em></p><h3>2. Sim 2.0: robots living inside neural worlds</h3><p><strong>Sim 2.0</strong> flips the perspective: instead of manually building more and more complex 3D worlds, you train a <strong>neural world model</strong> that <em>is</em> the world.</p><p>In practice, Sim 2.0 looks like:</p><ul><li><p>A large video or 3D generative model that has digested millions of real-world clips: kitchens, offices, warehouses, streets.</p></li><li><p>The model can <strong>roll out many possible futures</strong> from a given state: the same object picked up with a different grip, in slightly different lighting, with a different obstacle in the way.</p></li><li><p>A robot policy interacts inside this &#8220;dreamspace,&#8221; trying actions, receiving feedback, and learning how the world tends to respond.</p></li></ul><p>Instead of saying &#8220;here is a perfectly modeled warehouse,&#8221; we say &#8220;here is a world model that knows 10,000 ways warehouses can vary&#8221; &#8211; and we let the robot <strong>be a digital nomad</strong>, wandering through that distribution.</p><p>For humanoids and Physical AI companies, this matters because:</p><ul><li><p><strong>Diversity beats hand-crafting.</strong> Edge cases become the norm: spilled coffee, odd furniture, awkward human poses, half-open doors. These are exactly the situations where brittle policies fail.</p></li><li><p><strong>Long-horizon skills become learnable.</strong> World models can simulate not just single grasps but entire routines &#8211; clean a table, fetch an item, reset a room &#8211; and provide gradients over the whole sequence.</p></li><li><p><strong>Scaling becomes a compute problem, not a labeling problem.</strong> Once the world model is trained, generating more experience is &#8220;just&#8221; more compute, not more teleoperation hours or manual scene design.</p></li></ul><p>This is the jump from &#8220;we simulate a specific factory bay very well&#8221; to &#8220;we simulate enough of the world&#8217;s distribution that a robot can be dropped into a new building and still cope.&#8221;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Vu51!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddb11b16-519e-42e3-bf86-34684740a359_2272x1292.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Vu51!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddb11b16-519e-42e3-bf86-34684740a359_2272x1292.png 424w, https://substackcdn.com/image/fetch/$s_!Vu51!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddb11b16-519e-42e3-bf86-34684740a359_2272x1292.png 848w, https://substackcdn.com/image/fetch/$s_!Vu51!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddb11b16-519e-42e3-bf86-34684740a359_2272x1292.png 1272w, https://substackcdn.com/image/fetch/$s_!Vu51!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddb11b16-519e-42e3-bf86-34684740a359_2272x1292.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Vu51!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddb11b16-519e-42e3-bf86-34684740a359_2272x1292.png" width="1456" height="828" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ddb11b16-519e-42e3-bf86-34684740a359_2272x1292.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:828,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:575399,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/178923367?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddb11b16-519e-42e3-bf86-34684740a359_2272x1292.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Vu51!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddb11b16-519e-42e3-bf86-34684740a359_2272x1292.png 424w, https://substackcdn.com/image/fetch/$s_!Vu51!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddb11b16-519e-42e3-bf86-34684740a359_2272x1292.png 848w, https://substackcdn.com/image/fetch/$s_!Vu51!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddb11b16-519e-42e3-bf86-34684740a359_2272x1292.png 1272w, https://substackcdn.com/image/fetch/$s_!Vu51!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddb11b16-519e-42e3-bf86-34684740a359_2272x1292.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3>3. Why this is a big deal for PMs and VCs?</h3><p>If you&#8217;re evaluating humanoid or Physical AI startups, Sim 2.0 is not just a technical curiosity; it shapes <strong>who can actually compound</strong>:</p><ul><li><p><strong>Compute curve:</strong> Who has a credible path to scaling world-model training and simulation rollouts as GPUs get cheaper or more specialized? Are they treating simulation like a first-class compute workload, or an afterthought?</p></li><li><p><strong>Data flywheel:</strong> Who has access to rich, real-world video and robot logs to train world models &#8211; and a plan to keep that data flowing (deployments, teleop, partnerships)?</p></li><li><p><strong>Standards and tooling:</strong> Are they building on an ecosystem that can become a <strong>de facto standard</strong> (APIs, scene formats, evaluation benchmarks), or are they locked into a bespoke stack that won&#8217;t attract partners?</p></li><li><p><strong>Platform posture:</strong> Are they positioning their simulator / world model as an <strong>internal advantage only</strong>, or as a platform others can train on (e.g., developers, integrators, OEMs)? The latter tends to generate defensibility and network effects.</p></li></ul><p>In other words: Sim 2.0 is where &#8220;simulation as a feature&#8221; turns into <strong>simulation as infrastructure</strong> &#8211; the equivalent of a cloud for robot experience.</p><div><hr></div><h2>Vision&#8211;Language&#8211;Action models: the new robot brain</h2><p>Data alone doesn&#8217;t make a robot useful. You also need a brain that can:</p><ul><li><p>understand scenes,</p></li><li><p>interpret instructions,</p></li><li><p>and produce actions over time.</p></li></ul><p>A leading candidate for that role is the <strong>Vision&#8211;Language&#8211;Action (VLA)</strong> model.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!q53A!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F877f993c-77ef-4713-9747-227036c0fafd_2194x1208.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!q53A!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F877f993c-77ef-4713-9747-227036c0fafd_2194x1208.png 424w, https://substackcdn.com/image/fetch/$s_!q53A!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F877f993c-77ef-4713-9747-227036c0fafd_2194x1208.png 848w, https://substackcdn.com/image/fetch/$s_!q53A!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F877f993c-77ef-4713-9747-227036c0fafd_2194x1208.png 1272w, https://substackcdn.com/image/fetch/$s_!q53A!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F877f993c-77ef-4713-9747-227036c0fafd_2194x1208.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!q53A!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F877f993c-77ef-4713-9747-227036c0fafd_2194x1208.png" width="1456" height="802" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/877f993c-77ef-4713-9747-227036c0fafd_2194x1208.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:802,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Understanding pi0 by Physical Intelligence: A Vision-Language-Action Flow Model for General Robot Control&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Understanding pi0 by Physical Intelligence: A Vision-Language-Action Flow Model for General Robot Control" title="Understanding pi0 by Physical Intelligence: A Vision-Language-Action Flow Model for General Robot Control" srcset="https://substackcdn.com/image/fetch/$s_!q53A!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F877f993c-77ef-4713-9747-227036c0fafd_2194x1208.png 424w, https://substackcdn.com/image/fetch/$s_!q53A!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F877f993c-77ef-4713-9747-227036c0fafd_2194x1208.png 848w, https://substackcdn.com/image/fetch/$s_!q53A!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F877f993c-77ef-4713-9747-227036c0fafd_2194x1208.png 1272w, https://substackcdn.com/image/fetch/$s_!q53A!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F877f993c-77ef-4713-9747-227036c0fafd_2194x1208.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: Physical Intelligence</figcaption></figure></div><h3>1. What a VLA model actually is</h3><p>In simple terms, a VLA model ingests:</p><ul><li><p>what the robot <strong>sees</strong> (pixels or other sensor inputs),</p></li><li><p>what the human <strong>wants</strong> (a text instruction, spoken command, or higher-level goal),</p></li><li><p>sometimes a history of states/actions,</p></li></ul><p>and outputs:</p><ul><li><p>low-level motor commands, or</p></li><li><p>higher-level action primitives that other controllers can execute.</p></li></ul><p>It&#8217;s a generalist controller: see + read &#8594; decide &#8594; move.</p><h3>2. Planner + controller: splitting the work</h3><p>A pattern you see in several modern systems:</p><ul><li><p>A <strong>vision&#8211;language backbone</strong> (often adapted from a general-purpose model) handles perception and reasoning. It parses the scene, interprets the goal, and plans in terms of sub-tasks: &#8220;pick up the sponge, then the plate, then rinse.&#8221;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ulMj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82d66ad4-e603-4abd-b14e-f91d9e9752ec_2000x578.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ulMj!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82d66ad4-e603-4abd-b14e-f91d9e9752ec_2000x578.png 424w, https://substackcdn.com/image/fetch/$s_!ulMj!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82d66ad4-e603-4abd-b14e-f91d9e9752ec_2000x578.png 848w, https://substackcdn.com/image/fetch/$s_!ulMj!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82d66ad4-e603-4abd-b14e-f91d9e9752ec_2000x578.png 1272w, https://substackcdn.com/image/fetch/$s_!ulMj!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82d66ad4-e603-4abd-b14e-f91d9e9752ec_2000x578.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ulMj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82d66ad4-e603-4abd-b14e-f91d9e9752ec_2000x578.png" width="1456" height="421" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/82d66ad4-e603-4abd-b14e-f91d9e9752ec_2000x578.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:421,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ulMj!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82d66ad4-e603-4abd-b14e-f91d9e9752ec_2000x578.png 424w, https://substackcdn.com/image/fetch/$s_!ulMj!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82d66ad4-e603-4abd-b14e-f91d9e9752ec_2000x578.png 848w, https://substackcdn.com/image/fetch/$s_!ulMj!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82d66ad4-e603-4abd-b14e-f91d9e9752ec_2000x578.png 1272w, https://substackcdn.com/image/fetch/$s_!ulMj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82d66ad4-e603-4abd-b14e-f91d9e9752ec_2000x578.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: Physical Intelligence</figcaption></figure></div></li><li><p>A <strong>high-frequency action expert</strong> - typically a smaller network trained heavily in sim and real data - handles the millisecond-scale control needed to execute those sub-tasks.</p></li></ul><p>Architecturally, many of these systems resemble a kind of <strong>mixture-of-experts</strong>: different components specialize in planning, grasping, locomotion, etc., while sharing a common representation of the world.</p><h3>3. Language as supervision, not just interface</h3><p>As these models get better, language starts to play two roles:</p><ul><li><p><strong>Interface:</strong> humans can say &#8220;wipe down this counter&#8221; instead of specifying trajectories.</p></li><li><p><strong>Supervision:</strong> humans can <em>talk while they demonstrate</em>, and those utterances become labels.</p></li></ul><p>Imagine teleoperating a robot and narrating:</p><ul><li><p>&#8220;Now grab the handle.&#8221;</p></li><li><p>&#8220;The mug is slipping; re-grasp from the side.&#8221;</p></li><li><p>&#8220;Push more slowly until you feel the click.&#8221;</p></li></ul><p>Those phrases can be aligned with the motion data. Over time, the system can internalize:</p><ul><li><p>what &#8220;re-grasp&#8221; means in different contexts,</p></li><li><p>how &#8220;gently&#8221; changes contact forces,</p></li><li><p>and when to apply those corrections on its own.</p></li></ul><p>This is still an active research frontier, but the direction is clear: humans become both <strong>puppet masters</strong> and <strong>teachers</strong> of the internet of actions.</p><div><hr></div><h2>From teleop to data flywheel to physical Turing test</h2><p>Putting these pieces together, you get a loop:</p><ol><li><p><strong>Teleoperation and scripted policies</strong> provide the seed dataset and safe starting behaviors.</p></li><li><p><strong>Simulation</strong> amplifies that seed into millions of varied experiences, including edge cases.</p></li><li><p><strong>VLA models</strong> distill all that into reusable, language-guided skills.</p></li><li><p><strong>On-the-job learning and human feedback</strong> refine those skills further in real deployments.</p></li></ol><p>That loop is what we&#8217;re calling the <strong>data flywheel</strong>.</p><p>Every new deployment - whether in a warehouse, a backroom, or a lab - doesn&#8217;t just use existing skills. It also:</p><ul><li><p>exposes the system to new layouts and tools,</p></li><li><p>surfaces novel failure modes,</p></li><li><p>and produces more data that can be folded back into training.</p></li></ul><p>In that picture, autonomy doesn&#8217;t flip from &#8220;off&#8221; to &#8220;on&#8221; overnight. It expands gradually:</p><ul><li><p><strong>Phase 1:</strong> robots reliably execute a few tight loops (fold boxes, shuttle items, run a simple inspection).</p></li><li><p><strong>Phase 2:</strong> they handle small variations and recover from minor errors.</p></li><li><p><strong>Phase 3:</strong> they string together multi-step routines (&#8220;close down this station,&#8221; &#8220;reset this rack,&#8221; &#8220;prep this room&#8221;).</p></li><li><p><strong>Phase 4 (farther out):</strong> they approach something like the physical Turing test at least in constrained domains.</p></li></ul><p>Some prominent researchers have publicly speculated that, <em>if</em> current trends in data, sim, and models continue, we could see robots performing genuinely useful, multi-step loops in everyday environments within <strong>single-digit years</strong>. Others are more cautious and emphasize remaining gaps in safety, reliability, and sim-to-real transfer.</p><p>Either way, if the flywheel starts spinning, robots begin to look less like fragile prototypes and more like dependable infrastructure for specific slices of the physical world.</p><p>That&#8217;s where the <strong>Physical API</strong> framing starts to make sense.</p><div><hr></div><h2>Conclusion</h2><p>The surface story in robotics is still dominated by hardware - new humanoids, new demos, new grippers. But underneath, the real shift is epistemic: we are, piece by piece, building something like an <em>internet of actions</em> from teleoperation logs, massive simulation, and on-the-job learning, then distilling it into Vision&#8211;Language&#8211;Action brains that can understand both pixels and instructions. </p><p>Whether the &#8220;physical Turing test&#8221; arrives in five years or fifteen, this is the path it will walk along. If you care about where autonomy is really coming from, it&#8217;s less about any single robot video and more about this growing, shared pool of experience that future robots will all be swimming in.</p><h2></h2>]]></content:encoded></item><item><title><![CDATA[Strategy Decoded: #2 Adobe MAX 2025: From “AI feature” to “creative OS”]]></title><description><![CDATA[Choice + Trust + Agents: how Adobe&#8217;s &#8220;creator OS&#8221; takes shape]]></description><link>https://clairechoi616.substack.com/p/strategy-decoded-2-adobe-max-2025</link><guid isPermaLink="false">https://clairechoi616.substack.com/p/strategy-decoded-2-adobe-max-2025</guid><dc:creator><![CDATA[Deep Tech for the Non Tech]]></dc:creator><pubDate>Sat, 08 Nov 2025 19:30:35 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!h0OZ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe40fd9f-4c0a-4ecf-834f-1c4efec2b418_1431x804.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>I love switching hats - from PM, engineer, to designer - but designing lights up a different part of my brain. A recent push with the Stanford Robotics Club to codify our branding (ahead of next year&#8217;s big funding &#129401;) put me back in InDesign, Illustrator, and Photoshop - AND hands-on with Adobe&#8217;s new AI features. </p><p>It sparked a bigger question: where is Adobe taking the creative stack next? After MAX 2025, the answer looks less like &#8220;more features&#8221; and more like a creative OS.</p><div><hr></div><p><strong>How Adobe is turning model choice, trust, and agentic UX into a moat</strong></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!h0OZ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe40fd9f-4c0a-4ecf-834f-1c4efec2b418_1431x804.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!h0OZ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe40fd9f-4c0a-4ecf-834f-1c4efec2b418_1431x804.jpeg 424w, https://substackcdn.com/image/fetch/$s_!h0OZ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe40fd9f-4c0a-4ecf-834f-1c4efec2b418_1431x804.jpeg 848w, https://substackcdn.com/image/fetch/$s_!h0OZ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe40fd9f-4c0a-4ecf-834f-1c4efec2b418_1431x804.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!h0OZ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe40fd9f-4c0a-4ecf-834f-1c4efec2b418_1431x804.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!h0OZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe40fd9f-4c0a-4ecf-834f-1c4efec2b418_1431x804.jpeg" width="1431" height="804" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fe40fd9f-4c0a-4ecf-834f-1c4efec2b418_1431x804.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:804,&quot;width&quot;:1431,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;A person standing in front of a large screen\n\nAI-generated content may be incorrect.&quot;,&quot;title&quot;:&quot;A person standing in front of a large screen\n\nAI-generated content may be incorrect.&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="A person standing in front of a large screen

AI-generated content may be incorrect." title="A person standing in front of a large screen

AI-generated content may be incorrect." srcset="https://substackcdn.com/image/fetch/$s_!h0OZ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe40fd9f-4c0a-4ecf-834f-1c4efec2b418_1431x804.jpeg 424w, https://substackcdn.com/image/fetch/$s_!h0OZ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe40fd9f-4c0a-4ecf-834f-1c4efec2b418_1431x804.jpeg 848w, https://substackcdn.com/image/fetch/$s_!h0OZ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe40fd9f-4c0a-4ecf-834f-1c4efec2b418_1431x804.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!h0OZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe40fd9f-4c0a-4ecf-834f-1c4efec2b418_1431x804.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Adobe&#8217;s opening at MAX 2025 was less &#8220;a new toy&#8221; and more &#8220;a new operating model.&#8221; The company is steadily reframing Firefly + Creative Cloud as a <strong>single workspace</strong> where you pick the best AI model for each task, keep provenance intact, and move from intent to finished asset without hopping tools. Practically, that means Firefly now hosts <strong>Adobe&#8217;s own models together with partner models</strong> - Google (Gemini, Imagen, Veo), OpenAI (GPT-Image), Runway, Luma, Topaz, ElevenLabs, and more - &#8220;<strong>in one place, at one price</strong>.&#8221; For creators and teams, this isn&#8217;t a slogan; it&#8217;s a different unit of work: specify outcome &#8594; route to the right model &#8594; keep control and credits &#8594; ship.</p><div><hr></div><h2>What actually changed</h2><p>Three shifts stood out.</p><p><strong>1) Model routing becomes productized.</strong><br>Inside Firefly and core apps like Photoshop/Express, you can now <strong>choose among multiple frontier and specialty models</strong> at execution time. Adobe isn&#8217;t forcing a single stack; it&#8217;s <strong>brokering the best model per subtask</strong> (e.g., different models for Generative Fill vs. upscaling), and it&#8217;s already wiring partner options like <strong>Topaz Bloom/Gigapixel</strong> into Photoshop&#8217;s pipelines for Generative Upscale. This is an aggregator posture with opinionated rails.</p><p><strong>2) Personalization moves to the model layer.</strong><br><strong>Firefly Custom Models</strong> (private beta) let you drag-and-drop your own reference images and train a <strong>private, on-brand style model</strong>. The promise is consistency at scale - campaign families, character systems, house looks - owned by the creator or brand and <strong>not accessible to others without authorization</strong>. That shifts lock-in from file formats to <strong>institutionalized style</strong>, which is much harder to rip-and-replace.</p><p><strong>3) Agentic UX is arriving in the tools you already use.</strong><br>Adobe previewed <strong>Project Moonlight</strong>, a conversational, context-aware &#8220;creative director&#8221; that pulls from Creative Cloud libraries and even linked social accounts to propose ideas, compose first passes, and keep outputs on-brand. In parallel, <strong>AI Assistants</strong> land in <strong>Photoshop</strong> and <strong>Express</strong> to take on repetitive edits and surface step-by-step help, without removing tactile control. This isn&#8217;t a separate chatbot; it&#8217;s the UI layer inside the app.</p><div class="pullquote"><p>To be honest, <strong>the AI Assistant is really appealing to someone like me, who often becomes a designer for the team, but was never trained professionally</strong> so don&#8217;t know all the shortcuts / cheat keys to design work. Like, &#8216;rename all my layers&#8217; may seem like a tiny job, but that would save a ton of time in my work with my current robotics club.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vbVV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F55cd33bb-8e93-46f5-ae3a-ab45620c02c9_770x431.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vbVV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F55cd33bb-8e93-46f5-ae3a-ab45620c02c9_770x431.jpeg 424w, https://substackcdn.com/image/fetch/$s_!vbVV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F55cd33bb-8e93-46f5-ae3a-ab45620c02c9_770x431.jpeg 848w, https://substackcdn.com/image/fetch/$s_!vbVV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F55cd33bb-8e93-46f5-ae3a-ab45620c02c9_770x431.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!vbVV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F55cd33bb-8e93-46f5-ae3a-ab45620c02c9_770x431.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vbVV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F55cd33bb-8e93-46f5-ae3a-ab45620c02c9_770x431.jpeg" width="770" height="431" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/55cd33bb-8e93-46f5-ae3a-ab45620c02c9_770x431.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:431,&quot;width&quot;:770,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Adobe unveils AI Assistant for Photoshop to automate creative workflows&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Adobe unveils AI Assistant for Photoshop to automate creative workflows" title="Adobe unveils AI Assistant for Photoshop to automate creative workflows" srcset="https://substackcdn.com/image/fetch/$s_!vbVV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F55cd33bb-8e93-46f5-ae3a-ab45620c02c9_770x431.jpeg 424w, https://substackcdn.com/image/fetch/$s_!vbVV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F55cd33bb-8e93-46f5-ae3a-ab45620c02c9_770x431.jpeg 848w, https://substackcdn.com/image/fetch/$s_!vbVV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F55cd33bb-8e93-46f5-ae3a-ab45620c02c9_770x431.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!vbVV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F55cd33bb-8e93-46f5-ae3a-ab45620c02c9_770x431.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source; Adobe</figcaption></figure></div></div><h2>The feature rundown </h2><ul><li><p><strong>Firefly Image Model 5 (public beta):</strong> native <strong>4-megapixel</strong> generation, stronger photorealism (lighting, textures), and <strong>Prompt-to-Edit</strong> so you can <em>describe</em> edits in plain language; layered image editing is coming (in development). This narrows the &#8220;mock &#8594; comp &#8594; polish&#8221; loop. </p></li><li><p><strong>Audio joins the stack:</strong> <strong>Generate Soundtrack</strong> (timed to picture) and <strong>Generate Speech</strong> (multilingual voices via Firefly + <strong>ElevenLabs</strong>, with controllable emotion/emphasis). It&#8217;s a credible &#8220;no-copyright-drama&#8221; answer for commercial video.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!5oyL!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ced63c9-c610-4e82-a166-e7394b48d245_790x450.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!5oyL!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ced63c9-c610-4e82-a166-e7394b48d245_790x450.jpeg 424w, https://substackcdn.com/image/fetch/$s_!5oyL!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ced63c9-c610-4e82-a166-e7394b48d245_790x450.jpeg 848w, https://substackcdn.com/image/fetch/$s_!5oyL!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ced63c9-c610-4e82-a166-e7394b48d245_790x450.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!5oyL!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ced63c9-c610-4e82-a166-e7394b48d245_790x450.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!5oyL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ced63c9-c610-4e82-a166-e7394b48d245_790x450.jpeg" width="790" height="450" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9ced63c9-c610-4e82-a166-e7394b48d245_790x450.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:450,&quot;width&quot;:790,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;A screenshot of a music player\n\nAI-generated content may be incorrect.&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="A screenshot of a music player

AI-generated content may be incorrect." title="A screenshot of a music player

AI-generated content may be incorrect." srcset="https://substackcdn.com/image/fetch/$s_!5oyL!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ced63c9-c610-4e82-a166-e7394b48d245_790x450.jpeg 424w, https://substackcdn.com/image/fetch/$s_!5oyL!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ced63c9-c610-4e82-a166-e7394b48d245_790x450.jpeg 848w, https://substackcdn.com/image/fetch/$s_!5oyL!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ced63c9-c610-4e82-a166-e7394b48d245_790x450.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!5oyL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ced63c9-c610-4e82-a166-e7394b48d245_790x450.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></li><li><p><strong>YouTube distribution built-in:</strong> a <strong>Create for YouTube Shorts</strong> space inside <strong>Premiere mobile</strong> - templates, effects, and <strong>one-tap publish</strong> to Shorts - puts Adobe tools at the moment of posting, not just production. Expect this to expand their creator funnel.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!aPWq!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7bda75a1-316c-4e70-9317-f26a114ffa28_828x466.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!aPWq!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7bda75a1-316c-4e70-9317-f26a114ffa28_828x466.jpeg 424w, https://substackcdn.com/image/fetch/$s_!aPWq!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7bda75a1-316c-4e70-9317-f26a114ffa28_828x466.jpeg 848w, https://substackcdn.com/image/fetch/$s_!aPWq!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7bda75a1-316c-4e70-9317-f26a114ffa28_828x466.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!aPWq!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7bda75a1-316c-4e70-9317-f26a114ffa28_828x466.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!aPWq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7bda75a1-316c-4e70-9317-f26a114ffa28_828x466.jpeg" width="828" height="466" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7bda75a1-316c-4e70-9317-f26a114ffa28_828x466.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:466,&quot;width&quot;:828,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;A collage of a couple of images\n\nAI-generated content may be incorrect.&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="A collage of a couple of images

AI-generated content may be incorrect." title="A collage of a couple of images

AI-generated content may be incorrect." srcset="https://substackcdn.com/image/fetch/$s_!aPWq!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7bda75a1-316c-4e70-9317-f26a114ffa28_828x466.jpeg 424w, https://substackcdn.com/image/fetch/$s_!aPWq!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7bda75a1-316c-4e70-9317-f26a114ffa28_828x466.jpeg 848w, https://substackcdn.com/image/fetch/$s_!aPWq!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7bda75a1-316c-4e70-9317-f26a114ffa28_828x466.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!aPWq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7bda75a1-316c-4e70-9317-f26a114ffa28_828x466.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></li><li><p><strong>Partner models, not just partners:</strong> Beyond Google/OpenAI/Runway/Luma, Adobe is actively wiring <strong>Topaz</strong> into Photoshop flows (e.g., Generative Upscale) and <strong>ElevenLabs</strong> voices into Firefly. This is a concrete interoperability stance, not a press-release logo wall.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Xw-1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F037bda3d-7468-4e22-ad1d-29f3def2e65c_1200x724.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Xw-1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F037bda3d-7468-4e22-ad1d-29f3def2e65c_1200x724.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Xw-1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F037bda3d-7468-4e22-ad1d-29f3def2e65c_1200x724.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Xw-1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F037bda3d-7468-4e22-ad1d-29f3def2e65c_1200x724.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Xw-1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F037bda3d-7468-4e22-ad1d-29f3def2e65c_1200x724.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Xw-1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F037bda3d-7468-4e22-ad1d-29f3def2e65c_1200x724.jpeg" width="1200" height="724" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/037bda3d-7468-4e22-ad1d-29f3def2e65c_1200x724.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:724,&quot;width&quot;:1200,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Adobe MAX 2025: Adobe Goes All In With AI Use Across Ecosystem | Geek  Culture&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Adobe MAX 2025: Adobe Goes All In With AI Use Across Ecosystem | Geek  Culture" title="Adobe MAX 2025: Adobe Goes All In With AI Use Across Ecosystem | Geek  Culture" srcset="https://substackcdn.com/image/fetch/$s_!Xw-1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F037bda3d-7468-4e22-ad1d-29f3def2e65c_1200x724.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Xw-1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F037bda3d-7468-4e22-ad1d-29f3def2e65c_1200x724.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Xw-1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F037bda3d-7468-4e22-ad1d-29f3def2e65c_1200x724.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Xw-1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F037bda3d-7468-4e22-ad1d-29f3def2e65c_1200x724.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: Geek Culture</figcaption></figure></div></li><li><p><strong>Provenance as policy:</strong> <strong>Content Credentials</strong> (the CAI/C2PA &#8220;nutrition label&#8221;) remain central; with broader industry adoption, this is increasingly an <strong>enterprise procurement unlock</strong> where provenance is non-negotiable.</p></li></ul><div><hr></div><h2>Strategy: why this is durable</h2><p><strong>Choice with trust.</strong> Adobe&#8217;s bet is that creative work is heterogeneous: one &#8220;best&#8221; model rarely wins across ideation, edit, upscale, and voice. By <strong>routing intent to the right model</strong> inside a <strong>trusted environment</strong> (clear data contracts, provenance badges), Adobe converts fragmentation into a competitive edge. It also blunts single-model platform risk while capturing the UI/workflow layer where value accrues.</p><p><strong>Style as lock-in.</strong> When a brand&#8217;s look lives in a <strong>Custom Model</strong> that assistants can invoke across Photoshop/Express/Premiere, switching isn&#8217;t just moving files; it&#8217;s retraining institutional taste. That raises <strong>retention</strong> and <strong>seat expansion</strong> in a way point tools will struggle to match.</p><p><strong>Agentic UX &#8594; lower CAC, higher LTV.</strong> Assistants shorten time-to-first-value for newcomers and offload drudge work for pros. If Moonlight meaningfully stitches context across apps and channels, expect <strong>higher DAU and project throughput</strong> - a subscription flywheel, not a one-off feature bump.</p><p><strong>Distribution as a growth loop.</strong> The YouTube Shorts lane plants Adobe where creators actually publish. Templates + one-tap posting are subtle but powerful <strong>network effects</strong> - when the path to audience is paved, upstream creation tends to follow.</p><div><hr></div><h2>What to watch</h2><ul><li><p><strong>Assistant usage inside Photoshop/Express:</strong> multi-turn sessions per user/week, not just activation. </p></li><li><p><strong>Custom Model adoption:</strong> number of teams shipping on a house style model; assistant calls that reference those models. </p></li><li><p><strong>Partner-model mix:</strong> how often Photoshop/Firefly jobs route to <strong>non-Adobe</strong> models (e.g., Topaz, Google) vs. Firefly; tells you whether Adobe is truly neutral or nudging. </p></li><li><p><strong>Provenance penetration:</strong> % of exported assets with <strong>Content Credentials</strong> across agencies/brands - now feasible given CAI/C2PA ecosystem traction. </p></li></ul><div><hr></div><h2>Founder/PM takeaways</h2><ul><li><p>Build for <strong>intent &#8594; routing &#8594; control</strong>, not a monolithic model.</p></li><li><p>Treat <strong>style as a first-class artifact</strong> (versioned, sharable, testable).</p></li><li><p>Meet users at <strong>distribution</strong>; creation and posting shouldn&#8217;t be separate journeys.</p></li><li><p><strong>Instrument trust</strong> (provenance, permissions, training data posture) as product, not legal copy.</p></li></ul><div><hr></div><p><strong>Bottom line:</strong> Adobe isn&#8217;t winning because it has &#8220;more AI.&#8221; It&#8217;s winning because it&#8217;s turning <strong>choice, trust, and agentic workflows</strong> into a cohesive <strong>creative OS</strong>. If they keep the rails tight and the ecosystem truly open, that&#8217;s a hard position to disrupt. </p><p>Let me wrap up this article with Adobe&#8217;s official video debriefing what happened at AdobeMAX.</p><div id="youtube2-4haZJxpf9Bo" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;4haZJxpf9Bo&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/4haZJxpf9Bo?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div>]]></content:encoded></item><item><title><![CDATA[Event Decoded: #1 U.S. Taiwan High-Tech Forum: Physical AI LIVE 2025]]></title><description><![CDATA[The control plane for the real world: how assistants, robots, and ops are turning capability into kept promises]]></description><link>https://clairechoi616.substack.com/p/event-de-coded-1-us-taiwan-high-tech</link><guid isPermaLink="false">https://clairechoi616.substack.com/p/event-de-coded-1-us-taiwan-high-tech</guid><dc:creator><![CDATA[Deep Tech for the Non Tech]]></dc:creator><pubDate>Mon, 03 Nov 2025 15:31:33 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!D5Uh!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5eda786a-b70b-4567-9f6d-cd4063f1e317_1938x1332.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h2>90-second recap</h2><ul><li><p>The internet era was built on <strong>ranking</strong> (search + recs). The next era is <strong>acting</strong>: assistants and robots that plan and do.</p></li><li><p>Assistants moved from seq-to-seq &#8594; chain-of-thought &#8594; post-training &#8594; <strong>agentic workflows</strong>, with <strong>personalization</strong> as the reward model.</p></li><li><p>Robots get &#8220;general&#8221; through a <strong>recipe</strong>: broad pre-train, <strong>curated</strong> post-train, protect language, and plan <strong>environment diversity</strong>.</p></li><li><p>Early wins in deployment look like <strong>skills portfolios</strong> plus <strong>human oversight</strong>, with reliability and intervention rates driving ROI.</p></li><li><p>My takeaway: <strong>Assistants plan; robots prove.</strong> Reliability, not model size, separates demos from products.</p></li></ul><div><hr></div><h2>Why this afternoon mattered</h2><p><em>(I&#8217;ve just been so excited to find this event existed!! Huge thanks to my Taiwanese classmates :))</em></p><p>The internet&#8217;s big money machines - search and recommender systems - sorted choices. Now we&#8217;re moving to systems that <strong>take actions</strong>: assistants that coordinate tools and robots that work in messy spaces. UTHF (U.S. Taiwan High-Tech Forum) and NATEA (North America Taiwanese Engineering Association) prepared this event to make this shift feel real.</p><div><hr></div><h2>Event primer</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!D5Uh!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5eda786a-b70b-4567-9f6d-cd4063f1e317_1938x1332.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!D5Uh!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5eda786a-b70b-4567-9f6d-cd4063f1e317_1938x1332.jpeg 424w, https://substackcdn.com/image/fetch/$s_!D5Uh!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5eda786a-b70b-4567-9f6d-cd4063f1e317_1938x1332.jpeg 848w, https://substackcdn.com/image/fetch/$s_!D5Uh!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5eda786a-b70b-4567-9f6d-cd4063f1e317_1938x1332.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!D5Uh!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5eda786a-b70b-4567-9f6d-cd4063f1e317_1938x1332.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!D5Uh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5eda786a-b70b-4567-9f6d-cd4063f1e317_1938x1332.jpeg" width="1456" height="1001" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5eda786a-b70b-4567-9f6d-cd4063f1e317_1938x1332.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1001,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&#51060;&#48120;&#51648;&#50640; alt &#49549;&#49457;&#51060; &#50630;&#51020;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="&#51060;&#48120;&#51648;&#50640; alt &#49549;&#49457;&#51060; &#50630;&#51020;" title="&#51060;&#48120;&#51648;&#50640; alt &#49549;&#49457;&#51060; &#50630;&#51020;" srcset="https://substackcdn.com/image/fetch/$s_!D5Uh!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5eda786a-b70b-4567-9f6d-cd4063f1e317_1938x1332.jpeg 424w, https://substackcdn.com/image/fetch/$s_!D5Uh!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5eda786a-b70b-4567-9f6d-cd4063f1e317_1938x1332.jpeg 848w, https://substackcdn.com/image/fetch/$s_!D5Uh!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5eda786a-b70b-4567-9f6d-cd4063f1e317_1938x1332.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!D5Uh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5eda786a-b70b-4567-9f6d-cd4063f1e317_1938x1332.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>Where &amp; who:</strong> Computer History Museum, Mountain View. Keynotes by:</p><ul><li><p><strong>Dr. Ed H. Chi </strong>(VP of Research @ <span class="mention-wrap" data-attrs="{&quot;name&quot;:&quot;Google DeepMind&quot;,&quot;id&quot;:216622977,&quot;type&quot;:&quot;user&quot;,&quot;url&quot;:null,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/84767d5b-b6fb-4751-8c73-b2e1825ef8d9_144x144.png&quot;,&quot;uuid&quot;:&quot;8041fe46-b9c9-4fce-88f5-182bf9d92106&quot;}" data-component-name="MentionToDOM"></span> ) on <strong>&#8220;The Future of Personalized Universal Assistant&#8221;</strong></p></li><li><p><strong>Professor Chelsea Finn&#8217;s </strong>(Co-Founder @ <span class="mention-wrap" data-attrs="{&quot;name&quot;:&quot;Physical Intelligence&quot;,&quot;id&quot;:3271138,&quot;type&quot;:&quot;pub&quot;,&quot;url&quot;:&quot;https://open.substack.com/pub/waltereerens&quot;,&quot;photo_url&quot;:null,&quot;uuid&quot;:&quot;22873509-cd30-4eb9-9111-a0549efd534f&quot;}" data-component-name="MentionToDOM"></span>, Assistant Professor @ Stanford university) talk on <strong>&#8220;Bringing AI to the Physical World&#8221;</strong></p></li><li><p><strong>Dr. Ashish Kapoo (Founder @</strong><span class="mention-wrap" data-attrs="{&quot;name&quot;:&quot;General Robots&quot;,&quot;id&quot;:1361620,&quot;type&quot;:&quot;pub&quot;,&quot;url&quot;:&quot;https://open.substack.com/pub/generalrobots&quot;,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f98ea4c9-b240-41a1-a7b7-89cd09813c02_512x512.png&quot;,&quot;uuid&quot;:&quot;29f4c06b-6d7f-43ea-bb9f-cf0d3e0a71fc&quot;}" data-component-name="MentionToDOM"></span> <strong>and Dr Chi Chiu (Founder @CosmicbrainAI) </strong>panel talk on <strong>&#8220;Scaling from Simulation to Reality&#8221;</strong>.</p></li></ul><p><strong>Why now:</strong> The theme was simple: turn breakthroughs in <strong>agentic AI</strong> and <strong>embodied models</strong> into systems that keep promises outside the lab.</p><div><hr></div><h2>TL;DR: What shifted in my head</h2><ol><li><p><strong>Ranking &#8594; Generation &#8594; Agency.</strong> Assistants aren&#8217;t just answering; they&#8217;re <strong>planning multi-step workflows</strong>. Personalization is the reward signal.</p></li><li><p><strong>&#8220;Generalist&#8221; is a recipe, not a parameter count.</strong> Pre-train wide, <strong>post-train right</strong>, protect language, and budget environment entropy like a test plan.</p></li><li><p><strong>Hierarchies help.</strong> A high-level planner translates open-ended prompts into simple steps for a low-level controller; synthetic relabeling scales supervision.</p></li><li><p><strong>Deployment &#8800; demo.</strong> Early rollouts will look like <strong>skills portfolios with human oversight</strong>, CI/CD-style ops, and strict filters on synthetic data.</p></li><li><p><strong>Reliability is the moat.</strong> Imitation often caps around ~80%; <strong>RL-on-top</strong> + memory is the real path to 95%+.</p></li></ol><div><hr></div><h2>&#8220;The Future of Personalized Universal Assistant&#8221;</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!CpYy!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68cd0916-323a-4ced-ab50-f62e0297f98d_2048x1536.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!CpYy!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68cd0916-323a-4ced-ab50-f62e0297f98d_2048x1536.jpeg 424w, https://substackcdn.com/image/fetch/$s_!CpYy!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68cd0916-323a-4ced-ab50-f62e0297f98d_2048x1536.jpeg 848w, https://substackcdn.com/image/fetch/$s_!CpYy!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68cd0916-323a-4ced-ab50-f62e0297f98d_2048x1536.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!CpYy!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68cd0916-323a-4ced-ab50-f62e0297f98d_2048x1536.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!CpYy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68cd0916-323a-4ced-ab50-f62e0297f98d_2048x1536.jpeg" width="1456" height="1092" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/68cd0916-323a-4ced-ab50-f62e0297f98d_2048x1536.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1092,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:341903,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/177783407?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68cd0916-323a-4ced-ab50-f62e0297f98d_2048x1536.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!CpYy!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68cd0916-323a-4ced-ab50-f62e0297f98d_2048x1536.jpeg 424w, https://substackcdn.com/image/fetch/$s_!CpYy!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68cd0916-323a-4ced-ab50-f62e0297f98d_2048x1536.jpeg 848w, https://substackcdn.com/image/fetch/$s_!CpYy!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68cd0916-323a-4ced-ab50-f62e0297f98d_2048x1536.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!CpYy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68cd0916-323a-4ced-ab50-f62e0297f98d_2048x1536.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Dr Chi took his time to guide us through the journey of how we&#8217;ve moved from <strong>sequence-to-sequence</strong> to <strong>chain-of-thought</strong> to <strong>post-training</strong>. That stack pushed us past ranking into <strong>token-by-token generation</strong> that can chain tools. He showed &#8220;Project Astra&#8221;, a prototype assistant that reads a manual, finds a YouTube tutorial, pulls info from your email, and <strong>calls a store</strong> - <strong>System-1 patterning</strong> plus <strong>System-2 planning</strong> in one loop. The ingredients he emphasized: <strong>multi-step reasoning</strong>, <strong>agent workflows</strong>, <strong>synthetic data</strong>, and <strong>personalization</strong>.</p><h3>Why it matters</h3><p><strong>Ranking taught us what &#8220;good&#8221; looks like;</strong> agency adds <strong>planning and action</strong>. The assistants that win will look like <strong>Planner (System-2)</strong> &#8596; <strong>Skills (System-1)</strong> with a <strong>user-grounded reward</strong>. Personalization isn&#8217;t garnish; it&#8217;s the <strong>objective</strong>. A metric I want teams to publish: <strong>goal completions per minute of user attention saved</strong>.</p><h3>Why this matters to Physical AI</h3><p>A <strong>personalized universal assistant</strong> is the <strong>control plane</strong> for the physical world. It keeps a live view of <em>your</em> context (vision, audio, text), remembers preferences and state, and <strong>hands structured commands</strong> to actuators - robots, appliances, services - through tool APIs. Personalization is the <strong>reward model</strong> that keeps actions aligned once real things move. In short: <strong>assistants plan; robots prove.</strong></p><h3>How my view evolved</h3><p>I think many people around me, including myself till a year ago, tend to treat personalization as nice-to-have UX. We should now see it as the <strong>optimization target</strong> and the clean bridge from software agents to reliable physical action.</p><div><hr></div><h2>&#8220;Bringing AI to the Physical World&#8221;</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!AzTa!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe8742d1-94c1-40f6-8a3b-1b9b6b05d4f6_3213x2413.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!AzTa!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe8742d1-94c1-40f6-8a3b-1b9b6b05d4f6_3213x2413.jpeg 424w, https://substackcdn.com/image/fetch/$s_!AzTa!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe8742d1-94c1-40f6-8a3b-1b9b6b05d4f6_3213x2413.jpeg 848w, https://substackcdn.com/image/fetch/$s_!AzTa!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe8742d1-94c1-40f6-8a3b-1b9b6b05d4f6_3213x2413.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!AzTa!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe8742d1-94c1-40f6-8a3b-1b9b6b05d4f6_3213x2413.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!AzTa!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe8742d1-94c1-40f6-8a3b-1b9b6b05d4f6_3213x2413.jpeg" width="3213" height="2413" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fe8742d1-94c1-40f6-8a3b-1b9b6b05d4f6_3213x2413.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:2413,&quot;width&quot;:3213,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1183107,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/177783407?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d11744e-0c22-4e6d-a1c2-fc98520233df.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!AzTa!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe8742d1-94c1-40f6-8a3b-1b9b6b05d4f6_3213x2413.jpeg 424w, https://substackcdn.com/image/fetch/$s_!AzTa!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe8742d1-94c1-40f6-8a3b-1b9b6b05d4f6_3213x2413.jpeg 848w, https://substackcdn.com/image/fetch/$s_!AzTa!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe8742d1-94c1-40f6-8a3b-1b9b6b05d4f6_3213x2413.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!AzTa!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe8742d1-94c1-40f6-8a3b-1b9b6b05d4f6_3213x2413.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Professor Chelsea Finn opened by reframing &#8220;generalist robots&#8221; as a <strong>recipe</strong>, not a parameter race - then walked the room through that recipe using concrete cases from her company, <strong>Physical Intelligence</strong>, with clear visuals: a PaliGemma-based VLM driving a separate Action Expert, demos spanning single-arm, bimanual, and mobile manipulation, and long-horizon laundry tasks that transferred to <strong>unseen homes</strong>. </p><p>Her core insights were pragmatic: generalization is <strong>engineered</strong> (budget environment entropy like test coverage), real-world data is <strong>costly</strong>, so make each minute count (reuse legacy static datasets, synthesize supervision via <strong>LLM relabeling</strong>, add <strong>experience-retrieval memory</strong>, and use <strong>targeted RL</strong> when imitation plateaus around ~80%). The headline takeaway: the near-term &#8220;generalist&#8221; is multi-object and multi-layout <strong>inside bounded workflows</strong> - ship with a disciplined recipe now, then widen the envelope on a schedule.</p><h3>What Physical Intelligence has built</h3><ul><li><p><strong>Two-part model.</strong> A <strong>~3B</strong> vision-language backbone conditions a <strong>~300M</strong> <strong>Action Expert</strong> that emits <strong>~50-step</strong> control chunks (diffusion/flow-matching). Inputs: <strong>1&#8211;3 images + language</strong>; outputs: joint commands for <strong>single-arm, bimanual, and mobile</strong> platforms.</p></li><li><p><strong>Protect language.</strong> Naive end-to-end fine-tuning caused the policy to <strong>ignore instructions</strong>. Their fix: <strong>tokenized actions</strong> and a <strong>stop-gradient</strong> barrier from the control head, so the VLM keeps its instruction-following &#8220;brain.&#8221;</p></li><li><p><strong>Recipe &gt; soup.</strong> <strong>Broad pre-train</strong> (robot + web) + <strong>curated post-train</strong> on <strong>gold</strong> teleop demos beat &#8220;train on all data,&#8221; especially on long-horizon tasks.</p></li><li><p><strong>Generalization you can engineer.</strong> <strong>More and more diverse locations</strong> &#8594; better transfer to <strong>unseen homes</strong>. Prior <strong>static-arm</strong> data still helps <strong>mobile</strong> tasks.</p></li><li><p><strong>Open-ended prompting.</strong> A <strong>high-level planner</strong> turns &#8220;make a vegan sandwich (no pickles)&#8221; into a simple command sequence; <strong>LLM relabeling</strong> creates instruction&#8211;response pairs without expensive human-robot chats for every variant.</p></li></ul><h3>Where it still struggles</h3><ul><li><p><strong>Perception &amp; occlusion</strong> (esp. thin/transparent objects) &#8594; needs better views, sometimes different sensors.</p></li><li><p><strong>Deformable manipulation</strong> (laundry, bags) &#8594; benefits from higher-quality teleop and focused data.</p></li><li><p><strong>Long-horizon drift</strong> (losing the thread mid-task) &#8594; needs <strong>experience-retrieval memory</strong>.</p></li><li><p><strong>Premature &#8220;good-enough&#8221; stops</strong> &#8594; responds to <strong>RL fine-tuning</strong> on those failure clusters.</p></li></ul><h3>The cost reality, and how they reduce it</h3><p>Real-world data is <strong>expensive and slow</strong> to collect: getting into homes/worksites, supervising teleop, resets, and safety. Their workaround is better <strong>data economics</strong>:</p><ul><li><p><strong>Pre-train wide; post-train right</strong> on a small <strong>gold</strong> set.</p></li><li><p><strong>Protect language</strong> so demos remain reusable across tasks and robots.</p></li><li><p>Plan <strong>location entropy</strong> (fewer visits, more diversity).</p></li><li><p>Reuse <strong>legacy/static</strong> datasets to subsidize mobile skills.</p></li><li><p>Scale supervision with <strong>hierarchical prompting</strong> and <strong>LLM relabeling</strong>.</p></li><li><p>Use <strong>targeted RL</strong> when imitation plateaus.</p></li><li><p>Add <strong>experience-retrieval memory</strong> for multi-stage tasks like &#8220;put it back where it was.&#8221;</p></li></ul><h3>How my view evolved</h3><p>I came in thinking &#8220;more data = more general.&#8221; I left convinced the bottlenecks are <strong>instruction-set design</strong> and <strong>coverage planning</strong>. Bound both, ship earlier, then widen the envelope (fold &#8594; laundry; clear table &#8594; clean kitchen).</p><div><hr></div><h2>&#8220;Scaling from Simulation to Reality&#8221;</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!TJZj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a253a75-178e-438d-b65d-9f08405f810c.heic" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!TJZj!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a253a75-178e-438d-b65d-9f08405f810c.heic 424w, https://substackcdn.com/image/fetch/$s_!TJZj!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a253a75-178e-438d-b65d-9f08405f810c.heic 848w, https://substackcdn.com/image/fetch/$s_!TJZj!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a253a75-178e-438d-b65d-9f08405f810c.heic 1272w, https://substackcdn.com/image/fetch/$s_!TJZj!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a253a75-178e-438d-b65d-9f08405f810c.heic 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!TJZj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a253a75-178e-438d-b65d-9f08405f810c.heic" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9a253a75-178e-438d-b65d-9f08405f810c.heic&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1942321,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/heic&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/177783407?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a253a75-178e-438d-b65d-9f08405f810c.heic&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!TJZj!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a253a75-178e-438d-b65d-9f08405f810c.heic 424w, https://substackcdn.com/image/fetch/$s_!TJZj!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a253a75-178e-438d-b65d-9f08405f810c.heic 848w, https://substackcdn.com/image/fetch/$s_!TJZj!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a253a75-178e-438d-b65d-9f08405f810c.heic 1272w, https://substackcdn.com/image/fetch/$s_!TJZj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a253a75-178e-438d-b65d-9f08405f810c.heic 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><p>Both kept us honest. Today&#8217;s blocker isn&#8217;t clever models; it&#8217;s getting <strong>reliable skills</strong> into messy, revenue-bearing settings. Early scale will include <strong>human oversight</strong>, <strong>CI/CD with simulation</strong>, and <strong>rollback plans</strong>. Synthetic data helps but must be filtered. Expect <strong>fit-for-purpose rigs</strong> to land before household humanoids; ROI is driven by <strong>utilization</strong> and <strong>intervention rates</strong>, not parameter counts.</p><p><strong>How I judge vendors now:</strong> show <strong>interventions/hour</strong>, <strong>MTBF/MTTR</strong>, <strong>success vs. #locations</strong>, and <strong>episodes to add a new SKU</strong>. If those curves aren&#8217;t on the table, it&#8217;s still a demo.</p><div><hr></div><h2>Field notes (quick hits)</h2><ul><li><p><strong>Stop-gradient &#8220;wall.&#8221;</strong> Small architectural change, big effect: keeps instruction-following intact while control learns.</p></li><li><p><strong>&#8220;Rare but relevant&#8221; slices.</strong> Tiny amounts of mobile data still moved transfer - curate, don&#8217;t hoard.</p></li><li><p><strong>Coverage, not luck.</strong> Transfer improved steadily with <strong>location count</strong> - treat data like test coverage.</p></li><li><p><strong>Grounded open-ended prompting.</strong> LLM relabeling scales breadth; keep prompts tied to <strong>state changes</strong>, not just fluent text.</p></li></ul><div><hr></div><h2>Short playbook that I&#8217;ve scribbled on my way back</h2><p><strong>If you&#8217;re building:</strong><br>Freeze a language-faithful VLM, attach a separate <strong>Action Expert</strong>, and <strong>stop-grad</strong> the control head. Stand up a <strong>gold</strong> post-train set and report deltas vs. pre-train. Launch a <strong>location-coverage</strong> plan and chart transfer. Add <strong>experience-retrieval memory</strong> for multi-stage tasks. When imitation plateaus, use <strong>RL</strong> on the tightest failure clusters.</p><p><strong>If you&#8217;re buying or investing:</strong><br>Ask for <strong>pre+post vs. all-data ablations</strong>, instruction-following under shift, <strong>location-count curves</strong>, and <strong>live intervention stats</strong>. Review the ops plan: oversight ratio, CI/CD-in-sim, and rollback procedures.</p><div><hr></div><h2>My final synthesis - Assistants plan, robots prove</h2><p>Mr. Chi&#8217;s assistant stack, Professor Finn&#8217;s robot stack, and Dr. Kapoo and Mr. Chiu&#8217;s deployment forecast rhyme. Assistants <strong>plan across tools</strong> with personalized rewards; robots <strong>prove those plans</strong> in the world&#8217;s entropy. Both need <strong>memory</strong> and <strong>post-training</strong> grounded in real tasks. Moats look like <strong>personalization reward models</strong> (assistants) and <strong>coverage-curated data ops</strong> (robots).</p><div><hr></div><h2>What I&#8217;ll watch next </h2><ol><li><p><strong>Assistant &#8594; robot handshake</strong> becomes a shared convention (planner API + action schema). <em>Wrong if:</em> bespoke glue still dominates by late 2026..?</p></li><li><p><strong>Experience-retrieval memory</strong> becomes table-stakes for household-style tasks. <em>Wrong if:</em> we see &gt;95% success on &#8220;put-back-where-it-was&#8221; with no memory.</p></li><li><p><strong>RL-on-top</strong> is the path from ~80% &#8594; 95%+. <em>Wrong if:</em> production systems pass 95% with imitation only.</p></li></ol><div><hr></div><h2>Closing - From capability to commitments</h2><p>&#8220;Generalist&#8221; won&#8217;t be a single model. It&#8217;ll be a <strong>disciplined recipe</strong>: protect language, budget environment diversity, and treat reliability as a product requirement. That&#8217;s how prompts become policies, and how demos graduate into deployments that keep their promises.</p><p></p><p><em><strong>Thank you NATEA for bringing together amazing speakers! And I really appreciate all the speakers for sharing their Saturday afternoon to share your view from the frontier.</strong></em></p>]]></content:encoded></item><item><title><![CDATA[Deep-Tech Decoded: #6 XR devices - Smart Glasses]]></title><description><![CDATA[Can &#8220;assistant-first&#8221; glasses finally earn a place on your face - and your balance sheet?]]></description><link>https://clairechoi616.substack.com/p/deep-tech-decoded-6-xr-devices</link><guid isPermaLink="false">https://clairechoi616.substack.com/p/deep-tech-decoded-6-xr-devices</guid><dc:creator><![CDATA[Deep Tech for the Non Tech]]></dc:creator><pubDate>Thu, 30 Oct 2025 14:55:21 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!YMGv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6dd75489-f1df-486d-9a4d-9ed31d7a89d1_870x370.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h2>The Big Question</h2><p>Smart glasses have already ridden two hype waves: see-through AR in the 2010s and &#8220;camera-glasses with an assistant&#8221; now. Are we still chasing sci-fi - or is there a practical product wedge that crosses from novelty to daily habit for millions?</p><h2>TL;DR</h2><p>The consumer wedge isn&#8217;t full 3D; it&#8217;s <strong>eyes-up utility</strong>: translation, wayfinding, capture&#8594;memory, quick answers - in &lt;10 seconds, privately, without pulling a phone. MR/VR headsets will keep winning where <strong>immersion and 3D</strong> matter (training, design). What changed: <strong>miniaturized NPUs</strong>, better <strong>waveguides</strong>, and a fashion-first approach proved by <strong>Ray-Ban/Meta</strong>. The next 2-3 years are about brighter text outdoors, seamless <strong>on-device&#8596;phone&#8596;cloud</strong> routing, and B2B scale-ups that fund the ecosystem.</p><div><hr></div><h2>1) Decoding the device map </h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!YMGv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6dd75489-f1df-486d-9a4d-9ed31d7a89d1_870x370.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!YMGv!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6dd75489-f1df-486d-9a4d-9ed31d7a89d1_870x370.jpeg 424w, https://substackcdn.com/image/fetch/$s_!YMGv!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6dd75489-f1df-486d-9a4d-9ed31d7a89d1_870x370.jpeg 848w, https://substackcdn.com/image/fetch/$s_!YMGv!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6dd75489-f1df-486d-9a4d-9ed31d7a89d1_870x370.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!YMGv!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6dd75489-f1df-486d-9a4d-9ed31d7a89d1_870x370.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!YMGv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6dd75489-f1df-486d-9a4d-9ed31d7a89d1_870x370.jpeg" width="870" height="370" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6dd75489-f1df-486d-9a4d-9ed31d7a89d1_870x370.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:370,&quot;width&quot;:870,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!YMGv!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6dd75489-f1df-486d-9a4d-9ed31d7a89d1_870x370.jpeg 424w, https://substackcdn.com/image/fetch/$s_!YMGv!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6dd75489-f1df-486d-9a4d-9ed31d7a89d1_870x370.jpeg 848w, https://substackcdn.com/image/fetch/$s_!YMGv!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6dd75489-f1df-486d-9a4d-9ed31d7a89d1_870x370.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!YMGv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6dd75489-f1df-486d-9a4d-9ed31d7a89d1_870x370.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><ul><li><p><strong>AR glasses (see-through, heads-up)</strong>: Look like glasses. Small projector + waveguide or &#8220;birdbath&#8221; combiner injects <strong>text/icons</strong> into your real view. Optimized for 2D utility (captions, arrows, menus). Socially acceptable when light and stylish.</p></li><li><p><strong>MR headsets (mixed reality)</strong>: Visor with <strong>micro-OLED/LCD</strong> displays and <strong>video pass-through</strong>. Anchors 3D objects with depth/occlusion. Magical for 3D tasks, but bigger/heavier/costlier.</p></li><li><p><strong>VR headsets</strong>: Fully occluded, synthetic worlds. Best for games, simulation, focused training.</p></li></ul><p><strong>&#8220;AI smart glasses&#8221; (today&#8217;s focus)</strong>: Lightweight AR-ish frames with <strong>camera(s), mics, tiny display, and on-device AI</strong>; escalate heavier work to phone/cloud. Not holograms - <strong>useful 2D overlays + context-aware assistant</strong>.</p><p><strong>Product truth:</strong> AR glasses excel at <strong>glanceable utility</strong> and all-day wear; MR wins at <strong>rich 3D</strong>. Mixing those jobs in one form factor usually yields heavy, hot, unloved hardware.</p><div><hr></div><h2>2) What actually changed (why &#8220;now&#8221; feels different)</h2><ul><li><p><strong>Miniaturization + on-device AI.</strong> Wearable-class SoCs/NPUs can now do wake-word, OCR, short translation, and simple vision <strong>locally</strong> (sub-watt), while routing complex prompts to phone/cloud. (Think Snapdragon AR-class splits across glasses + phone for ~2&#8211;3&#215; on-glass AI perf at lower power.)</p></li><li><p><strong>Optics getting practical.</strong> Newer <strong>waveguides</strong> improve in/out coupling and efficiency, pushing toward <strong>outdoor-readable text</strong> without nuking battery. &#8220;Birdbath&#8221; variants still win on brightness for some SKUs.</p></li><li><p><strong>Fashion meets function.</strong> Ray-Ban/Meta showed that <strong>industrial design</strong> is a feature. Partnerships with Warby Parker / Gentle Monster signal the category is done shipping &#8220;tech goggles.&#8221;</p></li><li><p><strong>Clearer use-case focus.</strong> Teams are doing <strong>less, better</strong>: translation, wayfinding, capture&#8594;memory, quick answers. (Not dragons in the living room.)</p></li></ul><blockquote><p>Human angle worth noting: Knowlab highlights accessibility wins (e.g., <strong>AI glasses paired with retinal implants</strong> restoring functional vision in studies). Beyond consumer convenience, there&#8217;s real social value - live captions for hearing-impaired, scene description for low vision, cognitive assist. </p></blockquote><div><hr></div><h2>3) Who&#8217;s shipping what (and what&#8217;s brewing)</h2><p>The Tech Giant Arms Race is ongoing, and every companies are investing into theri own smart glasses. Four years ago when I was doing my first project on XR devices, it seemed like companies have halted aggressive investment, waiting for consumer tech trendsetters (e.g. Apple) to release and make the market first. But companies never totally stopped - they were always brewing new patents and partnership underneath. It&#8217;s exciting to see those finally come to life.</p><ul><li><p><strong>Meta &#215; Ray-Ban</strong>: Category leader in &#8220;assistant camera-glasses.&#8221; Recent Display model adds a <strong>small screen</strong> plus <strong>neural wristband</strong> control. Trying to earn mainstream <strong>style</strong> + viral utility = volume.</p><div id="youtube2-yyzJQmveKZU" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;yyzJQmveKZU&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/yyzJQmveKZU?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div></li><li><p><strong>Apple</strong>: Multiple reports point to <strong>iPhone-companion glasses</strong> targeted for <strong>announcement ~2026, ship ~2027</strong>: all-day wear, <strong>low-power Apple-Watch-class silicon</strong>, assistant-first before heavy AR. Strategy rhymes with Vision Pro: seed devs early, ship when comfortable. There aren&#8217;t any &#8216;public &amp; official&#8217; videos yet, so brought one that reviews on most recent leaks.</p><div id="youtube2-lfc8z9GV1fY" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;lfc8z9GV1fY&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/lfc8z9GV1fY?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div></li><li><p><strong>Samsung / Google (Android XR)</strong>: <strong>Galaxy XR</strong> headset launched; <strong>AI glasses</strong> in partnership with <strong>Warby Parker</strong> (mass) and <strong>Gentle Monster</strong> (fashion), integrated with <strong>Gemini</strong>. Google gets a second at-bat, this time with style and an AI-first stack.</p><div id="youtube2-r0AyzvfTRLo" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;r0AyzvfTRLo&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/r0AyzvfTRLo?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div></li><li><p><strong>Amazon Echo Frames:</strong> not AR, but firmly in the smart-glasses lane &#8212; and as an international student I basically live on Amazon &#128514;. They&#8217;re <strong>audio-first</strong> (no display/camera): open-ear audio + Alexa for quick, hands-free tasks. Different wedge from Meta&#8217;s camera+display (<strong>ambient assistant, zero visuals</strong>) that still builds habit and ecosystem pull. But <strong>what&#8217;s brewing?</strong> Amazon is <strong>piloting AR glasses (Amelia glasses) for delivery drivers</strong> (HUD for navigation, scanning, proof-of-delivery) - enterprise-first, not broadly released yet.</p><div id="youtube2-PpjFpp4bfGE" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;PpjFpp4bfGE&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/PpjFpp4bfGE?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div></li><li><p><strong>China plays offense</strong>: <strong>Alibaba Quark AI Glasses</strong> and <strong>Xiaomi AI Glasses</strong> push aggressive price/feature mixes (real-time translation, payments, livestream), validating consumer demand at scale.</p></li><li><p><strong>Snap (2026)</strong>: Public commitment to consumer AR <strong>Specs</strong> after &gt;$3B invested across 11 years - betting on see-through lenses, creator-led use.</p><div id="youtube2-wQANHWANCps" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;wQANHWANCps&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/wQANHWANCps?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div></li><li><p><strong>MR headsets</strong>: Apple <strong>Vision Pro</strong>, HoloLens 2/Magic Leap 2, Meta Quest family - increasingly <strong>pro/work</strong> tilted: training, design reviews, remote support, spatial workflows.</p></li></ul><p><strong>Read the room:</strong> Today&#8217;s divergence is clear - <strong>AR glasses aim B2C</strong> (eyes-up micro-tasks), while <strong>MR/VR aim B2B</strong> (immersion-heavy jobs with measurable ROI).</p><div><hr></div><h2>4) Use cases: The jobs to be done </h2><p><strong>Thesis:</strong> PMF likely to land <strong>faster in B2B</strong> (centralized buyers, immediate ROI, indoor optics) with <strong>MR/VR headsets</strong>. <strong>Consumer demand for AR glasses would</strong> follow once weight, outdoor readability, and price clear thresholds - anchored by travel/translation, wayfinding, capture&#8594;memory.</p><p>Below are the breakdown of use cases that already exist + I think would likely come into reality soon. <em><s>Also tried to bring back memories from my past projects on XR devices :)</s></em></p><h4><strong>B2C &#8211; Everyday, eyes-up utility</strong></h4><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!QhMD!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F929f1747-4da2-4ac2-8ebd-68f1708a7165_3022x1596.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!QhMD!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F929f1747-4da2-4ac2-8ebd-68f1708a7165_3022x1596.png 424w, https://substackcdn.com/image/fetch/$s_!QhMD!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F929f1747-4da2-4ac2-8ebd-68f1708a7165_3022x1596.png 848w, https://substackcdn.com/image/fetch/$s_!QhMD!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F929f1747-4da2-4ac2-8ebd-68f1708a7165_3022x1596.png 1272w, https://substackcdn.com/image/fetch/$s_!QhMD!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F929f1747-4da2-4ac2-8ebd-68f1708a7165_3022x1596.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!QhMD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F929f1747-4da2-4ac2-8ebd-68f1708a7165_3022x1596.png" width="1456" height="769" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/929f1747-4da2-4ac2-8ebd-68f1708a7165_3022x1596.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:769,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:6631805,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/177328849?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F929f1747-4da2-4ac2-8ebd-68f1708a7165_3022x1596.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!QhMD!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F929f1747-4da2-4ac2-8ebd-68f1708a7165_3022x1596.png 424w, https://substackcdn.com/image/fetch/$s_!QhMD!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F929f1747-4da2-4ac2-8ebd-68f1708a7165_3022x1596.png 848w, https://substackcdn.com/image/fetch/$s_!QhMD!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F929f1747-4da2-4ac2-8ebd-68f1708a7165_3022x1596.png 1272w, https://substackcdn.com/image/fetch/$s_!QhMD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F929f1747-4da2-4ac2-8ebd-68f1708a7165_3022x1596.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Smart glasses will earn their spot on your face not through holograms, but through <em>micro-moments of value</em> that happen dozens of times a day.</p><ol><li><p><strong>Capture &#8594; Memory</strong><br>This isn&#8217;t just about taking memorable 3D pictures/videos that you can already do on Apple Vision Pro. This is the &#8220;never forget again&#8221; layer. When you see something worth remembering - say, where you left your keys, a whiteboard note, or a wine label - the glasses capture that moment with context (&#8220;Saved: keys on desk, 10:32 AM&#8221;). Later, you can simply ask, &#8220;Where did I leave my keys?&#8221; and it recalls the visual. It&#8217;s not life-logging; it&#8217;s <em>human-scale recall</em>, bridging the gap between short-term memory and searchable vision. How far this could go? <strong>It&#8217;s not just keys that it can remember - things, locations, people&#8217;s face and characteristics, and more.</strong></p></li><li><p><strong>Travel &amp; Translation</strong><br>You glance at a menu in Tokyo or a metro sign in Paris - the text auto-translates and prices convert instantly (&#8220;Ramen Bowl &#8211; $8.50&#8221;). The detected language appears as a soft badge in your view. This isn&#8217;t about showing off AR tricks; it&#8217;s <em>reducing friction in unfamiliar contexts</em>, turning every traveler into a confident local without breaking conversational eye contact.</p></li><li><p><strong>Fitness &amp; Outdoors</strong><br>Think of it as a <em>heads-up dashboard for movement</em>. While running or cycling, you see pace, heart rate, and calories inline with the road ahead, plus contextual info like your playlist or route cues. No wrist flicks or screen taps - just continuous feedback that keeps your eyes on the path. XR here isn&#8217;t about 3D worlds; it&#8217;s <em>about flow</em> - keeping you present while still informed.</p></li><li><p><strong>Events &amp; Venues</strong><br>In crowded, sensory-heavy spaces like concerts or sports games, smart glasses surface <em>context without distraction</em>. Lyrics, next-song cues, and event info pop up as lightweight overlays, while still letting you enjoy the moment. Imagine your favorite band starting the encore and your glasses quietly reminding you which song is next or where your friends are sitting. It&#8217;s the difference between <em>capturing the vibe</em> and missing it while checking your phone.</p></li><li><p><strong>Wayfinding &amp; Local Discovery</strong><br>Navigation becomes ambient. Arrows align with the street ahead, and local offers or recommendations surface in context (&#8220;Caf&#233; 120 m &#8594; 10 % off latte&#8221;). You&#8217;re no longer toggling between the world and a map - the city <em>annotates itself</em>. It&#8217;s not just directions; it&#8217;s <em>eyes-up discovery</em> that blends movement, commerce, and curiosity into one frame.Together, these B2C cases revolve around one theme - <strong>micro-tasks in under ten seconds</strong>, done without pulling a phone. That&#8217;s the true wedge before AR goes mainstream.</p></li></ol><h4>[B2B Use cases]</h4><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!bYkQ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F952f52ac-b560-449d-9bfc-438295443daa_2792x1698.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!bYkQ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F952f52ac-b560-449d-9bfc-438295443daa_2792x1698.png 424w, https://substackcdn.com/image/fetch/$s_!bYkQ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F952f52ac-b560-449d-9bfc-438295443daa_2792x1698.png 848w, https://substackcdn.com/image/fetch/$s_!bYkQ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F952f52ac-b560-449d-9bfc-438295443daa_2792x1698.png 1272w, https://substackcdn.com/image/fetch/$s_!bYkQ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F952f52ac-b560-449d-9bfc-438295443daa_2792x1698.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!bYkQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F952f52ac-b560-449d-9bfc-438295443daa_2792x1698.png" width="1456" height="885" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/952f52ac-b560-449d-9bfc-438295443daa_2792x1698.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:885,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:4555473,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/177328849?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F952f52ac-b560-449d-9bfc-438295443daa_2792x1698.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!bYkQ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F952f52ac-b560-449d-9bfc-438295443daa_2792x1698.png 424w, https://substackcdn.com/image/fetch/$s_!bYkQ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F952f52ac-b560-449d-9bfc-438295443daa_2792x1698.png 848w, https://substackcdn.com/image/fetch/$s_!bYkQ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F952f52ac-b560-449d-9bfc-438295443daa_2792x1698.png 1272w, https://substackcdn.com/image/fetch/$s_!bYkQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F952f52ac-b560-449d-9bfc-438295443daa_2792x1698.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>B2B &#8211; Immersion with ROI</strong></p><p>In the enterprise world, adoption is faster because ROI is clear, optics are indoors, and training costs are high. XR devices already show measurable gains here.</p><ol><li><p><strong>Simulation Training</strong><br>Picture a new hotel front-desk employee or retail associate wearing lightweight MR glasses. Instead of static manuals, they practice live scenarios - greeting guests, checking reservations, handling edge cases - all guided by an overlayed trainer avatar. The headset tracks eye contact and timing, offering real-time cues (&#8220;Step 1 &#8594; Greet guest &#10003; Step 2 &#8594; Check reservation&#8221;). It&#8217;s learning-by-doing, scaled through simulation. What used to take days of shadowing now happens interactively in minutes.</p></li><li><p><strong>Inventory &amp; Logistics</strong><br>In a warehouse or fulfillment center, AR glasses display what&#8217;s next - shelf location, quantity, pick confirmation - directly in the worker&#8217;s view (&#8220;Shelf A3 &#183; 18 units &#183; Next Item &#8594;&#8221;). No handheld scanners or clipboards; both hands stay free. The system syncs live with ERP data, cutting picking errors and boosting throughput. It&#8217;s the quiet side of XR: <em>clear direction, zero confusion</em>.</p></li><li><p><strong>Remote Assist &amp; Maintenance</strong><br>When a technician faces a complex machine fault, they can summon an expert instantly. The expert sees what the technician sees through the headset&#8217;s camera and can draw annotations directly into the field of view (&#8220;Step 3 &#8594; Tighten bolt&#8221;). This collapses distance - senior engineers no longer need to fly out, and even junior techs can perform advanced repairs safely. The payoff is faster resolution and higher first-time-fix rates.</p></li><li><p><strong>Design Visualization &amp; Prototyping</strong><br>Engineers and designers can &#8220;pull&#8221; digital prototypes into the physical world. A new phone casing or robotic arm appears in mid-air, rendered at real scale and adjustable by hand (&#8220;Material: Aluminum &#183; Angle: 32&#176;&#8221;). Instead of imagining CAD models on a flat screen, teams walk around and iterate instantly - testing proportions, materials, and ergonomics in context. XR becomes <em>a spatial whiteboard for product imagination</em>.</p></li><li><p><strong>Factory Quality Inspection</strong><br>Inspectors see the line through computer vision assistance. The system flags misaligned screws, missing labels, or cosmetic defects in real time (&#8220;Defect: Misaligned screw &#8594; Fail&#8221;). Each check is logged automatically with timestamp and item ID. The glasses act as a second set of eyes - tireless, consistent, and precise - ensuring that every unit meets spec before it ships.</p></li></ol><div><hr></div><h2>5) Hard constraints, and how teams are attacking them</h2><h3>A) Displays &amp; optics (brightness, efficiency, eyebox)</h3><ul><li><p><strong>Waveguides</strong> trending to higher coupling efficiency &#8594; <strong>fewer nits for same legibility</strong>; reflective variants help outdoors.</p></li><li><p><strong>Micro-OLED/&#181;LED</strong> + better collimation raise brightness without cooking the temple.</p></li><li><p><strong>Mechanical</strong>: distributed batteries (both temples/nose), slimmer driver boards, using the <strong>frame as heat spreader</strong>.</p></li></ul><h3>B) Meta-optics for cameras/sensors (thinner &#8220;eyes&#8221;)</h3><ul><li><p><strong>Metasurface lenses</strong> compress multi-element stacks into sub-mm &#8220;flat&#8221; optics - already shipping in depth/eye-tracking modules.</p></li><li><p>Expect early wins in <strong>gesture/eye tracking</strong> and short-range depth (smaller FOV needs), enabling <strong>sleeker fronts</strong> without bulging camera pods.</p></li></ul><h3>C) Vision comfort &amp; eyesight</h3><ul><li><p>Fix the <strong>vergence&#8211;accommodation</strong> mismatch with comfortable virtual focal distance for text (~1.5&#8211;4 m), conservative motion, high contrast.</p></li><li><p><strong>Prescription integration</strong> (lens inserts, 3D-printed Rx with embedded films), <strong>electrochromic dimming</strong> outdoors, and per-user calibration.</p></li></ul><h3>D) On-device AI &amp; routing (the real unlock)</h3><ul><li><p><strong>Tiered compute</strong>: glasses NPU handles wake word, OCR, short translation, simple vision; phone/cloud do heavy LLM or multimodal reasoning.</p></li><li><p><strong>Product magic</strong> is an <strong>invisible router</strong> that chooses local vs phone vs cloud, keeps latency &lt;~300 ms for common actions, and manages <strong>KV-caches/streaming</strong> so it feels instant.</p></li></ul><blockquote><p>Accessibility side note (from Knowlab): integrated <strong>AI vision aids</strong> - scene description, live captions - plus research using <strong>retinal implants + AI glasses</strong> shows non-gadget, life-changing impact.</p></blockquote><div><hr></div><h2>6) Market momentum (signals from the field)</h2><ul><li><p><strong>Meta &#215; Ray-Ban</strong> validated <strong>style + utility</strong> can move units (and eyewear equities). New Display variant and neural wristband push input UX forward.</p></li><li><p><strong>Samsung/Google</strong> are seeding an <strong>Android XR</strong> runway with fashion partners (Warby Parker, Gentle Monster) and <strong>Gemini</strong> on-device.</p></li><li><p><strong>Apple</strong> is likely to prime the pump with a <strong>dev-first</strong> reveal ahead of launch, mirroring Vision Pro&#8217;s ramp.</p></li><li><p><strong>Alibaba, Xiaomi</strong> show consumer price/feature elasticity at scale (translation, payments, live-stream) with aggressive weights (~40 g without lenses reported on some SKUs).</p></li><li><p><strong>Snap</strong> commits to 2026 consumer AR <strong>Specs</strong> after a decade+ of R&amp;D.</p></li></ul><p><strong>Forecasts (directionally)</strong> point to a steep curve: multi-tens of millions of units by decade&#8217;s end if the category holds its current trajectory; B2B roll-outs will subsidize the tech curve while consumer SKUs find their wedge.</p><div><hr></div><h2>7) Roadmap: what I think would tangibly change in the next &lt;5 years?</h2><ul><li><p><strong>Outdoor text readability becomes &#8220;good enough.&#8221;</strong> Brighter, higher-contrast overlays; smarter auto-dimming/contrast modes.</p></li><li><p><strong>Latency choreography tightens.</strong> Common asks (translate this sign, summarize this page, what&#8217;s that building?) return answers in ~1&#8211;2 s via smart local&#8596;phone&#8596;cloud routing.</p></li><li><p><strong>Quiet inputs</strong> mature: subtle frame swipes/pinch/eye-affirm beats &#8220;talk to your glasses in public.&#8221;</p></li><li><p><strong>Memory as a feature</strong> stabilizes: capture &#8594; auto-organize &#8594; &#8220;find again&#8221; by voice works reliably.</p></li><li><p><strong>Enterprise scale</strong>: four-figure deployments in field service, warehousing, assembly; AEC pilots widen; training becomes a horizontal SKU.</p></li><li><p><strong>Consumer wedge</strong> hardens around <strong>travel/translation + wayfinding + capture&#8594;memory</strong>, priced to move (and likely carrier-bundled).</p></li><li><p><strong>New business models.</strong> Especially for MR/VR headsets: As OEMs push hard to create mass replacement of desktops to MR/VR headsets, <strong>rental businesses</strong> (just like how companies rent laptops to large corporates) may start happening, starting from B2B side.</p></li></ul><p><strong>Signals to watch</strong></p><ul><li><p>Sub-<strong>150 g</strong> frames claiming <strong>4&#8211;6 h</strong> mixed use, explicit &#8220;<strong>outdoor-readable</strong> text.&#8221;</p></li><li><p>Phone OS releases with <strong>deeper handoff APIs</strong> for glasses.</p></li><li><p><strong>Carrier bundles</strong> and retail tie-ups.</p></li><li><p>Case studies with hard deltas: <strong>first-time-fix&#8593;</strong>, <strong>picks/hour&#8593;</strong>, <strong>rework&#8595;</strong>.</p></li></ul><div><hr></div><h2>8) Closing Answer</h2><p>So - can assistant-first smart glasses move from novelty to habit? <strong>Yes, if they embrace constraint.</strong> The winners won&#8217;t chase full 3D fantasies on a 40-gram frame. They&#8217;ll <strong>do less, better</strong>: instant translation, arrows when you need them, hands-free capture that turns into trustworthy memory, and quick &#8220;what&#8217;s that?&#8221; answers - <strong>fast, private, stylish</strong>. <em><s>(TMI: To me it&#8217;s interesting to see this trend happening across verticals - just as I explained in my prior article on &#8216;smaller&#8217; AI models. Techs shifting gear towards mass adoption and scale up tending to go smaller but better.)</s></em></p><p>Short-term, <strong>MR/VR headsets</strong> own 3D-heavy work (training, design, remote assist); <strong>AR glasses</strong> earn the consumer day by nailing eyes-up micro-tasks. </p><h4>As optics get brighter, NPUs get cheaper, and routing gets invisible, I am convinced that <strong>AR glasses trend toward &#8220;the next smartphone&#8221;</strong> - the thing you don&#8217;t leave home without - while <strong>MR/VR becomes the next laptop</strong> - heavier, but where deep work happens.</h4><h4>The bar is simple and ruthless: <strong>useful in under 10 seconds, without fuss</strong>. Teams that meet it - respecting human factors as much as silicon - will own the face, and the balance sheet that follows.</h4>]]></content:encoded></item><item><title><![CDATA[Deep-Tech Decoded: #5 Cloud Control Planes & DNS]]></title><description><![CDATA[Why One Region&#8217;s Hiccup Took Half the Internet Down]]></description><link>https://clairechoi616.substack.com/p/deep-tech-decoded-5-cloud-control</link><guid isPermaLink="false">https://clairechoi616.substack.com/p/deep-tech-decoded-5-cloud-control</guid><dc:creator><![CDATA[Deep Tech for the Non Tech]]></dc:creator><pubDate>Thu, 23 Oct 2025 14:00:33 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!7gA6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb289bc2b-cbaf-46c3-a4a4-02f406ea9fe2_800x739.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h2>One Big Question</h2><p><strong>How did a &#8220;routine&#8221; DNS issue in one AWS region ripple into banks, games, and government sites - and what designs keep the next hiccup from becoming everyone&#8217;s outage?</strong></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!9sht!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde25acf6-93b4-4122-b7ca-12611b32cdf1_1280x720.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!9sht!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde25acf6-93b4-4122-b7ca-12611b32cdf1_1280x720.webp 424w, https://substackcdn.com/image/fetch/$s_!9sht!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde25acf6-93b4-4122-b7ca-12611b32cdf1_1280x720.webp 848w, https://substackcdn.com/image/fetch/$s_!9sht!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde25acf6-93b4-4122-b7ca-12611b32cdf1_1280x720.webp 1272w, https://substackcdn.com/image/fetch/$s_!9sht!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde25acf6-93b4-4122-b7ca-12611b32cdf1_1280x720.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!9sht!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde25acf6-93b4-4122-b7ca-12611b32cdf1_1280x720.webp" width="1280" height="720" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/de25acf6-93b4-4122-b7ca-12611b32cdf1_1280x720.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:720,&quot;width&quot;:1280,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Amazon outage, AWS outage, Amazon AWS logo&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Amazon outage, AWS outage, Amazon AWS logo" title="Amazon outage, AWS outage, Amazon AWS logo" srcset="https://substackcdn.com/image/fetch/$s_!9sht!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde25acf6-93b4-4122-b7ca-12611b32cdf1_1280x720.webp 424w, https://substackcdn.com/image/fetch/$s_!9sht!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde25acf6-93b4-4122-b7ca-12611b32cdf1_1280x720.webp 848w, https://substackcdn.com/image/fetch/$s_!9sht!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde25acf6-93b4-4122-b7ca-12611b32cdf1_1280x720.webp 1272w, https://substackcdn.com/image/fetch/$s_!9sht!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde25acf6-93b4-4122-b7ca-12611b32cdf1_1280x720.webp 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>TL;DR</h2><p>A DNS misfire for a <strong>DynamoDB</strong> endpoint in <strong>US-EAST-1</strong> triggered synchronized retries and service-chain stalls across thousands of apps. Resilience comes from <strong>multi-region control planes, DNS canaries/rollback, sane client retries + circuit breakers, and practiced failovers</strong>. Outages happen; internet-wide cascades don&#8217;t have to.</p><div><hr></div><h2>Cheat Sheet before Diving In</h2><ul><li><p><strong>DNS:</strong> name &#8594; IP. If it&#8217;s wrong/slow, everything above it wobbles.</p></li><li><p><strong>Control plane:</strong> orchestration brain - keep <strong>multi-region</strong> and <strong>partitioned</strong>.</p></li><li><p><strong>US-EAST-1:</strong> AWS&#8217;s gravity well - minimize hard dependencies.</p></li><li><p><strong>Retry storms:</strong> synchronized client retries that amplify outages&#8212;use <strong>backoff + jitter</strong>.</p></li><li><p><strong>Active-active:</strong> multiple regions serving at once; no cold glass to break.</p></li><li><p><strong>TTL:</strong> DNS cache time; balance agility vs stability.</p></li></ul><div><hr></div><h2>Decode the Basics </h2><p><strong>The cloud in three lines.</strong> You rent compute, storage, and platforms over the internet instead of buying servers. IaaS = raw machines; PaaS = managed building blocks (databases, queues, identity); SaaS = finished apps. The draw: speed, elasticity, and managed ops.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!yswt!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F438aacc1-55f5-48f3-85b9-a2901d31b5c6_1470x876.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!yswt!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F438aacc1-55f5-48f3-85b9-a2901d31b5c6_1470x876.jpeg 424w, https://substackcdn.com/image/fetch/$s_!yswt!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F438aacc1-55f5-48f3-85b9-a2901d31b5c6_1470x876.jpeg 848w, https://substackcdn.com/image/fetch/$s_!yswt!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F438aacc1-55f5-48f3-85b9-a2901d31b5c6_1470x876.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!yswt!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F438aacc1-55f5-48f3-85b9-a2901d31b5c6_1470x876.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!yswt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F438aacc1-55f5-48f3-85b9-a2901d31b5c6_1470x876.jpeg" width="1470" height="876" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/438aacc1-55f5-48f3-85b9-a2901d31b5c6_1470x876.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:876,&quot;width&quot;:1470,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:105132,&quot;alt&quot;:&quot;Cloud service models: SaaS, PaaS and IaaS&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Cloud service models: SaaS, PaaS and IaaS" title="Cloud service models: SaaS, PaaS and IaaS" srcset="https://substackcdn.com/image/fetch/$s_!yswt!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F438aacc1-55f5-48f3-85b9-a2901d31b5c6_1470x876.jpeg 424w, https://substackcdn.com/image/fetch/$s_!yswt!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F438aacc1-55f5-48f3-85b9-a2901d31b5c6_1470x876.jpeg 848w, https://substackcdn.com/image/fetch/$s_!yswt!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F438aacc1-55f5-48f3-85b9-a2901d31b5c6_1470x876.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!yswt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F438aacc1-55f5-48f3-85b9-a2901d31b5c6_1470x876.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: Stackscale</figcaption></figure></div><p><strong>Two planes, two risks.</strong> The <strong>control plane</strong> (the brain) creates/updates resources and enforces identity/policy. The <strong>data plane</strong> moves live traffic. If &#8220;global&#8221; control services quietly anchor to one region, a small fault can jam a lot.</p><p><strong>DNS is the phonebook.</strong> It translates names (e.g., an API endpoint) to IPs. If DNS lies or times out, clients can&#8217;t find the door. Cache settings (TTLs) and client behavior decide whether you recover quickly&#8212;or stampede the network.</p><blockquote><p><strong>One-screen mental model</strong></p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!oLU7!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d40a085-18a3-4b47-ace8-e99b0d5d71a5_2048x284.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!oLU7!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d40a085-18a3-4b47-ace8-e99b0d5d71a5_2048x284.png 424w, https://substackcdn.com/image/fetch/$s_!oLU7!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d40a085-18a3-4b47-ace8-e99b0d5d71a5_2048x284.png 848w, https://substackcdn.com/image/fetch/$s_!oLU7!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d40a085-18a3-4b47-ace8-e99b0d5d71a5_2048x284.png 1272w, https://substackcdn.com/image/fetch/$s_!oLU7!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d40a085-18a3-4b47-ace8-e99b0d5d71a5_2048x284.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!oLU7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d40a085-18a3-4b47-ace8-e99b0d5d71a5_2048x284.png" width="684" height="94.8956043956044" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3d40a085-18a3-4b47-ace8-e99b0d5d71a5_2048x284.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:202,&quot;width&quot;:1456,&quot;resizeWidth&quot;:684,&quot;bytes&quot;:11296,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/176780453?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d40a085-18a3-4b47-ace8-e99b0d5d71a5_2048x284.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!oLU7!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d40a085-18a3-4b47-ace8-e99b0d5d71a5_2048x284.png 424w, https://substackcdn.com/image/fetch/$s_!oLU7!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d40a085-18a3-4b47-ace8-e99b0d5d71a5_2048x284.png 848w, https://substackcdn.com/image/fetch/$s_!oLU7!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d40a085-18a3-4b47-ace8-e99b0d5d71a5_2048x284.png 1272w, https://substackcdn.com/image/fetch/$s_!oLU7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d40a085-18a3-4b47-ace8-e99b0d5d71a5_2048x284.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div></blockquote><div><hr></div><h2>What Just Happened (Oct 20, 2025)</h2><ul><li><p><strong>Where:</strong> <strong>US-EAST-1 (N. Virginia)</strong>, AWS&#8217;s busiest historical region.</p></li><li><p><strong>Trigger:</strong> A <strong>DNS resolution issue</strong> on the <strong>DynamoDB API endpoint</strong> after an update.</p></li><li><p><strong>Blast radius:</strong> &gt;1,000 companies impacted; consumer apps, games, banks, and gov portals showed failures.</p></li><li><p><strong>Why it spread:</strong> US-EAST-1 gravity + service chaining + synchronized retries = a mundane fault turned very public.</p></li><li><p><strong>Recovery:</strong> DNS was mitigated the same day; lingering queues/backlogs cleared afterward.</p></li><li><p><strong>Note:</strong> No signs of a cyberattack, this was configuration/infra behavior. And yes, <strong>&#8220;it&#8217;s always DNS&#8221;</strong> is more pattern than joke.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!7gA6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb289bc2b-cbaf-46c3-a4a4-02f406ea9fe2_800x739.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!7gA6!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb289bc2b-cbaf-46c3-a4a4-02f406ea9fe2_800x739.jpeg 424w, https://substackcdn.com/image/fetch/$s_!7gA6!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb289bc2b-cbaf-46c3-a4a4-02f406ea9fe2_800x739.jpeg 848w, https://substackcdn.com/image/fetch/$s_!7gA6!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb289bc2b-cbaf-46c3-a4a4-02f406ea9fe2_800x739.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!7gA6!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb289bc2b-cbaf-46c3-a4a4-02f406ea9fe2_800x739.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!7gA6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb289bc2b-cbaf-46c3-a4a4-02f406ea9fe2_800x739.jpeg" width="800" height="739" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b289bc2b-cbaf-46c3-a4a4-02f406ea9fe2_800x739.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:739,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;AWS Outage Knocks Out Major Services Like Snapchat and Alexa - Business  Insider&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="AWS Outage Knocks Out Major Services Like Snapchat and Alexa - Business  Insider" title="AWS Outage Knocks Out Major Services Like Snapchat and Alexa - Business  Insider" srcset="https://substackcdn.com/image/fetch/$s_!7gA6!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb289bc2b-cbaf-46c3-a4a4-02f406ea9fe2_800x739.jpeg 424w, https://substackcdn.com/image/fetch/$s_!7gA6!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb289bc2b-cbaf-46c3-a4a4-02f406ea9fe2_800x739.jpeg 848w, https://substackcdn.com/image/fetch/$s_!7gA6!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb289bc2b-cbaf-46c3-a4a4-02f406ea9fe2_800x739.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!7gA6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb289bc2b-cbaf-46c3-a4a4-02f406ea9fe2_800x739.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">AWS outage yesterday brought down the rest of the internet. Outage-tracking website DownDetector showed a fresh wave of outage reports for services including Amazon and Venmo later Monday morning.</figcaption></figure></div><div><hr></div><h2>Product &amp; Operations Lens: what actually reduces blast radius</h2><ol><li><p><strong>Multi-region by design.</strong> Treat identity, metadata, and routing as <strong>multi-region products</strong> (active-active for reads; queue + replay for writes). Avoid single-region anchors.</p></li><li><p><strong>DNS with brakes.</strong> Dual/secondary DNS for crown-jewel names, <strong>staged/canary changes</strong> with instant rollback, health-checked CNAMEs, thoughtful TTLs. Treat DNS edits like code.</p></li><li><p><strong>Clients are part of resilience.</strong> Ship <strong>exponential backoff + jitter</strong>, circuit breakers, idempotent writes, and bounded queues. Fail fast with clear UX instead of hot-looping retries.</p></li><li><p><strong>Practice the failure.</strong> Run <strong>DNS game-days</strong> and <strong>region-evac drills</strong> during low traffic. Simulate &#8220;endpoint unreachable.&#8221; Measure real RTO/RPO - not slideware.</p></li></ol><div class="pullquote"><p><strong>Anti-pattern &#8594; Better pattern</strong><br>Single-region IAM/metadata &#8594; <strong>Replicated/partitioned across regions</strong><br>Global tables anchored to one region &#8594; <strong>Multi-home critical keys</strong><br>Infinite instant retries &#8594; <strong>Backoff + jitter + circuit breaker</strong><br>&#8220;We&#8217;ll fail over in a crisis&#8221; &#8594; <strong>Quarterly evacuation game-days</strong></p></div><h2>What Providers Are Doing (and what you can lean on)</h2><ul><li><p><strong>Partitioned control planes / cell architectures</strong> to contain faults.</p></li><li><p><strong>Safer DNS pipelines</strong> (canaries, health checks, fast rollback).</p></li><li><p><strong>Stronger SDK defaults</strong> (built-in backoff, circuit breakers, idempotency).</p></li><li><p><strong>Decoupling &#8220;global&#8221; services</strong> from US-EAST-1.</p></li><li><p><strong>Security &amp; posture tooling</strong> (policy guardrails, config drift alerts, retry-storm detection).</p></li><li><p><strong>Dual-DNS / edge partnerships</strong> (Route 53 + Cloudflare/NS1/Akamai) for split authority.</p></li></ul><p><em>Goal: localized degrade, not headlines.</em></p><div><hr></div><h2>Market &amp; Strategy Lens</h2><ul><li><p><strong>Concentration risk &#8800; cloud is bad.</strong> It&#8217;s a <strong>design-for-failure</strong> problem.</p></li><li><p><strong>Multi-cloud helps only if</strong> shared dependencies (DNS, identity, data) are detangled; otherwise you just doubled contracts.</p></li><li><p><strong>Regulatory push</strong> (EU/UK) will ask banks/gov for evidence of <strong>DNS/identity/data redundancy</strong> and region independence.</p></li><li><p><strong>Economics:</strong> Compare <strong>$/minute downtime</strong> vs <strong>$/month resilience</strong>. DNS canaries, SDK backoff, and dual-DNS are cheap relative to lost revenue and trust.</p></li></ul><div><hr></div><h2>Signals to Watch</h2><ul><li><p><strong>Specific post-mortems:</strong> DNS canary/rollback SLAs; region-partitioned control planes.</p></li><li><p><strong>SDK updates you can feel:</strong> default backoff/jitter, circuit breakers, idempotent calls.</p></li><li><p><strong>Customer detangling:</strong> dual-DNS adoption; published region-evac runbooks; moving IAM/metadata off single-region anchors.</p></li><li><p><strong>Tooling uptick:</strong> DNS change-management/canary tools; multi-region data products.</p></li><li><p><strong>Regulatory guidance</strong> on cloud resilience audits that include <strong>name-resolution SLOs</strong>.</p></li></ul><div><hr></div><h2>What I&#8217;m Still Unsure About</h2><ul><li><p>How much &#8220;global&#8221; control still transits <strong>US-EAST-1</strong> under the hood - and the pace of unwinding it.</p></li><li><p>Whether customers will fund <strong>active-active + dual-DNS</strong> after the news cycle fades.</p></li><li><p>How far providers push <strong>SDK-level defaults</strong> to curb client-side storms.</p></li></ul><div><hr></div><h2>Closing Answer</h2><p><strong>Too many &#8220;global&#8221; paths still breathe through US-EAST-1, and naming is oxygen.</strong> When DNS for a core endpoint failed, clients all retried together, chains jammed, and the blast radius grew. The fix isn&#8217;t glamorous: multi-region control planes, DNS isolation with canaries, client backoff/circuit breakers, and regular game-days. <strong>You can&#8217;t stop hiccups, but you can keep them from becoming headlines.</strong></p>]]></content:encoded></item><item><title><![CDATA[Deep-Tech Decoded: #4 Humanoid Robots]]></title><description><![CDATA[If the last decade was about AI brains, the next is about AI bodies.]]></description><link>https://clairechoi616.substack.com/p/deep-tech-decoded-4-humanoid-robots</link><guid isPermaLink="false">https://clairechoi616.substack.com/p/deep-tech-decoded-4-humanoid-robots</guid><dc:creator><![CDATA[Deep Tech for the Non Tech]]></dc:creator><pubDate>Tue, 21 Oct 2025 15:02:52 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!E0pu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4301e595-934f-4f87-97cc-297efbf981f5_2921x2216.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h1>Are humanoid robots ready for their ChatGPT moment, or still a few hard steps away?</h1><p><strong>TL;DR:</strong> The spark is here; the wildfire starts on factory floors. As shared-space safety matures, reliability improves, and costs keep sliding, humanoids scale from narrow tasks to wider shifts - less a viral jump, more a compounding curve.</p><div><hr></div><h2>Why Humanoids, Why Now?</h2><p>Last time I wrote about a specific product within humanoids, but this time I&#8217;ll zoom out to explain about the humanoids overall.</p><p><strong>To the 20th century, robots were machines behind fences; to the 21st, they may stand beside us.</strong></p><p>Humanoid robots are no longer just science fiction or viral demo material. They&#8217;re being piloted in factories, warehouses, and even hospitals, aiming to fill labor gaps and automate tasks in spaces built for humans. Unlike the home-focused Figure 03, which spotlights imitation learning and consumer chores, this article zooms out: <strong>What makes humanoids possible, what are their industrial frontiers, and what will it take for them to scale?</strong></p><div><hr></div><h2>What Is a Humanoid Robot, Really?</h2><p>A humanoid robot is a machine designed to operate in human environments, with a body plan (two arms, two legs, a head) that lets it use our tools, open our doors, and navigate our world. But the real story is how these robots combine <strong>mechanical engineering, advanced sensors, and AI-driven control</strong> to move, perceive, and act with increasing autonomy.</p><p>But humanoid robots are not all the same - most real-world models fall into four broad categories, each with distinct roles and interaction styles:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!kubm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27eee1b9-b71f-4319-a2a9-623adf76a510_1464x898.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!kubm!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27eee1b9-b71f-4319-a2a9-623adf76a510_1464x898.png 424w, https://substackcdn.com/image/fetch/$s_!kubm!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27eee1b9-b71f-4319-a2a9-623adf76a510_1464x898.png 848w, https://substackcdn.com/image/fetch/$s_!kubm!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27eee1b9-b71f-4319-a2a9-623adf76a510_1464x898.png 1272w, https://substackcdn.com/image/fetch/$s_!kubm!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27eee1b9-b71f-4319-a2a9-623adf76a510_1464x898.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!kubm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27eee1b9-b71f-4319-a2a9-623adf76a510_1464x898.png" width="1456" height="893" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/27eee1b9-b71f-4319-a2a9-623adf76a510_1464x898.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:893,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:734138,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/176660937?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27eee1b9-b71f-4319-a2a9-623adf76a510_1464x898.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!kubm!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27eee1b9-b71f-4319-a2a9-623adf76a510_1464x898.png 424w, https://substackcdn.com/image/fetch/$s_!kubm!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27eee1b9-b71f-4319-a2a9-623adf76a510_1464x898.png 848w, https://substackcdn.com/image/fetch/$s_!kubm!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27eee1b9-b71f-4319-a2a9-623adf76a510_1464x898.png 1272w, https://substackcdn.com/image/fetch/$s_!kubm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27eee1b9-b71f-4319-a2a9-623adf76a510_1464x898.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Categorization of humanoid robots (Data source: Standard robots)</figcaption></figure></div><p>This 4-part categorization helps clarify why robots like Figure 03 feel so different from, say, a logistics robot rolling through an assembly line: each is built for a different spectrum of engagement and capability.</p><div><hr></div><h2>The Anatomy of a Modern Humanoid</h2><ul><li><p><strong>Skeleton &amp; Actuators:</strong> Lightweight alloys and high-torque motors mimic bones and muscles, enabling walking, lifting, and balancing.</p></li><li><p><strong>Sensors:</strong> Cameras, LiDAR, tactile pads, and inertial sensors provide vision, touch, and balance - crucial for safe, adaptive movement.</p></li><li><p><strong>AI &#8220;Brain&#8221;:</strong> Onboard computers run perception, planning, and control algorithms, often powered by the same chips that drive generative AI.h-density batteries and efficient power management are essential for multi-hour operation.</p></li></ul><div><hr></div><h2>The Four Industrial Bridges</h2><p>While home robots like Figure 03 focus on learning from demonstration, industrial humanoids face a different set of hurdles:</p><ol><li><p><strong>Safety in Shared Spaces</strong></p><ul><li><p>Robots must work alongside people without cages, requiring robust perception, fast reflexes, and compliance with evolving safety standards.</p></li></ul></li><li><p><strong>Uptime and Reliability</strong></p><ul><li><p>Industrial buyers demand robots that can run for hours, self-diagnose faults, and recover from stumbles - no babysitting allowed.</p></li></ul></li><li><p><strong>Dexterity and Versatility</strong></p><ul><li><p>From handling car parts to sorting packages, humanoids need hands and arms that can adapt to a wide range of objects and tasks.</p></li></ul></li><li><p><strong>Cost Down, Scale Up</strong></p><ul><li><p>To move beyond pilots, costs must drop from six figures to the price of a car. This means modular designs, mass manufacturing, and a robust supply chain.</p></li></ul></li></ol><div><hr></div><h2>The Tech Stack: What&#8217;s Important and New?</h2><ul><li><p><strong>Actuators:</strong> Next-gen motors and gearboxes deliver human-like strength and speed, with fewer parts and less maintenance.</p></li><li><p><strong>Perception:</strong> Multi-modal sensor fusion (vision, touch, force) enables robots to &#8220;see&#8221; and &#8220;feel&#8221; their environment in real time.</p></li><li><p><strong>AI Control:</strong> Large models trained in simulation and real-world data allow robots to generalize across new tasks and settings - crucial for industrial flexibility.</p></li><li><p><strong>Cloud &amp; Edge:</strong> Many robots now blend onboard processing with cloud-based updates, letting them learn from each other and improve over time.</p></li></ul><div><hr></div><h2>Where Are Humanoids Working First?</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!_FW6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd02646c8-e050-4f31-9427-1b891ae162e9_1486x1218.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!_FW6!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd02646c8-e050-4f31-9427-1b891ae162e9_1486x1218.png 424w, https://substackcdn.com/image/fetch/$s_!_FW6!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd02646c8-e050-4f31-9427-1b891ae162e9_1486x1218.png 848w, https://substackcdn.com/image/fetch/$s_!_FW6!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd02646c8-e050-4f31-9427-1b891ae162e9_1486x1218.png 1272w, https://substackcdn.com/image/fetch/$s_!_FW6!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd02646c8-e050-4f31-9427-1b891ae162e9_1486x1218.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!_FW6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd02646c8-e050-4f31-9427-1b891ae162e9_1486x1218.png" width="1456" height="1193" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d02646c8-e050-4f31-9427-1b891ae162e9_1486x1218.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1193,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:117699,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/176660937?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd02646c8-e050-4f31-9427-1b891ae162e9_1486x1218.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!_FW6!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd02646c8-e050-4f31-9427-1b891ae162e9_1486x1218.png 424w, https://substackcdn.com/image/fetch/$s_!_FW6!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd02646c8-e050-4f31-9427-1b891ae162e9_1486x1218.png 848w, https://substackcdn.com/image/fetch/$s_!_FW6!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd02646c8-e050-4f31-9427-1b891ae162e9_1486x1218.png 1272w, https://substackcdn.com/image/fetch/$s_!_FW6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd02646c8-e050-4f31-9427-1b891ae162e9_1486x1218.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: McKinsey</figcaption></figure></div><p>Humanoid robots are found across sectors, but <strong>many pilots are found in manufacturing and logistics, mainly due to the fact that it is the only industry at this point that can justify the high initial investment into product &amp; installation</strong>. Looking more specifically into some of the sectors, below are the specific sub sectors that are increasingly trying out humanoids: </p><ul><li><p><strong>Automotive Plants:</strong> Robots handle repetitive, ergonomic tasks&#8212;moving parts, loading machines, and even quality checks.</p></li><li><p><strong>Warehouses:</strong> Early deployments focus on picking, packing, and moving goods in spaces designed for people, not conveyor belts.</p></li><li><p><strong>Healthcare &amp; Labs:</strong> Some humanoids assist with patient mobility or repetitive lab work, where adaptability and safety are paramount.</p></li></ul><div><hr></div><h2>Emerging Types of Humanoids</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!E0pu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4301e595-934f-4f87-97cc-297efbf981f5_2921x2216.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!E0pu!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4301e595-934f-4f87-97cc-297efbf981f5_2921x2216.webp 424w, https://substackcdn.com/image/fetch/$s_!E0pu!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4301e595-934f-4f87-97cc-297efbf981f5_2921x2216.webp 848w, https://substackcdn.com/image/fetch/$s_!E0pu!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4301e595-934f-4f87-97cc-297efbf981f5_2921x2216.webp 1272w, https://substackcdn.com/image/fetch/$s_!E0pu!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4301e595-934f-4f87-97cc-297efbf981f5_2921x2216.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!E0pu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4301e595-934f-4f87-97cc-297efbf981f5_2921x2216.webp" width="1456" height="1105" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4301e595-934f-4f87-97cc-297efbf981f5_2921x2216.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1105,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;The Current Generation of Humanoid Robots (2025)&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="The Current Generation of Humanoid Robots (2025)" title="The Current Generation of Humanoid Robots (2025)" srcset="https://substackcdn.com/image/fetch/$s_!E0pu!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4301e595-934f-4f87-97cc-297efbf981f5_2921x2216.webp 424w, https://substackcdn.com/image/fetch/$s_!E0pu!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4301e595-934f-4f87-97cc-297efbf981f5_2921x2216.webp 848w, https://substackcdn.com/image/fetch/$s_!E0pu!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4301e595-934f-4f87-97cc-297efbf981f5_2921x2216.webp 1272w, https://substackcdn.com/image/fetch/$s_!E0pu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4301e595-934f-4f87-97cc-297efbf981f5_2921x2216.webp 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: Voronoi; Date: Apr 2025</figcaption></figure></div><p>If Figure 03 (the one I introduced in my last product-decoded article) is one of the more approachable, home-oriented humanoids you&#8217;ll soon find around you, there are more ambitious types on the horizon - each opening new markets and technical frontiers.</p><ul><li><p><strong>Morphing and Multi-Modal Humanoids:</strong><br>This is the robot that made me write this article. A member from my Stanford Student Robotic&#8217;s Club uploaded this video on our discord, saying that our drones should be able to deliver to the doorstep via transformative robot modules. </p><p>Caltech&#8217;s X1 system merges a bipedal humanoid (Unitree G1) with a drone-like robot (M4) that can fly, drive, and roll. This hybrid can walk, deploy a flying scout, and traverse obstacles - showing how modular, multi-modal robots could tackle logistics, disaster response, and exploration in ways single-mode robots cannot.</p><div id="youtube2-F8DwBWCVZ0c" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;F8DwBWCVZ0c&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/F8DwBWCVZ0c?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div></li><li><p><strong>Industrial-Grade Heavy-Lift Humanoids:</strong><br>Robots like Tesla&#8217;s Optimus Gen 2 and NEURA Robotics&#8217; 4NE-1 are engineered for manufacturing, logistics, and construction, with reinforced frames and high payload capacity. These are designed for demanding environments and tasks that require both strength and adaptability.&#8203;</p><div id="youtube2-bOtiglveYyM" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;bOtiglveYyM&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/bOtiglveYyM?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div></li><li><p><strong>Swarm-Enabled and Collaborative Humanoids:</strong><br>Some startups are developing humanoids that work in coordinated teams, sharing data and dividing tasks - potentially transforming warehouse automation and large-scale logistics.</p><div id="youtube2-wc1UpmxkCwA" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;wc1UpmxkCwA&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/wc1UpmxkCwA?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div></li><li><p><strong>Specialized Healthcare and Research Humanoids:</strong><br>Models like Fourier Intelligence&#8217;s GR-1 are tailored for patient mobility, rehabilitation, and lab work, while others serve as platforms for advanced AI and human-robot interaction research.</p><div id="youtube2-KoAEaZm1Hw4" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;KoAEaZm1Hw4&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/KoAEaZm1Hw4?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div></li></ul><p>For venture capital, these emerging types represent not just incremental improvements, but new business models and market opportunities - modularity, collaboration, and autonomy are the next big bets.</p><div><hr></div><h2>The Business Case: Why Industry Cares</h2><ul><li><p><strong>Labor Shortages:</strong> Aging populations and tight labor markets make automation attractive, especially for dull, dirty, or dangerous jobs.</p></li><li><p><strong>Flexibility:</strong> Unlike fixed automation, humanoids can be re-tasked for new workflows without expensive retooling.</p></li><li><p><strong>ROI:</strong> As costs fall and reliability rises, the break-even point for humanoids in logistics and manufacturing is approaching fast.</p></li></ul><div><hr></div><h2>Roadblocks to Mass Adoption</h2><p>Despite the promise, mass adoption of humanoids faces deep technology and business roadblocks:</p><ul><li><p><strong>Dexterity and Manipulation:</strong><br>Most humanoids still lack the fine motor skills and adaptive control needed for unstructured environments. Human-level manipulation remains a massive engineering challenge, limiting real-world deployment to repetitive or highly structured tasks.</p></li><li><p><strong>Energy Efficiency and Runtime:</strong><br>Multi-jointed limbs and balancing systems consume significant power, resulting in short runtimes (often 2&#8211;4 hours per charge). This restricts use to short shifts or requires frequent battery swaps, making them less practical than task-specific robots.</p></li><li><p><strong>Speed and Payload:</strong><br>For safety and balance, humanoids move cautiously and are slower than both humans and industrial robots. Their payload capacity is also limited, making them unsuitable for high-speed, heavy-duty environments.</p></li><li><p><strong>Cost and Scalability:</strong><br>Building a humanoid robot can cost anywhere from $16,000 to $300,000+. The absence of a mature supply chain means companies often have to design and manufacture components from scratch. Achieving economies of scale is critical to bringing costs down to the $20,000&#8211;$50,000 range needed for mass adoption.</p></li><li><p><strong>Regulation and Liability:</strong><br>Regulatory frameworks for humanoids are lagging behind technology. Certification for workplace safety, liability in mixed human-robot environments, and ethical considerations are all unresolved, slowing large-scale deployment.</p></li><li><p><strong>Social and Political Resistance:</strong><br>Humanoids are designed to take over jobs currently done by humans, which can lead to resistance from workers, unions, and policymakers. Public acceptance and trust will be critical for adoption.</p></li><li><p><strong>Market Hype and Investor Pressure:</strong><br>Fast funding cycles and aggressive investor expectations can push companies to over-promise and under-deliver, risking market disappointment and a correction similar to the autonomous vehicle sector.</p></li></ul><div><hr></div><h2>How the Industry Is Tackling These Barriers</h2><p>The field is attacking these challenges with targeted innovation and collaboration:</p><ul><li><p><strong>Dexterity:</strong><br>Labs and companies are developing new actuators, soft robotics, and modular hands to mimic human flexibility. Imitation learning - robots learning by watching humans - improves generalization and reduces the need for hand-coded routines.</p></li><li><p><strong>Energy and Runtime:</strong><br>Higher-density batteries, swappable packs, and lightweight materials are being integrated into new models. Fast-charging infrastructure and energy-efficient actuators are extending operational hours.&#8203;</p></li><li><p><strong>Cost and Scale:</strong><br>Modular, standardized designs and mass production techniques, often borrowed from the automotive sector, are driving down costs. Open-source platforms and shared supply chains are enabling broader participation and faster iteration.</p></li><li><p><strong>Software and Reliability:</strong><br>Large, multimodal AI models trained on both simulation and real-world data are making robots more robust. Self-correcting algorithms and error recovery routines are being built in to handle unexpected situations.&#8203;</p></li><li><p><strong>Regulation and Safety:</strong><br>Industry groups and standards bodies are piloting new safety protocols, including real-time force limits and certified fall recovery. Companies are working with regulators to gather safety data and refine compliance pathways.</p></li><li><p><strong>Social Acceptance:</strong><br>User-friendly interfaces, transparent privacy policies, and collaborative training programs are being piloted to build trust and ease workforce integration.&#8203;</p></li></ul><p>The next breakthroughs will likely come from modular hardware, smarter AI, and real-world pilots that prove reliability, safety, and ROI at scale.</p><div><hr></div><h2>The Bottom Line</h2><p>Humanoid robots are poised to become the next industrial revolutionaries - not by mimicking every human gesture, but by automating the physical work that keeps our factories, warehouses, and hospitals running. The winners will be those who cross the four industrial bridges: <strong>safety, uptime, dexterity, and cost.</strong></p><p>The age of the embodied AI colleague is just beginning. The question is not if, but how fast - and who will lead the charge.</p>]]></content:encoded></item><item><title><![CDATA[Deep-tech Decoded: #3 'Smaller' AI models]]></title><description><![CDATA[Small Is the New Smart: Why AI Models Are Shrinking]]></description><link>https://clairechoi616.substack.com/p/deep-tech-decoded-3-smaller-ai-models</link><guid isPermaLink="false">https://clairechoi616.substack.com/p/deep-tech-decoded-3-smaller-ai-models</guid><dc:creator><![CDATA[Deep Tech for the Non Tech]]></dc:creator><pubDate>Sat, 18 Oct 2025 01:30:23 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!NAjY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ac27bff-a2a4-418a-9e2b-dce0c9b4e1d4_680x455.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h1>Small Is the New Smart: Why AI Models Are Shrinking</h1><p><strong>TL;DR:</strong> &#8220;Bigger = smarter&#8221; is giving way to &#8220;smarter per watt, per dollar, per millisecond.&#8221; Tricks like mixture-of-experts, distillation, and quantization - plus on-device chips - let smaller models feel fast, cheap, and (for many jobs) just as good.</p><div><hr></div><h2>The end of the size war</h2><p>Two or three years ago, the leaderboard felt simple: whoever had the most parameters seemed to have the best model. We spoke in billions like it was casual - 7B, 70B, 175B. OpenAI&#8217;s GPT-3 set the tone; GPT-4 raised it. Meta, Google, Anthropic, and others chased the same horizon. Scale looked like destiny.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://clairechoi616.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Claire&#8217;s Substack! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>And then the bill arrived.</p><p>Training a frontier model isn&#8217;t a one-off indulgence. It&#8217;s a repeating nine-figure commitment, with power use big enough to light a small city &#8212; and rising per-query costs when you deploy the model. Reported numbers for the latest giants show <strong>steep</strong> increases in API cost vs. earlier models, alongside growing concern that scaling alone is hitting diminishing returns. </p><p>Even if you can afford it, users still judge you on something brutally simple: <strong>latency</strong>. If an answer takes two seconds instead of 300 milliseconds, it feels worse - no matter how clever it is.</p><p>That&#8217;s when the conversation shifted from <em>How big?</em> to <em>How efficient?</em></p><div><hr></div><h2>What changed: efficiency beats mass</h2><p>A smaller model doesn&#8217;t have to be a dumber one. Modern training and inference squeeze more work out of fewer parameters, and map cleanly to today&#8217;s hardware.</p><ul><li><p><strong>Mixture-of-Experts (MoE):</strong> Route each token to a couple of specialists, not the whole brain. Like pinging the right teammate instead of dragging the entire company into a meeting.</p></li><li><p><strong>Distillation:</strong> A compact &#8220;student&#8221; copies a large &#8220;teacher&#8217;s&#8221; habits - how it balances options, how it handles tricky prompts - without carrying every neuron the teacher has.</p></li><li><p><strong>Quantization:</strong> Store numbers in fewer bits (8-bit, even 4-bit). That slashes memory traffic - the quiet killer of inference cost - and boosts throughput on modern accelerators.</p></li></ul><p>Add pruning (drop low-value connections), smarter decoding (draft-and-verify, prefix reuse), and tidier memory handling (KV-caches that don&#8217;t thrash), and you get a surprise: <strong>small models that feel fast, cheap, and, for many tasks, good enough.</strong></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!NAjY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ac27bff-a2a4-418a-9e2b-dce0c9b4e1d4_680x455.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!NAjY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ac27bff-a2a4-418a-9e2b-dce0c9b4e1d4_680x455.jpeg 424w, https://substackcdn.com/image/fetch/$s_!NAjY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ac27bff-a2a4-418a-9e2b-dce0c9b4e1d4_680x455.jpeg 848w, https://substackcdn.com/image/fetch/$s_!NAjY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ac27bff-a2a4-418a-9e2b-dce0c9b4e1d4_680x455.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!NAjY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ac27bff-a2a4-418a-9e2b-dce0c9b4e1d4_680x455.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!NAjY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ac27bff-a2a4-418a-9e2b-dce0c9b4e1d4_680x455.jpeg" width="680" height="455" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9ac27bff-a2a4-418a-9e2b-dce0c9b4e1d4_680x455.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:455,&quot;width&quot;:680,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!NAjY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ac27bff-a2a4-418a-9e2b-dce0c9b4e1d4_680x455.jpeg 424w, https://substackcdn.com/image/fetch/$s_!NAjY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ac27bff-a2a4-418a-9e2b-dce0c9b4e1d4_680x455.jpeg 848w, https://substackcdn.com/image/fetch/$s_!NAjY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ac27bff-a2a4-418a-9e2b-dce0c9b4e1d4_680x455.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!NAjY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9ac27bff-a2a4-418a-9e2b-dce0c9b4e1d4_680x455.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: Artificial Analysis</figcaption></figure></div><p>You can see the industry leaning in. OpenAI&#8217;s <strong>GPT-4o mini</strong> launched at <strong>$0.15 per million input tokens</strong> and <strong>$0.60 per million output</strong> - more than <strong>60% cheaper</strong> than GPT-3.5 Turbo - a clear signal that efficiency matters as much as peak IQ. Meta&#8217;s <strong>Llama 3.1</strong> shows how far algorithmic efficiency has come: its largest variant competes with top proprietary models at lower runtime cost, and it&#8217;s open. </p><div><hr></div><h2>Why on-device matters</h2><p>Running a model on your phone or laptop used to be a parlor trick. Now it&#8217;s a business decision. Device-class <strong>NPUs</strong> (neural processing units) make local inference practical. The payoff is threefold:</p><ul><li><p><strong>Speed:</strong> No round-trip to the cloud; sub-second replies become the default.</p></li><li><p><strong>Privacy:</strong> Sensitive inputs can stay on your device.</p></li><li><p><strong>Cost control:</strong> Not every tap hits a cloud GPU.</p></li></ul><p>Apple planted a flag with &#8220;Apple Intelligence.&#8221; Qualcomm, AMD, and Intel are shipping chips with AI blocks for everyday apps. Once people feel <strong>zero-lag</strong> assistants and private summarization, it&#8217;s hard to go back.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!DUhc!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbd26929-dd07-47bb-8abb-2911b0c5d87a_700x499.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!DUhc!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbd26929-dd07-47bb-8abb-2911b0c5d87a_700x499.webp 424w, https://substackcdn.com/image/fetch/$s_!DUhc!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbd26929-dd07-47bb-8abb-2911b0c5d87a_700x499.webp 848w, https://substackcdn.com/image/fetch/$s_!DUhc!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbd26929-dd07-47bb-8abb-2911b0c5d87a_700x499.webp 1272w, https://substackcdn.com/image/fetch/$s_!DUhc!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbd26929-dd07-47bb-8abb-2911b0c5d87a_700x499.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!DUhc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbd26929-dd07-47bb-8abb-2911b0c5d87a_700x499.webp" width="700" height="499" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/dbd26929-dd07-47bb-8abb-2911b0c5d87a_700x499.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:499,&quot;width&quot;:700,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:17378,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/176463133?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbd26929-dd07-47bb-8abb-2911b0c5d87a_700x499.webp&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!DUhc!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbd26929-dd07-47bb-8abb-2911b0c5d87a_700x499.webp 424w, https://substackcdn.com/image/fetch/$s_!DUhc!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbd26929-dd07-47bb-8abb-2911b0c5d87a_700x499.webp 848w, https://substackcdn.com/image/fetch/$s_!DUhc!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbd26929-dd07-47bb-8abb-2911b0c5d87a_700x499.webp 1272w, https://substackcdn.com/image/fetch/$s_!DUhc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbd26929-dd07-47bb-8abb-2911b0c5d87a_700x499.webp 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">At WWDC 2025, Apple said it would move AI onto devices, not just the cloud. Running AI on-device can be faster and keeps personal data local.</figcaption></figure></div><div><hr></div><h2>The new architecture: fleets, not monoliths</h2><p>In the scale era, teams tried to do everything with one colossal model. In the efficiency era, teams run <strong>fleets</strong>:</p><ul><li><p>a tiny router that figures out what you&#8217;re asking,</p></li><li><p>a fast generalist (think 3&#8211;9B parameters) for common queries,</p></li><li><p>and a few specialists - vision, code, deeper reasoning - that spin up only when needed.</p></li></ul><p>You don&#8217;t bring an orchestra to play a doorbell chime. You ring the bell. If it turns into a symphony, then you hire the strings.</p><p>This is where <strong>retrieval</strong> earns its keep. Instead of cramming the entire internet into weights, you fetch what&#8217;s needed - docs, tables, recent emails - and let a smaller core reason over fresh context. In practice, <strong>memory bandwidth and search</strong> (not raw parameter count) become your real constraints.</p><div><hr></div><h2>What &#8220;good&#8221; looks like in products</h2><p>If you&#8217;re shipping, a few truths tend to hold:</p><ul><li><p><strong>Latency wins hearts.</strong> Sub-300 ms replies feel magical.</p></li><li><p><strong>Quality-per-token beats peak scores.</strong> A distilled ~9B with retrieval and guardrails often beats a giant model that costs 10&#215; and responds slower.</p></li><li><p><strong>Privacy is a feature.</strong> Run local for personal data; escalate to cloud only when you truly need the bigger brain.</p></li><li><p><strong>The ops bill is a design input.</strong> Quantization, fused kernels, small prompts, and tight KV-cache handling aren&#8217;t &#8220;optimizations&#8221;; they&#8217;re part of the product.</p></li></ul><p>Failure modes have changed, too. It&#8217;s common to see a tiny model wrapped in a giant prompt, saving on model size but overspending on context. Or a pipeline that&#8217;s compute-fast but I/O-bound. The fixes are mostly discipline: slimmer prompts, smarter retrieval, and kernels that keep data close to compute.</p><div><hr></div><h2>How the business map shifts</h2><p>Value moves when constraints move. Shrinking models and growing NPUs shift power in a few places:</p><ul><li><p><strong>Cloud &#8594; Edge balance.</strong> You still need the cloud for heavy training and some inference, but more experiences happen locally. That cuts variable cloud cost and enables &#8220;always-on&#8221; features without scary bills.</p></li><li><p><strong>Model providers &#8594; solution providers.</strong> The moat isn&#8217;t &#8220;biggest model&#8221; anymore; it&#8217;s <strong>best model for the job</strong>, packaged with retrieval, tools, and guardrails.</p></li><li><p><strong>Hardware differentiation.</strong> Device makers who pair capable NPUs with clean developer stacks will pull ahead. The best apps will feel &#8220;instant&#8221; because they&#8217;re local.</p></li><li><p><strong>Metrics that matter.</strong> Accuracy still matters, but operators care just as much about <strong>latency, tokens-per-dollar, and energy-per-query</strong>. That&#8217;s why cheaper, capable &#8220;mini&#8221; and open models are gaining steam - they&#8217;re usable and affordable at scale. </p></li></ul><div><hr></div><h2>Why &#8220;smaller&#8221; now: two simple realities</h2><p><strong>1) Diminishing returns from brute-force scaling.</strong><br>Recent reports on the newest giants show incremental quality gains at <strong>much</strong> higher compute and energy, plus fragile behavior that comes with complex routing. In short: scaling still helps, just not enough to justify the cost in many use cases. </p><p><strong>2) Better outcomes from &#8220;doing more with less.&#8221;</strong><br>Smaller, purpose-built models can match (or surpass) larger ones on targeted tasks when trained on focused data, and they&#8217;re faster to train, deploy, and run, often on local devices. That&#8217;s good for product teams and the planet. </p><div><hr></div><h2>Where this goes next</h2><p>Expect routing and sparsity to get smarter; expect quantization to be the <strong>default</strong>, not an exotic trick. Expect more apps to <strong>start local</strong> and escalate only when necessary. And expect &#8220;model fleets&#8221; to become normal infrastructure, not bespoke wizardry.</p><p>There are open questions. How far can a small model go, even with great retrieval, before you truly need frontier-level depth? Will on-device ecosystems converge (easy for developers) or fragment (porting pain)? As agents take on longer, tool-heavy workflows, does the pendulum swing back toward bigger centralized brains - or do routers and specialists keep the upper hand?</p><div><hr></div><h2>The answer, finally</h2><p>This isn&#8217;t a detour. It&#8217;s a <strong>structural turn</strong>. Frontier models still matter for pushing the ceiling, but most of the daily value - what users feel and businesses pay for - will be delivered by <strong>smaller models that are fast, local, and paired with the right data</strong>.</p><p>You don&#8217;t just save money. You ship more features, to more people, in more places. The steady drop in &#8220;mini&#8221; model prices and the rise of credible open options are the tells. </p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://clairechoi616.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Claire&#8217;s Substack! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Product Decoded #2: Figure 03 - From Language to Action]]></title><description><![CDATA[Why &#8220;show, don&#8217;t code&#8221; could be the home robot unlock]]></description><link>https://clairechoi616.substack.com/p/product-decoded-2-figure-03-from</link><guid isPermaLink="false">https://clairechoi616.substack.com/p/product-decoded-2-figure-03-from</guid><dc:creator><![CDATA[Deep Tech for the Non Tech]]></dc:creator><pubDate>Mon, 13 Oct 2025 13:50:46 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!kbEu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0faffbae-fa49-4659-8ba7-b912361b7b23_800x445.gif" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!kbEu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0faffbae-fa49-4659-8ba7-b912361b7b23_800x445.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!kbEu!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0faffbae-fa49-4659-8ba7-b912361b7b23_800x445.gif 424w, https://substackcdn.com/image/fetch/$s_!kbEu!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0faffbae-fa49-4659-8ba7-b912361b7b23_800x445.gif 848w, https://substackcdn.com/image/fetch/$s_!kbEu!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0faffbae-fa49-4659-8ba7-b912361b7b23_800x445.gif 1272w, https://substackcdn.com/image/fetch/$s_!kbEu!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0faffbae-fa49-4659-8ba7-b912361b7b23_800x445.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!kbEu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0faffbae-fa49-4659-8ba7-b912361b7b23_800x445.gif" width="800" height="445" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0faffbae-fa49-4659-8ba7-b912361b7b23_800x445.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:445,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1870746,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/gif&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/176013242?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0faffbae-fa49-4659-8ba7-b912361b7b23_800x445.gif&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!kbEu!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0faffbae-fa49-4659-8ba7-b912361b7b23_800x445.gif 424w, https://substackcdn.com/image/fetch/$s_!kbEu!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0faffbae-fa49-4659-8ba7-b912361b7b23_800x445.gif 848w, https://substackcdn.com/image/fetch/$s_!kbEu!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0faffbae-fa49-4659-8ba7-b912361b7b23_800x445.gif 1272w, https://substackcdn.com/image/fetch/$s_!kbEu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0faffbae-fa49-4659-8ba7-b912361b7b23_800x445.gif 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: Figure</figcaption></figure></div><h2>TL;DR</h2><ul><li><p><strong>What shipped:</strong> Figure unveiled <strong>Figure 03</strong>, a humanoid doing multi-step chores (tidy toys, rinse &amp; load, laundry + detergent) while running on an <strong>in-house robot brain</strong> (no longer piggy-backing on ChatGPT).</p></li><li><p><strong>The big shift:</strong> Instead of &#8220;talk smart,&#8221; Figure is going after &#8220;<strong>move smart</strong>&#8221;- training robots by <strong>watching</strong> humans (imitation learning) so you <strong>teach by doing</strong>, not by programming.</p></li><li><p><strong>Why this matters:</strong> This could be the <strong>ChatGPT moment for robots</strong>: the day regular people become robot &#8220;teachers,&#8221; the market moves from factory lines to living rooms.</p></li><li><p><strong>Reality check:</strong> Motions are slow, scenes look controlled, and rivals (Optimus/Atlas) exist. But the <strong>direction of travel</strong> - from expert code &#8594; everyday demonstration - creates the flywheel.</p></li><li><p><strong>What to watch:</strong> the <strong>data engine</strong> (how much/high-quality demo data), <strong>policy architecture</strong> (how the brain learns long, smooth actions), <strong>tactile sensing &amp; safety</strong>, and <strong>real-home benchmarks</strong> that generalize beyond the lab.</p></li></ul><div><hr></div><h2>Quick rewind: &#8220;Goodbye to the borrowed brain&#8221;</h2><p>Earlier demos used a <strong>borrowed brain</strong> (ChatGPT-assisted behaviors). With <strong>Figure 03</strong>, the company says the brain is now <strong>in-house</strong>, trained end-to-end specifically for <strong>seeing &#8594; deciding &#8594; moving</strong>. The timeline is fast (months, not years), which suggests this wasn&#8217;t a scramble post-split; it was a <strong>planned blueprint</strong>.</p><p><strong>Why change the brain?</strong> Because charming conversation doesn&#8217;t wash dishes. <strong>Household value = safe, repeatable motion</strong>. That&#8217;s the product bet.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://clairechoi616.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Claire&#8217;s Substack! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!rxCN!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc38c4ffb-600e-4b21-bd04-298e10d9a54b_800x446.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rxCN!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc38c4ffb-600e-4b21-bd04-298e10d9a54b_800x446.gif 424w, https://substackcdn.com/image/fetch/$s_!rxCN!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc38c4ffb-600e-4b21-bd04-298e10d9a54b_800x446.gif 848w, https://substackcdn.com/image/fetch/$s_!rxCN!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc38c4ffb-600e-4b21-bd04-298e10d9a54b_800x446.gif 1272w, https://substackcdn.com/image/fetch/$s_!rxCN!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc38c4ffb-600e-4b21-bd04-298e10d9a54b_800x446.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rxCN!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc38c4ffb-600e-4b21-bd04-298e10d9a54b_800x446.gif" width="800" height="446" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c38c4ffb-600e-4b21-bd04-298e10d9a54b_800x446.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:446,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:12839633,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/gif&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/176013242?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc38c4ffb-600e-4b21-bd04-298e10d9a54b_800x446.gif&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!rxCN!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc38c4ffb-600e-4b21-bd04-298e10d9a54b_800x446.gif 424w, https://substackcdn.com/image/fetch/$s_!rxCN!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc38c4ffb-600e-4b21-bd04-298e10d9a54b_800x446.gif 848w, https://substackcdn.com/image/fetch/$s_!rxCN!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc38c4ffb-600e-4b21-bd04-298e10d9a54b_800x446.gif 1272w, https://substackcdn.com/image/fetch/$s_!rxCN!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc38c4ffb-600e-4b21-bd04-298e10d9a54b_800x446.gif 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: Figure</figcaption></figure></div><p></p><div><hr></div><h2>Product decoded: how a home robot actually &#8220;thinks&#8221;</h2><p>Below is a simplified &#8220;stack&#8221; you can keep in mind while watching any home-robot demo:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZgY-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c911ce8-8e2d-4d6b-ac45-ffe42570dd40_1024x1536.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZgY-!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c911ce8-8e2d-4d6b-ac45-ffe42570dd40_1024x1536.png 424w, https://substackcdn.com/image/fetch/$s_!ZgY-!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c911ce8-8e2d-4d6b-ac45-ffe42570dd40_1024x1536.png 848w, https://substackcdn.com/image/fetch/$s_!ZgY-!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c911ce8-8e2d-4d6b-ac45-ffe42570dd40_1024x1536.png 1272w, https://substackcdn.com/image/fetch/$s_!ZgY-!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c911ce8-8e2d-4d6b-ac45-ffe42570dd40_1024x1536.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZgY-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c911ce8-8e2d-4d6b-ac45-ffe42570dd40_1024x1536.png" width="1024" height="1536" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8c911ce8-8e2d-4d6b-ac45-ffe42570dd40_1024x1536.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1536,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2235212,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/176013242?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c911ce8-8e2d-4d6b-ac45-ffe42570dd40_1024x1536.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ZgY-!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c911ce8-8e2d-4d6b-ac45-ffe42570dd40_1024x1536.png 424w, https://substackcdn.com/image/fetch/$s_!ZgY-!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c911ce8-8e2d-4d6b-ac45-ffe42570dd40_1024x1536.png 848w, https://substackcdn.com/image/fetch/$s_!ZgY-!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c911ce8-8e2d-4d6b-ac45-ffe42570dd40_1024x1536.png 1272w, https://substackcdn.com/image/fetch/$s_!ZgY-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c911ce8-8e2d-4d6b-ac45-ffe42570dd40_1024x1536.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><ol><li><p><strong>Sensing</strong></p><ul><li><p><strong>Eyes:</strong> multiple cameras (likely RGB + depth).</p></li><li><p><strong>Hands/skin:</strong> <strong>tactile pads</strong> to feel slip/pressure (so plates don&#8217;t drop and detergents don&#8217;t spill).</p></li><li><p><strong>Inner ear:</strong> IMUs for balance; joint sensors for knowing where limbs are.</p></li></ul></li><li><p><strong>Perception (&#8220;What am I seeing?&#8221;)</strong></p><ul><li><p>Detects <strong>objects</strong> (plate, cup, knob), <strong>places</strong> (sink, rack), <strong>affordances</strong> (&#8220;this part is graspable&#8221;), and <strong>states</strong> (door closed/open, water on/off).</p></li></ul></li><li><p><strong>Policy (&#8220;How do I move?&#8221;)</strong></p><ul><li><p>The brain learns to map camera/touch inputs <strong>directly to arm/hand trajectories</strong>.</p></li><li><p>In practice this looks like <strong>imitation learning</strong>: humans demonstrate tasks; the robot learns <strong>smooth, long-horizon</strong> behavior (e.g., through <strong>transformer</strong> or <strong>diffusion</strong> policies known to produce stable motions).</p></li></ul></li><li><p><strong>Planner (&#8220;What&#8217;s the next small goal?&#8221;)</strong></p><ul><li><p>Strings skills into <strong>routines</strong>: <em>pick plate &#8594; rinse &#8594; load &#8594; close door</em>.</p></li><li><p>Handles <strong>retries</strong> (slipped plate? regrasp).</p></li></ul></li><li><p><strong>Control &amp; safety</strong></p><ul><li><p><strong>Speed caps</strong>, force limits, stop-on-contact, and geofences so the robot moves carefully around people and fragile stuff.</p></li></ul></li></ol><p>The robot learns to <strong>see like a person, move like an apprentice</strong>, and follow a <strong>checklist</strong> to finish the chore without breaking anything.</p><div><hr></div><h2>Why imitation learning is the unlock</h2><p>Historically, robots repeated motions that an expert <strong>coded</strong>. To change the task, you hired another expert.</p><p><strong>Imitation learning flips it:</strong> you <strong>show</strong> the task; the robot <strong>learns</strong> the pattern. That means:</p><ul><li><p><strong>Teaching scales</strong>: Any competent adult can become a robot &#8220;teacher.&#8221;</p></li><li><p><strong>Upgrades are data</strong>: Performance improves by collecting <strong>more/better demonstrations</strong>, not rewriting code.</p></li><li><p><strong>Generalization</strong>: Training across many homes (different dishwashers, knobs, lighting) makes the brain robust, just like how humans learn.</p></li></ul><blockquote><p>Analogy: ChatGPT moved AI from &#8220;build a model&#8221; to <strong>&#8220;just type.&#8221;</strong><br>Figure wants robots to move from &#8220;write motion code&#8221; to <strong>&#8220;just show.&#8221;</strong></p></blockquote><div><hr></div><h2>What the demo showed - and what it didn&#8217;t</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!gIOq!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc685d387-91d4-4521-8045-01d15f63cd37_800x446.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!gIOq!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc685d387-91d4-4521-8045-01d15f63cd37_800x446.gif 424w, https://substackcdn.com/image/fetch/$s_!gIOq!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc685d387-91d4-4521-8045-01d15f63cd37_800x446.gif 848w, https://substackcdn.com/image/fetch/$s_!gIOq!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc685d387-91d4-4521-8045-01d15f63cd37_800x446.gif 1272w, https://substackcdn.com/image/fetch/$s_!gIOq!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc685d387-91d4-4521-8045-01d15f63cd37_800x446.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!gIOq!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc685d387-91d4-4521-8045-01d15f63cd37_800x446.gif" width="800" height="446" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c685d387-91d4-4521-8045-01d15f63cd37_800x446.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:446,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:12312616,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/gif&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/176013242?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc685d387-91d4-4521-8045-01d15f63cd37_800x446.gif&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!gIOq!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc685d387-91d4-4521-8045-01d15f63cd37_800x446.gif 424w, https://substackcdn.com/image/fetch/$s_!gIOq!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc685d387-91d4-4521-8045-01d15f63cd37_800x446.gif 848w, https://substackcdn.com/image/fetch/$s_!gIOq!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc685d387-91d4-4521-8045-01d15f63cd37_800x446.gif 1272w, https://substackcdn.com/image/fetch/$s_!gIOq!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc685d387-91d4-4521-8045-01d15f63cd37_800x446.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: Figure</figcaption></figure></div><p><strong>Showed:</strong></p><ul><li><p>Multi-step sequences: <em>find &#8594; grasp &#8594; manipulate &#8594; place &#8594; start machine</em>.</p></li><li><p>Hand articulations that look <strong>gentle</strong> (no crushed plates).</p></li><li><p>A run without obvious teleoperation (at least in the published cut).</p></li></ul><p><strong>Didn&#8217;t show (yet):</strong></p><ul><li><p><strong>Speed</strong> near human level (today it&#8217;s cautious).</p></li><li><p><strong>Edge-case chaos</strong> (crowded sinks, unfamiliar detergent caps, small sponges, wet/slippery surfaces).</p></li><li><p><strong>Recovery</strong> from mistakes (mis-grasp, soap spill).</p></li><li><p><strong>Long runs</strong> (hours of mixed chores), or <strong>family safety</strong> with kids/pets wandering in and out.</p></li></ul><p>That&#8217;s fine for a public first look - but it defines the <strong>next milestones</strong>.</p><div><hr></div><h2>Data is the new &#8220;robot oil&#8221;</h2><p>For action learning, the moat is a <strong>data engine</strong>, not just a motor:</p><ul><li><p><strong>Where data comes from:</strong></p><ul><li><p><strong>Teleoperation rigs</strong> (humans drive arms in VR or with exoskeleton gloves).</p></li><li><p><strong>In-home pilots</strong> (privacy-aware capture of household tasks).</p></li><li><p><strong>Synthetic/augmentation</strong> (camera angles, lighting, distractors).</p></li></ul></li><li><p><strong>What &#8220;good&#8221; data means:</strong></p><ul><li><p><strong>Coverage:</strong> many layouts and brands (Bosch, LG, Samsung dishwashers, different detergent bottles).</p></li><li><p><strong>Tactile variety:</strong> slick plates vs. textured mugs; soft clothing vs. heavy towels.</p></li><li><p><strong>Long sequences:</strong> so the robot learns <em>persistence</em> (open, place, close, press).</p></li></ul></li><li><p><strong>Why it compounds:</strong></p><ul><li><p>More homes &#8594; fewer surprises &#8594; fewer failure modes &#8594; <strong>faster</strong> and <strong>cheaper</strong> chores.</p></li></ul></li></ul><div><hr></div><h2>Speed vs. safety: how this actually improves</h2><p>Today&#8217;s careful pace is normal. Speed comes from four levers:</p><ol><li><p><strong>Confidence</strong>: as the brain sees more homes, it hesitates less.</p></li><li><p><strong>Tactile + vision fusion</strong>: hands that <em>feel</em> reduce re-tries.</p></li><li><p><strong>Trajectory quality</strong>: better planners make fewer stop-start micro-moves.</p></li><li><p><strong>Hardware headroom</strong>: most arms can move far faster than demos show; software uncaps <strong>gradually</strong>, gated by safety.</p></li></ol><p>Expect <strong>step-wise unlocks</strong> (e.g., a &#8220;kitchen beta&#8221; that&#8217;s deliberately slower around glass, faster around laundry).</p><div><hr></div><h2>Where Figure 03 could beat - or lag - rivals</h2><p><strong>Versus Tesla Optimus</strong></p><ul><li><p><strong>Tesla&#8217;s edge:</strong> enormous <strong>data operations</strong>, in-house silicon, and a factory playground (repeatable tasks).</p></li><li><p><strong>Figure&#8217;s angle:</strong> prioritize <strong>household generalization</strong> and <strong>imitation UX</strong> (teach by doing) earlier.</p></li><li><p><strong>Watch:</strong> who shows <strong>transfer</strong> first - &#8220;new house, new dishwasher, still works&#8221; at acceptable speed.</p></li></ul><p><strong>Versus Boston Dynamics (Atlas)</strong></p><ul><li><p><strong>BD&#8217;s edge:</strong> <strong>athletic mobility</strong> and world-class hardware/control.</p></li><li><p><strong>Figure&#8217;s angle:</strong> <strong>chore competence</strong> over parkour; <strong>hands &amp; dishwashers</strong> over jumps and vaults.</p></li><li><p><strong>Watch:</strong> hand dexterity and <strong>tactile</strong> progress (opening detergent caps is harder than it looks).</p></li></ul><div><hr></div><h2>What needs to be true for product-market fit (PMF) at home</h2><ul><li><p><strong>Reliability:</strong> &#8805;95% success on common chores; low <strong>oops</strong> rate (breakage, spills).</p></li><li><p><strong>Safety:</strong> predictable force limits and <strong>fast stop</strong>; child/pet awareness.</p></li><li><p><strong>Speed:</strong> near human for routine steps; slower for delicate ones is okay.</p></li><li><p><strong>Economics:</strong> cost of the robot + service <strong>beats hiring</strong> for the target buyer (or delivers unique value like independence for seniors).</p></li><li><p><strong>Delight:</strong> setup feels like an <strong>appliance</strong>, not a research project; teaching a new routine is <strong>minutes</strong>, not hours.</p></li></ul><div><hr></div><h2>Skill store: how features might ship</h2><p>Think in <strong>skills</strong> (each taught by demos, upgraded by data):</p><ul><li><p><strong>Kitchen set:</strong> rinse &amp; rack, cutlery sort, wipe, trash, recycling.</p></li><li><p><strong>Laundry set:</strong> load, detergent, run cycle, transfer, fold basics.</p></li><li><p><strong>Tidying set:</strong> find toys, sort to bins, stack books.</p></li><li><p><strong>Errands set (later):</strong> bring water/snacks, door answer, basic fetch.</p></li></ul><p>Each skill advertises: <strong>success rate</strong>, <strong>typical time</strong>, <strong>safe zones</strong>, and <strong>known caveats</strong> (e.g., &#8220;works with front-loaders only&#8221;).</p><div><hr></div><h2>A non-engineer&#8217;s guide to evaluating the next demo</h2><p>When the next Figure/Optimus video drops, score it on these <strong>five</strong>:</p><ol><li><p><strong>Generalization:</strong> new rooms or props vs. the same studio.</p></li><li><p><strong>Contact quality:</strong> smooth grasps, no plate clatter, firm but gentle pushes.</p></li><li><p><strong>Recovery:</strong> visible retries when things go off-plan&#8212;did it fix itself?</p></li><li><p><strong>Pacing:</strong> fewer micro-pauses; more continuous motion.</p></li><li><p><strong>End-to-end task close:</strong> did it actually finish (door closed, cycle started)?</p></li></ol><div><hr></div><h2>Risks &amp; open questions</h2><ul><li><p><strong>Data privacy:</strong> in-home data is precious - <strong>opt-in</strong>, on-device processing, and clear deletion are table stakes.</p></li><li><p><strong>Edge-case explosion:</strong> homes are unpredictable; the <strong>long tail</strong> is the hardest part.</p></li><li><p><strong>Maintenance:</strong> hands/joints wear; who services and how often?</p></li><li><p><strong>False sense of competence:</strong> polished cuts can mask supervision - ask for <strong>continuous runs</strong> and <strong>third-party evals</strong>.</p></li></ul><div><hr></div><h2>Why this still feels like a &#8220;ChatGPT moment&#8221;</h2><p>ChatGPT wasn&#8217;t perfect; it <strong>proved possibility</strong>. Figure 03 does the same for <strong>action</strong>: it reframes robots as <strong>things you teach</strong>, not <strong>things you program</strong>. If that unlocks even a narrow set of chores first - and those skills improve with data - we won&#8217;t just get a cool demo. We&#8217;ll get the first real <strong>consumer robot category</strong> since the vacuum.</p><div><hr></div><h2>What I&#8217;m watching next (3 signals)</h2><ol><li><p><strong>Benchmarks that matter:</strong> standardized <strong>household task suites</strong> (multiple homes, brands, lighting) with third-party scoring.</p></li><li><p><strong>Brain disclosures:</strong> even short notes on <strong>policy type</strong> (diffusion vs transformer), <strong>tactile use</strong>, and <strong>memory</strong> show technical maturity.</p></li><li><p><strong>Speed unlocks with safety:</strong> transparent reports on <strong>incidents per hour</strong>, plate break rate, and how software limits relax over time.</p></li></ol><div><hr></div><h2>Closing thought</h2><p>We don&#8217;t need perfect; we need <strong>proven possible</strong>. Figure 03 shows a credible path: <strong>teach by doing</strong>. If the company can turn that into a repeatable <strong>data engine</strong> and a safety-first <strong>skill store</strong>, it won&#8217;t just move robots out of factories - it will move them into the rhythm of everyday life.</p><div><hr></div><h3>Appendix</h3><ul><li><p><strong>Imitation learning:</strong> robots learn by watching human demos; you <strong>show</strong>, they <strong>copy</strong>.</p></li><li><p><strong>Policy:</strong> the learned &#8220;brain&#8221; that turns camera/touch into arm/hand motion.</p></li><li><p><strong>Generalization:</strong> works in <strong>new, unseen homes</strong> without reprogramming.</p></li><li><p><strong>Tactile sensing:</strong> &#8220;robot fingertips&#8221; that feel slip/pressure.</p></li><li><p><strong>Recovery behavior:</strong> how the robot fixes mistakes mid-task.</p></li></ul><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://clairechoi616.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Claire&#8217;s Substack! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Strategy Decoded: #1 OpenAI DevDay: From “App” to “AI OS”]]></title><description><![CDATA[OpenAI DevDay: From &#8220;App&#8221; to &#8220;AI OS&#8221;]]></description><link>https://clairechoi616.substack.com/p/decoding-strategy-1-openai-devday</link><guid isPermaLink="false">https://clairechoi616.substack.com/p/decoding-strategy-1-openai-devday</guid><dc:creator><![CDATA[Deep Tech for the Non Tech]]></dc:creator><pubDate>Wed, 08 Oct 2025 15:26:20 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!eLWM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94f0ed78-cbf8-4467-8366-63a6dea5f804_666x500.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h2>OpenAI DevDay: From &#8220;App&#8221; to &#8220;AI OS&#8221;</h2><p><strong>TL;DR</strong>: <a href="http://OpenAI DevDay 2025  OpenAI https://openai.com &#8250; devday">OpenAI&#8217;s DevDay</a> pushed three big ideas:</p><ol><li><p><strong>ChatGPT as an operating system</strong>: apps run <em>inside</em> the chat, invoked by intent, not icons.</p></li><li><p><strong>Agent creation for everyone</strong>: a no-code &#8220;Agent Kit&#8221; that assembles useful agents like Lego.</p></li><li><p><strong>Media as programmable</strong>: Sora 2 turns sketches and phone clips into cinematic assets; voice becomes the default interface.</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!eLWM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94f0ed78-cbf8-4467-8366-63a6dea5f804_666x500.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!eLWM!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94f0ed78-cbf8-4467-8366-63a6dea5f804_666x500.jpeg 424w, https://substackcdn.com/image/fetch/$s_!eLWM!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94f0ed78-cbf8-4467-8366-63a6dea5f804_666x500.jpeg 848w, https://substackcdn.com/image/fetch/$s_!eLWM!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94f0ed78-cbf8-4467-8366-63a6dea5f804_666x500.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!eLWM!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94f0ed78-cbf8-4467-8366-63a6dea5f804_666x500.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!eLWM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94f0ed78-cbf8-4467-8366-63a6dea5f804_666x500.jpeg" width="666" height="500" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/94f0ed78-cbf8-4467-8366-63a6dea5f804_666x500.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:500,&quot;width&quot;:666,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;20251006_100248&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="20251006_100248" title="20251006_100248" srcset="https://substackcdn.com/image/fetch/$s_!eLWM!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94f0ed78-cbf8-4467-8366-63a6dea5f804_666x500.jpeg 424w, https://substackcdn.com/image/fetch/$s_!eLWM!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94f0ed78-cbf8-4467-8366-63a6dea5f804_666x500.jpeg 848w, https://substackcdn.com/image/fetch/$s_!eLWM!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94f0ed78-cbf8-4467-8366-63a6dea5f804_666x500.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!eLWM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F94f0ed78-cbf8-4467-8366-63a6dea5f804_666x500.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: OpenAI Community</figcaption></figure></div><h3>What they showed </h3><ul><li><p><strong>Apps inside ChatGPT (Apps SDK).</strong><br>Demos stitched Coursera, Canva, and Zillow directly into the chat. You don&#8217;t &#8220;open&#8221; apps; you describe what you want. ChatGPT routes your intent to the right app, pins a video, edits a design, or shows housing inventory - all without leaving the chat window. It&#8217;s not <em>just</em> integration; it&#8217;s co-working: the model and the app share context in the same space.</p></li><li><p><strong>Build agents in minutes (Agent Kit).</strong><br>On stage, a developer assembled an agent for the DevDay website with drag-and-drop blocks: a classifier, session-data connector, some conditional logic, guardrails, and UI widgets&#8212;no code. The agent went live on the site and answered questions (&#8220;Which session for learning agent building?&#8221; &#8594; &#8220;11:15 a.m.&#8221;).</p></li><li><p><strong>New model + new modalities.</strong><br>OpenAI announced <strong>GPT-5 Pro</strong> (positioned as its most capable model yet), <strong>Real-Time Mini</strong> (the voice model at ~70% cheaper while keeping quality close), and the <strong>Sora 2 API</strong> for stateful, directed video. Sora 2 can expand a phone clip into a cinematic wide shot, or turn designer sketches into 3D product mockups (a demo built with Mattel).</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!huwF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25eede49-705a-405c-ae47-7bba85e28357_3840x2160.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!huwF!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25eede49-705a-405c-ae47-7bba85e28357_3840x2160.webp 424w, https://substackcdn.com/image/fetch/$s_!huwF!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25eede49-705a-405c-ae47-7bba85e28357_3840x2160.webp 848w, https://substackcdn.com/image/fetch/$s_!huwF!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25eede49-705a-405c-ae47-7bba85e28357_3840x2160.webp 1272w, https://substackcdn.com/image/fetch/$s_!huwF!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25eede49-705a-405c-ae47-7bba85e28357_3840x2160.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!huwF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25eede49-705a-405c-ae47-7bba85e28357_3840x2160.webp" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/25eede49-705a-405c-ae47-7bba85e28357_3840x2160.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Interface view of a customer service automation flow in a visual builder tool. The canvas shows connected nodes labeled Start, Jailbreak guardrail, Classification agent, If/else, Return agent, Retention agent, Information agent, Hallucination guardrail, and End. A sidebar on the left lists available node types such as Agent, Note, File search, Guardrails, MCP, and User approval. Top controls include options for Evaluate, Code, Preview, and Publish.&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Interface view of a customer service automation flow in a visual builder tool. The canvas shows connected nodes labeled Start, Jailbreak guardrail, Classification agent, If/else, Return agent, Retention agent, Information agent, Hallucination guardrail, and End. A sidebar on the left lists available node types such as Agent, Note, File search, Guardrails, MCP, and User approval. Top controls include options for Evaluate, Code, Preview, and Publish." title="Interface view of a customer service automation flow in a visual builder tool. The canvas shows connected nodes labeled Start, Jailbreak guardrail, Classification agent, If/else, Return agent, Retention agent, Information agent, Hallucination guardrail, and End. A sidebar on the left lists available node types such as Agent, Note, File search, Guardrails, MCP, and User approval. Top controls include options for Evaluate, Code, Preview, and Publish." srcset="https://substackcdn.com/image/fetch/$s_!huwF!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25eede49-705a-405c-ae47-7bba85e28357_3840x2160.webp 424w, https://substackcdn.com/image/fetch/$s_!huwF!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25eede49-705a-405c-ae47-7bba85e28357_3840x2160.webp 848w, https://substackcdn.com/image/fetch/$s_!huwF!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25eede49-705a-405c-ae47-7bba85e28357_3840x2160.webp 1272w, https://substackcdn.com/image/fetch/$s_!huwF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25eede49-705a-405c-ae47-7bba85e28357_3840x2160.webp 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: OpenAI, Agent Kit</figcaption></figure></div><h3>Strategy: Why this matters</h3><ul><li><p><strong>From icons to intents.</strong><br>Smartphones trained us to tap icons. ChatGPT is training us to declare intentions (&#8220;teach me ML,&#8221; &#8220;make a poster,&#8221; &#8220;show Pittsburgh listings&#8221;). When the <em>intent</em> is the starting point, the operating system is the conversation - and <strong>distribution shifts to who owns that conversational surface</strong>.</p></li><li><p><strong>Lock-in 2.0 (the &#8220;AI OS&#8221; moat).</strong><br>If users stay in one conversational surface for learning, designing, shopping, and booking, that surface becomes the new home screen. The more third-party apps wire in, the stronger the gravity and the harder it is to leave. This echoes Apple&#8217;s ecosystem lock-in, but at the <strong>workflow</strong> level instead of the device level.</p></li><li><p><strong>Agents become the new &#8220;apps.&#8221;</strong><br>The Agent Kit reframes &#8220;an app&#8221; as a directed workflow (classify &#8594; fetch &#8594; constrain &#8594; render) composed by anyone. Expect an explosion of micro-agents pinned to websites, docs, and support portals. The battle will move from <strong>who can code</strong> to <strong>who can design guardrailed workflows</strong> that are trustworthy and fast.</p></li><li><p><strong>Voice as the default interface.</strong><br>With Real-Time Mini, OpenAI is pushing cost down on live conversational models. When voice becomes the cheapest frictionless path, usage frequency climbs and multi-turn context deepens - another flywheel for platform lock-in.</p></li><li><p><strong>Media becomes programmable.</strong><br>Sora 2 treats video like a stateful object you can direct. That compresses time-to-asset from weeks to minutes and blurs the line between concept, storyboard, and render. The winners will be those who <strong>chain</strong> Sora-like tools into product pipelines (marketing, industrial design, training content).</p></li></ul><h3>What wasn&#8217;t said (and still matters)</h3><ul><li><p><strong>The device question.</strong><br>Jony Ive and Sam Altman discussed their AI device - no reveals. Hints: many concept variants, human-centered philosophy (&#8220;joy,&#8221; &#8220;connection&#8221;), speculation of no screen, context-aware hardware. Target: <strong>late 2026</strong>. Translation: they&#8217;re optimizing for <strong>how it feels to live with</strong>, not specs. If the AI OS lives in the conversation, the hardware&#8217;s job is ambient capture, presence, and gentle delivery - not app grids.</p></li></ul><h3>What to watch next</h3><ol><li><p><strong>Developer economics</strong>: How will Apps SDK and agents monetize - usage rev-share, marketplace listings, or enterprise bundles?</p></li><li><p><strong>Trust &amp; safety by design</strong>: Guardrails were in the demo. Expect compliance, redaction, and auditability to become standard &#8220;features&#8221; of agent building.</p></li><li><p><strong>Workflow incumbents</strong>: Design suites, CRMs, and vertical SaaS will choose between deep embedding vs. building their own agentic surfaces.</p></li><li><p><strong>Default distribution</strong>: Whoever becomes the &#8220;first conversational stop&#8221; for common tasks (learn, plan, buy, fix) will accumulate the most valuable context graph.</p></li></ol><h3>Practical playbook for Founder/PMs</h3><ul><li><p><strong>Ship one high-value agent, not ten features.</strong> Pick a painful workflow and compose an agent with clear guardrails and measurable success criteria (time saved, error rate).</p></li><li><p><strong>Exploit context hand-off.</strong> Use Apps SDK-style integrations to accept state from ChatGPT (user profile, current intent) and return structured results ready for the next step.</p></li><li><p><strong>Design for voice first.</strong> Make sure your agent&#8217;s prompts and outputs degrade gracefully from voice &#8596; text &#8596; UI, without losing state.</p></li></ul><h3>A last word on &#8220;AI building AI&#8221;</h3><p>Altman framed this as moving from a lone developer writing code to <strong>AI teams</strong> collaborating. The shift is not just faster coding; it&#8217;s <em>who orchestrates workflows</em>. If the OS is a conversation, the new &#8220;programming language&#8221; is intent, constraints, and policy - assembled by anyone who can think in systems.</p>]]></content:encoded></item><item><title><![CDATA[Product Decoded: #1 CUDA ]]></title><description><![CDATA[How NVIDIA turned GPUs into a product system and where they are headed for]]></description><link>https://clairechoi616.substack.com/p/product-decoded-1-cuda</link><guid isPermaLink="false">https://clairechoi616.substack.com/p/product-decoded-1-cuda</guid><dc:creator><![CDATA[Deep Tech for the Non Tech]]></dc:creator><pubDate>Sat, 04 Oct 2025 19:08:58 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!sJeh!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0981e5fb-f903-41c6-b1da-7e3fd9dbf338_1536x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h2>TL;DR </h2><ul><li><p><strong>CUDA</strong> is the &#8220;operating manual&#8221; that lets a GPU act like a <strong>huge team of helpers</strong> doing tiny tasks at the same time. Most AI software quietly depends on it.</p></li><li><p><strong>What&#8217;s new:</strong> a <strong>tile-based way of writing code</strong> (think &#8220;work on this small chunk, repeat&#8221;), a cleaner <strong>one-toolkit</strong> story for servers and devices, better <strong>math libraries</strong>, and easier <strong>Python</strong>.</p></li><li><p><strong>New chips (Blackwell) + tiny numbers (FP4/NVFP4)</strong> = more work per dollar and per watt for AI inference&#8212;if your model operators are supported.</p></li><li><p><strong>Why NVIDIA still wins:</strong> not just fast chips, but <strong>mature libraries, tools, examples, and a giant talent base</strong> that have compounding effects.</p></li><li><p><strong>What could change:</strong> rising <strong>portability</strong> efforts (UXL/oneAPI), <strong>AMD ROCm</strong>, and code-conversion tools are improving. Track them, but switch only when <strong>your</strong> economics say so.</p></li></ul><div><hr></div><h2>Tiny glossary before diving in (10 seconds each)</h2><ul><li><p><strong>CUDA</strong>: The software layer that tells thousands of GPU helpers what to do.</p></li><li><p><strong>Tile-based model</strong>: Work on a small chunk &#8594; repeat everywhere.</p></li><li><p><strong>Tensor Core</strong>: A special math unit for AI&#8217;s matrix math.</p></li><li><p><strong>FP4 / NVFP4</strong>: Extra-small numbers that let more data flow through without wrecking accuracy.</p></li><li><p><strong>Operator coverage</strong>: Whether your model&#8217;s steps use the fast libraries or slow fallbacks.</p></li><li><p><strong>Nsight</strong>: NVIDIA&#8217;s performance magnifying glass.</p></li></ul><div><hr></div><h2>A kitchen, not a single chef</h2><p>Imagine running a restaurant. A <strong>CPU</strong> is one brilliant chef cooking dishes one by one. A <strong>GPU</strong> is a kitchen <strong>with thousands of line cooks</strong> chopping, stirring, and plating <strong>in parallel</strong>.<br><strong>CUDA</strong> is the binder of recipes and checklists that tells all those line cooks <strong>exactly</strong> what to do, in what order, and how to share the tools and pans without colliding.</p><p>This is why GPUs crush tasks like image recognition or LLMs: lots of <strong>similar</strong> steps repeated over and over is where a big team shines.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://clairechoi616.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Claire&#8217;s Substack! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!sJeh!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0981e5fb-f903-41c6-b1da-7e3fd9dbf338_1536x1024.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!sJeh!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0981e5fb-f903-41c6-b1da-7e3fd9dbf338_1536x1024.png 424w, https://substackcdn.com/image/fetch/$s_!sJeh!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0981e5fb-f903-41c6-b1da-7e3fd9dbf338_1536x1024.png 848w, https://substackcdn.com/image/fetch/$s_!sJeh!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0981e5fb-f903-41c6-b1da-7e3fd9dbf338_1536x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!sJeh!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0981e5fb-f903-41c6-b1da-7e3fd9dbf338_1536x1024.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!sJeh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0981e5fb-f903-41c6-b1da-7e3fd9dbf338_1536x1024.png" width="1456" height="971" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0981e5fb-f903-41c6-b1da-7e3fd9dbf338_1536x1024.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:971,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2333513,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/175287820?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0981e5fb-f903-41c6-b1da-7e3fd9dbf338_1536x1024.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!sJeh!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0981e5fb-f903-41c6-b1da-7e3fd9dbf338_1536x1024.png 424w, https://substackcdn.com/image/fetch/$s_!sJeh!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0981e5fb-f903-41c6-b1da-7e3fd9dbf338_1536x1024.png 848w, https://substackcdn.com/image/fetch/$s_!sJeh!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0981e5fb-f903-41c6-b1da-7e3fd9dbf338_1536x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!sJeh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0981e5fb-f903-41c6-b1da-7e3fd9dbf338_1536x1024.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><div><hr></div><h2>What actually changed in Aug 2025 (and why you should care)</h2><p>Think of three upgrades:</p><h3>1) &#8220;Tiles&#8221; &#8594; a friendlier way to program the kitchen</h3><p>Old CUDA asked you to micromanage workers and storage (who stirs when, which shelf holds onions). The <strong>tile-based model</strong> says:</p><blockquote><p>&#8220;Work on <strong>this small square of the job</strong>, then repeat everywhere.&#8221;<br>CUDA&#8217;s compiler now handles much of the fussy scheduling. </p><p><strong>Result:</strong> faster developer cycles, fewer foot-guns, and performance that&#8217;s closer to what the hardware can really do.</p></blockquote><h3>2) One toolkit across servers and devices</h3><p>Whether you deploy on big servers or on smaller Arm-based devices (Jetson/edge boxes), it now <strong>feels like the same toolkit</strong>. That reduces &#8220;it worked on the DGX but broke on the robot&#8221; moments.</p><h3>3) Python is no longer second-class</h3><p>You can stay in Python for most of your work (via <strong>CUDA Python</strong> and <strong>NV Math for Python</strong>) and only drop into low-level code for the truly hot parts. </p><p><strong>Translation:</strong> quicker prototypes, less glue code, happier teams.</p><div><hr></div><h2>Why the newest chips matter </h2><p>New NVIDIA chips (the <strong>Blackwell</strong> family) unlock two practical things for AI inference:</p><ol><li><p><strong>Smaller numbers that still work (FP4 / NVFP4).</strong><br>Think of this as <strong>zipping</strong> your data so it takes less space and moves faster. With the right calibration, models keep accuracy while your <strong>throughput jumps</strong>.</p></li><li><p><strong>Better &#8220;kitchen logistics.&#8221;</strong><br>Blackwell adds tricks that keep the &#8220;line cooks&#8221; busy without running back and forth to the pantry:</p></li></ol><ul><li><p><strong>Two teams can work as one</strong> on big batches (higher utilization).</p></li><li><p>There&#8217;s a small <strong>private shelf</strong> next to the stove for ingredients used every few seconds (less back-and-forth to the main storage).</p></li><li><p><strong>Wider scoops</strong> move more data in one go (fewer trips).</p></li></ul><p><strong>Net effect:</strong> For the same quality target, you often need <strong>fewer GPUs</strong> or can serve <strong>more requests per second</strong>.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!UlWm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7d307f1-fedf-446d-ae90-6709a73a595c_2048x1152.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!UlWm!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7d307f1-fedf-446d-ae90-6709a73a595c_2048x1152.jpeg 424w, https://substackcdn.com/image/fetch/$s_!UlWm!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7d307f1-fedf-446d-ae90-6709a73a595c_2048x1152.jpeg 848w, https://substackcdn.com/image/fetch/$s_!UlWm!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7d307f1-fedf-446d-ae90-6709a73a595c_2048x1152.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!UlWm!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7d307f1-fedf-446d-ae90-6709a73a595c_2048x1152.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!UlWm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7d307f1-fedf-446d-ae90-6709a73a595c_2048x1152.jpeg" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a7d307f1-fedf-446d-ae90-6709a73a595c_2048x1152.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;NVIDIA Blackwell: Born for Extreme-Scale AI Inference | NVIDIA Blog&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="NVIDIA Blackwell: Born for Extreme-Scale AI Inference | NVIDIA Blog" title="NVIDIA Blackwell: Born for Extreme-Scale AI Inference | NVIDIA Blog" srcset="https://substackcdn.com/image/fetch/$s_!UlWm!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7d307f1-fedf-446d-ae90-6709a73a595c_2048x1152.jpeg 424w, https://substackcdn.com/image/fetch/$s_!UlWm!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7d307f1-fedf-446d-ae90-6709a73a595c_2048x1152.jpeg 848w, https://substackcdn.com/image/fetch/$s_!UlWm!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7d307f1-fedf-446d-ae90-6709a73a595c_2048x1152.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!UlWm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7d307f1-fedf-446d-ae90-6709a73a595c_2048x1152.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: NVIDIA</figcaption></figure></div><div><hr></div><h2>How teams actually build on CUDA (a 4-step loop)</h2><p>Whether someone&#8217;s doing vision, LLMs, or simulations, this is how it generally works:</p><ol><li><p><strong>Start with the libraries</strong> (fastest wins).</p><ul><li><p>Math: <strong>cuBLAS/cuSOLVER/cuTENSOR/cuFFT</strong></p></li><li><p>Deep learning: <strong>cuDNN</strong> (training), <strong>TensorRT</strong> (fast inference)</p></li><li><p>Python access: <strong>NV Math for Python</strong><br>These are heavily optimized and usually beat custom code.</p></li></ul></li><li><p><strong>Profile before you tinker.</strong><br>Use <strong>Nsight</strong> tools to see what&#8217;s slow: is it waiting on memory, not using the GPU fully, or tripping on data layout?</p></li><li><p><strong>Turn the big knobs first.</strong></p><ul><li><p><strong>Precision</strong>: FP16 &#8594; FP8 &#8594; <strong>FP4</strong> where supported.</p></li><li><p><strong>Operator coverage</strong>: make sure your model ops map to fast, fused kernels (green paths) rather than slow fallbacks (gray paths).</p></li><li><p><strong>Data layout &amp; fusion</strong>: keep data in the right shape and <strong>combine steps</strong> so you touch slow memory less.</p></li></ul></li><li><p><strong>Write custom kernels last.</strong><br>If a crucial operation still isn&#8217;t covered or fast enough, <strong>start with the tile approach</strong>. Only drop to the old low-level style when you must.</p></li></ol><div><hr></div><h2>The few knobs that move real money</h2><ul><li><p><strong>Precision</strong>: Smaller formats (down to FP4) = <strong>more work per dollar</strong> - as long as accuracy stays acceptable.</p></li><li><p><strong>Operator coverage</strong>: If TensorRT/cuDNN fully support your model, you fly. If they don&#8217;t, you pay.</p></li><li><p><strong>Memory locality</strong>: Keep &#8220;hot&#8221; data close; batch it; move it in <strong>wider chunks</strong>.</p></li><li><p><strong>Concurrency</strong>: Overlap data transfers with compute (like prepping the next plate while the sauce reduces).</p></li><li><p><strong>Fusion</strong>: Do multiple small steps in one pass to avoid extra memory trips.</p></li></ul><div><hr></div><h2>Where NVIDIA&#8217;s moat really is (beyond fast chips)</h2><p>Picture a four-layer stack:</p><ol><li><p><strong>Hardware cadence</strong> (new chips every cycle)</p></li><li><p><strong>Libraries</strong> (math + AI) that hide complexity</p></li><li><p><strong>Tools</strong> (profilers, debuggers) that speed iteration</p></li><li><p><strong>Ecosystem gravity</strong>: tons of examples, courses, and engineers who already know CUDA</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Kbzd!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F85537ce4-fcdc-45c7-b494-6887c91ca520_1024x1024.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Kbzd!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F85537ce4-fcdc-45c7-b494-6887c91ca520_1024x1024.png 424w, https://substackcdn.com/image/fetch/$s_!Kbzd!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F85537ce4-fcdc-45c7-b494-6887c91ca520_1024x1024.png 848w, https://substackcdn.com/image/fetch/$s_!Kbzd!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F85537ce4-fcdc-45c7-b494-6887c91ca520_1024x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!Kbzd!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F85537ce4-fcdc-45c7-b494-6887c91ca520_1024x1024.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Kbzd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F85537ce4-fcdc-45c7-b494-6887c91ca520_1024x1024.png" width="1024" height="1024" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/85537ce4-fcdc-45c7-b494-6887c91ca520_1024x1024.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1024,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1628070,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/175287820?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F85537ce4-fcdc-45c7-b494-6887c91ca520_1024x1024.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Kbzd!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F85537ce4-fcdc-45c7-b494-6887c91ca520_1024x1024.png 424w, https://substackcdn.com/image/fetch/$s_!Kbzd!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F85537ce4-fcdc-45c7-b494-6887c91ca520_1024x1024.png 848w, https://substackcdn.com/image/fetch/$s_!Kbzd!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F85537ce4-fcdc-45c7-b494-6887c91ca520_1024x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!Kbzd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F85537ce4-fcdc-45c7-b494-6887c91ca520_1024x1024.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Each layer reinforces the others. That&#8217;s why it&#8217;s hard for alternatives to catch up: they have to match <strong>all four</strong> at once.</p><div><hr></div><h2>What could chip away at the moat</h2><ul><li><p><strong>Portability efforts</strong> (like UXL/oneAPI) want one common way to program <strong>any</strong> accelerator.</p></li><li><p><strong>AMD&#8217;s ROCm</strong> keeps improving, especially for mainstream AI workloads.</p></li><li><p><strong>Code-conversion tools</strong> can move a lot of CUDA code automatically, though the <strong>last mile still needs hand work</strong>.</p></li></ul><p><strong>Practical advice:</strong> Don&#8217;t &#8220;bet the company&#8221; on a switch. <strong>Pilot</strong> one service on an alternative stack, gather real <strong>cost-at-SLO</strong> numbers, and expand only if the numbers hold.</p><div><hr></div><h2>What to watch next (if you only track three things)</h2><ol><li><p><strong>How quickly FP4 support</strong> spreads across the libraries you use (TensorRT, cuDNN).</p></li><li><p><strong>Real gains from tiles</strong> in everyday models (fewer code hacks, same or better performance).</p></li><li><p><strong>Credible parity</strong> on a non-CUDA stack for your workload - at equal SLOs and total cost.</p></li></ol><div><hr></div><h2>Closing thought</h2><p>CUDA isn&#8217;t one trick; it&#8217;s a <strong>product system</strong> where <strong>friendlier programming (tiles), smarter chips (Blackwell), tiny numbers (FP4), and deep libraries</strong> all stack up. If you&#8217;re building today, start with the libraries, measure with Nsight, and pull the big knobs - <strong>precision, coverage, locality, fusion</strong> - before touching custom kernels. Hedge with a portability pilot if it helps you sleep at night, but make every platform decision answer the only question that matters: <strong>does this lower my cost to hit the same SLOs?</strong></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://clairechoi616.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Claire&#8217;s Substack! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Deep-tech Decoded: #2 Metamaterial]]></title><description><![CDATA[Investigating behind the hype of the ingredient to invisible cloaks.]]></description><link>https://clairechoi616.substack.com/p/deep-tech-decoded-2-metamaterial</link><guid isPermaLink="false">https://clairechoi616.substack.com/p/deep-tech-decoded-2-metamaterial</guid><dc:creator><![CDATA[Deep Tech for the Non Tech]]></dc:creator><pubDate>Tue, 30 Sep 2025 14:02:53 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!vWwI!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84a65b3a-064a-4f24-91ae-3727d275d31c_600x356.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vWwI!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84a65b3a-064a-4f24-91ae-3727d275d31c_600x356.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vWwI!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84a65b3a-064a-4f24-91ae-3727d275d31c_600x356.jpeg 424w, https://substackcdn.com/image/fetch/$s_!vWwI!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84a65b3a-064a-4f24-91ae-3727d275d31c_600x356.jpeg 848w, https://substackcdn.com/image/fetch/$s_!vWwI!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84a65b3a-064a-4f24-91ae-3727d275d31c_600x356.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!vWwI!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84a65b3a-064a-4f24-91ae-3727d275d31c_600x356.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vWwI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84a65b3a-064a-4f24-91ae-3727d275d31c_600x356.jpeg" width="600" height="356" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/84a65b3a-064a-4f24-91ae-3727d275d31c_600x356.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:356,&quot;width&quot;:600,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;(&#49324;&#51652;=&#54532;&#47536;&#49828;&#53556; &#45824;&#54617;&#44284; &#50892;&#49905;&#53556; &#45824;&#54617;&#51032; &#50672;&#44396;&#50896;&#46308;&#51060; &#44405;&#51008; &#49548;&#44552; &#50508;&#44081;&#51060; &#53356;&#44592;&#51032; &#52488;&#49548;&#54805; &#52852;&#47700;&#46972;&#47484; &#44060;&#48156;&#54664;&#45796;. &#51060; &#49884;&#49828;&#53596;&#51008; 160&#47564; &#44060;&#51032; &#50896;&#53685;&#54805; &#44592;&#46181;&#51060; &#48149;&#54784; &#51080;&#44256; &#52980;&#54504;&#53552; &#52841;&#52376;&#47100; &#49373;&#49328;&#46112; &#49688; &#51080;&#45716; &#47700;&#53440;&#54364;&#47732;&#51060;&#46972;&#45716; &#44592;&#49696;&#51012; &#49324;&#50857;&#54664;&#45796;. Princeton University, Sharlach)&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="(&#49324;&#51652;=&#54532;&#47536;&#49828;&#53556; &#45824;&#54617;&#44284; &#50892;&#49905;&#53556; &#45824;&#54617;&#51032; &#50672;&#44396;&#50896;&#46308;&#51060; &#44405;&#51008; &#49548;&#44552; &#50508;&#44081;&#51060; &#53356;&#44592;&#51032; &#52488;&#49548;&#54805; &#52852;&#47700;&#46972;&#47484; &#44060;&#48156;&#54664;&#45796;. &#51060; &#49884;&#49828;&#53596;&#51008; 160&#47564; &#44060;&#51032; &#50896;&#53685;&#54805; &#44592;&#46181;&#51060; &#48149;&#54784; &#51080;&#44256; &#52980;&#54504;&#53552; &#52841;&#52376;&#47100; &#49373;&#49328;&#46112; &#49688; &#51080;&#45716; &#47700;&#53440;&#54364;&#47732;&#51060;&#46972;&#45716; &#44592;&#49696;&#51012; &#49324;&#50857;&#54664;&#45796;. Princeton University, Sharlach)" title="(&#49324;&#51652;=&#54532;&#47536;&#49828;&#53556; &#45824;&#54617;&#44284; &#50892;&#49905;&#53556; &#45824;&#54617;&#51032; &#50672;&#44396;&#50896;&#46308;&#51060; &#44405;&#51008; &#49548;&#44552; &#50508;&#44081;&#51060; &#53356;&#44592;&#51032; &#52488;&#49548;&#54805; &#52852;&#47700;&#46972;&#47484; &#44060;&#48156;&#54664;&#45796;. &#51060; &#49884;&#49828;&#53596;&#51008; 160&#47564; &#44060;&#51032; &#50896;&#53685;&#54805; &#44592;&#46181;&#51060; &#48149;&#54784; &#51080;&#44256; &#52980;&#54504;&#53552; &#52841;&#52376;&#47100; &#49373;&#49328;&#46112; &#49688; &#51080;&#45716; &#47700;&#53440;&#54364;&#47732;&#51060;&#46972;&#45716; &#44592;&#49696;&#51012; &#49324;&#50857;&#54664;&#45796;. Princeton University, Sharlach)" srcset="https://substackcdn.com/image/fetch/$s_!vWwI!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84a65b3a-064a-4f24-91ae-3727d275d31c_600x356.jpeg 424w, https://substackcdn.com/image/fetch/$s_!vWwI!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84a65b3a-064a-4f24-91ae-3727d275d31c_600x356.jpeg 848w, https://substackcdn.com/image/fetch/$s_!vWwI!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84a65b3a-064a-4f24-91ae-3727d275d31c_600x356.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!vWwI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84a65b3a-064a-4f24-91ae-3727d275d31c_600x356.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>Question: Metamaterials</strong> have lived through two hype cycles already - invisibility cloaks in the 2000s, 5G antennas in the 2010s. Is the 2020s their third chance at bat - <strong>and this time, can they finally make it out of the lab and onto the balance sheet? </strong></p><div><hr></div><h1>Decode the Basics</h1><ol><li><p><strong>What exactly is a metamaterial, and how is it different from normal materials?</strong></p><p>Most materials behave the way they do because of chemistry - the atoms and molecules they&#8217;re made from. A metamaterial is different: its properties come from geometry, not chemistry. </p><p>Imagine a chessboard patterned into a material at a scale smaller than the wavelength of light or radio waves. Those repeating structures act like &#8220;knobs&#8221; that change how waves pass through. A normal glass slab might just let light through. A metamaterial slab with engineered patterns can bend, focus, or twist that same light in ways nature never allowed.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JIh4!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f763a01-7fae-43ba-a43c-7269fc322014_1024x1024.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JIh4!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f763a01-7fae-43ba-a43c-7269fc322014_1024x1024.png 424w, https://substackcdn.com/image/fetch/$s_!JIh4!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f763a01-7fae-43ba-a43c-7269fc322014_1024x1024.png 848w, https://substackcdn.com/image/fetch/$s_!JIh4!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f763a01-7fae-43ba-a43c-7269fc322014_1024x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!JIh4!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f763a01-7fae-43ba-a43c-7269fc322014_1024x1024.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JIh4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f763a01-7fae-43ba-a43c-7269fc322014_1024x1024.png" width="1024" height="1024" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2f763a01-7fae-43ba-a43c-7269fc322014_1024x1024.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1024,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1518522,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/174806178?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f763a01-7fae-43ba-a43c-7269fc322014_1024x1024.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!JIh4!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f763a01-7fae-43ba-a43c-7269fc322014_1024x1024.png 424w, https://substackcdn.com/image/fetch/$s_!JIh4!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f763a01-7fae-43ba-a43c-7269fc322014_1024x1024.png 848w, https://substackcdn.com/image/fetch/$s_!JIh4!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f763a01-7fae-43ba-a43c-7269fc322014_1024x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!JIh4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f763a01-7fae-43ba-a43c-7269fc322014_1024x1024.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Illustration of what I explained. Generated by Gemini.</figcaption></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!CujV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65b37921-e6a7-4846-a248-a793c72ff7f5_1008x568.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!CujV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65b37921-e6a7-4846-a248-a793c72ff7f5_1008x568.png 424w, https://substackcdn.com/image/fetch/$s_!CujV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65b37921-e6a7-4846-a248-a793c72ff7f5_1008x568.png 848w, https://substackcdn.com/image/fetch/$s_!CujV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65b37921-e6a7-4846-a248-a793c72ff7f5_1008x568.png 1272w, https://substackcdn.com/image/fetch/$s_!CujV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65b37921-e6a7-4846-a248-a793c72ff7f5_1008x568.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!CujV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65b37921-e6a7-4846-a248-a793c72ff7f5_1008x568.png" width="1008" height="568" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/65b37921-e6a7-4846-a248-a793c72ff7f5_1008x568.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:568,&quot;width&quot;:1008,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:807256,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/174806178?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65b37921-e6a7-4846-a248-a793c72ff7f5_1008x568.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!CujV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65b37921-e6a7-4846-a248-a793c72ff7f5_1008x568.png 424w, https://substackcdn.com/image/fetch/$s_!CujV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65b37921-e6a7-4846-a248-a793c72ff7f5_1008x568.png 848w, https://substackcdn.com/image/fetch/$s_!CujV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65b37921-e6a7-4846-a248-a793c72ff7f5_1008x568.png 1272w, https://substackcdn.com/image/fetch/$s_!CujV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65b37921-e6a7-4846-a248-a793c72ff7f5_1008x568.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Examples of the nano patterns on a Metamaterial (in this case, Metalens.) Source: POSTECH</figcaption></figure></div><p></p></li><li><p><strong>How do metamaterials actually &#8220;bend&#8221; light, sound, or radio waves?</strong><br>The pop-sci gateway drug was <strong>negative-index refraction</strong> - the idea (Veselago, 1968) that if you engineer both permittivity and permeability to be negative, light can bend the &#8220;wrong&#8221; way. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Yy6Y!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8877f72c-91db-4ded-8825-41ac44df2375_985x1086.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Yy6Y!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8877f72c-91db-4ded-8825-41ac44df2375_985x1086.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Yy6Y!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8877f72c-91db-4ded-8825-41ac44df2375_985x1086.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Yy6Y!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8877f72c-91db-4ded-8825-41ac44df2375_985x1086.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Yy6Y!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8877f72c-91db-4ded-8825-41ac44df2375_985x1086.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Yy6Y!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8877f72c-91db-4ded-8825-41ac44df2375_985x1086.jpeg" width="985" height="1086" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8877f72c-91db-4ded-8825-41ac44df2375_985x1086.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1086,&quot;width&quot;:985,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:108276,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/174806178?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F032ed901-57aa-45bc-97c9-c60e429b33bd_1086x1000.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Yy6Y!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8877f72c-91db-4ded-8825-41ac44df2375_985x1086.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Yy6Y!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8877f72c-91db-4ded-8825-41ac44df2375_985x1086.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Yy6Y!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8877f72c-91db-4ded-8825-41ac44df2375_985x1086.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Yy6Y!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8877f72c-91db-4ded-8825-41ac44df2375_985x1086.jpeg 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Illustration of what &#8220;Negative index Refraction&#8221; is</figcaption></figure></div><p></p><p>And how does that type of unreal refraction happen in Metamaterial? Think of a wavefront - a line of marching soldiers all stepping in sync. Now put a fence in front of them where each gate slows the soldiers down by a slightly different amount. When they regroup on the other side, the line is tilted or curved. That&#8217;s how metamaterials work: each tiny patterned cell adds a controlled <strong>phase delay</strong>. By carefully varying those delays across a surface, engineers can <strong>steer beams, focus waves, or cancel reflections</strong>. It&#8217;s the physics of delays, not bulk shape, that reshapes the wave.</p></li><li><p><strong>What are the canonical demos out there: cool science vs. practical products?</strong><br>If you&#8217;ve seen the headlines about &#8220;invisibility cloaks&#8221; or &#8220;perfect lenses,&#8221; you&#8217;ve seen the hype side of metamaterials. In 2006, a Duke team demoed a rudimentary &#8220;<strong>invisibility cloak</strong>&#8221; in 2006. Below is a more recent version of it.</p></li></ol><div id="youtube2-pZMyWEWHCTM" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;pZMyWEWHCTM&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/pZMyWEWHCTM?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><p>It worked by warping light around an object rather than reflecting it back - but only in narrow bands, with loss and noticeable distortion, so it never became a product. Still, those cloaks (and &#8220;perfect lenses&#8221;) became cultural icons that proved metamaterials can do things nature doesn&#8217;t. </p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://clairechoi616.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Claire&#8217;s Substack! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Frsn!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b6a061c-f449-47e7-8331-3576cba8af80_951x999.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Frsn!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b6a061c-f449-47e7-8331-3576cba8af80_951x999.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Frsn!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b6a061c-f449-47e7-8331-3576cba8af80_951x999.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Frsn!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b6a061c-f449-47e7-8331-3576cba8af80_951x999.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Frsn!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b6a061c-f449-47e7-8331-3576cba8af80_951x999.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Frsn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b6a061c-f449-47e7-8331-3576cba8af80_951x999.jpeg" width="951" height="999" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8b6a061c-f449-47e7-8331-3576cba8af80_951x999.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:999,&quot;width&quot;:951,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:112109,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/174806178?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa74633ef-a17b-4a31-9f43-78bd16916634_1001x1000.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Frsn!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b6a061c-f449-47e7-8331-3576cba8af80_951x999.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Frsn!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b6a061c-f449-47e7-8331-3576cba8af80_951x999.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Frsn!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b6a061c-f449-47e7-8331-3576cba8af80_951x999.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Frsn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b6a061c-f449-47e7-8331-3576cba8af80_951x999.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Illustration of how Metamaterial can work as an invisible cloak</figcaption></figure></div><p></p><p></p><p>The practical pivot was to <strong>metasurfaces</strong>: ultra-thin, patterned sheets that steer, focus, or filter waves with semiconductor-style manufacturing. These can replace or shrink bulky lenses and antennas and are already on commercialization paths - unlike the headline cloaks.</p><div><hr></div><h1>Where It&#8217;s Heading</h1><ol start="4"><li><p><strong>What changed in the last 5&#8211;10 years to make metamaterials more viable?</strong><br>For years, making metamaterials was like doing art with an electron beam: slow, tiny, and expensive. That changed thanks to three big shifts:</p></li></ol><ul><li><p><strong>Deep-UV lithography (DUV)</strong>: the same technology used in chip fabs now prints metasurface patterns across 8&#8211;12 inch wafers. Companies like STMicroelectronics are already running meta-optics designs on 300 mm lines - the same fabs that churn out semiconductors.</p></li><li><p><strong>Nanoimprint lithography (NIL)</strong>: think of stamping patterns instead of drawing them. Modern NIL can press nanostructures efficiently, even with roll-to-roll approaches, opening a path to low-cost, mass production of lenses and films.</p></li><li><p><strong>Grayscale lithography (GSL)</strong>: instead of binary on/off patterning, GSL can sculpt smooth height profiles in one shot. That allows broadband, high-efficiency lenses without stacking multiple layers.</p></li></ul><p>Together, these methods are <strong>moving metasurfaces from lab demos to something fabs and foundries can scale</strong>. Add in lower-loss dielectrics and AI-assisted design, and the technology is no longer hand-crafted art.</p><ol start="5"><li><p><strong>What are the leading application frontiers now?</strong><br>Two stand out. </p></li></ol><ul><li><p>First is <strong>RF and antennas</strong>: metamaterials enable flat, electronically steerable arrays for satellites, 5G/6G repeaters, and compact radars. Startups like Pivotal Commware, Echodyne, and Kymeta are already shipping. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!f2t-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86b0f7f4-85b8-4d96-9186-ef55abaf10a2_220x261.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!f2t-!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86b0f7f4-85b8-4d96-9186-ef55abaf10a2_220x261.png 424w, https://substackcdn.com/image/fetch/$s_!f2t-!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86b0f7f4-85b8-4d96-9186-ef55abaf10a2_220x261.png 848w, https://substackcdn.com/image/fetch/$s_!f2t-!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86b0f7f4-85b8-4d96-9186-ef55abaf10a2_220x261.png 1272w, https://substackcdn.com/image/fetch/$s_!f2t-!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86b0f7f4-85b8-4d96-9186-ef55abaf10a2_220x261.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!f2t-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86b0f7f4-85b8-4d96-9186-ef55abaf10a2_220x261.png" width="220" height="261" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/86b0f7f4-85b8-4d96-9186-ef55abaf10a2_220x261.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:261,&quot;width&quot;:220,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Metamaterial Based Patch Antenna with Broad Bandwidth Designed by COMSOL  Multiphysics&#174; Software&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Metamaterial Based Patch Antenna with Broad Bandwidth Designed by COMSOL  Multiphysics&#174; Software" title="Metamaterial Based Patch Antenna with Broad Bandwidth Designed by COMSOL  Multiphysics&#174; Software" srcset="https://substackcdn.com/image/fetch/$s_!f2t-!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86b0f7f4-85b8-4d96-9186-ef55abaf10a2_220x261.png 424w, https://substackcdn.com/image/fetch/$s_!f2t-!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86b0f7f4-85b8-4d96-9186-ef55abaf10a2_220x261.png 848w, https://substackcdn.com/image/fetch/$s_!f2t-!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86b0f7f4-85b8-4d96-9186-ef55abaf10a2_220x261.png 1272w, https://substackcdn.com/image/fetch/$s_!f2t-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F86b0f7f4-85b8-4d96-9186-ef55abaf10a2_220x261.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Example of a Metamaterial based patch antenna. Source: COSMOL</figcaption></figure></div></li><li><p>Second is <strong>optics</strong>: so-called &#8220;metalenses&#8221; - flat, patterned sheets that can replace multi-element glass - are now licensed into consumer and automotive supply chains through players like Metalenz.</p><p></p></li></ul><ol start="5"><li><p><strong>What concrete inflection points are on the horizon?</strong></p></li></ol><ul><li><p><strong>6G standards</strong> in the late 2020s will require antennas that can steer higher-frequency signals more flexibly.</p></li><li><p><strong>Defense procurement</strong> is looking for compact counter-drone radars and low-SWaP (size, weight, power) sensors, where metamaterials are competitive.</p></li><li><p><strong>Consumer optics</strong> could break out if the first smartphone or AR device adopts meta-optics at volume.</p></li></ul><ol start="7"><li><p><strong>What&#8217;s the realistic timeline for commercial scale?</strong><br>RF metamaterials are already commercial - you can buy a Kymeta satellite terminal or see an Echodyne radar deployed today. The next 1&#8211;3 years are about scaling in defense and telecom. Optics are ramping in the next 3&#8211;5 years as Tier-1 suppliers like ST integrate them into sensing modules for phones, cars, and AR glasses.</p></li></ol><div><hr></div><h1>Zooming in to the not-there-yet product: Meta Optics</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!L2ee!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e925fdb-0695-4b82-8183-3d9b04101b9b_1026x688.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!L2ee!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e925fdb-0695-4b82-8183-3d9b04101b9b_1026x688.png 424w, https://substackcdn.com/image/fetch/$s_!L2ee!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e925fdb-0695-4b82-8183-3d9b04101b9b_1026x688.png 848w, https://substackcdn.com/image/fetch/$s_!L2ee!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e925fdb-0695-4b82-8183-3d9b04101b9b_1026x688.png 1272w, https://substackcdn.com/image/fetch/$s_!L2ee!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e925fdb-0695-4b82-8183-3d9b04101b9b_1026x688.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!L2ee!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e925fdb-0695-4b82-8183-3d9b04101b9b_1026x688.png" width="1026" height="688" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3e925fdb-0695-4b82-8183-3d9b04101b9b_1026x688.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:688,&quot;width&quot;:1026,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:802540,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/174806178?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e925fdb-0695-4b82-8183-3d9b04101b9b_1026x688.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!L2ee!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e925fdb-0695-4b82-8183-3d9b04101b9b_1026x688.png 424w, https://substackcdn.com/image/fetch/$s_!L2ee!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e925fdb-0695-4b82-8183-3d9b04101b9b_1026x688.png 848w, https://substackcdn.com/image/fetch/$s_!L2ee!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e925fdb-0695-4b82-8183-3d9b04101b9b_1026x688.png 1272w, https://substackcdn.com/image/fetch/$s_!L2ee!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e925fdb-0695-4b82-8183-3d9b04101b9b_1026x688.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: Moxtek</figcaption></figure></div><ol start="8"><li><p><strong>What can a metalens do that glass or plastic can&#8217;t?</strong><br>A traditional lens bends light because of its shape: thicker in the middle, thinner on the edges. A <strong>metalens</strong> is flat, but patterned with nanostructures that do the same bending trick - and more. It can focus, correct aberrations, and even filter polarization in one wafer-thin layer. That means fewer stacked elements, slimmer modules, and potentially cheaper assembly.</p></li><li><p><strong>What are the tradeoffs?</strong></p></li></ol><ul><li><p>Pros: compactness, multifunctionality, semiconductor-style repeatability.</p></li><li><p>Cons: lower efficiency compared to high-end glass, bandwidth limits, and fabrication challenges at visible wavelengths.</p></li></ul><ol start="10"><li><p><strong>What does &#8220;good&#8221; look like in real products?</strong></p></li></ol><ul><li><p>In <strong>imaging sensors</strong>, a metalens needs high resolution (measured by modulation transfer function), low aberration, and thin form factors.</p></li><li><p>In <strong>lidar and 3D sensing</strong>, beam shaping accuracy and efficiency are key.</p></li><li><p>In <strong>phone cameras</strong>, success means shaving off millimeters of thickness without sacrificing low-light performance or adding ghosting.</p></li></ul><p>Metalenz (to elaborate later) and STMicroelectronics are already pushing toward these metrics in smartphone and automotive applications. Major players in optics like Sony or Samsung also have several patents.</p><ol start="11"><li><p><strong>What are the common failure modes, and how do you fix them?</strong></p></li></ol><ul><li><p><strong>Losses</strong>: use better dielectric materials to reduce absorption.</p></li><li><p><strong>Fabrication defects</strong>: fix with tighter process control and wafer metrology.</p></li><li><p><strong>Thermal drift or chromatic issues</strong>: mitigate with hybrid stacks or multi-layer designs.</p></li></ul><div><hr></div><h1>Zooming out again: The market opportunities of metamaterials</h1><ol start="12"><li><p><strong>Who are the anchor buyers?</strong><br>Defense agencies (radars, stealth), telecom and satellite operators (steerable antennas), and consumer electronics makers (smartphones, AR, automotive sensors). Each brings different price tolerance and volume potential.</p></li><li><p><strong>Which incumbents are experimenting, and how does it affect startups?</strong><br>Big defense primes and telco OEMs are running pilots. On the consumer side, STMicroelectronics has licensed meta-optics IP. This is double-edged: incumbents validate the field, but once they master the process, they can pressure startup margins.</p></li><li><p><strong>What near-term signals would show breakout?</strong></p></li></ol><ul><li><p>A smartphone shipping with a meta-lens.</p></li><li><p>6G field trials citing metasurface antennas.</p></li><li><p>Multi-year defense production contracts for compact radars.</p></li><li><p>Satellite terminals with metamaterial beam-steering achieving broad operator adoption.</p></li></ul><ol start="15"><li><p>Who are the startups to keep an eye on in this area ?</p></li></ol><ul><li><p><strong>Metalenz (Boston, 2016)</strong> &#8211; Probably the most visible player in meta-optics, and also the one I often encountered during my prior projects. They&#8217;ve taken Harvard lab tech into the supply chain by licensing flat &#8220;metalenses&#8221; to STMicro, aiming to slim down smartphone 3D sensing modules. Per what I heard from my friends researching in optics, they&#8217;ve been quite actively collaborating with big techs (e.g. Apple, Google) to jointly develop products that would fit into their consumer products (e.g. Smartphone, MX device.) Wonder how they are doing with those projects. Their upside to me: they have a fab-ready IP and an actual path to volume in consumer electronics. But there is still a question mark: can they can hit efficiency and bandwidth targets before handset OEMs lose patience.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!bg4g!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35a9cecd-74a4-408a-b8f1-d55d519ac23f_1536x753.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!bg4g!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35a9cecd-74a4-408a-b8f1-d55d519ac23f_1536x753.png 424w, https://substackcdn.com/image/fetch/$s_!bg4g!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35a9cecd-74a4-408a-b8f1-d55d519ac23f_1536x753.png 848w, https://substackcdn.com/image/fetch/$s_!bg4g!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35a9cecd-74a4-408a-b8f1-d55d519ac23f_1536x753.png 1272w, https://substackcdn.com/image/fetch/$s_!bg4g!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35a9cecd-74a4-408a-b8f1-d55d519ac23f_1536x753.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!bg4g!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35a9cecd-74a4-408a-b8f1-d55d519ac23f_1536x753.png" width="1456" height="714" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/35a9cecd-74a4-408a-b8f1-d55d519ac23f_1536x753.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:714,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;conventional vs. metalenz illustration&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="conventional vs. metalenz illustration" title="conventional vs. metalenz illustration" srcset="https://substackcdn.com/image/fetch/$s_!bg4g!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35a9cecd-74a4-408a-b8f1-d55d519ac23f_1536x753.png 424w, https://substackcdn.com/image/fetch/$s_!bg4g!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35a9cecd-74a4-408a-b8f1-d55d519ac23f_1536x753.png 848w, https://substackcdn.com/image/fetch/$s_!bg4g!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35a9cecd-74a4-408a-b8f1-d55d519ac23f_1536x753.png 1272w, https://substackcdn.com/image/fetch/$s_!bg4g!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F35a9cecd-74a4-408a-b8f1-d55d519ac23f_1536x753.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Metalens based imaging sensor. Source: Metalenz</figcaption></figure></div></li><li><p><strong>Imagia (Silicon Valley, 2022)</strong> &#8211; A seed-stage rival coming at optics from a different angle: building lenses like chips, patterned directly on LEDs or sensors. If it works, AR/VR hardware could suddenly shrink. The bet here is paradigm shift; the risk is classic deep-tech: long road to scale, and Metalenz already has a head start.</p></li><li><p><strong>Pivotal Commware (Kirkland, 2016)</strong> &#8211; A 5G infrastructure startup pushing &#8220;holographic beamforming&#8221; panels that make mmWave signals bend around corners and extend coverage. The win condition: delivering phased-array-like performance at a fraction of the cost and power. The risk: convincing conservative carriers to deploy at scale, in a market crowded with cheaper repeaters and legacy vendors.</p></li><li><p><strong>Echodyne (Kirkland, 2014)</strong> &#8211; Their MESA radars use metamaterials to do electronically scanned arrays without the massive price tag. That makes compact, low-power radars viable for drones, perimeter security, and defense. The upside: they already have TRL-9 deployments with defense and government customers. The risk: radar budgets are cyclical, and AESA incumbents won&#8217;t stand still.</p></li><li><p><strong>Radi-Cool (Boulder, 2016)</strong> &#8211; Less hyped but fascinating: a metamaterial film that cools surfaces under direct sun by radiating heat into space. Think passive air conditioning for roofs and cars. The upside: a massive climate-tech TAM. The risk: durability and adoption in the conservative building materials industry.</p></li><li><p><strong>Multiwave (Geneva, 2015)</strong> &#8211; Applying metamaterials to MRI. Their coils boost signal quality and enable low-field, portable MRI machines for point-of-care diagnostics. The upside: democratizing MRI in places where it&#8217;s nonexistent. The risk: brutal medical device regulatory cycles and competition from incumbent imaging players.</p></li><li><p><strong>PlanOpSim (Belgium, ~2018)</strong> &#8211; Less visible but strategic: they make the design software for metasurfaces, letting engineers plug flat optics into standard CAD workflows. The upside: every meta-optics team needs this bridge to real products. The risk: niche EDA markets scale only if the hardware ecosystem does.</p></li></ul><p></p><div><hr></div><h2>Closing Answer</h2><p><strong>So - is this hype cycle #3, or something real?</strong><br>To me, this isn&#8217;t hype-cycle #3 - it&#8217;s graduation, with guardrails. Metamaterials won&#8217;t replace glass or phased arrays across the board, but they&#8217;ll win, and stick, where thin, passive, fab-friendly parts beat incumbents on size, power, and integration. RF already crossed that line in select radars and satcom; optics follows as 300 mm meta-optics prove yield and efficiency. Watch for three tells in the next 24 months: a tier-1 handset/AR design-win, a multi-year defense production award, and 6G pilots standardizing metasurface options. Hit two of three, and we&#8217;re not in a lucky up-cycle - we&#8217;re watching <strong>niche tech harden into a moat.</strong></p><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://clairechoi616.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Claire&#8217;s Substack! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Decoding tech for Non-Engineers]]></title><description><![CDATA[As long as I can remember my childhood, my grandfather&#8217;s lab students used to wander into our family dinners, chopstick debates turning into excited riffs about paradigm-shifting tech - flying cars, man made clouds, and robots that might someday empty the dishwasher.]]></description><link>https://clairechoi616.substack.com/p/decoding-tech-for-non-engineers</link><guid isPermaLink="false">https://clairechoi616.substack.com/p/decoding-tech-for-non-engineers</guid><dc:creator><![CDATA[Deep Tech for the Non Tech]]></dc:creator><pubDate>Sat, 27 Sep 2025 06:09:50 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/fec5e10e-9105-4167-8de5-94861078da23_6141x4096.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>As long as I can remember my childhood, my grandfather&#8217;s lab students used to wander into our family dinners, chopstick debates turning into excited riffs about paradigm-shifting tech - flying cars, man made clouds, and robots that might someday empty the dishwasher. I loved being in the splash zone&#8230; and then - classic - I veered away from science toward the commercial side. I thought I&#8217;d start a business, got confused again, and fell into consulting.</p><p>Luckily consulting turned out to be a quite of a front-row seat. Over four years at McKinsey, I worked across a wide spectrum of technologies - from Asian semiconductors to an ingredient for Harry Potter&#8217;s invisible cloak - that felt one experiment away from science fiction. That kept me enthusiastic: helping technical visionaries turn lab brilliance into products and markets. I&#8217;ve helped CEOs think about partnerships, built go-to-market strategies, and modeled multi-billion-dollar autonomy roadmaps. I sometimes got feedback that I dove <strong>too</strong> deep - cornering friends to dissect a white paper diagram, or getting along a little <em>too</em> well with the engineering team. On nights and weekends, I built scrappy consumer apps - a spam-filter that actually worked and a wellness app that quietly found real users. </p><p>This curiosity and early exposure helped me earn the trust of skeptical engineers and hard-bar product leaders - and it still opens the door to the whiteboard. But I&#8217;ve also felt the pull to go deeper: to understand the bones and flesh of the tech and, more importantly, to nurture a product myself from day one. </p><p><strong>This Substack is how I sharpen the muscles for that in public.</strong> I want to be closer to products that touch the real world - that means explaining complex systems clearly, making trade-offs explicit, and developing a point of view on where value will accrue.I don&#8217;t want to be the PM who only echoes what engineers say. I don&#8217;t want to be the investor who just follows hype. So I&#8217;m learning in public.</p><p><strong>This will be part study journal, part translation service.</strong> I&#8217;ll take one hard concept at a time - from robotic stacks, invisible cloaks to innovative XR devices and services  - and break it down into something we can actually understand. On top of the technical juice, I&#8217;ll add a product lens (user pain, MVP, roadmap, adoption) and a market lens (market structure, moats, where value captures).</p><p><strong>I&#8217;m not pretending to be a deep-tech professor. I&#8217;m writing because I learn best by explaining - and because curiosity is more fun with company.</strong></p><p>If that sounds useful, subscribe and join me. I&#8217;ll ask the confused questions so you don&#8217;t have to - and I&#8217;ll do the work to turn the answers into clear, practical takeaways.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!iXo_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44058018-8692-422e-a4e2-c48f1f27eedf_6141x4096.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!iXo_!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44058018-8692-422e-a4e2-c48f1f27eedf_6141x4096.jpeg 424w, https://substackcdn.com/image/fetch/$s_!iXo_!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44058018-8692-422e-a4e2-c48f1f27eedf_6141x4096.jpeg 848w, https://substackcdn.com/image/fetch/$s_!iXo_!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44058018-8692-422e-a4e2-c48f1f27eedf_6141x4096.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!iXo_!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44058018-8692-422e-a4e2-c48f1f27eedf_6141x4096.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!iXo_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44058018-8692-422e-a4e2-c48f1f27eedf_6141x4096.jpeg" width="1456" height="971" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/44058018-8692-422e-a4e2-c48f1f27eedf_6141x4096.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:971,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:8092699,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://clairechoi616.substack.com/i/174670814?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44058018-8692-422e-a4e2-c48f1f27eedf_6141x4096.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!iXo_!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44058018-8692-422e-a4e2-c48f1f27eedf_6141x4096.jpeg 424w, https://substackcdn.com/image/fetch/$s_!iXo_!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44058018-8692-422e-a4e2-c48f1f27eedf_6141x4096.jpeg 848w, https://substackcdn.com/image/fetch/$s_!iXo_!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44058018-8692-422e-a4e2-c48f1f27eedf_6141x4096.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!iXo_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F44058018-8692-422e-a4e2-c48f1f27eedf_6141x4096.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div>]]></content:encoded></item></channel></rss>