<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[Machine Yearning]]></title><description><![CDATA[AI, without the hype. Candid commentary and analyses from lab to market.]]></description><link>https://www.machineyearning.io</link><image><url>https://substackcdn.com/image/fetch/$s_!-RAu!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39397fed-3de8-46df-ab35-4f48dc5edf4e_300x300.png</url><title>Machine Yearning</title><link>https://www.machineyearning.io</link></image><generator>Substack</generator><lastBuildDate>Fri, 01 May 2026 21:50:16 GMT</lastBuildDate><atom:link href="https://www.machineyearning.io/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Ryan Cunningham]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[machineyearning@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[machineyearning@substack.com]]></itunes:email><itunes:name><![CDATA[Ryan Cunningham]]></itunes:name></itunes:owner><itunes:author><![CDATA[Ryan Cunningham]]></itunes:author><googleplay:owner><![CDATA[machineyearning@substack.com]]></googleplay:owner><googleplay:email><![CDATA[machineyearning@substack.com]]></googleplay:email><googleplay:author><![CDATA[Ryan Cunningham]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[Silicon Vanguard: Ranking China's Domestic Chip Leaders]]></title><description><![CDATA[meet the hardware powering its sovereign AI ecosystem]]></description><link>https://www.machineyearning.io/p/chinas-silicon-vanguard</link><guid isPermaLink="false">https://www.machineyearning.io/p/chinas-silicon-vanguard</guid><dc:creator><![CDATA[Ryan Cunningham]]></dc:creator><pubDate>Thu, 11 Sep 2025 17:16:16 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/0649d2d7-c2a5-45c5-a6c0-fd032f8ab9bb_2912x2096.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>When I first started on this project, it was mostly just scratch notes from in-country conversations with Chinese semiconductor experts and investors meant to inform our hardware theses at <a href="https://edgerunner.io">Edgerunner Ventures</a>. But this kind of took on a life of its own, and I have other things I&#8217;d like to write on&#8230; so publishing it below now in full.</em></p><p><em>I&#8217;m also releasing v0.1 of the <a href="https://docs.google.com/spreadsheets/d/1vnS6Rlazkwpg7yBqsD9YE2eoVlAKFLG2vdOoPkiCU5c/edit?gid=57675061#gid=57675061">&#8221;Silicon Vanguard&#8221; dataset</a>, which powers the bulk of this analysis. Chinese chip companies aren&#8217;t as forthcoming about individual unit specs, so it&#8217;s difficult to find performance data from reputable sources. I decided to compile my own from securities filings, archived company pages, and other firsthand sources, and I will be adding more to it over time.</em></p><p><em>This is my longest post yet - 13,000 words - so fair warning, instead of just diving in, I&#8217;d recommend the following order:</em></p><p><em>POLICYMAKERS / BIG PICTURE FOLKS:</em></p><ul><li><p><em><a href="https://www.machineyearning.io/i/170939417/executive-summary">Executive Summary</a></em></p></li><li><p><em><a href="https://www.machineyearning.io/i/170939417/company-profiles">Skim Profiles</a></em></p></li><li><p><em><a href="https://docs.google.com/spreadsheets/d/1vnS6Rlazkwpg7yBqsD9YE2eoVlAKFLG2vdOoPkiCU5c/edit?gid=57675061#gid=57675061">Bookmark the Silicon Vanguard dataset</a></em></p></li><li><p><em><a href="https://www.machineyearning.io/i/170939417/industry-certifications">Industry Certifications</a></em></p></li><li><p><em><a href="https://www.machineyearning.io/i/170939417/strategic-analysis">Strategic Analysis</a> &#8594; <a href="https://www.machineyearning.io/i/170939417/entity-list-impact">Entity List Impact</a></em></p></li><li><p><em><a href="https://www.machineyearning.io/i/170939417/strategic-analysis">Strategic Analysis</a> &#8594; <a href="https://www.machineyearning.io/i/170939417/strategic-backing">Strategic Backing</a></em></p></li></ul><p><em>TECH-FORWARD ANALYSTS:</em></p><ul><li><p><em><a href="https://www.machineyearning.io/i/170939417/executive-summary">Executive Summary</a></em></p></li><li><p><em><a href="https://www.machineyearning.io/i/170939417/company-profiles">Skim Profiles</a></em></p></li><li><p><em><a href="https://docs.google.com/spreadsheets/d/1vnS6Rlazkwpg7yBqsD9YE2eoVlAKFLG2vdOoPkiCU5c/edit?gid=57675061#gid=57675061">Bookmark the Silicon Vanguard dataset</a></em></p></li><li><p><em><a href="https://www.machineyearning.io/i/170939417/data-and-methodology">Data &amp; Methodology (all)</a></em></p></li><li><p><em><a href="https://www.machineyearning.io/i/170939417/technical-analysis">Technical Analysis (all)</a></em></p></li><li><p><em><a href="https://www.machineyearning.io/i/170939417/strategic-analysis">Strategic Analysis</a> &#8594; <a href="https://www.machineyearning.io/i/170939417/entity-list-impact">Entity List Impact</a></em></p></li><li><p><em><a href="https://www.machineyearning.io/i/170939417/strategic-analysis">Strategic Analysis</a> &#8594; <a href="https://www.machineyearning.io/i/170939417/commercial-adoption">Commercial Adoption</a></em></p></li></ul><p><em>While the content itself is written by me, this was a valuable stress-test on modern LLM limitations for deep-tech, bilingual industry analyses, revealing fault lines on citations and hallucinations. Western model providers almost never use firsthand Chinese-language sources, but handle mixed-language PDFs okay. DeepSeek, Kimi K2, and Chinese models are way better at sourcing, but suffer from more serious hallucinations in long context windows, requiring a lot of triple-checking and filter-maxxing. To this end, if anything is inaccurate, please comment with other firsthand sources! Will edit and gladly attribute the correction.</em></p><p><em>Lastly, my friend Lesley Gao (author of <a href="https://theshearforce.substack.com/?utm_campaign=profile_chips">Shear Force</a>) was incredibly helpful in sourcing additional materials to pad out financials and leadership information, as well as contributing to general takeaways. Her recent post, <a href="https://theshearforce.substack.com/p/how-a-1-lighter-defied-inflation">How a $1 Lighter Defied Inflation for 20 Years</a>, is an excellent primer to Chinese industrial clusters, and as a framework carries serious relevance for all kinds of industrial policymaking.</em></p><div><hr></div><h1><strong>Introduction</strong></h1><p>If you ask the question &#8220;how far behind is China on leading edge semiconductors,&#8221; you&#8217;ll get as many different answers as the number of people you ask. 20 years<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>, 10 years<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>, 5 years<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>, 2 years.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a> If you ask &#8220;who will become China&#8217;s NVIDIA,&#8221; you may most commonly hear &#8220;Huawei,&#8221; or more recently &#8220;Cambricon.&#8221;<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a></p><p>To me, all variants of these questions are unsatisfactory. Focusing solely on the immediate, visible events - the &#8220;leading edge&#8221; - overlooks underlying systems that generate and sustain technoeconomic momentum. After all, the Titanic didn&#8217;t strike the ice above the surface&#8230; it struck the entire iceberg underneath. This is why technological change often feels gradual, then all of a sudden.</p><p>Additionally, while most domestic chips lag foreign competitors in raw performance, obsessing over isolated stats ignores an objectively rapid pace of development. Their performance on energy-compute metrics, I believe, are more germane to questions on sovereign AI.</p><p>A more nuanced understanding of China&#8217;s chip ecosystem is therefore warranted.</p><p>In this post, I&#8217;ll pull back the veil on 16 companies we&#8217;re tracking in this space, and attempt to make sense of their relative positioning in the market. These breakouts are finding new ways to bypass tech bottlenecks with heterogeneous approaches, deep cross-sector collaborations, and novel design paradigms. There are also clear losers which highlight limits to a state-led innovation ideology. Truly there are dozens of companies worth mentioning, but only a handful have made meaningful contributions to the country&#8217;s sovereign chip ecosystem thus far.</p><p>To be clear, we&#8217;re focusing solely on the fabless accelerator (GPU and ASIC) chip designers of today. We&#8217;ll save CPUs, FPGAs, and memory chip IDMs like SMIC, CXMT, and YMTC - all critical parts of the value chain - for a future deep-dive.</p><p>It&#8217;s fair to argue this analysis is incomplete without fully assessing domestic ability to reliably <em>fabricate</em> these chips (e.g. SMIC, YMTC, CXMT), but as we&#8217;ll see, even if you hold domestic yields frozen at current levels, you have to admit designers have made considerable strides along the tech tree.</p><div><hr></div><h1>Executive Summary</h1><p>I&#8217;ll summarize the findings of this analysis up front, and lay out its implications for American technologists and policymakers. Read on for fuller details on individual companies, performance specs, and qualitative measurements.</p><h2>Takeaways</h2><p><strong>First</strong>, while U.S. entity listings may have had a deleterious effect on some individual first movers (e.g. <strong>Biren, Moore Threads</strong>), this strategy undoubtedly catalyzed domestic chip and software innovations bypassing one-off tech bans. As an example, this month <strong>Huawei HiSilicon</strong> plans to open-source its Unified Cache Manager (UCM), an AI inference acceleration toolkit which shards model memory workloads across different memory types (HBM, DRAM, even SSDs), reducing latency by up to 90% and improving system throughput by up to 22x.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-6" href="#footnote-6" target="_self">6</a> This would significantly blunt the impact of HBM restrictions on sanctioned entities, which experts agree is a critical bottleneck for domestic accelerators.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-7" href="#footnote-7" target="_self">7</a></p><p><strong>Second,</strong> the domestic ecosystem is converging on model, hardware, and precision standards meant to establish &#8220;good enough&#8221; performance for large-scale deployments. DeepSeek has also announced its support for UE8M0 FP8 precision as a performance standard for domestic chips, and chipmakers are proclaiming FP8 support for their newest SKUs (<strong>Cambricon</strong> &#24605;&#20803;690, <strong>Enflame</strong> &#36995;&#24605;L600<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-8" href="#footnote-8" target="_self">8</a>, <strong>SOPHGO</strong> SC11-FP300<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-9" href="#footnote-9" target="_self">9</a>). CAICT recently began issuing &#8220;AI Chip and Large Model Adaptation Test Certificates&#8221; to chipmakers (<strong>Moore Threads, SOPHGO</strong>) that have demonstrated &#8220;passable&#8221; inference performance for full-blooded DeepSeek R1 671B. This third-party baseline establishes acceptability thresholds for performance and energy efficiency, and provides clarity to buyers and sellers in the procurement process for multi-million dollar chip contracts.</p><p><strong>Third,</strong> hardware-specific investments are bearing fruit in sparse computing and in-memory / near-memory (PIM / NDP) designs. <strong>Moffett AI</strong>&#8217;s first-gen chips have already out-performed the NVIDIA H100 in MLPerf Inference benchmarks, achieving &gt;1.6x performance throughput at a ~3x smaller energy footprint, yielding ~5x greater tokens / joule in single-card and supernode environments.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-10" href="#footnote-10" target="_self">10</a> Continued advancements in sparsity, analog computing, and heterogeneous computing techniques are improving energy-compute yields and reducing total cost of ownership.</p><p><strong>Finally</strong>, the last pillar may be NVIDIA&#8217;s software moat (CUDA), though this too may be about to crack. While CUDA (and even AMD&#8217;s ROCm) are still preferred by the majority of Chinese AI engineers, leading chipmakers (<strong>Baidu Kunlunxin, Moore Threads, MetaX, Enflame</strong>) have announced top-level priorities to ensure full CUDA compatibility for their newest chips, and vendor-specific transcompilers are seeing remarkable improvements in CUDA translation performance (<strong>Cambricon&#8217;s</strong> Qimeng-XPiler yielding 95%+ accuracy and &lt;5 hours debugging time)<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-11" href="#footnote-11" target="_self">11</a>. This is likely a greater threat to CUDA dominance than a head-to-head CUDA vs. CANN open-source developer battle.</p><p>In sum, we&#8217;re witnessing an inflection point in domestic Chinese chip capabilities. Despite lagging behind trailing-edge NVIDIA chips, domestic hardware is evolving beyond mere sufficiency for powering a sovereign AI ecosystem - it may soon be sufficient to begin exporting the Chinese AI stack.</p><h2>Implications</h2><p>In case I haven&#8217;t made this abundantly clear&#8230; the horse has left the barn. Continued attempts to control, deny, or retard domestic Chinese progress will only accelerate this now inevitable transition. Cold War-era technology restrictions are completely at odds with how international developer ecosystems function, and in the modern era create antithetical outcomes to their stated objective.</p><p>Senior AI Policy Advisor Sriram Krishnan articulates this reality well, stating the U.S. needs to maximize developer market share of the &#8220;American AI Stack&#8221; - the models, chips, and software being used to train and run them. It&#8217;s a classic developer flywheel.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-12" href="#footnote-12" target="_self">12</a></p><p>We are already in the first bouts of this &#8220;AI Stack&#8221; competition. Here in the Valley, most developers and startups I know are using Chinese models in some capacity (Martin Casado of a16z estimates 80%)<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-13" href="#footnote-13" target="_self">13</a>. And who can blame them - these models are open-source, meet or exceed American model quality, and are 10x-30x cheaper when hosting for inference.</p><p>Meanwhile, domestic first-movers OpenAI<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-14" href="#footnote-14" target="_self">14</a> and Anthropic<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-15" href="#footnote-15" target="_self">15</a> are implementing restrictions to manage inference costs, nerfing performance, throughput, and output quality in the process. This is a fast track to AI platform decay - or &#8220;enshittification&#8221; (&#24179;&#21488;&#23630;&#21270;, <em>p&#237;ngt&#225;i sh&#464; hu&#224;</em>) - and discourages developers from building on volatile infrastructure.</p><p>This reality demands a strategic recalibration. While a holistic approach to building the American AI Stack is a starting point, evidence from China&#8217;s Silicon Vanguard demonstrates that energy-compute optimization is becoming the decisive factor in at-scale deployment of AI systems&#8230; and will come to define sovereignty in the new world order.</p><p>Containment is no longer an option. The only option is to compete.</p><div><hr></div><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.machineyearning.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Enjoying it so far? Subscribe to Machine Yearning and don&#8217;t miss the next drop. </p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h1><strong>The Players</strong></h1><p><em>This won&#8217;t be an all-inclusive list. There&#8217;s a lot of investment dollars going into this sector as China rapidly builds up its domestic semiconductor manufacturing and design muscle.</em></p><p>Here are the names of the 16 entities we&#8217;ll be covering.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Swmu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9eb35da9-ff07-48c3-93e7-3fc8507e50c6_898x1038.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Swmu!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9eb35da9-ff07-48c3-93e7-3fc8507e50c6_898x1038.png 424w, https://substackcdn.com/image/fetch/$s_!Swmu!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9eb35da9-ff07-48c3-93e7-3fc8507e50c6_898x1038.png 848w, https://substackcdn.com/image/fetch/$s_!Swmu!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9eb35da9-ff07-48c3-93e7-3fc8507e50c6_898x1038.png 1272w, https://substackcdn.com/image/fetch/$s_!Swmu!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9eb35da9-ff07-48c3-93e7-3fc8507e50c6_898x1038.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Swmu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9eb35da9-ff07-48c3-93e7-3fc8507e50c6_898x1038.png" width="898" height="1038" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9eb35da9-ff07-48c3-93e7-3fc8507e50c6_898x1038.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1038,&quot;width&quot;:898,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:325406,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9eb35da9-ff07-48c3-93e7-3fc8507e50c6_898x1038.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Swmu!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9eb35da9-ff07-48c3-93e7-3fc8507e50c6_898x1038.png 424w, https://substackcdn.com/image/fetch/$s_!Swmu!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9eb35da9-ff07-48c3-93e7-3fc8507e50c6_898x1038.png 848w, https://substackcdn.com/image/fetch/$s_!Swmu!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9eb35da9-ff07-48c3-93e7-3fc8507e50c6_898x1038.png 1272w, https://substackcdn.com/image/fetch/$s_!Swmu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9eb35da9-ff07-48c3-93e7-3fc8507e50c6_898x1038.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Below are other logic chip designers primarily building CPUs and FPGAs. They aren&#8217;t the subject of this post, but we&#8217;re tracking them nonetheless.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Rruj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c880833-05e1-4399-aaa4-d1aed73aa51b_714x281.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Rruj!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c880833-05e1-4399-aaa4-d1aed73aa51b_714x281.png 424w, https://substackcdn.com/image/fetch/$s_!Rruj!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c880833-05e1-4399-aaa4-d1aed73aa51b_714x281.png 848w, https://substackcdn.com/image/fetch/$s_!Rruj!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c880833-05e1-4399-aaa4-d1aed73aa51b_714x281.png 1272w, https://substackcdn.com/image/fetch/$s_!Rruj!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c880833-05e1-4399-aaa4-d1aed73aa51b_714x281.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Rruj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c880833-05e1-4399-aaa4-d1aed73aa51b_714x281.png" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8c880833-05e1-4399-aaa4-d1aed73aa51b_714x281.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:64736,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c880833-05e1-4399-aaa4-d1aed73aa51b_714x281.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Rruj!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c880833-05e1-4399-aaa4-d1aed73aa51b_714x281.png 424w, https://substackcdn.com/image/fetch/$s_!Rruj!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c880833-05e1-4399-aaa4-d1aed73aa51b_714x281.png 848w, https://substackcdn.com/image/fetch/$s_!Rruj!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c880833-05e1-4399-aaa4-d1aed73aa51b_714x281.png 1272w, https://substackcdn.com/image/fetch/$s_!Rruj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c880833-05e1-4399-aaa4-d1aed73aa51b_714x281.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><h2>Groupings</h2><p>We categorize these 16 players out into 6 groups based on their stages of development, adoption, and inherent competitive advantages.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!wlim!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe14ea8c8-2aac-457d-b981-717c1c6c4497_731x668.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!wlim!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe14ea8c8-2aac-457d-b981-717c1c6c4497_731x668.png 424w, https://substackcdn.com/image/fetch/$s_!wlim!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe14ea8c8-2aac-457d-b981-717c1c6c4497_731x668.png 848w, https://substackcdn.com/image/fetch/$s_!wlim!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe14ea8c8-2aac-457d-b981-717c1c6c4497_731x668.png 1272w, https://substackcdn.com/image/fetch/$s_!wlim!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe14ea8c8-2aac-457d-b981-717c1c6c4497_731x668.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!wlim!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe14ea8c8-2aac-457d-b981-717c1c6c4497_731x668.png" width="731" height="668" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e14ea8c8-2aac-457d-b981-717c1c6c4497_731x668.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:668,&quot;width&quot;:731,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:117009,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe14ea8c8-2aac-457d-b981-717c1c6c4497_731x668.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!wlim!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe14ea8c8-2aac-457d-b981-717c1c6c4497_731x668.png 424w, https://substackcdn.com/image/fetch/$s_!wlim!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe14ea8c8-2aac-457d-b981-717c1c6c4497_731x668.png 848w, https://substackcdn.com/image/fetch/$s_!wlim!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe14ea8c8-2aac-457d-b981-717c1c6c4497_731x668.png 1272w, https://substackcdn.com/image/fetch/$s_!wlim!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe14ea8c8-2aac-457d-b981-717c1c6c4497_731x668.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><em>7 of these 16 are on the U.S. Entity List (indicated by &#128681;), which restricts their access to foreign-produced wafers and high-bandwidth memory (HBM) solutions. Current entity listings are public at <a href="https://www.ecfr.gov/current/title-15/subtitle-B/chapter-VII/subchapter-C/part-744/appendix-Supplement%20No.%204%20to%20Part%20744">eCFR.gov</a>.</em></figcaption></figure></div><h2>Tier List</h2><p>Since most people are going to skip to this section anyway, I&#8217;ll tee up my domestic tier list for the players we&#8217;ve evaluated. This was inspired by Nathan Lambert in his <a href="https://www.interconnects.ai/p/chinas-top-19-open-model-labs">China Research Lab breakdown</a>, but is by no means an authoritative declaration - it&#8217;s subjective and shouldn&#8217;t be taken too seriously.</p><p>To be clear: this is strictly a <strong>domestic</strong> tier list. We&#8217;ll include comparisons to China-available NVIDIA and AMD chips later on, but including them here would pollute the purpose of the exercise.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ck0U!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2cc857ad-1cc5-4b04-ba07-ace8b0e5a8c4_1067x1069.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ck0U!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2cc857ad-1cc5-4b04-ba07-ace8b0e5a8c4_1067x1069.png 424w, https://substackcdn.com/image/fetch/$s_!ck0U!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2cc857ad-1cc5-4b04-ba07-ace8b0e5a8c4_1067x1069.png 848w, https://substackcdn.com/image/fetch/$s_!ck0U!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2cc857ad-1cc5-4b04-ba07-ace8b0e5a8c4_1067x1069.png 1272w, https://substackcdn.com/image/fetch/$s_!ck0U!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2cc857ad-1cc5-4b04-ba07-ace8b0e5a8c4_1067x1069.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ck0U!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2cc857ad-1cc5-4b04-ba07-ace8b0e5a8c4_1067x1069.png" width="1200" height="1202.249297094658" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2cc857ad-1cc5-4b04-ba07-ace8b0e5a8c4_1067x1069.png&quot;,&quot;srcNoWatermark&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d6a93d3b-97a6-4254-9c40-c80d7af9f88b_1067x1069.png&quot;,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:1069,&quot;width&quot;:1067,&quot;resizeWidth&quot;:1200,&quot;bytes&quot;:309095,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F178dd1f8-ac4f-4245-ab6d-38f66a8fd4d9_1067x1073.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-large" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ck0U!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2cc857ad-1cc5-4b04-ba07-ace8b0e5a8c4_1067x1069.png 424w, https://substackcdn.com/image/fetch/$s_!ck0U!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2cc857ad-1cc5-4b04-ba07-ace8b0e5a8c4_1067x1069.png 848w, https://substackcdn.com/image/fetch/$s_!ck0U!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2cc857ad-1cc5-4b04-ba07-ace8b0e5a8c4_1067x1069.png 1272w, https://substackcdn.com/image/fetch/$s_!ck0U!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2cc857ad-1cc5-4b04-ba07-ace8b0e5a8c4_1067x1069.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>S through D tiers (along with the &#29399;&#23630; bottom tier) are ranked based on a rubric assessing company performance across 5 dimensions: product, leadership, developer adoption, commercial adoption, and strategic backing.</p><p>A separate tier, &#8220;Edgerunner,&#8221; is reserved for players pursuing novel hardware designs that effectively bypass systemic energy-compute bottlenecks. They may have either limited or unavailable information on commercial deployments. This is not an ordinal ranking <em>per se</em>, but a callout that we should pay close attention to progress on these design paradigms.</p><h3>Ranking Rubric</h3><p>The analyses sections go into plenty of detail on evaluations - the rubric is here for reference.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3cnx!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3b92d61-315d-438a-925c-613ba9da4b26_699x539.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3cnx!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3b92d61-315d-438a-925c-613ba9da4b26_699x539.png 424w, https://substackcdn.com/image/fetch/$s_!3cnx!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3b92d61-315d-438a-925c-613ba9da4b26_699x539.png 848w, https://substackcdn.com/image/fetch/$s_!3cnx!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3b92d61-315d-438a-925c-613ba9da4b26_699x539.png 1272w, https://substackcdn.com/image/fetch/$s_!3cnx!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3b92d61-315d-438a-925c-613ba9da4b26_699x539.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3cnx!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3b92d61-315d-438a-925c-613ba9da4b26_699x539.png" width="1200" height="925.3218884120172" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c3b92d61-315d-438a-925c-613ba9da4b26_699x539.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:539,&quot;width&quot;:699,&quot;resizeWidth&quot;:1200,&quot;bytes&quot;:123838,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3b92d61-315d-438a-925c-613ba9da4b26_699x539.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-large" alt="" srcset="https://substackcdn.com/image/fetch/$s_!3cnx!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3b92d61-315d-438a-925c-613ba9da4b26_699x539.png 424w, https://substackcdn.com/image/fetch/$s_!3cnx!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3b92d61-315d-438a-925c-613ba9da4b26_699x539.png 848w, https://substackcdn.com/image/fetch/$s_!3cnx!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3b92d61-315d-438a-925c-613ba9da4b26_699x539.png 1272w, https://substackcdn.com/image/fetch/$s_!3cnx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3b92d61-315d-438a-925c-613ba9da4b26_699x539.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h1>Company Profiles</h1><div><hr></div><h2>Taxonomy Primer</h2><p><em>Skip if you&#8217;re already familiar.</em></p><p>In semiconductor coverage, a bunch of acronyms can get confusing. It also doesn&#8217;t help when technical and marketing terms are often conflated (e.g. Google &#8220;TPU&#8221; brand, SOPHGO&#8217;s TPUs, and Zhonghao Xinying&#8217;s &#8220;GPTPUs&#8221;). This confuses retail investors, policymakers, and seasoned analysts alike.</p><p>To keep it simple, here&#8217;s a primer for logic chip taxonomy, as well as where each of our companies fit in that schema. If the entity designs chips in multiple categories, it&#8217;s mentioned in each.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!RCZL!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96820f47-51f5-4bc5-bb08-3c739240c674_696x661.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!RCZL!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96820f47-51f5-4bc5-bb08-3c739240c674_696x661.png 424w, https://substackcdn.com/image/fetch/$s_!RCZL!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96820f47-51f5-4bc5-bb08-3c739240c674_696x661.png 848w, https://substackcdn.com/image/fetch/$s_!RCZL!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96820f47-51f5-4bc5-bb08-3c739240c674_696x661.png 1272w, https://substackcdn.com/image/fetch/$s_!RCZL!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96820f47-51f5-4bc5-bb08-3c739240c674_696x661.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!RCZL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96820f47-51f5-4bc5-bb08-3c739240c674_696x661.png" width="724" height="687.5919540229885" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/96820f47-51f5-4bc5-bb08-3c739240c674_696x661.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:661,&quot;width&quot;:696,&quot;resizeWidth&quot;:724,&quot;bytes&quot;:149092,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96820f47-51f5-4bc5-bb08-3c739240c674_696x661.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!RCZL!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96820f47-51f5-4bc5-bb08-3c739240c674_696x661.png 424w, https://substackcdn.com/image/fetch/$s_!RCZL!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96820f47-51f5-4bc5-bb08-3c739240c674_696x661.png 848w, https://substackcdn.com/image/fetch/$s_!RCZL!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96820f47-51f5-4bc5-bb08-3c739240c674_696x661.png 1272w, https://substackcdn.com/image/fetch/$s_!RCZL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96820f47-51f5-4bc5-bb08-3c739240c674_696x661.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><em>Italicized names are not covered in this post.</em></p><p>As a general rule of thumb, this list is in order of <em>generalizability </em>and <em>energy efficiency</em> for AI workloads. GPUs are general-purpose and strike a good balance between performance and efficiency, but are outclassed by ASICs on energy-compute terms. ASICs have slower iteration times due to hardware life cycles and a more limited customer base, but are often the logic chip of choice for large hyperscalers who are their own customers. Lastly, PIM / NDP chips are the most energy-efficient logic chips by orders of magnitude, but are still in an experimental stage of development. Data on their performance in production environments is somewhat limited.</p><div><hr></div><h2>&#129470; Heavyweights</h2><blockquote><p>Well-funded silicon teams wholly-owned by a major Chinese technology incumbent, with access to top talent, recurring revenue, and AI cloud real estate for deployments and co-designs. Strategy is generally to win at rack-scale, not at card-scale - custom interconnects a major boon.</p></blockquote><h3><strong>Huawei HiSilicon (&#21326;&#20026;&#28023;&#24605;)</strong></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!qLeS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b2c9d16-9bc8-4939-8257-59cd4d8aefac_640x427.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!qLeS!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b2c9d16-9bc8-4939-8257-59cd4d8aefac_640x427.jpeg 424w, https://substackcdn.com/image/fetch/$s_!qLeS!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b2c9d16-9bc8-4939-8257-59cd4d8aefac_640x427.jpeg 848w, https://substackcdn.com/image/fetch/$s_!qLeS!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b2c9d16-9bc8-4939-8257-59cd4d8aefac_640x427.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!qLeS!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b2c9d16-9bc8-4939-8257-59cd4d8aefac_640x427.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!qLeS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b2c9d16-9bc8-4939-8257-59cd4d8aefac_640x427.jpeg" width="724" height="483.04375" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0b2c9d16-9bc8-4939-8257-59cd4d8aefac_640x427.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:427,&quot;width&quot;:640,&quot;resizeWidth&quot;:724,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Huawei debuts Nvidia's supernode rival at WAIC 2025 as local firms display  advanced intelligent computing solutions&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Huawei debuts Nvidia's supernode rival at WAIC 2025 as local firms display  advanced intelligent computing solutions" title="Huawei debuts Nvidia's supernode rival at WAIC 2025 as local firms display  advanced intelligent computing solutions" srcset="https://substackcdn.com/image/fetch/$s_!qLeS!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b2c9d16-9bc8-4939-8257-59cd4d8aefac_640x427.jpeg 424w, https://substackcdn.com/image/fetch/$s_!qLeS!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b2c9d16-9bc8-4939-8257-59cd4d8aefac_640x427.jpeg 848w, https://substackcdn.com/image/fetch/$s_!qLeS!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b2c9d16-9bc8-4939-8257-59cd4d8aefac_640x427.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!qLeS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b2c9d16-9bc8-4939-8257-59cd4d8aefac_640x427.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>1991 | <a href="https://www.hisilicon.com/en">hisilicon.com</a> | Private | Entity Listed &#128681;</p><p>China&#8217;s most vertically integrated NVIDIA alternative: Da Vinci core-based 910A/B/C parts bound into Atlas boards/servers and rack-scale CloudMatrix systems, with CANN software and optical backplanes. The 910C centers on dual-chiplet packaging (~53 B transistors, ~64 AI cores) with 3D die-to-die fabric and HBM3, serving as the building block for CloudMatrix 384 across 16 racks.</p><p>Strategy is &#8220;win at rack scale,&#8221; trading device-level peak for throughput-per-rack and supply security; pricing (&#8776;&#165;110k for 910B; &#8776;&#165;180&#8211;200k for 910C) underscores the TCO pitch versus scarce H100/H200. Adoption is broad across state-linked telcos, finance, and internet platforms; some 910C deliveries push into late-2025 as capacity ramps.</p><h3><strong>T-HEAD / Alibaba (&#24179;&#22836;&#21733;)</strong></h3><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!7YA7!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb172ac15-ac6a-49de-995c-f32f50510ba6_2000x1041.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!7YA7!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb172ac15-ac6a-49de-995c-f32f50510ba6_2000x1041.png 424w, https://substackcdn.com/image/fetch/$s_!7YA7!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb172ac15-ac6a-49de-995c-f32f50510ba6_2000x1041.png 848w, https://substackcdn.com/image/fetch/$s_!7YA7!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb172ac15-ac6a-49de-995c-f32f50510ba6_2000x1041.png 1272w, https://substackcdn.com/image/fetch/$s_!7YA7!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb172ac15-ac6a-49de-995c-f32f50510ba6_2000x1041.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!7YA7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb172ac15-ac6a-49de-995c-f32f50510ba6_2000x1041.png" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b172ac15-ac6a-49de-995c-f32f50510ba6_2000x1041.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Alibaba T-Head Works with China's Leading Smart Voice Chip Supplier,  Allwinner, to Launch New Computing Chips - Pandaily&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Alibaba T-Head Works with China's Leading Smart Voice Chip Supplier,  Allwinner, to Launch New Computing Chips - Pandaily" title="Alibaba T-Head Works with China's Leading Smart Voice Chip Supplier,  Allwinner, to Launch New Computing Chips - Pandaily" srcset="https://substackcdn.com/image/fetch/$s_!7YA7!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb172ac15-ac6a-49de-995c-f32f50510ba6_2000x1041.png 424w, https://substackcdn.com/image/fetch/$s_!7YA7!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb172ac15-ac6a-49de-995c-f32f50510ba6_2000x1041.png 848w, https://substackcdn.com/image/fetch/$s_!7YA7!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb172ac15-ac6a-49de-995c-f32f50510ba6_2000x1041.png 1272w, https://substackcdn.com/image/fetch/$s_!7YA7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb172ac15-ac6a-49de-995c-f32f50510ba6_2000x1041.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>2018 | <a href="https://t-head.cn/">t-head.cn</a> | Private</p><p>Alibaba&#8217;s chip arm merging C-SKY + DAMO chip team: open-source Xuantie RISC-V cores on one flank, Hanguang NPUs and Yitian CPUs on the other. Hanguang 800 (12 nm; large on-chip SRAM, flexible precision) was architected as a high-efficiency inference engine for Alibaba workloads (images/video, recommendation, &#8220;City Brain&#8221;). Patent surveys point to a new phase of chiplets and dataflow-centric fabrics with adaptive tiling and aggressive mixed precision. chiplet-based, dataflow-centric fabrics with adaptive tiling and aggressive mixed precision.</p><p>Adoption is primarily internal/partner-cloud: Panjiu/HPN7.0 fabric interconnects thousands &#8594; hundreds-of-thousands of accelerators; migrations from DCN+ to HPN show material end-to-end training gains. ALink aims to NVLink-like scale for both international and domestic chips - a vertically integrated path to cost/latency control rather than chasing peak single-card specs.</p><h3><strong>Kunlunxin / Baidu (&#26118;&#20177;&#33455;)</strong></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!A88P!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0c3a09b-1c51-4cb8-b089-582628d9eba2_1080x603.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!A88P!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0c3a09b-1c51-4cb8-b089-582628d9eba2_1080x603.png 424w, https://substackcdn.com/image/fetch/$s_!A88P!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0c3a09b-1c51-4cb8-b089-582628d9eba2_1080x603.png 848w, https://substackcdn.com/image/fetch/$s_!A88P!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0c3a09b-1c51-4cb8-b089-582628d9eba2_1080x603.png 1272w, https://substackcdn.com/image/fetch/$s_!A88P!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0c3a09b-1c51-4cb8-b089-582628d9eba2_1080x603.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!A88P!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0c3a09b-1c51-4cb8-b089-582628d9eba2_1080x603.png" width="1080" height="603" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d0c3a09b-1c51-4cb8-b089-582628d9eba2_1080x603.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:603,&quot;width&quot;:1080,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:457314,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0c3a09b-1c51-4cb8-b089-582628d9eba2_1080x603.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!A88P!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0c3a09b-1c51-4cb8-b089-582628d9eba2_1080x603.png 424w, https://substackcdn.com/image/fetch/$s_!A88P!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0c3a09b-1c51-4cb8-b089-582628d9eba2_1080x603.png 848w, https://substackcdn.com/image/fetch/$s_!A88P!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0c3a09b-1c51-4cb8-b089-582628d9eba2_1080x603.png 1272w, https://substackcdn.com/image/fetch/$s_!A88P!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0c3a09b-1c51-4cb8-b089-582628d9eba2_1080x603.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>2011; spun-out 2019 | <a href="https://kunlunxin.com">kunlunxin.com</a> | Private</p><p>Baidu-born XPU line integrated with Baidu Cloud + PaddlePaddle: 1st-gen Samsung 14 nm shipped widely; Kunlun II (7 nm, XPU-R) expanded the base; P800 (XPU-P) targets LLM training and 8-bit inference with rack-scale &#36229;&#33410;&#28857; cabinets. Advantage is tight alignment with Baidu&#8217;s software and services.</p><p>Adoption proof points include China Merchants Bank&#8217;s AI chip project (P800 backing full-fat Qwen/DeepSeek variants on 8-card nodes and clusters) and Baidu&#8217;s own &#19975;&#21345;&#8594;&#19977;&#19975;&#21345; P800 fleet growth in-cloud.</p><div><hr></div><h2>&#128009; Four Little Dragons (&#22235;&#23567;&#40857;)</h2><blockquote><p>An informal industry term granted to four well-funded, novel fabless semiconductor companies. Prospects are generally positive. All have announced their intent to IPO or have already filed prospectuses.</p></blockquote><h3><strong>Enflame (&#29159;&#21407;&#31185;&#25216;)</strong></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Wmxi!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd33690aa-d7a0-43b0-95a3-4121d002049c_3790x2842.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Wmxi!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd33690aa-d7a0-43b0-95a3-4121d002049c_3790x2842.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Wmxi!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd33690aa-d7a0-43b0-95a3-4121d002049c_3790x2842.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Wmxi!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd33690aa-d7a0-43b0-95a3-4121d002049c_3790x2842.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Wmxi!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd33690aa-d7a0-43b0-95a3-4121d002049c_3790x2842.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Wmxi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd33690aa-d7a0-43b0-95a3-4121d002049c_3790x2842.jpeg" width="3790" height="2842" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d33690aa-d7a0-43b0-95a3-4121d002049c_3790x2842.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:2842,&quot;width&quot;:3790,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2196636,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd687f8b4-edba-42e1-83c9-05b249cbac5a_5712x4284.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Wmxi!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd33690aa-d7a0-43b0-95a3-4121d002049c_3790x2842.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Wmxi!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd33690aa-d7a0-43b0-95a3-4121d002049c_3790x2842.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Wmxi!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd33690aa-d7a0-43b0-95a3-4121d002049c_3790x2842.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Wmxi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd33690aa-d7a0-43b0-95a3-4121d002049c_3790x2842.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>2018 | <a href="https://www.enflame-tech.com/">enflame-tech.com</a> | Private </p><p>Shanghai ASIC house led by AMD alumni (Zhao Lidong, Zhang Yalin), optimized for scale-out systems over single-chip hero numbers. Tencent is both anchor investor and proving ground; Big Fund participation adds patient capital. 2024&#8211;25 pushed mass inference (S60) while refreshing training (L600 &#35757;&#25512;&#19968;&#20307; with higher on-package memory/interconnect).</p><p>Traction is real: ~70k S60 units shipped; the Gansu Qingyang 10,016-card S60 cluster under &#19996;&#25968;&#35199;&#31639; is a marquee win; earlier provincial clusters (e.g., Yichang) and wide Tencent workload coverage bolster credibility. The software ecosystem (&#8220;&#39533;&#31639;/&#37492;&#31639;&#8221;) still trails CUDA, but the company is investing to close op coverage and porting costs. WAIC 2025 showcased &#8220;DeepSeek-ready&#8221; all-in-ones for R1-671B, doubling down on visible, workload-level validation.</p><h3><strong>Moore Threads (&#25705;&#23572;&#32447;&#31243;)</strong></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!a0BQ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15e235ca-3153-4e45-ab01-53193e343c5d_2560x1707.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!a0BQ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15e235ca-3153-4e45-ab01-53193e343c5d_2560x1707.jpeg 424w, https://substackcdn.com/image/fetch/$s_!a0BQ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15e235ca-3153-4e45-ab01-53193e343c5d_2560x1707.jpeg 848w, https://substackcdn.com/image/fetch/$s_!a0BQ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15e235ca-3153-4e45-ab01-53193e343c5d_2560x1707.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!a0BQ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15e235ca-3153-4e45-ab01-53193e343c5d_2560x1707.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!a0BQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15e235ca-3153-4e45-ab01-53193e343c5d_2560x1707.jpeg" width="1456" height="971" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/15e235ca-3153-4e45-ab01-53193e343c5d_2560x1707.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:971,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Moore Threads Unveils MTT S4000 GPU: Equipped With 48 GB Memory, 200 TOPS  AI Compute, Gen5 Ready&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Moore Threads Unveils MTT S4000 GPU: Equipped With 48 GB Memory, 200 TOPS  AI Compute, Gen5 Ready" title="Moore Threads Unveils MTT S4000 GPU: Equipped With 48 GB Memory, 200 TOPS  AI Compute, Gen5 Ready" srcset="https://substackcdn.com/image/fetch/$s_!a0BQ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15e235ca-3153-4e45-ab01-53193e343c5d_2560x1707.jpeg 424w, https://substackcdn.com/image/fetch/$s_!a0BQ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15e235ca-3153-4e45-ab01-53193e343c5d_2560x1707.jpeg 848w, https://substackcdn.com/image/fetch/$s_!a0BQ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15e235ca-3153-4e45-ab01-53193e343c5d_2560x1707.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!a0BQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15e235ca-3153-4e45-ab01-53193e343c5d_2560x1707.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>2020 | <a href="https://www.mthreads.com/">mthreads.com</a> | Private (<a href="https://static.sse.com.cn/stock/disclosure/announcement/c/202506/002098_20250630_D74O.pdf">IPO Prospectus</a>) | Entity Listed &#128681;</p><p>Beijing fabless GPU vendor founded by ex-NVIDIA China GM Zhang Jianzhong; heavily backed (ByteDance, Tencent, China Mobile equity arm, Lenovo Capital) and fast-iterating across &#33487;&#22564;&#8594;&#26149;&#26195;&#8594;&#26354;&#38498; with the MUSA stack; shifted from TSMC to SMIC post-Entity List.</p><p>Adoption signal is improving: CAICT validated the MTT S4000 for large-model inference; deployments span Kuaishou (1,000-GPU cluster), the big three telcos, and university/energy partners. Datacenter SKUs (S3000/S4000) anchor B2B, while the E300 edge module (AB100 AI SoC, ~50 TOPS INT8) shows consumer/edge ambition. Performance lags NVIDIA&#8217;s top end but distribution, capital, and ecosystem deals give it staying power domestically.</p><h3><strong>MetaX (&#27792;&#26342;&#38598;&#25104;&#30005;&#36335;)</strong></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!txBH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb29efc81-5407-49aa-b2b8-c70ee2624555_3024x1583.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!txBH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb29efc81-5407-49aa-b2b8-c70ee2624555_3024x1583.jpeg 424w, https://substackcdn.com/image/fetch/$s_!txBH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb29efc81-5407-49aa-b2b8-c70ee2624555_3024x1583.jpeg 848w, https://substackcdn.com/image/fetch/$s_!txBH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb29efc81-5407-49aa-b2b8-c70ee2624555_3024x1583.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!txBH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb29efc81-5407-49aa-b2b8-c70ee2624555_3024x1583.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!txBH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb29efc81-5407-49aa-b2b8-c70ee2624555_3024x1583.jpeg" width="3024" height="1583" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b29efc81-5407-49aa-b2b8-c70ee2624555_3024x1583.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1583,&quot;width&quot;:3024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:828909,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3645493b-00c0-4a23-a0b0-0ea005abd4e0_4032x3024.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!txBH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb29efc81-5407-49aa-b2b8-c70ee2624555_3024x1583.jpeg 424w, https://substackcdn.com/image/fetch/$s_!txBH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb29efc81-5407-49aa-b2b8-c70ee2624555_3024x1583.jpeg 848w, https://substackcdn.com/image/fetch/$s_!txBH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb29efc81-5407-49aa-b2b8-c70ee2624555_3024x1583.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!txBH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb29efc81-5407-49aa-b2b8-c70ee2624555_3024x1583.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>2020 | <a href="https://www.metax-tech.com/">metax-tech.com</a> | Private (<a href="https://static.sse.com.cn/stock/disclosure/announcement/c/202506/002078_20250630_VAWJ.pdf">IPO Prospectus</a>)</p><p>Shanghai GPGPU startup (AMD-veteran founding team) with three series&#8212;&#26342;&#24605; N (inference), &#26342;&#20113; C (training/compute), &#26342;&#24425; G (graphics)&#8212;plus CUDA-like MXMACA and MetaXLink interconnect. STAR Market IPO prospectus accepted 2025-06-30; &gt;25k units shipped; next-gen dual-chiplet C600 touts 144 GB HBM3e.</p><p>Commercial motion leans on operator/provincial compute platforms and national distributors: disclosed multi-billion-RMB orders via integrators, Lingang partnerships, and &#8220;&#21315;&#21345;&#32423;&#8221; clusters. Earlier parts relied on overseas HBM/foundry; newer N300/C600 are billed as domestic supply chain supported (&#8220;&#22522;&#20110;&#22269;&#20135;&#20379;&#24212;&#38142;&#8221;) signaling a risk-mitigation pivot while keeping cost-right inference as the spearhead.</p><h3><strong>Biren (&#22721;&#20190;)</strong></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!9Umo!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8de5a721-6c68-4066-a6f7-88ff63dcbb40_986x760.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!9Umo!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8de5a721-6c68-4066-a6f7-88ff63dcbb40_986x760.jpeg 424w, https://substackcdn.com/image/fetch/$s_!9Umo!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8de5a721-6c68-4066-a6f7-88ff63dcbb40_986x760.jpeg 848w, https://substackcdn.com/image/fetch/$s_!9Umo!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8de5a721-6c68-4066-a6f7-88ff63dcbb40_986x760.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!9Umo!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8de5a721-6c68-4066-a6f7-88ff63dcbb40_986x760.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!9Umo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8de5a721-6c68-4066-a6f7-88ff63dcbb40_986x760.jpeg" width="986" height="760" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8de5a721-6c68-4066-a6f7-88ff63dcbb40_986x760.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:760,&quot;width&quot;:986,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;New Chinese Biren BR100 GPGPU apparently beats Nvidia's Ampere A100 -  NotebookCheck.net News&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="New Chinese Biren BR100 GPGPU apparently beats Nvidia's Ampere A100 -  NotebookCheck.net News" title="New Chinese Biren BR100 GPGPU apparently beats Nvidia's Ampere A100 -  NotebookCheck.net News" srcset="https://substackcdn.com/image/fetch/$s_!9Umo!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8de5a721-6c68-4066-a6f7-88ff63dcbb40_986x760.jpeg 424w, https://substackcdn.com/image/fetch/$s_!9Umo!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8de5a721-6c68-4066-a6f7-88ff63dcbb40_986x760.jpeg 848w, https://substackcdn.com/image/fetch/$s_!9Umo!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8de5a721-6c68-4066-a6f7-88ff63dcbb40_986x760.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!9Umo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8de5a721-6c68-4066-a6f7-88ff63dcbb40_986x760.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>2019 | <a href="https://birentech.com">birentech.com</a> | Private | Entity Listed &#128681;</p><p>Early bright star in domestic semiconductor design, Shanghai-based player built around the BR100 7nm chip with a star lineup of ex-NVIDIA/AMD/Huawei/Alibaba talent; later leadership churn followed Entity List actions and a pivot from TSMC to SMIC (N+2) with simplified derivatives (BiLi 106/110/166). Software stack (BIRENSUPA) is AI-first, not consumer graphics, and does not seem CUDA-friendly.</p><p>Commercially, Biren is lagging newer dragons, but has nonetheless landed state-linked telcos/clouds (China Mobile/Telecom), SenseTime, State Grid, and Shanghai AI Lab (1,000-GPU cluster). Newer liquid-cooled OAM lines and the &#8220;LightSphere X&#8221; photonic SuperNode (with Xizhi &amp; ZTE) target density/efficiency at 2,000-GPU scale; first stop is the Shanghai INESA Intelligent Computing Center.</p><div><hr></div><h2><strong>&#127941; Public Champions</strong></h2><blockquote><p>Publicly traded firms with considerable success in commercial deployments.</p></blockquote><h3><strong>Cambricon (&#23506;&#27494;&#32426;)</strong></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!UvPD!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F493d3fad-be4a-4548-8468-70f74666614d_1440x810.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!UvPD!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F493d3fad-be4a-4548-8468-70f74666614d_1440x810.jpeg 424w, https://substackcdn.com/image/fetch/$s_!UvPD!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F493d3fad-be4a-4548-8468-70f74666614d_1440x810.jpeg 848w, https://substackcdn.com/image/fetch/$s_!UvPD!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F493d3fad-be4a-4548-8468-70f74666614d_1440x810.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!UvPD!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F493d3fad-be4a-4548-8468-70f74666614d_1440x810.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!UvPD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F493d3fad-be4a-4548-8468-70f74666614d_1440x810.jpeg" width="1440" height="810" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/493d3fad-be4a-4548-8468-70f74666614d_1440x810.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:810,&quot;width&quot;:1440,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Investors bet on Cambricon to be China's next AI champion&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Investors bet on Cambricon to be China's next AI champion" title="Investors bet on Cambricon to be China's next AI champion" srcset="https://substackcdn.com/image/fetch/$s_!UvPD!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F493d3fad-be4a-4548-8468-70f74666614d_1440x810.jpeg 424w, https://substackcdn.com/image/fetch/$s_!UvPD!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F493d3fad-be4a-4548-8468-70f74666614d_1440x810.jpeg 848w, https://substackcdn.com/image/fetch/$s_!UvPD!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F493d3fad-be4a-4548-8468-70f74666614d_1440x810.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!UvPD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F493d3fad-be4a-4548-8468-70f74666614d_1440x810.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>2016 | <a href="http://cambricon.com">cambricon.com</a> | 688256.SH | </strong>Entity Listed &#128681;</p><p>CAS/USTC child prodigies spun early domestic accelerator research (DianNao) into a commercial ASIC line; after early highs (Kirin 970 era) and a slump (Entity List, lost smartphone socket), the Siyuan 590 (domestic 7 nm) catalyzed a resurgence with A100-class training in some metrics. H1&#8217;25 revenue spiked ($400M) but is highly concentrated (top-5 &#8776;95%, one &#8776;79%&#8212;rumored ByteDance).</p><p>Cambricon is again relevant in training-class deployments (Siyuan 590 now, 690 next), fueled by heavy R&amp;D (&#8776;&#165;1.07 B in 2024, ~91% of revenue). Concentration risk is real, but the technical IP, government alignment, and renewed customer pull have shifted the narrative from &#8220;fallen star&#8221; to &#8220;second act.&#8221;</p><h3><strong>Hygon (&#28023;&#20809;&#20449;&#24687;)</strong></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!bPcX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7fc79482-4736-4a7c-9800-c42d569eefa8_2000x1040.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!bPcX!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7fc79482-4736-4a7c-9800-c42d569eefa8_2000x1040.jpeg 424w, https://substackcdn.com/image/fetch/$s_!bPcX!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7fc79482-4736-4a7c-9800-c42d569eefa8_2000x1040.jpeg 848w, https://substackcdn.com/image/fetch/$s_!bPcX!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7fc79482-4736-4a7c-9800-c42d569eefa8_2000x1040.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!bPcX!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7fc79482-4736-4a7c-9800-c42d569eefa8_2000x1040.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!bPcX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7fc79482-4736-4a7c-9800-c42d569eefa8_2000x1040.jpeg" width="1456" height="757" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7fc79482-4736-4a7c-9800-c42d569eefa8_2000x1040.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:757,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Leading Chinese CPU Firm Hygon Listed to Shanghai's STAR Market - Pandaily&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Leading Chinese CPU Firm Hygon Listed to Shanghai's STAR Market - Pandaily" title="Leading Chinese CPU Firm Hygon Listed to Shanghai's STAR Market - Pandaily" srcset="https://substackcdn.com/image/fetch/$s_!bPcX!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7fc79482-4736-4a7c-9800-c42d569eefa8_2000x1040.jpeg 424w, https://substackcdn.com/image/fetch/$s_!bPcX!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7fc79482-4736-4a7c-9800-c42d569eefa8_2000x1040.jpeg 848w, https://substackcdn.com/image/fetch/$s_!bPcX!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7fc79482-4736-4a7c-9800-c42d569eefa8_2000x1040.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!bPcX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7fc79482-4736-4a7c-9800-c42d569eefa8_2000x1040.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>2014 | <a href="http://hygon.cn">hygon.cn</a> | 688041.SH | </strong>Entity Listed &#128681;</p><p>Originally established as a joint-venture between AMD, Chengdu Haiguang Microelectronics, and Chengu Haiguang Integrated Circuit Design. Best known for CPUs, Hygon has been reorganizing into accelerators with DCU-branded GPGPU-like parts (CUDA-compatible environment). DCU8100 accelerator reports parity with A100/MI100 in some precisions. A major 2025 move was the announced merger with Sugon (&#26329;&#20809;) to tighten the HPC stack.</p><p>Current accelerator push seems to be riding existing HPC/government channels while CPUs remain the revenue bedrock. Sugon tie-up could accelerate system-level offerings as domestic alternatives mature, especially in heterogeneous computing systems.</p><div><hr></div><h2><strong>&#129514; Specialists</strong></h2><blockquote><p>A mixture of startups and established players pursuing ASICs and other specialized hardware paradigms. While the Ascend, Hanguang 800, and Cambricon MLU lines are technically ASICs, their incumbency separates them from this bucket.</p></blockquote><h3><strong>SOPHGO / SOPHON (&#31639;&#33021;&#31185;&#25216;)</strong></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!l0rh!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81ad15db-40fc-4002-a3ef-08e249ed0140_2602x781.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!l0rh!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81ad15db-40fc-4002-a3ef-08e249ed0140_2602x781.jpeg 424w, https://substackcdn.com/image/fetch/$s_!l0rh!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81ad15db-40fc-4002-a3ef-08e249ed0140_2602x781.jpeg 848w, https://substackcdn.com/image/fetch/$s_!l0rh!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81ad15db-40fc-4002-a3ef-08e249ed0140_2602x781.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!l0rh!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81ad15db-40fc-4002-a3ef-08e249ed0140_2602x781.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!l0rh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81ad15db-40fc-4002-a3ef-08e249ed0140_2602x781.jpeg" width="2602" height="781" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/81ad15db-40fc-4002-a3ef-08e249ed0140_2602x781.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:781,&quot;width&quot;:2602,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:479163,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96d01f54-b560-4209-b7fe-248a25145e3a_4032x3024.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!l0rh!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81ad15db-40fc-4002-a3ef-08e249ed0140_2602x781.jpeg 424w, https://substackcdn.com/image/fetch/$s_!l0rh!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81ad15db-40fc-4002-a3ef-08e249ed0140_2602x781.jpeg 848w, https://substackcdn.com/image/fetch/$s_!l0rh!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81ad15db-40fc-4002-a3ef-08e249ed0140_2602x781.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!l0rh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F81ad15db-40fc-4002-a3ef-08e249ed0140_2602x781.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>2016 | <a href="https://sophgo.com/">sophgo.com</a> | Private | Entity Listed &#128681;</p><p>Bitmain-lineage consolidation of SOPHON (cloud/edge TPUs) + CVITEK (edge vision SoCs) into an end-to-end AI stack spanning BM1684X/1688 to the new BM1690 datacenter TPU, plus SG-series RISC-V CPUs and MLOps/toolchains. Recent flagship SC11-FP300 (BM1690, 256 GB LPDDR5X, ~1.1 TB/s) and a 128-TPU liquid-cooled &#8220;SuperNode&#8221; target LLM inference at scale; the SophNet service fronts popular domestic models.</p><p>Historically strongest in video analytics and city/industrial AI; 2025 marks a pivot to mainstream LLMs. CAICT validated SC11-FP300 against DeepSeek-R1-671B inference; broader hyperscaler uptake remains to be proven, but product cadence and platformization are heading in the right direction.</p><h3><strong>Zhonghao Xinying (&#20013;&#26122;&#33455;&#33521;)</strong></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!w1Op!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6289d4c-95fa-4da6-a317-ff7340ecd40e_960x503.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!w1Op!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6289d4c-95fa-4da6-a317-ff7340ecd40e_960x503.jpeg 424w, https://substackcdn.com/image/fetch/$s_!w1Op!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6289d4c-95fa-4da6-a317-ff7340ecd40e_960x503.jpeg 848w, https://substackcdn.com/image/fetch/$s_!w1Op!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6289d4c-95fa-4da6-a317-ff7340ecd40e_960x503.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!w1Op!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6289d4c-95fa-4da6-a317-ff7340ecd40e_960x503.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!w1Op!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6289d4c-95fa-4da6-a317-ff7340ecd40e_960x503.jpeg" width="960" height="503" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a6289d4c-95fa-4da6-a317-ff7340ecd40e_960x503.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:503,&quot;width&quot;:960,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:93286,&quot;alt&quot;:&quot;10.000-Wort-R&#252;ckblick: Erste China AI Computing Power Conference -  Aufschlussreiche Vortr&#228;ge von &#252;ber 15 Prominenten!&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="10.000-Wort-R&#252;ckblick: Erste China AI Computing Power Conference -  Aufschlussreiche Vortr&#228;ge von &#252;ber 15 Prominenten!" title="10.000-Wort-R&#252;ckblick: Erste China AI Computing Power Conference -  Aufschlussreiche Vortr&#228;ge von &#252;ber 15 Prominenten!" srcset="https://substackcdn.com/image/fetch/$s_!w1Op!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6289d4c-95fa-4da6-a317-ff7340ecd40e_960x503.jpeg 424w, https://substackcdn.com/image/fetch/$s_!w1Op!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6289d4c-95fa-4da6-a317-ff7340ecd40e_960x503.jpeg 848w, https://substackcdn.com/image/fetch/$s_!w1Op!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6289d4c-95fa-4da6-a317-ff7340ecd40e_960x503.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!w1Op!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa6289d4c-95fa-4da6-a317-ff7340ecd40e_960x503.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>2018 | <a href="http://www.zhcltech.com">zhcltech.com</a> | 605255.SH (Pending SSE inquiry)</p><p>Hangzhou TPU upstart founded by a Google TPU alum (worked on v2/v3/v4); markets &#8220;GPTPU&#8221; training chips (&#21049;&#37027;/Chana) with 1,024-chip scaling and a first &#8220;Taize&#8221; cluster concept. Positioning is assertive on perf/$ versus &#8220;overseas GPUs.&#8221;</p><p>Pilots are tangible: Guangdong Unicom&#8217;s initial 32-node TPU center slated to grow, China Mobile Tianjin&#8217;s TPU intelligent computing center with Taiji servers, and a Zhejiang University research platform. But corporate maneuvering (reverse-merger path, unusual trading signals) and aggressive revenue claims raise a lot of red flags.</p><h3><strong>Moffett AI (&#22696;&#33455;)</strong></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!e-ag!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1046bd0-3a4f-4f6b-8b59-99cdf1917366_1080x463.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!e-ag!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1046bd0-3a4f-4f6b-8b59-99cdf1917366_1080x463.webp 424w, https://substackcdn.com/image/fetch/$s_!e-ag!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1046bd0-3a4f-4f6b-8b59-99cdf1917366_1080x463.webp 848w, https://substackcdn.com/image/fetch/$s_!e-ag!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1046bd0-3a4f-4f6b-8b59-99cdf1917366_1080x463.webp 1272w, https://substackcdn.com/image/fetch/$s_!e-ag!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1046bd0-3a4f-4f6b-8b59-99cdf1917366_1080x463.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!e-ag!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1046bd0-3a4f-4f6b-8b59-99cdf1917366_1080x463.webp" width="1080" height="463" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c1046bd0-3a4f-4f6b-8b59-99cdf1917366_1080x463.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:463,&quot;width&quot;:1080,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;MLPerf&#25918;&#27036;&#65292;&#20013;&#22269;AI&#33455;&#29255;&#20844;&#21496;&#20877;&#33719;&#19990;&#30028;&#31532;&#19968;&#65281;&#22823;&#27169;&#22411;&#25512;&#29702;&#19977;&#39033;&#20896;&#20891;&#65292;&#24615;&#33021;&#36229;&#36234;H100 - &#26234;&#28304;&#31038;&#21306;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="MLPerf&#25918;&#27036;&#65292;&#20013;&#22269;AI&#33455;&#29255;&#20844;&#21496;&#20877;&#33719;&#19990;&#30028;&#31532;&#19968;&#65281;&#22823;&#27169;&#22411;&#25512;&#29702;&#19977;&#39033;&#20896;&#20891;&#65292;&#24615;&#33021;&#36229;&#36234;H100 - &#26234;&#28304;&#31038;&#21306;" title="MLPerf&#25918;&#27036;&#65292;&#20013;&#22269;AI&#33455;&#29255;&#20844;&#21496;&#20877;&#33719;&#19990;&#30028;&#31532;&#19968;&#65281;&#22823;&#27169;&#22411;&#25512;&#29702;&#19977;&#39033;&#20896;&#20891;&#65292;&#24615;&#33021;&#36229;&#36234;H100 - &#26234;&#28304;&#31038;&#21306;" srcset="https://substackcdn.com/image/fetch/$s_!e-ag!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1046bd0-3a4f-4f6b-8b59-99cdf1917366_1080x463.webp 424w, https://substackcdn.com/image/fetch/$s_!e-ag!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1046bd0-3a4f-4f6b-8b59-99cdf1917366_1080x463.webp 848w, https://substackcdn.com/image/fetch/$s_!e-ag!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1046bd0-3a4f-4f6b-8b59-99cdf1917366_1080x463.webp 1272w, https://substackcdn.com/image/fetch/$s_!e-ag!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1046bd0-3a4f-4f6b-8b59-99cdf1917366_1080x463.webp 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>2018 | <a href="https://moffettai.com">moffettai.com</a> | Private</p><p>Silicon Valley-founded, now Shenzhen-based. Highly promising sparse-first accelerator company (Antoum chip/S4/S10/S30) focused on algorithm-hardware co-design for high-efficiency inference; adjacent work touches PIM/NDP and compiler/runtime research. Public technical presence includes Hot Chips, MLPerf, and a long paper trail on sparsity/near-data computation.</p><p>Large-scale hyperscaler adoption not yet substantiated, but public inferencing benchmarks support SparseOne cards for large model deployments. The bet is that structurally sparse LLMs + memory-centric designs can deliver outsized perf/W once software maturity and workloads align.</p><h3><strong>InnoStar Semiconductor (&#26133;&#21407;&#21322;&#23548;&#20307;)</strong></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!UBju!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf9d3ee1-9465-4c14-9f48-22979a36a0cd_928x346.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!UBju!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf9d3ee1-9465-4c14-9f48-22979a36a0cd_928x346.png 424w, https://substackcdn.com/image/fetch/$s_!UBju!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf9d3ee1-9465-4c14-9f48-22979a36a0cd_928x346.png 848w, https://substackcdn.com/image/fetch/$s_!UBju!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf9d3ee1-9465-4c14-9f48-22979a36a0cd_928x346.png 1272w, https://substackcdn.com/image/fetch/$s_!UBju!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf9d3ee1-9465-4c14-9f48-22979a36a0cd_928x346.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!UBju!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf9d3ee1-9465-4c14-9f48-22979a36a0cd_928x346.png" width="928" height="346" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/af9d3ee1-9465-4c14-9f48-22979a36a0cd_928x346.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:346,&quot;width&quot;:928,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!UBju!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf9d3ee1-9465-4c14-9f48-22979a36a0cd_928x346.png 424w, https://substackcdn.com/image/fetch/$s_!UBju!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf9d3ee1-9465-4c14-9f48-22979a36a0cd_928x346.png 848w, https://substackcdn.com/image/fetch/$s_!UBju!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf9d3ee1-9465-4c14-9f48-22979a36a0cd_928x346.png 1272w, https://substackcdn.com/image/fetch/$s_!UBju!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf9d3ee1-9465-4c14-9f48-22979a36a0cd_928x346.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>2019 | <a href="https://www.innostar-semi.com/">innostar-semi.com</a> | Private</p><p>Interesting ReRAM/PIM specialist tackling the memory wall by colocating compute with storage (&#8220;ATOM&#8221; memory-compute). Backed by a mix of global and domestic capital (e.g., KPCB, Lam Research, Ant Group, ByteDance), reflecting both technical ambition and strategic relevance.</p><p>This is a platform-stage play where success hinges on reliable ReRAM arrays, compiler/tiling toolchains, and marrying the parts to transformer-era dataflows. As performance data is not available and commercial adoption has yet to be publicized, the inclusion is thematic more than a endorsement of their specific offering.</p><div><hr></div><h2><strong>&#128034; Trailing Incumbents</strong></h2><blockquote><p>Established or older GPU incumbents whose commercial adoption is trailing newer competitors.</p></blockquote><h3><strong>Iluvatar CoreX (&#22825;&#25968;&#26234;&#33455;)</strong></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Jclm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F38d08dcc-6cd6-4242-bc81-18dada1167c3_1200x675.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Jclm!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F38d08dcc-6cd6-4242-bc81-18dada1167c3_1200x675.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Jclm!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F38d08dcc-6cd6-4242-bc81-18dada1167c3_1200x675.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Jclm!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F38d08dcc-6cd6-4242-bc81-18dada1167c3_1200x675.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Jclm!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F38d08dcc-6cd6-4242-bc81-18dada1167c3_1200x675.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Jclm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F38d08dcc-6cd6-4242-bc81-18dada1167c3_1200x675.jpeg" width="1200" height="675" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/38d08dcc-6cd6-4242-bc81-18dada1167c3_1200x675.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:675,&quot;width&quot;:1200,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Jclm!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F38d08dcc-6cd6-4242-bc81-18dada1167c3_1200x675.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Jclm!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F38d08dcc-6cd6-4242-bc81-18dada1167c3_1200x675.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Jclm!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F38d08dcc-6cd6-4242-bc81-18dada1167c3_1200x675.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Jclm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F38d08dcc-6cd6-4242-bc81-18dada1167c3_1200x675.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>2015 | <a href="https://www.iluvatar.com/">iluvatar.com</a> | Private</p><p>Early GPGPU entrant (founded 2015) with IxRT SDK and the Ti&#257;ng&#257;i (training)/Zh&#236;k&#462;i (inference) lines; leadership turbulence (chair/CEO shuffle, probe) but continued technical progress and financing. Ti&#257;ng&#257;i 100 was among the first 7 nm domestic training GPUs; follow-ons target parity with A100-class baselines in select tasks. &#22825;&#35813;150 barely outside the A100 performance and efficiency window in energy-compute analysis.</p><p>Adoption has skewed to consortiums and research: BAAI training &gt;70B-param models and integration at the National Supercomputing Center in Wuxi show capability; within heterogeneous clouds, presence is modest. Strategy favors ecosystem building and pilots over mega single-tenant wins.</p><h3><strong>Denglin Technology (&#30331;&#20020;&#31185;&#25216;)</strong></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!i2Cm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79d49899-a541-4599-bc85-518915018e44_732x455.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!i2Cm!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79d49899-a541-4599-bc85-518915018e44_732x455.png 424w, https://substackcdn.com/image/fetch/$s_!i2Cm!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79d49899-a541-4599-bc85-518915018e44_732x455.png 848w, https://substackcdn.com/image/fetch/$s_!i2Cm!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79d49899-a541-4599-bc85-518915018e44_732x455.png 1272w, https://substackcdn.com/image/fetch/$s_!i2Cm!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79d49899-a541-4599-bc85-518915018e44_732x455.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!i2Cm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79d49899-a541-4599-bc85-518915018e44_732x455.png" width="732" height="455" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/79d49899-a541-4599-bc85-518915018e44_732x455.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:455,&quot;width&quot;:732,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;GPU builder Denglin Technology gets investment from China &#8211; Jon Peddie  Research&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="GPU builder Denglin Technology gets investment from China &#8211; Jon Peddie  Research" title="GPU builder Denglin Technology gets investment from China &#8211; Jon Peddie  Research" srcset="https://substackcdn.com/image/fetch/$s_!i2Cm!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79d49899-a541-4599-bc85-518915018e44_732x455.png 424w, https://substackcdn.com/image/fetch/$s_!i2Cm!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79d49899-a541-4599-bc85-518915018e44_732x455.png 848w, https://substackcdn.com/image/fetch/$s_!i2Cm!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79d49899-a541-4599-bc85-518915018e44_732x455.png 1272w, https://substackcdn.com/image/fetch/$s_!i2Cm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79d49899-a541-4599-bc85-518915018e44_732x455.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>2017 | <a href="https://denglinai.com/">denglinai.com</a> | Private</p><p>Niche GPU/accelerator vendor with the Goldwasser II family (from 15&#8211;150 W SKUs covering MXM to PCIe). Positioning emphasizes INT8/FP16 density at low power envelopes rather than head-to-head datacenter training.</p><p>Commercially, the most visible activity is an &#8220;AI+ PC&#8221; tie-up with Lenovo; beyond that, evidence of large-scale traction is limited. Reads as an edge/embedded play more than a hyperscale contender.</p><div><hr></div><h2>&#128169; &#29399;&#23630;</h2><blockquote><p>Companies that have thus far failed to demonstrate any commercial viability for AI accelerators whatsoever, but tenure in the industry forces a mention.</p></blockquote><h3>Jingjia Micro (&#26223;&#22025;&#24494;)</h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!dRmh!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F559409ec-3b87-41d4-a236-14471005aee1_700x378.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!dRmh!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F559409ec-3b87-41d4-a236-14471005aee1_700x378.jpeg 424w, https://substackcdn.com/image/fetch/$s_!dRmh!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F559409ec-3b87-41d4-a236-14471005aee1_700x378.jpeg 848w, https://substackcdn.com/image/fetch/$s_!dRmh!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F559409ec-3b87-41d4-a236-14471005aee1_700x378.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!dRmh!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F559409ec-3b87-41d4-a236-14471005aee1_700x378.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!dRmh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F559409ec-3b87-41d4-a236-14471005aee1_700x378.jpeg" width="700" height="378" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/559409ec-3b87-41d4-a236-14471005aee1_700x378.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:378,&quot;width&quot;:700,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Jingjia Micro JM9 GPU Series Targeting GTX 1080 Performance Tapes Out |  TechPowerUp&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Jingjia Micro JM9 GPU Series Targeting GTX 1080 Performance Tapes Out |  TechPowerUp" title="Jingjia Micro JM9 GPU Series Targeting GTX 1080 Performance Tapes Out |  TechPowerUp" srcset="https://substackcdn.com/image/fetch/$s_!dRmh!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F559409ec-3b87-41d4-a236-14471005aee1_700x378.jpeg 424w, https://substackcdn.com/image/fetch/$s_!dRmh!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F559409ec-3b87-41d4-a236-14471005aee1_700x378.jpeg 848w, https://substackcdn.com/image/fetch/$s_!dRmh!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F559409ec-3b87-41d4-a236-14471005aee1_700x378.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!dRmh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F559409ec-3b87-41d4-a236-14471005aee1_700x378.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>2016 | <a href="http://www.jingjiamicro.com">jingjiamicro.com</a> | 300474.SZ | Entity Listed &#128681;</p><p>Changsha-based, defense-rooted fabless GPU/SoC designer and one of China&#8217;s longest-standing GPU houses; a designated &#8220;National Specialist, Unique, and New Key Little Giant.&#8221; The Little Giant status delivers both hard benefits (non-dilutive grants, priority &#8220;Big Fund&#8221; access, policy-bank credit) and soft advantages (procurement preference, reputational signaling). Founders/executives came from CETC 38th Institute with NUDT pedigrees.</p><p>Adoption concentrates in avionics/radar and government PC/workstation localization&#8212;little evidence of modern AI workloads; H1&#8217;25 revenue fell ~45% YoY with GPU segment down ~63%, indicating continued reliance on PLA/SOE demand. Has been completely unable to materialize early leads or government advantages in product development and commercial deployments for AI workloads. JM11 chip line seems to be a last-ditch effort to remain in the race.</p><div><hr></div><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.machineyearning.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Machine Yearning! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h1><strong>Data &amp; Methodology</strong></h1><p>I&#8217;ll summarize the scope of this analysis in the below table, then expand on some areas I know will have the comments section fraught with whataboutisms.</p><p>The attributes I care about revolve around energy-compute fundamentals, commercial traction, and leadership acumen. In absence of demonstrable commercial progress, strategic backing or announced collaborations in high-profile &#8220;Eastern Data, Western Compute&#8221; (&#19996;&#25968;&#35199;&#31639;)<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-16" href="#footnote-16" target="_self">16</a> and Intelligent Computing Center projects are helpful, but not a substitute.</p><h2>Introducing the &#8220;Silicon Vanguard&#8221; Dataset</h2><p>I&#8217;ve compiled first-party and trusted third-party resources on foreign and domestic chip performance specs into a single artifact I&#8217;m calling the Silicon Vanguard dataset (for lack of a better name for now).</p><p>This initial version, which I&#8217;m <a href="https://docs.google.com/spreadsheets/d/1vnS6Rlazkwpg7yBqsD9YE2eoVlAKFLG2vdOoPkiCU5c/edit?usp=sharing">sharing publicly</a>, combines performance data from company resources, securities filings, semiconductor conference presentations, reputable equity researchers and industry analysts, and more. Third-hand data reported by tech bloggers or general tech media is more speculative than helpful, so I&#8217;ve ignored it at the expense of relative completeness of the dataset.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!0vWr!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1efce4f-e438-4956-8cc4-23077bd79f3e_1553x1116.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!0vWr!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1efce4f-e438-4956-8cc4-23077bd79f3e_1553x1116.png 424w, https://substackcdn.com/image/fetch/$s_!0vWr!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1efce4f-e438-4956-8cc4-23077bd79f3e_1553x1116.png 848w, https://substackcdn.com/image/fetch/$s_!0vWr!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1efce4f-e438-4956-8cc4-23077bd79f3e_1553x1116.png 1272w, https://substackcdn.com/image/fetch/$s_!0vWr!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1efce4f-e438-4956-8cc4-23077bd79f3e_1553x1116.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!0vWr!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1efce4f-e438-4956-8cc4-23077bd79f3e_1553x1116.png" width="1200" height="862.0879120879121" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e1efce4f-e438-4956-8cc4-23077bd79f3e_1553x1116.png&quot;,&quot;srcNoWatermark&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/56c347a2-eefc-4d82-ae75-b57432e0777f_1553x1116.png&quot;,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:1046,&quot;width&quot;:1456,&quot;resizeWidth&quot;:1200,&quot;bytes&quot;:421856,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56c347a2-eefc-4d82-ae75-b57432e0777f_1553x1116.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-large" alt="" srcset="https://substackcdn.com/image/fetch/$s_!0vWr!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1efce4f-e438-4956-8cc4-23077bd79f3e_1553x1116.png 424w, https://substackcdn.com/image/fetch/$s_!0vWr!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1efce4f-e438-4956-8cc4-23077bd79f3e_1553x1116.png 848w, https://substackcdn.com/image/fetch/$s_!0vWr!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1efce4f-e438-4956-8cc4-23077bd79f3e_1553x1116.png 1272w, https://substackcdn.com/image/fetch/$s_!0vWr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1efce4f-e438-4956-8cc4-23077bd79f3e_1553x1116.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Source: <a href="https://docs.google.com/spreadsheets/u/0/d/1vnS6Rlazkwpg7yBqsD9YE2eoVlAKFLG2vdOoPkiCU5c/edit">Silicon Vanguard dataset (Machine Yearning)</a>.</p><p>In addition to basic specs, there are energy-compute metrics and hand-calc&#8217;d theoretical inference throughput for various cases. This will be expanded in future versions.</p><p>For now, let&#8217;s take a quick peek at some of the raw performance figures.</p><h3><strong>Raw Performance</strong></h3><p>On a gross basis, the gap in performance between American and Chinese accelerators is pretty stark. At a median computational throughput of 96 TFLOPS, Chinese accelerators trail American ones by a whopping 722 TFLOPS - about the equivalent of a Tesla V100, which precedes current cards by 4 generations. Chinese accelerators in the upper crust of performance beats out an A100, but not an H100 without using sparsification techniques.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!yJz_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5a2a3f6-a747-485f-81cd-21f266ff750d_730x270.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!yJz_!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5a2a3f6-a747-485f-81cd-21f266ff750d_730x270.png 424w, https://substackcdn.com/image/fetch/$s_!yJz_!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5a2a3f6-a747-485f-81cd-21f266ff750d_730x270.png 848w, https://substackcdn.com/image/fetch/$s_!yJz_!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5a2a3f6-a747-485f-81cd-21f266ff750d_730x270.png 1272w, https://substackcdn.com/image/fetch/$s_!yJz_!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5a2a3f6-a747-485f-81cd-21f266ff750d_730x270.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!yJz_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5a2a3f6-a747-485f-81cd-21f266ff750d_730x270.png" width="730" height="270" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a5a2a3f6-a747-485f-81cd-21f266ff750d_730x270.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:270,&quot;width&quot;:730,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:29961,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5a2a3f6-a747-485f-81cd-21f266ff750d_730x270.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!yJz_!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5a2a3f6-a747-485f-81cd-21f266ff750d_730x270.png 424w, https://substackcdn.com/image/fetch/$s_!yJz_!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5a2a3f6-a747-485f-81cd-21f266ff750d_730x270.png 848w, https://substackcdn.com/image/fetch/$s_!yJz_!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5a2a3f6-a747-485f-81cd-21f266ff750d_730x270.png 1272w, https://substackcdn.com/image/fetch/$s_!yJz_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5a2a3f6-a747-485f-81cd-21f266ff750d_730x270.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JT4_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff1a1ca26-015a-4ff3-a19a-3d43d2ad8586_908x1588.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JT4_!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff1a1ca26-015a-4ff3-a19a-3d43d2ad8586_908x1588.png 424w, https://substackcdn.com/image/fetch/$s_!JT4_!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff1a1ca26-015a-4ff3-a19a-3d43d2ad8586_908x1588.png 848w, https://substackcdn.com/image/fetch/$s_!JT4_!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff1a1ca26-015a-4ff3-a19a-3d43d2ad8586_908x1588.png 1272w, https://substackcdn.com/image/fetch/$s_!JT4_!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff1a1ca26-015a-4ff3-a19a-3d43d2ad8586_908x1588.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JT4_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff1a1ca26-015a-4ff3-a19a-3d43d2ad8586_908x1588.png" width="908" height="1588" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f1a1ca26-015a-4ff3-a19a-3d43d2ad8586_908x1588.png&quot;,&quot;srcNoWatermark&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/65f0087f-5510-4517-80a0-18dea2d70736_908x1588.png&quot;,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1588,&quot;width&quot;:908,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!JT4_!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff1a1ca26-015a-4ff3-a19a-3d43d2ad8586_908x1588.png 424w, https://substackcdn.com/image/fetch/$s_!JT4_!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff1a1ca26-015a-4ff3-a19a-3d43d2ad8586_908x1588.png 848w, https://substackcdn.com/image/fetch/$s_!JT4_!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff1a1ca26-015a-4ff3-a19a-3d43d2ad8586_908x1588.png 1272w, https://substackcdn.com/image/fetch/$s_!JT4_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff1a1ca26-015a-4ff3-a19a-3d43d2ad8586_908x1588.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Memory bandwidth speeds tell a similar story. On strictly a hardware basis, HBM restrictions hold back most Chinese chips on memory performance, though upper tier accelerators are landing somewhere around an A100 in performance. Inference acceleration toolkits are attempting to get around this problem while CXMT improves domestic HBM production quality.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JvOb!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19224d0a-8f29-48ad-bf0d-af45b42cd0bb_723x261.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JvOb!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19224d0a-8f29-48ad-bf0d-af45b42cd0bb_723x261.png 424w, https://substackcdn.com/image/fetch/$s_!JvOb!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19224d0a-8f29-48ad-bf0d-af45b42cd0bb_723x261.png 848w, https://substackcdn.com/image/fetch/$s_!JvOb!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19224d0a-8f29-48ad-bf0d-af45b42cd0bb_723x261.png 1272w, https://substackcdn.com/image/fetch/$s_!JvOb!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19224d0a-8f29-48ad-bf0d-af45b42cd0bb_723x261.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JvOb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19224d0a-8f29-48ad-bf0d-af45b42cd0bb_723x261.png" width="723" height="261" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/19224d0a-8f29-48ad-bf0d-af45b42cd0bb_723x261.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:261,&quot;width&quot;:723,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:31307,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19224d0a-8f29-48ad-bf0d-af45b42cd0bb_723x261.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!JvOb!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19224d0a-8f29-48ad-bf0d-af45b42cd0bb_723x261.png 424w, https://substackcdn.com/image/fetch/$s_!JvOb!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19224d0a-8f29-48ad-bf0d-af45b42cd0bb_723x261.png 848w, https://substackcdn.com/image/fetch/$s_!JvOb!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19224d0a-8f29-48ad-bf0d-af45b42cd0bb_723x261.png 1272w, https://substackcdn.com/image/fetch/$s_!JvOb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F19224d0a-8f29-48ad-bf0d-af45b42cd0bb_723x261.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!jzmo!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf873bb7-4818-480f-9553-8659c68f66f0_908x1588.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!jzmo!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf873bb7-4818-480f-9553-8659c68f66f0_908x1588.png 424w, https://substackcdn.com/image/fetch/$s_!jzmo!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf873bb7-4818-480f-9553-8659c68f66f0_908x1588.png 848w, https://substackcdn.com/image/fetch/$s_!jzmo!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf873bb7-4818-480f-9553-8659c68f66f0_908x1588.png 1272w, https://substackcdn.com/image/fetch/$s_!jzmo!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf873bb7-4818-480f-9553-8659c68f66f0_908x1588.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!jzmo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf873bb7-4818-480f-9553-8659c68f66f0_908x1588.png" width="908" height="1588" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/af873bb7-4818-480f-9553-8659c68f66f0_908x1588.png&quot;,&quot;srcNoWatermark&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0fd5386a-db3f-452c-ad1a-f3842a25649c_908x1588.png&quot;,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1588,&quot;width&quot;:908,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!jzmo!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf873bb7-4818-480f-9553-8659c68f66f0_908x1588.png 424w, https://substackcdn.com/image/fetch/$s_!jzmo!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf873bb7-4818-480f-9553-8659c68f66f0_908x1588.png 848w, https://substackcdn.com/image/fetch/$s_!jzmo!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf873bb7-4818-480f-9553-8659c68f66f0_908x1588.png 1272w, https://substackcdn.com/image/fetch/$s_!jzmo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf873bb7-4818-480f-9553-8659c68f66f0_908x1588.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Of the Chinese accelerator cards for which we have data, the best performing families seem to come from <strong>Huawei</strong>, <strong>Enflame</strong>, and <strong>MetaX</strong>. We haven&#8217;t yet found performance data on <strong>Cambricon&#8217;s</strong> latest accelerator, the &#24605;&#20803;590, but industry adoption trends suggest this would be a top performer as well. <strong>Biren</strong> is excluded from this list as recorded performance data is from their TSMC-produced cards, which are no longer accessible since being added to the entity list.</p><h3><strong>Energy-Compute Performance</strong></h3><p>We start by looking at two primary energy-compute metrics - <em>computational energy efficiency</em> and <em>memory energy efficiency</em>. These are measured in FLOPS / Watt and bytes per joule, respectively. Both of these metrics give an indication for what kind of performance we can expect in a limited energy envelope.</p><p>The objective for energy-compute practitioners is to <strong>maximize tokens per joule</strong> (subject to quality and throughput thresholds). AI accelerator cards don&#8217;t need to have the absolute best stats - beyond a certain threshold of size, speed, and power, they just need to be good enough to handle industry standard models. Hence the Chinese ecosystem&#8217;s convergence on DeepSeek / UE8M0 FP8 precision as a newly established model performance standard.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-17" href="#footnote-17" target="_self">17</a></p><p>So, with these data, we can hand-calc tokens-per-joule by calculating the number of operations required to run a specific model (in this case, Llama 3.3 8B) at a specific precision (FP16, so 2 bytes per parameter). After determining whether the operation is compute-bound or memory-bound, we can solve for the time it takes to generate a single token during the decode sequence (seconds / token), and flip it to get the peak theoretical throughput (tokens / second). Based on the hardware&#8217;s energy consumption, we finally solve for tokens / joule.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-18" href="#footnote-18" target="_self">18</a></p><p>Alas, when translated into production workloads, both advantages compound for American chips. In a theoretical 8B parameter model inference run, American accelerators can hit around 200 tokens / second. Most Chinese accelerators can only hit about a quarter of that.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!UCR_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbacfb3db-f286-49de-acb7-30b323c1c5e0_744x265.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!UCR_!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbacfb3db-f286-49de-acb7-30b323c1c5e0_744x265.png 424w, https://substackcdn.com/image/fetch/$s_!UCR_!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbacfb3db-f286-49de-acb7-30b323c1c5e0_744x265.png 848w, https://substackcdn.com/image/fetch/$s_!UCR_!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbacfb3db-f286-49de-acb7-30b323c1c5e0_744x265.png 1272w, https://substackcdn.com/image/fetch/$s_!UCR_!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbacfb3db-f286-49de-acb7-30b323c1c5e0_744x265.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!UCR_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbacfb3db-f286-49de-acb7-30b323c1c5e0_744x265.png" width="744" height="265" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bacfb3db-f286-49de-acb7-30b323c1c5e0_744x265.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:265,&quot;width&quot;:744,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:32492,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbacfb3db-f286-49de-acb7-30b323c1c5e0_744x265.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!UCR_!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbacfb3db-f286-49de-acb7-30b323c1c5e0_744x265.png 424w, https://substackcdn.com/image/fetch/$s_!UCR_!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbacfb3db-f286-49de-acb7-30b323c1c5e0_744x265.png 848w, https://substackcdn.com/image/fetch/$s_!UCR_!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbacfb3db-f286-49de-acb7-30b323c1c5e0_744x265.png 1272w, https://substackcdn.com/image/fetch/$s_!UCR_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbacfb3db-f286-49de-acb7-30b323c1c5e0_744x265.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!de78!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4eab0188-82c0-4fd3-9ebb-fc305b0d48c5_908x1588.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!de78!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4eab0188-82c0-4fd3-9ebb-fc305b0d48c5_908x1588.png 424w, https://substackcdn.com/image/fetch/$s_!de78!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4eab0188-82c0-4fd3-9ebb-fc305b0d48c5_908x1588.png 848w, https://substackcdn.com/image/fetch/$s_!de78!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4eab0188-82c0-4fd3-9ebb-fc305b0d48c5_908x1588.png 1272w, https://substackcdn.com/image/fetch/$s_!de78!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4eab0188-82c0-4fd3-9ebb-fc305b0d48c5_908x1588.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!de78!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4eab0188-82c0-4fd3-9ebb-fc305b0d48c5_908x1588.png" width="908" height="1588" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4eab0188-82c0-4fd3-9ebb-fc305b0d48c5_908x1588.png&quot;,&quot;srcNoWatermark&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/29338e52-16c2-4daa-9c17-39533a6755ab_908x1588.png&quot;,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1588,&quot;width&quot;:908,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!de78!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4eab0188-82c0-4fd3-9ebb-fc305b0d48c5_908x1588.png 424w, https://substackcdn.com/image/fetch/$s_!de78!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4eab0188-82c0-4fd3-9ebb-fc305b0d48c5_908x1588.png 848w, https://substackcdn.com/image/fetch/$s_!de78!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4eab0188-82c0-4fd3-9ebb-fc305b0d48c5_908x1588.png 1272w, https://substackcdn.com/image/fetch/$s_!de78!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4eab0188-82c0-4fd3-9ebb-fc305b0d48c5_908x1588.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The gap in energy efficiency between American and Chinese chips is narrower than the raw performance gap, but still there. Across the board, American chips yield about 58% more tokens per joule than Chinese chips do. Peak throughput significantly affects this figure - time is energy, after all.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!fixx!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa34d16e6-9e05-4495-b66e-71b658b4eee6_742x262.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!fixx!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa34d16e6-9e05-4495-b66e-71b658b4eee6_742x262.png 424w, https://substackcdn.com/image/fetch/$s_!fixx!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa34d16e6-9e05-4495-b66e-71b658b4eee6_742x262.png 848w, https://substackcdn.com/image/fetch/$s_!fixx!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa34d16e6-9e05-4495-b66e-71b658b4eee6_742x262.png 1272w, https://substackcdn.com/image/fetch/$s_!fixx!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa34d16e6-9e05-4495-b66e-71b658b4eee6_742x262.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!fixx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa34d16e6-9e05-4495-b66e-71b658b4eee6_742x262.png" width="742" height="262" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a34d16e6-9e05-4495-b66e-71b658b4eee6_742x262.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:262,&quot;width&quot;:742,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:32231,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa34d16e6-9e05-4495-b66e-71b658b4eee6_742x262.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!fixx!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa34d16e6-9e05-4495-b66e-71b658b4eee6_742x262.png 424w, https://substackcdn.com/image/fetch/$s_!fixx!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa34d16e6-9e05-4495-b66e-71b658b4eee6_742x262.png 848w, https://substackcdn.com/image/fetch/$s_!fixx!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa34d16e6-9e05-4495-b66e-71b658b4eee6_742x262.png 1272w, https://substackcdn.com/image/fetch/$s_!fixx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa34d16e6-9e05-4495-b66e-71b658b4eee6_742x262.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Interestingly enough, of the American chip line the NVIDIA H20 is the most energy-efficient at 1.1 tokens / joule, even beating out the Blackwell lineup at 0.95. The Ascend 910C is China&#8217;s most energy efficient chip at 0.86 tokens / joule, with the exception of <strong>Moffett AI</strong>&#8217;s SparseOne S30 (which natively supports up to 32x sparsity), and yields a whopping maximum of 2.72 tokens / joule. Aside from Moffett, nearly all GPUs and TPUs in this dataset are stuck underneath an efficiency frontier of ~1 token / joule.</p><p>If the Ascend 910D performs as rumored - up to 900 TFLOPS (assuming at FP16)<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-19" href="#footnote-19" target="_self">19</a>, 4,800 GBps of memory bandwidth in a 350W package - this would push its peak throughput to ~333 tokens/sec at an energy efficiency rating of 1.27 tokens / joule.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-20" href="#footnote-20" target="_self">20</a> That would place it in the 75th percentile of American chip inference performance while also making it the most energy efficient chip on the market, without sparsity.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!bhHy!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc500225-c6a0-462a-9d60-61923ea51b8a_908x1588.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!bhHy!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc500225-c6a0-462a-9d60-61923ea51b8a_908x1588.png 424w, https://substackcdn.com/image/fetch/$s_!bhHy!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc500225-c6a0-462a-9d60-61923ea51b8a_908x1588.png 848w, https://substackcdn.com/image/fetch/$s_!bhHy!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc500225-c6a0-462a-9d60-61923ea51b8a_908x1588.png 1272w, https://substackcdn.com/image/fetch/$s_!bhHy!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc500225-c6a0-462a-9d60-61923ea51b8a_908x1588.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!bhHy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc500225-c6a0-462a-9d60-61923ea51b8a_908x1588.png" width="908" height="1588" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/cc500225-c6a0-462a-9d60-61923ea51b8a_908x1588.png&quot;,&quot;srcNoWatermark&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2f00a7dd-365c-426f-ac5e-15bd82fddf51_908x1588.png&quot;,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1588,&quot;width&quot;:908,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!bhHy!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc500225-c6a0-462a-9d60-61923ea51b8a_908x1588.png 424w, https://substackcdn.com/image/fetch/$s_!bhHy!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc500225-c6a0-462a-9d60-61923ea51b8a_908x1588.png 848w, https://substackcdn.com/image/fetch/$s_!bhHy!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc500225-c6a0-462a-9d60-61923ea51b8a_908x1588.png 1272w, https://substackcdn.com/image/fetch/$s_!bhHy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc500225-c6a0-462a-9d60-61923ea51b8a_908x1588.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>FLOPS Are (Not) All You Need</h2><p>Ending the analysis here would be pretty unsatisfactory. Rumors on the Ascend 910D are fine, but we&#8217;re concerned with discerning the ability for China to domestically serve its inference needs <strong>today</strong>. We can&#8217;t let perfect - or in this case, leading edge - be the enemy of &#8220;good enough.&#8221;</p><p>Why don&#8217;t I care as much about raw FLOPS? Well for one, it wouldn&#8217;t be as fun an analysis if I did, and two, that&#8217;s a misleading indicator for the overall utility of an accelerator card. Hardware is just one piece of the puzzle.</p><p><strong>Past a certain point, we do not care if an accelerator card is best-in-class. We care only if it gets the job done.</strong></p><p>I already know this statement is going to ruffle some feathers. It would sound like cope coming from a Chinese commentator. It&#8217;s well-understood that most developers would prefer to use the best chips and software available to them. But since the US seems keen on alienating NVIDIA from Chinese buyers as much as possible (Secretary Lutnick&#8217;s insistence on &#8220;addicting&#8221; Chinese engineers to America&#8217;s technology stack is just <strong>ridiculously </strong>Opium War-coded&#8230; it&#8217;s hard to believe that isn&#8217;t somewhat intentional<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-21" href="#footnote-21" target="_self">21</a>), we might as well try to figure out what constitutes a passable domestic replacement.</p><p>With this in mind, how do we determine what is considered &#8220;good enough&#8221; for the Chinese AI ecosystem?</p><h2>Industry Certifications</h2><p>Without applying some guardrails to it, that could be an entirely vibes-based debate. We could set an arbitrary threshold like NVIDIA A100-grade, but this still feels loose.</p><p>One of the takeaways Paul and I had from WAIC was the universal convergence on DeepSeek R1 671B as the effective <a href="https://en.wikipedia.org/wiki/Wintel">&#8220;Wintel&#8221;</a> standard for the AI era. This time around, instead of &#8220;Intel Inside&#8221; stickers, WAIC booths for vendors like SOPHGO, Enflame, and Moore Threads all featured all-in-one systems (supernodes) which they loudly proclaimed capable of running &#8220;full-blooded&#8221; DeepSeek R1, that is, the single-precision 671B parameter model. Seriously, the branding was everywhere.</p><p>When these chips costs tens of thousands of RMB per card, should unwitting procurement teams simply take these vendors at their word? This is a major question that even American neoclouds are grappling with - not just whether the chips work, but whether I can make a profit by hosting them.</p><p>Luckily, a third-party certification ecosystem is emerging to provide clarity. The China Academy of Information and Communications Technology (CAICT), a subordinate of the Ministry of Industry and Information Technology (MIIT), is in the business of conducting third-party benchmarking, assessments, and certifications for all kinds of hardware and software applications.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-22" href="#footnote-22" target="_self">22</a> Recently, it seems they&#8217;ve begun issuing certificates for &#8220;AI Chip and Large Model Adaptation Tests,&#8221; verifying that applicant vendors&#8217; hardware can passably run inference for DeepSeek R1 671B.</p><p>This is useful for a few reasons:</p><ol><li><p>It confirms beyond a reasonable doubt that DeepSeek R1 isn&#8217;t just a marketing push, it&#8217;s actually being adopted into industry baseline benchmarking</p></li><li><p>It contributes to our understanding of how the private and public sector in China can provide operational clarity to both vendors and purchasers</p></li><li><p>It gives us a concrete &#8220;good enough&#8221; threshold to conduct our own assessment</p></li></ol><h3>SOPHGO SC11-FP300 (Issued 2025-06-26)</h3><p>The SC11 FP300 is SOPHGO&#8217;s latest TPU chip - the BM1690 - in a PCIe form factor. It uses LPDDR5x memory, and is much more optimized for energy efficiency than a GPGPU. This emphasis on energy efficiency is part of SOPHGO&#8217;s heritage as Bitmain&#8217;s original ASIC business for cryptocurrency mining.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!f-Ty!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb4e8066-38f3-4190-8b6a-3662bf7dc943_5712x4284.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!f-Ty!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb4e8066-38f3-4190-8b6a-3662bf7dc943_5712x4284.jpeg 424w, https://substackcdn.com/image/fetch/$s_!f-Ty!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb4e8066-38f3-4190-8b6a-3662bf7dc943_5712x4284.jpeg 848w, https://substackcdn.com/image/fetch/$s_!f-Ty!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb4e8066-38f3-4190-8b6a-3662bf7dc943_5712x4284.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!f-Ty!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb4e8066-38f3-4190-8b6a-3662bf7dc943_5712x4284.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!f-Ty!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb4e8066-38f3-4190-8b6a-3662bf7dc943_5712x4284.jpeg" width="1456" height="1092" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/eb4e8066-38f3-4190-8b6a-3662bf7dc943_5712x4284.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1092,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:4021332,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb4e8066-38f3-4190-8b6a-3662bf7dc943_5712x4284.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!f-Ty!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb4e8066-38f3-4190-8b6a-3662bf7dc943_5712x4284.jpeg 424w, https://substackcdn.com/image/fetch/$s_!f-Ty!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb4e8066-38f3-4190-8b6a-3662bf7dc943_5712x4284.jpeg 848w, https://substackcdn.com/image/fetch/$s_!f-Ty!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb4e8066-38f3-4190-8b6a-3662bf7dc943_5712x4284.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!f-Ty!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb4e8066-38f3-4190-8b6a-3662bf7dc943_5712x4284.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: WAIC 2025. The SGM7-40 is a server node containing 8 SC11-FP300 cards in either an air-cooled (right) or direct-to-chip liquid-cooled (left) form factor.</figcaption></figure></div><p>Unlike the MTT S4000, the SC11 FP300 is confirmed to work with FP8 precision workloads - very important for future training and inference needs.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!a5lf!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F55753e0b-322d-4cb2-8372-0397dc385a86_782x1102.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!a5lf!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F55753e0b-322d-4cb2-8372-0397dc385a86_782x1102.webp 424w, https://substackcdn.com/image/fetch/$s_!a5lf!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F55753e0b-322d-4cb2-8372-0397dc385a86_782x1102.webp 848w, https://substackcdn.com/image/fetch/$s_!a5lf!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F55753e0b-322d-4cb2-8372-0397dc385a86_782x1102.webp 1272w, https://substackcdn.com/image/fetch/$s_!a5lf!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F55753e0b-322d-4cb2-8372-0397dc385a86_782x1102.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!a5lf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F55753e0b-322d-4cb2-8372-0397dc385a86_782x1102.webp" width="782" height="1102" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/55753e0b-322d-4cb2-8372-0397dc385a86_782x1102.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1102,&quot;width&quot;:782,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!a5lf!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F55753e0b-322d-4cb2-8372-0397dc385a86_782x1102.webp 424w, https://substackcdn.com/image/fetch/$s_!a5lf!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F55753e0b-322d-4cb2-8372-0397dc385a86_782x1102.webp 848w, https://substackcdn.com/image/fetch/$s_!a5lf!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F55753e0b-322d-4cb2-8372-0397dc385a86_782x1102.webp 1272w, https://substackcdn.com/image/fetch/$s_!a5lf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F55753e0b-322d-4cb2-8372-0397dc385a86_782x1102.webp 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: Company <a href="https://mp.weixin.qq.com/s/36fbnRALFduG0atq0SRdOg">WeChat channel</a>.</figcaption></figure></div><blockquote><p><em>Advanced Computing Products</em></p><p><em>AI Chip and Large Model Adaptation</em></p><p><em>Test Certificate</em></p><p><em>Xiamen Suanneng Technology Co., Ltd.</em></p><p><em>Room 702-01, Xinghui Building, No. 9 Zengcuo'an North Road, Software Park, Xiamen Torch High-tech Zone</em></p><p><em>Product Name: Suanneng SC11 FP300 Computing Card</em></p><p><em>Product Model: SOPHON SC11 FP300</em></p><p><em>After testing by the China Academy of Information and Communications Technology, your SOPHON SC11 FP300 computing card has passed.</em></p><p><em>"AI Chip and Large Model Adaptability Passability Assessment Software and Hardware Environment and Test Details" (FT-L06-0196-01)</em></p><p><em>This certificate specifies the passability adaptation requirements for inference scenarios. The adapted large model is the DeepSeek-R1 671B large model developed by Hangzhou DeepSeek Artificial Intelligence Basic Technology Research Co., Ltd.</em></p><p><em>This certificate is hereby issued.</em></p></blockquote><h3>Moore Threads MTT S4000 (Issued 2025-04-30)</h3><p>The MTT S4000 is the latest data center card in the Moore Threads lineup. It&#8217;s a general-purpose GPU (GPGPU) that uses the ChunXiao chip architecture, Moore Threads&#8217; 2nd gen chip.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!E8jP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F710af104-8917-47ea-aae7-df825efb6164_5712x4284.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!E8jP!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F710af104-8917-47ea-aae7-df825efb6164_5712x4284.jpeg 424w, https://substackcdn.com/image/fetch/$s_!E8jP!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F710af104-8917-47ea-aae7-df825efb6164_5712x4284.jpeg 848w, https://substackcdn.com/image/fetch/$s_!E8jP!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F710af104-8917-47ea-aae7-df825efb6164_5712x4284.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!E8jP!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F710af104-8917-47ea-aae7-df825efb6164_5712x4284.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!E8jP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F710af104-8917-47ea-aae7-df825efb6164_5712x4284.jpeg" width="1456" height="1941" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/710af104-8917-47ea-aae7-df825efb6164_5712x4284.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1941,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:3806189,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F710af104-8917-47ea-aae7-df825efb6164_5712x4284.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!E8jP!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F710af104-8917-47ea-aae7-df825efb6164_5712x4284.jpeg 424w, https://substackcdn.com/image/fetch/$s_!E8jP!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F710af104-8917-47ea-aae7-df825efb6164_5712x4284.jpeg 848w, https://substackcdn.com/image/fetch/$s_!E8jP!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F710af104-8917-47ea-aae7-df825efb6164_5712x4284.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!E8jP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F710af104-8917-47ea-aae7-df825efb6164_5712x4284.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: WAIC 2025. The MTT KUAE is a server node based on the MTT S4000 GPU.</figcaption></figure></div><p>It comes with 48GB of GDDR6 memory, the same kind you&#8217;ll usually find in NVIDIA consumer GPUs. In addition to the KUAE, Moore Threads had an OAM module and edge device on display at their booth.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ODib!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0370638-7c31-4cdf-937e-8bae052e643f_600x831.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ODib!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0370638-7c31-4cdf-937e-8bae052e643f_600x831.png 424w, https://substackcdn.com/image/fetch/$s_!ODib!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0370638-7c31-4cdf-937e-8bae052e643f_600x831.png 848w, https://substackcdn.com/image/fetch/$s_!ODib!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0370638-7c31-4cdf-937e-8bae052e643f_600x831.png 1272w, https://substackcdn.com/image/fetch/$s_!ODib!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0370638-7c31-4cdf-937e-8bae052e643f_600x831.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ODib!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0370638-7c31-4cdf-937e-8bae052e643f_600x831.png" width="724" height="1002.74" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f0370638-7c31-4cdf-937e-8bae052e643f_600x831.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:831,&quot;width&quot;:600,&quot;resizeWidth&quot;:724,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ODib!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0370638-7c31-4cdf-937e-8bae052e643f_600x831.png 424w, https://substackcdn.com/image/fetch/$s_!ODib!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0370638-7c31-4cdf-937e-8bae052e643f_600x831.png 848w, https://substackcdn.com/image/fetch/$s_!ODib!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0370638-7c31-4cdf-937e-8bae052e643f_600x831.png 1272w, https://substackcdn.com/image/fetch/$s_!ODib!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0370638-7c31-4cdf-937e-8bae052e643f_600x831.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Source: <a href="https://finance.sina.com.cn/tech/roll/2025-05-07/doc-inevtyis9418933.shtml">Sina Weibo</a>.</p><blockquote><p><em>Advanced Computing Products</em></p><p><em>AI Chip and Large Model Adaptation</em></p><p><em>Test Certificate</em></p><p><em>MooreThread Intelligent Technology (Beijing) Co., Ltd., Building 3, Wangjing International R&amp;D Park, Chaoyang District, Beijing. Product Name: MTT S4000 Training-Initialization Computing Card</em></p><p><em>Product Model: MooreThread MTT S4000</em></p><p><em>After testing by the China Academy of Information and Communications Technology, your company's training-initialization computing card, the MooreThread MTT S4000, has passed the inference scenario passability requirements of the "AI Chip and Large Model Adaptability Passability Assessment Software and Hardware Environment and Test Details" (FT-L06-0196-01). The adapted large model is the DeepSeek-R1 671B developed by Hangzhou DeepSeek Artificial Intelligence Basic Technology Research Co., Ltd.</em></p><p><em>This certificate is hereby issued.</em></p></blockquote><p>As the rest of the Four Little Dragons seek a public offering, I expect we&#8217;ll see similar CAICT certifications as a positive signal for commercial adoption in the IPO process.</p><h2>Calculating &#8220;Good Enough&#8221; Performance</h2><p>The MTT S4000 and SC11-FP300 aren&#8217;t the best chips in our domestic chipmaker inventory, but they do provide useful examples to determine our &#8220;good enough&#8221; threshold. We can update this prior as new CAICT certifications become public.</p><p>Strictly on an energy-compute basis, both chips still fall short of NVIDIA&#8217;s A100 series from 5 years ago. But they offer decent enough performance to warrant a closer look.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!eHOT!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92d84107-e2fb-4aad-8c12-31ad04505c94_723x512.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!eHOT!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92d84107-e2fb-4aad-8c12-31ad04505c94_723x512.png 424w, https://substackcdn.com/image/fetch/$s_!eHOT!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92d84107-e2fb-4aad-8c12-31ad04505c94_723x512.png 848w, https://substackcdn.com/image/fetch/$s_!eHOT!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92d84107-e2fb-4aad-8c12-31ad04505c94_723x512.png 1272w, https://substackcdn.com/image/fetch/$s_!eHOT!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92d84107-e2fb-4aad-8c12-31ad04505c94_723x512.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!eHOT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92d84107-e2fb-4aad-8c12-31ad04505c94_723x512.png" width="723" height="512" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/92d84107-e2fb-4aad-8c12-31ad04505c94_723x512.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:512,&quot;width&quot;:723,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:59834,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92d84107-e2fb-4aad-8c12-31ad04505c94_723x512.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!eHOT!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92d84107-e2fb-4aad-8c12-31ad04505c94_723x512.png 424w, https://substackcdn.com/image/fetch/$s_!eHOT!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92d84107-e2fb-4aad-8c12-31ad04505c94_723x512.png 848w, https://substackcdn.com/image/fetch/$s_!eHOT!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92d84107-e2fb-4aad-8c12-31ad04505c94_723x512.png 1272w, https://substackcdn.com/image/fetch/$s_!eHOT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F92d84107-e2fb-4aad-8c12-31ad04505c94_723x512.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Asterisks indicate value was derived from a confirmed performance figure cast into target precision (e.g. 400 TOPS INT8 == 200 FLOPS FP16). Checkmark indicates compatibility with the precision format.</figcaption></figure></div><p>First, we need to assess what takes longer: processing the entire prompt, or moving the weights from memory to the logic chip for processing. This determines whether the prefill stage is compute-bound or memory bound, respectively. We take the greater of those two values (in seconds), and add that to the total time it takes to decode our output sequence. The decode stage is almost always memory-bound.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!DNZ2!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8252acd8-1cbe-4a6e-a5f5-9b7510a2968a_750x430.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!DNZ2!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8252acd8-1cbe-4a6e-a5f5-9b7510a2968a_750x430.png 424w, https://substackcdn.com/image/fetch/$s_!DNZ2!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8252acd8-1cbe-4a6e-a5f5-9b7510a2968a_750x430.png 848w, https://substackcdn.com/image/fetch/$s_!DNZ2!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8252acd8-1cbe-4a6e-a5f5-9b7510a2968a_750x430.png 1272w, https://substackcdn.com/image/fetch/$s_!DNZ2!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8252acd8-1cbe-4a6e-a5f5-9b7510a2968a_750x430.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!DNZ2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8252acd8-1cbe-4a6e-a5f5-9b7510a2968a_750x430.png" width="750" height="430" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8252acd8-1cbe-4a6e-a5f5-9b7510a2968a_750x430.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:430,&quot;width&quot;:750,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:65530,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8252acd8-1cbe-4a6e-a5f5-9b7510a2968a_750x430.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!DNZ2!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8252acd8-1cbe-4a6e-a5f5-9b7510a2968a_750x430.png 424w, https://substackcdn.com/image/fetch/$s_!DNZ2!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8252acd8-1cbe-4a6e-a5f5-9b7510a2968a_750x430.png 848w, https://substackcdn.com/image/fetch/$s_!DNZ2!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8252acd8-1cbe-4a6e-a5f5-9b7510a2968a_750x430.png 1272w, https://substackcdn.com/image/fetch/$s_!DNZ2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8252acd8-1cbe-4a6e-a5f5-9b7510a2968a_750x430.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>From this limited dataset, it seems that CAICT deems peak throughput of ~47.6 tokens / sec <strong>passable</strong> for an 8B parameter model on a single card. Keep in mind that all tested hardware come in multi-card server node configurations - it&#8217;s likely the CAICT tests were conducted on those systems rather than single-card inference.</p><p>Nearly 13 seconds of inference time for an 8B model is a little long for my taste, and even a 3-generation old card like the A100 can do that in half the time at much better energy efficiency. Regardless, this is as good a starting point as any.</p><p>We&#8217;ll therefore set our passability threshold, as suggested by CAICT, for FP16 inference on an 8B model at:</p><div class="latex-rendered" data-attrs="{&quot;persistentExpression&quot;:&quot;\\begin{array}{|l|l|}\n\\hline\n\\textbf{Metric} &amp; \\textbf{Requirement} \\\\\n\\hline\n\\text{Peak Throughput } (\\tau_{sec}) &amp; \\geq 47.6 \\\\\n\\hline\n\\text{Tokens / Joule } (\\eta) &amp; \\geq 0.14 \\\\\n\\hline\n\\end{array}&quot;,&quot;id&quot;:&quot;NEWWHRREZX&quot;}" data-component-name="LatexBlockToDOM"></div><p>With those requirements in place, we can now conduct a more informed analysis than just raw performance specs would have afforded.</p><div><hr></div><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.machineyearning.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Wait, there&#8217;s more? You&#8217;re kidding right?</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h1><strong>Technical Analysis</strong></h1><p>As a refresher, the <strong>triple-product advantage</strong> is the compounding effect that moderate gains in individual vectors (energy efficiency, throughput speed, model quality) create when multiplied together.</p><p>This means that even if hardware is only &#8220;good enough&#8221; on efficiency and throughput speed, collaborative innovations in distributed inference, sparse memory operations, mixed precision, and so on can reduce computational complexity to fit within those limitations. Not to mention having plenty of surplus power to support initial energy footprints.</p><p>While NVIDIA and AMD are undoubtedly pushing out high-quality hardware, these systems have been stuck on the same iso-efficiency frontier: ~0.52 (&#177; 0.21) tokens / joule since 2016. Newer systems boast incredibly powerful computational performance and boosted memory speeds, but without leveling off power draw for new chips, their energy efficiency hasn&#8217;t improved that much.</p><p>If an ecosystem is successfully co-designing models, hardware, and inference environments, then native participants of that ecosystem stand to benefit the most. Put another way, innovation is happening on all fronts.</p><h2>1. Product Performance</h2><p><strong>Standouts: Huawei, Enflame, MetaX, Moffett AI.</strong></p><div class="pullquote"><p><em>CAICT certification provides a useful benchmark for industry adoption and acceptability. <strong>Huawei, Enflame, Moore Threads, MetaX, Hygon, Iluvatar CoreX, SOPHGO, </strong>and<strong> Moffett AI </strong>each have at least one &#8220;passable&#8221; card based on inferred CAICT guidance (tokens / joule &gt;= 0.14, peak throughput &gt;= 47.6 on Llama 3.3 8B).</em></p><p><em>Only <strong>Huawei, Enflame, MetaX, </strong>and<strong> Moffett AI </strong>have cards within spitting distance of A100 inference performance.</em></p></div><p>We&#8217;ll start by plotting the SKUs of all vendors with available data on a throughput vs. power scatterplot. This highlights both accelerator performance and energy efficiency characteristics.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!TyGQ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdee617b0-6f41-4873-a373-c8033ffc3dab_4481x3351.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!TyGQ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdee617b0-6f41-4873-a373-c8033ffc3dab_4481x3351.png 424w, https://substackcdn.com/image/fetch/$s_!TyGQ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdee617b0-6f41-4873-a373-c8033ffc3dab_4481x3351.png 848w, https://substackcdn.com/image/fetch/$s_!TyGQ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdee617b0-6f41-4873-a373-c8033ffc3dab_4481x3351.png 1272w, https://substackcdn.com/image/fetch/$s_!TyGQ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdee617b0-6f41-4873-a373-c8033ffc3dab_4481x3351.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!TyGQ!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdee617b0-6f41-4873-a373-c8033ffc3dab_4481x3351.png" width="1200" height="897.5274725274726" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/dee617b0-6f41-4873-a373-c8033ffc3dab_4481x3351.png&quot;,&quot;srcNoWatermark&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1ff37172-67bb-4176-bd83-4b8ee5b38fc2_4481x3351.png&quot;,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:1089,&quot;width&quot;:1456,&quot;resizeWidth&quot;:1200,&quot;bytes&quot;:1134081,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ff37172-67bb-4176-bd83-4b8ee5b38fc2_4481x3351.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-large" alt="" srcset="https://substackcdn.com/image/fetch/$s_!TyGQ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdee617b0-6f41-4873-a373-c8033ffc3dab_4481x3351.png 424w, https://substackcdn.com/image/fetch/$s_!TyGQ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdee617b0-6f41-4873-a373-c8033ffc3dab_4481x3351.png 848w, https://substackcdn.com/image/fetch/$s_!TyGQ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdee617b0-6f41-4873-a373-c8033ffc3dab_4481x3351.png 1272w, https://substackcdn.com/image/fetch/$s_!TyGQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdee617b0-6f41-4873-a373-c8033ffc3dab_4481x3351.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>As a reminder, our &#8220;good enough&#8221; energy-compute thresholds for Llama 3.3 8B inference in FP16 precision are:</p><ul><li><p><strong>Throughput: </strong><em>&#964;_sec</em> &gt;= 47.6 tokens / sec</p></li><li><p><strong>Energy Efficiency: </strong><em>&#951;</em> &gt;= 0.14 tokens / joule</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!aqWx!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79c62ace-88c4-4d92-b25e-802b030c688f_4481x3351.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!aqWx!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79c62ace-88c4-4d92-b25e-802b030c688f_4481x3351.png 424w, https://substackcdn.com/image/fetch/$s_!aqWx!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79c62ace-88c4-4d92-b25e-802b030c688f_4481x3351.png 848w, https://substackcdn.com/image/fetch/$s_!aqWx!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79c62ace-88c4-4d92-b25e-802b030c688f_4481x3351.png 1272w, https://substackcdn.com/image/fetch/$s_!aqWx!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79c62ace-88c4-4d92-b25e-802b030c688f_4481x3351.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!aqWx!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79c62ace-88c4-4d92-b25e-802b030c688f_4481x3351.png" width="1200" height="897.5274725274726" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/79c62ace-88c4-4d92-b25e-802b030c688f_4481x3351.png&quot;,&quot;srcNoWatermark&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/77fc2682-3d1e-4d15-866c-9d9372c4757a_4481x3351.png&quot;,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:1089,&quot;width&quot;:1456,&quot;resizeWidth&quot;:1200,&quot;bytes&quot;:1303405,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F77fc2682-3d1e-4d15-866c-9d9372c4757a_4481x3351.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-large" alt="" srcset="https://substackcdn.com/image/fetch/$s_!aqWx!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79c62ace-88c4-4d92-b25e-802b030c688f_4481x3351.png 424w, https://substackcdn.com/image/fetch/$s_!aqWx!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79c62ace-88c4-4d92-b25e-802b030c688f_4481x3351.png 848w, https://substackcdn.com/image/fetch/$s_!aqWx!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79c62ace-88c4-4d92-b25e-802b030c688f_4481x3351.png 1272w, https://substackcdn.com/image/fetch/$s_!aqWx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F79c62ace-88c4-4d92-b25e-802b030c688f_4481x3351.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><strong>Energy-Compute Matrix</strong> based on Llama 3.3 8B case. Our peak throughput and energy efficiency boundaries intersect at the MTT S4000 data point. Red zones indicate the SKU does not meet throughput minimum, energy efficiency minimum, or both. Everything to the upper-left of this intersection is &#8220;good enough.&#8221;</figcaption></figure></div><p>Readers may also remember the &#8220;Ubiquitous Edge Intelligence&#8221; milestone (the purple zone) discussed at WAIC 2025. This is the north star for domestic model-chip co-designs, defined as the following:</p><ul><li><p><strong>Throughput: </strong><em>&#964;_sec</em> &gt;= 100 tokens / sec</p></li><li><p><strong>Energy Footprint: </strong><em>E</em> &lt; 20 watts</p></li><li><p><strong>Energy Efficiency: </strong><em>&#951;</em> &gt;= 20 tokens / joule</p></li></ul><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;6aff6fa1-68eb-46fe-a039-fec68e3119ab&quot;,&quot;caption&quot;:&quot;So, GPT-5 was released a few days ago.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Energy-Compute Theory: China's New Objective Function&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:2295132,&quot;name&quot;:&quot;Ryan Cunningham&quot;,&quot;bio&quot;:&quot;energy-compute and technoeconomics &#8226; founder @ Edgerunner Ventures &#8226; ex-Uber&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!cF6f!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e40b64a-002f-4b1b-bc62-16df254e2f7b_995x995.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;post_date&quot;:&quot;2025-08-14T14:15:08.832Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/36bd065e-07d8-4a4a-9ef0-3ebc1e2c93b2_2912x2096.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.machineyearning.io/p/energy-compute-theory-chinas-new&quot;,&quot;section_name&quot;:null,&quot;video_upload_id&quot;:null,&quot;id&quot;:170203312,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:49,&quot;comment_count&quot;:8,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;Machine Yearning&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!-RAu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39397fed-3de8-46df-ab35-4f48dc5edf4e_300x300.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><p>Applying those restraints leaves 10 of 16 vendors having at least one SKU which makes the cut. Insufficient information on Denglin Technology, Zhonghao Xinying, Innostar Semiconductor, and T-HEAD prevents placing them on this graph, though Innostar and T-HEAD&#8217;s approaches to on-chip memory would likely place them in an energy-efficient tier.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!A9O5!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3298a5c-cd09-47b1-898c-4ece61c18db8_722x949.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!A9O5!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3298a5c-cd09-47b1-898c-4ece61c18db8_722x949.png 424w, https://substackcdn.com/image/fetch/$s_!A9O5!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3298a5c-cd09-47b1-898c-4ece61c18db8_722x949.png 848w, https://substackcdn.com/image/fetch/$s_!A9O5!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3298a5c-cd09-47b1-898c-4ece61c18db8_722x949.png 1272w, https://substackcdn.com/image/fetch/$s_!A9O5!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3298a5c-cd09-47b1-898c-4ece61c18db8_722x949.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!A9O5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3298a5c-cd09-47b1-898c-4ece61c18db8_722x949.png" width="722" height="949" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c3298a5c-cd09-47b1-898c-4ece61c18db8_722x949.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:949,&quot;width&quot;:722,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:137740,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3298a5c-cd09-47b1-898c-4ece61c18db8_722x949.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!A9O5!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3298a5c-cd09-47b1-898c-4ece61c18db8_722x949.png 424w, https://substackcdn.com/image/fetch/$s_!A9O5!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3298a5c-cd09-47b1-898c-4ece61c18db8_722x949.png 848w, https://substackcdn.com/image/fetch/$s_!A9O5!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3298a5c-cd09-47b1-898c-4ece61c18db8_722x949.png 1272w, https://substackcdn.com/image/fetch/$s_!A9O5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc3298a5c-cd09-47b1-898c-4ece61c18db8_722x949.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>However, <strong>Biren</strong> and <strong>Cambricon</strong>&#8217;s passable products are from an older generation of chips, back before they were added to the U.S. Entity List in 2022. Both have since shifted wafer production to SMIC, and performance data has been harder to come by for their newer lineups. So for fairness, we should remove these from the list. This doesn&#8217;t mean that newer chips powering Biren&#8217;s &#22721;&#30782;166 lineup or Cambricon&#8217;s &#24605;&#20803;590 series aren&#8217;t passable - we just don&#8217;t have the data for that yet.</p><p>That leaves 8 / 12 companies - <strong>Huawei, Enflame, Moore Threads, MetaX, Hygon, Iluvatar CoreX, SOPHGO, </strong>and<strong> Moffett AI </strong>- still in play.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!_Xha!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4252c27d-9de7-437e-a8d6-f8242b3b80c7_8962x6702.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!_Xha!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4252c27d-9de7-437e-a8d6-f8242b3b80c7_8962x6702.png 424w, https://substackcdn.com/image/fetch/$s_!_Xha!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4252c27d-9de7-437e-a8d6-f8242b3b80c7_8962x6702.png 848w, https://substackcdn.com/image/fetch/$s_!_Xha!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4252c27d-9de7-437e-a8d6-f8242b3b80c7_8962x6702.png 1272w, https://substackcdn.com/image/fetch/$s_!_Xha!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4252c27d-9de7-437e-a8d6-f8242b3b80c7_8962x6702.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!_Xha!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4252c27d-9de7-437e-a8d6-f8242b3b80c7_8962x6702.png" width="1200" height="897.5274725274726" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4252c27d-9de7-437e-a8d6-f8242b3b80c7_8962x6702.png&quot;,&quot;srcNoWatermark&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c712ead1-ba9a-4be2-ab45-628ad8cf24eb_8962x6702.png&quot;,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:1089,&quot;width&quot;:1456,&quot;resizeWidth&quot;:1200,&quot;bytes&quot;:2560362,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc712ead1-ba9a-4be2-ab45-628ad8cf24eb_8962x6702.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-large" alt="" srcset="https://substackcdn.com/image/fetch/$s_!_Xha!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4252c27d-9de7-437e-a8d6-f8242b3b80c7_8962x6702.png 424w, https://substackcdn.com/image/fetch/$s_!_Xha!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4252c27d-9de7-437e-a8d6-f8242b3b80c7_8962x6702.png 848w, https://substackcdn.com/image/fetch/$s_!_Xha!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4252c27d-9de7-437e-a8d6-f8242b3b80c7_8962x6702.png 1272w, https://substackcdn.com/image/fetch/$s_!_Xha!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4252c27d-9de7-437e-a8d6-f8242b3b80c7_8962x6702.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><strong>Energy-Compute Matrix</strong> with A100 performance filter. Represented by the blue diamond, the NVIDIA A100 provides a useful, albeit trailing, performance milestone.</figcaption></figure></div><p>Speaking to the considerable performance gaps, most vendors fall out of favor if we raise the passability threshold to within striking distance of an A100. Of the 39 original accelerator cards we have data for, only 4 remain: the <strong>Huawei Ascend 910C, Enflame &#20113;&#29159;T20, MetaX &#26342;&#20113;C500, </strong>and the <strong>Moffett AI SparseOne </strong>series<strong>.</strong></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!NfHl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d9986aa-24ee-47dc-85f3-6a4ac2c7fecd_837x247.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!NfHl!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d9986aa-24ee-47dc-85f3-6a4ac2c7fecd_837x247.png 424w, https://substackcdn.com/image/fetch/$s_!NfHl!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d9986aa-24ee-47dc-85f3-6a4ac2c7fecd_837x247.png 848w, https://substackcdn.com/image/fetch/$s_!NfHl!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d9986aa-24ee-47dc-85f3-6a4ac2c7fecd_837x247.png 1272w, https://substackcdn.com/image/fetch/$s_!NfHl!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d9986aa-24ee-47dc-85f3-6a4ac2c7fecd_837x247.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!NfHl!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d9986aa-24ee-47dc-85f3-6a4ac2c7fecd_837x247.png" width="1200" height="354.1218637992832" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1d9986aa-24ee-47dc-85f3-6a4ac2c7fecd_837x247.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:247,&quot;width&quot;:837,&quot;resizeWidth&quot;:1200,&quot;bytes&quot;:44560,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d9986aa-24ee-47dc-85f3-6a4ac2c7fecd_837x247.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-large" alt="" srcset="https://substackcdn.com/image/fetch/$s_!NfHl!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d9986aa-24ee-47dc-85f3-6a4ac2c7fecd_837x247.png 424w, https://substackcdn.com/image/fetch/$s_!NfHl!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d9986aa-24ee-47dc-85f3-6a4ac2c7fecd_837x247.png 848w, https://substackcdn.com/image/fetch/$s_!NfHl!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d9986aa-24ee-47dc-85f3-6a4ac2c7fecd_837x247.png 1272w, https://substackcdn.com/image/fetch/$s_!NfHl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d9986aa-24ee-47dc-85f3-6a4ac2c7fecd_837x247.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>I&#8217;m including Moffett AI in the mix as hardware-specific innovations have enabled high sparsification factors of up to 32x which, on &lt;250W form factors, yield incredibly energy efficient computation. While the 32x case is visible, even 16x and 8x are sufficient to meet the CAICT bar.</p><p>Notably, MetaX is the only general-purpose GPU (GPGPU) in this list. The others are ASICs!</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!9zpc!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F668652a2-76f6-47c5-85d2-72f2aaf01150_3549x1830.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!9zpc!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F668652a2-76f6-47c5-85d2-72f2aaf01150_3549x1830.jpeg 424w, https://substackcdn.com/image/fetch/$s_!9zpc!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F668652a2-76f6-47c5-85d2-72f2aaf01150_3549x1830.jpeg 848w, https://substackcdn.com/image/fetch/$s_!9zpc!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F668652a2-76f6-47c5-85d2-72f2aaf01150_3549x1830.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!9zpc!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F668652a2-76f6-47c5-85d2-72f2aaf01150_3549x1830.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!9zpc!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F668652a2-76f6-47c5-85d2-72f2aaf01150_3549x1830.jpeg" width="1200" height="618.7658495350803" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/668652a2-76f6-47c5-85d2-72f2aaf01150_3549x1830.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:1830,&quot;width&quot;:3549,&quot;resizeWidth&quot;:1200,&quot;bytes&quot;:1181616,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f1d7c13-318f-428f-b6fc-5e331c0f5e7b_4032x3024.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-large" alt="" srcset="https://substackcdn.com/image/fetch/$s_!9zpc!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F668652a2-76f6-47c5-85d2-72f2aaf01150_3549x1830.jpeg 424w, https://substackcdn.com/image/fetch/$s_!9zpc!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F668652a2-76f6-47c5-85d2-72f2aaf01150_3549x1830.jpeg 848w, https://substackcdn.com/image/fetch/$s_!9zpc!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F668652a2-76f6-47c5-85d2-72f2aaf01150_3549x1830.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!9zpc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F668652a2-76f6-47c5-85d2-72f2aaf01150_3549x1830.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: WAIC 2025. The MetaX &#26342;&#20113;C500 chip along with its successor, the &#26342;&#20113;C600. The C600 appears to be a dual-chiplet design, similar to the Huawei Ascend 910C and NVIDIA Blackwell B200.</figcaption></figure></div><h3><strong>Memory Comparison</strong></h3><p>Note that with the exception of Moore Threads, SOPHGO, and Moffett AI, nearly all accelerators seem to require at least HBM2E-tier memory to make this list.</p><p>Most domestic chip designers use HBM2E or earlier, with some using GDDR6 or LPDDR5x memory. <strong>Cambricon, Enflame, Biren, MetaX, </strong>and <strong>Iluvatar CoreX </strong>all use HBM2E in their latest chips, with Enflame&#8217;s upcoming <strong>&#36995;&#24605;L600</strong> rumored to be an early candidate for HBM3.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-23" href="#footnote-23" target="_self">23</a> After its initial entity listing and until its <strong>&#24605;&#20803;590</strong> chip (2022-2024), Cambricon used LPDDR5 memory, as does <strong>Moffett AI</strong> (LPDDR4x) for its low-powered SparseOne cards. Finally, <strong>Kunlunxin </strong>and <strong>Moore Threads </strong>all use GDDR6 memory.</p><p>The thermals, tradeoffs, and physical limitations of memory tech will be the subject of a future post. For now, it&#8217;s safe to assume that domestic chip designers will eventually have to align with a sovereign domestic supply chain for memory chips, which at the moment, remains a bottleneck to more advanced chip production.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-24" href="#footnote-24" target="_self">24</a></p><h3><strong>Sparse Computing</strong></h3><p>Critical to Moffett AI&#8217;s outperformance is its approach to sparsity. In neuroscience, it&#8217;s widely understood that human wetware leverages a high degree of sparsity in computations - in fact, less than 2% of neurons actually fire for any given input. This is a major contributor to the human brain&#8217;s considerable computational energy efficiency.</p><p>&#8220;Sparsity&#8221; in AI workloads refers to the proportion of zeroes (or near-zero) values in a neural network&#8217;s parameters or activations - the more zeroes, the fewer operations and data to have to work with. At most, typical GPUs like NVIDIA&#8217;s lineup can only natively support 50% (2x) sparsity.</p><p>Sparsity is represented in either percentages or multiples (e.g. 50% of weights set to zero = 2x sparsity, 87.5% = 8x, 96.9% = 32x). Some research has shown that LLMs can be sparsified to high degrees (often 50-90% sparsity) with minimal loss in accuracy when done properly.</p><p>A few years ago, Ian Chen and Zhibin Xiao, co-founders of Moffett, published a paper outlining their approach to this problem. The Antoum chip forms the core of their hardware lineup, the SparseOne cards, which are capable of up to 32x sparsification. While there is an inherent accuracy-speed tradeoff with sparsification (you are losing some information after all), the team found considerable speedups in throughput (and therefore, energy efficiency) with minimal accuracy loss.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ghH6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F473eb120-890f-48b6-878c-fb01eec11f1b_558x530.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ghH6!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F473eb120-890f-48b6-878c-fb01eec11f1b_558x530.png 424w, https://substackcdn.com/image/fetch/$s_!ghH6!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F473eb120-890f-48b6-878c-fb01eec11f1b_558x530.png 848w, https://substackcdn.com/image/fetch/$s_!ghH6!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F473eb120-890f-48b6-878c-fb01eec11f1b_558x530.png 1272w, https://substackcdn.com/image/fetch/$s_!ghH6!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F473eb120-890f-48b6-878c-fb01eec11f1b_558x530.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ghH6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F473eb120-890f-48b6-878c-fb01eec11f1b_558x530.png" width="724" height="687.6702508960574" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/473eb120-890f-48b6-878c-fb01eec11f1b_558x530.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:530,&quot;width&quot;:558,&quot;resizeWidth&quot;:724,&quot;bytes&quot;:115617,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F473eb120-890f-48b6-878c-fb01eec11f1b_558x530.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ghH6!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F473eb120-890f-48b6-878c-fb01eec11f1b_558x530.png 424w, https://substackcdn.com/image/fetch/$s_!ghH6!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F473eb120-890f-48b6-878c-fb01eec11f1b_558x530.png 848w, https://substackcdn.com/image/fetch/$s_!ghH6!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F473eb120-890f-48b6-878c-fb01eec11f1b_558x530.png 1272w, https://substackcdn.com/image/fetch/$s_!ghH6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F473eb120-890f-48b6-878c-fb01eec11f1b_558x530.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: <a href="https://arxiv.org/pdf/2207.08006https://arxiv.org/pdf/2207.08006">Moffett AI (arXiv, 2022).</a></figcaption></figure></div><p>The aforementioned S4 in their paper only draws 70W of power. The SparseOne S30, a 250W variant, previously demonstrated production-grade performance on large 100B+ parameter models like BLOOM-176B - 432 tokens / second in an 8-card deployment.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-25" href="#footnote-25" target="_self">25</a></p><p>Moffett AI has also repeatedly participated in MLCommons&#8217; MLPerf Inference benchmarks, receiving consistently high marks for its SparseOne lineup. The S30 in particular nearly doubled H100 throughput on common LLM workloads on a 2.8x smaller energy footprint, yielding a ~5x boost in energy efficiency with virtually no loss in output quality.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-26" href="#footnote-26" target="_self">26</a></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!502A!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74e8bc9b-4103-4984-b88f-2a8e7ed6e2c7_1920x1080.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!502A!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74e8bc9b-4103-4984-b88f-2a8e7ed6e2c7_1920x1080.png 424w, https://substackcdn.com/image/fetch/$s_!502A!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74e8bc9b-4103-4984-b88f-2a8e7ed6e2c7_1920x1080.png 848w, https://substackcdn.com/image/fetch/$s_!502A!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74e8bc9b-4103-4984-b88f-2a8e7ed6e2c7_1920x1080.png 1272w, https://substackcdn.com/image/fetch/$s_!502A!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74e8bc9b-4103-4984-b88f-2a8e7ed6e2c7_1920x1080.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!502A!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74e8bc9b-4103-4984-b88f-2a8e7ed6e2c7_1920x1080.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/74e8bc9b-4103-4984-b88f-2a8e7ed6e2c7_1920x1080.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;13.png&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="13.png" title="13.png" srcset="https://substackcdn.com/image/fetch/$s_!502A!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74e8bc9b-4103-4984-b88f-2a8e7ed6e2c7_1920x1080.png 424w, https://substackcdn.com/image/fetch/$s_!502A!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74e8bc9b-4103-4984-b88f-2a8e7ed6e2c7_1920x1080.png 848w, https://substackcdn.com/image/fetch/$s_!502A!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74e8bc9b-4103-4984-b88f-2a8e7ed6e2c7_1920x1080.png 1272w, https://substackcdn.com/image/fetch/$s_!502A!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74e8bc9b-4103-4984-b88f-2a8e7ed6e2c7_1920x1080.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: <a href="https://www.moffettai.com/xin-wen-zhong-xin/mlperf-shou-ci-da-mo-xing-tui-li-ce-ping-fang.html">Moffett AI, MLCommons</a>.</figcaption></figure></div><p>Finally, that Moffett&#8217;s Antoum cards were deployed on nodes from reputable OEMs (H3C and Inspur) demonstrated minimal or limited customizations required to run these hardware, hinting at commercial viability. Inspur is also an investor in the company.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-27" href="#footnote-27" target="_self">27</a></p><p>Future Moffett cards are confirmed to support FP8 precision workloads.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-28" href="#footnote-28" target="_self">28</a></p><h2>2. Developer Adoption</h2><p><em>Standouts:<strong> Cambricon, Moore Threads, Huawei, Enflame, MetaX.</strong></em></p><p><em>Laggards: <strong>Iluvatar CoreX, Biren.</strong></em></p><div class="pullquote"><p><em>NVIDIA&#8217;s CUDA ecosystem retains top-billing in developer preferences, and CUDA-like compatibility is a positive adoption factor for GPGPU designers like <strong>Enflame, Moore Threads,</strong> and <strong>MetaX</strong>.</em></p><p><em>Significant investment in transcompilation tech a positive tailwind for custom silicon designers like <strong>Cambricon</strong>. Non-heavyweights pushing their own custom languages, like <strong>Iluvatar</strong> and <strong>Biren</strong>, are struggling.</em></p><p><em>Heterogeneous frameworks, transcompilation, and open-source strategies will contribute to CUDA moat erosion over time.</em></p></div><p>Before you ask, I&#8217;m not going to give a definitive answer on if and when NVIDIA&#8217;s CUDA moat gives way to domestic innovation. But I&#8217;ll highlight some contributing factors we can track.</p><p>Hardware isn&#8217;t useful if developers hate having to work with it. During our China trip, we spoke to a host of Chinese cloud providers asking about developer adoption of these various SDKs. Universally, preferences for CUDA (and even ROCm) remain over domestic alternatives thanks to their much larger extant developer and troubleshooting communities - a valuable moat to have.</p><p>Domestic players recognize this and attempting one of three paths:</p><ol><li><p><strong>Seek CUDA compatibility. </strong>Suiyuan plans to invest 20% of its IPO fundraising to develop a CUDA-compatible tool chain, with a goal of achieving 90% operator compatibility by 2025, and the migration cost will be reduced to 40 man hours per person. MetaX has also announced that its C500 series (and all future cards) would be CUDA compatible. Kunlunxin&#8217;s lineup is also reportedly CUDA compatible.</p></li><li><p><strong>Press advantages into software. </strong>Huawei recently announced it would be open-sourcing CANN to help build up its developer ecosystem. Older players like Iluvatar CoreX and Biren are pushing their own custom programming languages, but may be facing adoption headwinds as a result.</p></li><li><p><strong>Build out transcompilation libraries. </strong>Both Cambricon and Moore Threads have invested considerable resources into their in-house transcompilation libraries, QiMeng-Xpiler and MUSIFY, which convert CUDA into their native programming languages (BANG C and MUSA).</p></li></ol><p><strong>Programming Stacks &amp; CUDA Compatibility Roadmaps</strong></p><p><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-29" href="#footnote-29" target="_self">29</a></p><p>Of course, Heavyweights don&#8217;t <em>have</em> to worry about developer adoption, since they have the cloud real estate to justify significant hardware R&amp;D investments. Third-party players, like the little dragons, have to partner with hyperscalers in some capacity if they want to see widespread adoption, and can grease the wheel with CUDA compatibility. But Huawei wants to establish itself as the de facto leader of the sovereign semiconductor ecosystem, and to do that, they need developers - lots of developers - to use CANN.</p><p>During this transitional stage, an &#8220;abstraction layer&#8221; market seems to exist for building transcompilation libraries, converting code written in CUDA into hardware-specific accelerator code. NVIDIA obviously wants to restrict this as much as possible, and has banned such translation layers in its licensing terms since 2021.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-30" href="#footnote-30" target="_self">30</a></p><p>Nonetheless, CUDA optimization and translation remains a big market nonetheless. Readers may remember Sakana AI (Japan&#8217;s sovereign research lab) announced, retracted, and re-launched its own CUDA engineer,<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-31" href="#footnote-31" target="_self">31</a> and Y Combinator featured a similar request in its Spring 2025 &#8220;Request for Startups.&#8221;<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-32" href="#footnote-32" target="_self">32</a> In China, Xcore Sigma (&#20013;&#31185;&#21152;&#31166;) is one such example.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-33" href="#footnote-33" target="_self">33</a></p><p>The challenge with this approach is that third-party optimization libraries will always be a few steps behind first-party equivalents. Without a tight relationship between the software and hardware designers, it takes time for optimizations against those changes to diffuse.</p><p>Naturally, this routes highly competitive developers and downstream markets towards natively supported solutions that have the most up-to-date acceleration. This is why for domestic chip designers, reliable transcompilation between common libraries and custom hardware is a must-have. That&#8217;s basic network effects.</p><h3><strong>Moore Threads - MUSIFY</strong></h3><p>As an example, Moore Threads is promoting its MUSIFY toolkit to convert CUDA code into MUSA (Moore Threads Unified System Architecture), its native programming language. If it works, this would significantly reduce the technical barriers and time costs of switching platforms. Users speculate that it works similarly to <a href="https://github.com/vosen/ZLUDA">ZLUDA</a>, which translates PTX code at runtime.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-34" href="#footnote-34" target="_self">34</a></p><h3><strong>Cambricon - Transcompilation</strong></h3><p>A joint team between Cambricon (including the two co-founders) and ICT-CAS co-developed a transpiler called QiMeng-Xpiler, which converts between more commonly supported libraries (NVIDIA CUDA, AMD HIP, and Intel VNNI) and BANG C, Cambricon&#8217;s proprietary C-like language. It can handle these transcompilations with an average accuracy of 95% (admittedly weighed down by Intel), which outperforms both traditional rules-based methods and AI-native approaches.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!MUIM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ce18d6a-b446-4820-a8a5-6a586b5bfb64_1014x592.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!MUIM!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ce18d6a-b446-4820-a8a5-6a586b5bfb64_1014x592.png 424w, https://substackcdn.com/image/fetch/$s_!MUIM!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ce18d6a-b446-4820-a8a5-6a586b5bfb64_1014x592.png 848w, https://substackcdn.com/image/fetch/$s_!MUIM!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ce18d6a-b446-4820-a8a5-6a586b5bfb64_1014x592.png 1272w, https://substackcdn.com/image/fetch/$s_!MUIM!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ce18d6a-b446-4820-a8a5-6a586b5bfb64_1014x592.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!MUIM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ce18d6a-b446-4820-a8a5-6a586b5bfb64_1014x592.png" width="1014" height="592" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0ce18d6a-b446-4820-a8a5-6a586b5bfb64_1014x592.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:592,&quot;width&quot;:1014,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:158403,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ce18d6a-b446-4820-a8a5-6a586b5bfb64_1014x592.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!MUIM!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ce18d6a-b446-4820-a8a5-6a586b5bfb64_1014x592.png 424w, https://substackcdn.com/image/fetch/$s_!MUIM!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ce18d6a-b446-4820-a8a5-6a586b5bfb64_1014x592.png 848w, https://substackcdn.com/image/fetch/$s_!MUIM!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ce18d6a-b446-4820-a8a5-6a586b5bfb64_1014x592.png 1272w, https://substackcdn.com/image/fetch/$s_!MUIM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ce18d6a-b446-4820-a8a5-6a586b5bfb64_1014x592.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Source: <a href="https://www.usenix.org/system/files/osdi25-dong.pdf">USENIX</a>, Cambricon.</p><p>The last thing that developers want to do is debug, hence the preference for well-supported languages like CUDA. Plus, transcompilation can take hours - 3.7 on average - which is a long time to wait before realizing your code is riddled with errors. Even a few percentage points difference in final compilation accuracy can mean hours, if not days, of manual debugging.</p><p>Fortunately for Cambricon, while QiMeng failed to generate a functional-program the first time for its most challenging operation in this study (Deformable Attention, ~200 lines of code) around, it didn&#8217;t require long for programmers to debug it: half an hour for &#8220;senior coders&#8221; (software engineers), and 3 hours for &#8220;junior coders&#8221; (masters students). Plus the original transcompilation time of 4.5 hours, that&#8217;s about a day&#8217;s work compared to a week&#8217;s work of manual transcompilation - a ~20-30x productivity improvement when deployed alongside professional software engineers.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Zpa3!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03e9c5a5-0bff-40ae-9c35-e0f16d978106_698x314.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Zpa3!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03e9c5a5-0bff-40ae-9c35-e0f16d978106_698x314.png 424w, https://substackcdn.com/image/fetch/$s_!Zpa3!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03e9c5a5-0bff-40ae-9c35-e0f16d978106_698x314.png 848w, https://substackcdn.com/image/fetch/$s_!Zpa3!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03e9c5a5-0bff-40ae-9c35-e0f16d978106_698x314.png 1272w, https://substackcdn.com/image/fetch/$s_!Zpa3!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03e9c5a5-0bff-40ae-9c35-e0f16d978106_698x314.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Zpa3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03e9c5a5-0bff-40ae-9c35-e0f16d978106_698x314.png" width="698" height="314" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/03e9c5a5-0bff-40ae-9c35-e0f16d978106_698x314.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:314,&quot;width&quot;:698,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Zpa3!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03e9c5a5-0bff-40ae-9c35-e0f16d978106_698x314.png 424w, https://substackcdn.com/image/fetch/$s_!Zpa3!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03e9c5a5-0bff-40ae-9c35-e0f16d978106_698x314.png 848w, https://substackcdn.com/image/fetch/$s_!Zpa3!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03e9c5a5-0bff-40ae-9c35-e0f16d978106_698x314.png 1272w, https://substackcdn.com/image/fetch/$s_!Zpa3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03e9c5a5-0bff-40ae-9c35-e0f16d978106_698x314.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div><hr></div><h1>Strategic Analysis</h1><p>Beyond individual product performance, there are qualitative factors to examine: strategic and financial backers, leadership teams, and commercial track records all contribute to relative standing. But the backdrop for much of this is the fallout and catalytic effects that American export controls have had on the ecosystem. To paint the picture, we&#8217;ll start with the entity list impact.</p><div><hr></div><h2>1. Entity List Impact</h2><p>While being named to the U.S. Entity List has hit some first movers like Biren hard, for others they have been more speed bumps than Great Walls.</p><p>Second mover advantages for the other three dragons - Enflame, Moore Threads, and MetaX - seem to exist, as they&#8217;ve had an easier time finding commercial adoption from large hyperscalers.</p><p>Specifically, HBM is a critical bottleneck holding back domestic chip performance&#8230; for now. Time will tell how quickly CXMT / YMTC can ramp up HBM3+ production. Heterogeneous computing and software-first innovations are beginning to evade these bottlenecks somewhat successfully.</p><p>For now, poorer SMIC yields (~40%) are contributing to higher unit costs for domestic chips than NVIDIA-produced chips (TSMC at 80-90% yield depending on process node). However, as this improves, unit costs for domestic chips will systemically improve.</p><h3><strong>Leadership Turmoil: Biren</strong></h3><p>Founded in 2019, while Biren wasn&#8217;t the first domestic GPU company, it was widely considered to be an early darling.</p><p>The founder &amp; CEO, <strong>Zhang Wen (&#24352;&#25991;)</strong>, is the former president of SenseTime and holds a JD from Harvard University + an MBA from Columbia University.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-35" href="#footnote-35" target="_self">35</a> Notably, he does not come from a technical background, but he is an excellent dealmaker and headhunter: his extensive experience enabled him to assemble an Avengers-class team of founders and executives from Huawei, Alibaba, Qualcomm, and more.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-36" href="#footnote-36" target="_self">36</a> On the team alone, Biren was able to secure significant amounts of capital - nearly $700M USD and establish early partnerships with other hyperscalers.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-37" href="#footnote-37" target="_self">37</a></p><p>In 2021, Zhang added another heavyweight to the roster - <strong>Li Xinrong (&#26446;&#26032;&#33635;)</strong>, former AMD executive and its head of China R&amp;D. Curiously, he was added as &#8220;Co-CEO,&#8221; likely to reinforce the technical leadership acumen Zhang Wen lacked in the executive role.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-38" href="#footnote-38" target="_self">38</a></p><p>Biren debuted its flagship chip, the BR100, in 2022 at Hot Chips 34 and positioned it as a domestic alternative to NVIDIA&#8217;s A100/H100 class, emphasizing strong BF16/INT8 throughput, flexible precision handling, and speedy interconnect bandwidth. This was objectively an impressive chip at the time, eclipsing the A100 (released just a year earlier) in several performance metrics.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-39" href="#footnote-39" target="_self">39</a></p><p>However, that conference may have attracted unwanted attention from Biden-era policymakers. Months later, Biren would be added to the U.S. Entity List, starving it from access to TSMC process nodes (and later, HBM from SK Hynix / Samsung) for fabrication.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-40" href="#footnote-40" target="_self">40</a> Zhang would also go on to lose both his co-founders - Xu Lingjie (&#24464;&#20940;&#26480;) and Jiao Guofang (&#28966;&#22269;&#26041;) - in the ensuing fallout. Biren has yet to recover its pole position in the domestic market.</p><p>There&#8217;s a Chinese idiom &#26641;&#22823;&#25307;&#39118; (<em>sh&#249; d&#224; zh&#257;o f&#275;ng</em>, &#8220;a tall tree attracts the wind&#8221;) which captures this problem. This is also probably why most Chinese chip designers are incredibly tight-lipped about performance specs for their product lineups.</p><h3><strong>Memory Bottlenecking: Moore Threads</strong></h3><p>Being named to the U.S. Entity List hasn&#8217;t helped Moore Threads&#8217; HBM prospects, either - entity-listed or sanctioned companies are typically barred from using HBM products from SK Hynix, Samsung, and Micron, the international memory triumvirate.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-41" href="#footnote-41" target="_self">41</a> This means their primary options for domestic memory IDMs are Yangtze Memory Technologies Corp (YMTC) and ChangXin Memory Technologies (CXMT).</p><p>The majority of leading edge chips from NVIDIA and AMD use HBM3E memory, which is more advanced than the memory types that sanctioned domestic chipmakers have access to. More specifically, HBM3E achieves total package memory bandwidth speeds of over 1.2 terabytes per second (TBps) per memory stack, roughly 3x faster than HBM2E. Advanced cards use multiple stacks of HBM - 4, 6, or even up to 12.</p><p>Moore Threads uses GDDR6, rated at or around 512 GB/s. GDDR is the kind of memory you&#8217;d find in prosumer gaming GPUs - still good quality, but pales in comparison to HBM.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!XosS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb887e379-c6e5-42aa-8371-ed8ffcf185d7_1407x781.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!XosS!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb887e379-c6e5-42aa-8371-ed8ffcf185d7_1407x781.png 424w, https://substackcdn.com/image/fetch/$s_!XosS!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb887e379-c6e5-42aa-8371-ed8ffcf185d7_1407x781.png 848w, https://substackcdn.com/image/fetch/$s_!XosS!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb887e379-c6e5-42aa-8371-ed8ffcf185d7_1407x781.png 1272w, https://substackcdn.com/image/fetch/$s_!XosS!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb887e379-c6e5-42aa-8371-ed8ffcf185d7_1407x781.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!XosS!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb887e379-c6e5-42aa-8371-ed8ffcf185d7_1407x781.png" width="1200" height="666.0980810234541" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b887e379-c6e5-42aa-8371-ed8ffcf185d7_1407x781.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:781,&quot;width&quot;:1407,&quot;resizeWidth&quot;:1200,&quot;bytes&quot;:474276,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb887e379-c6e5-42aa-8371-ed8ffcf185d7_1407x781.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-large" alt="" srcset="https://substackcdn.com/image/fetch/$s_!XosS!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb887e379-c6e5-42aa-8371-ed8ffcf185d7_1407x781.png 424w, https://substackcdn.com/image/fetch/$s_!XosS!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb887e379-c6e5-42aa-8371-ed8ffcf185d7_1407x781.png 848w, https://substackcdn.com/image/fetch/$s_!XosS!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb887e379-c6e5-42aa-8371-ed8ffcf185d7_1407x781.png 1272w, https://substackcdn.com/image/fetch/$s_!XosS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb887e379-c6e5-42aa-8371-ed8ffcf185d7_1407x781.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Source: Pacific Securities, <a href="https://blog.csdn.net/GPT20236688/article/details/135957588">CSDN</a>.</p><p>Reportedly, CXMT has already been producing HBM2 since mid-2024, and has begun testing HBM3 with select industry partners.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-42" href="#footnote-42" target="_self">42</a> They expect to enter mass production for HBM3 and HBM3E grade memory chips in 2026-2027. Interestingly, YMTC - which has been producing stacked NAND chips for years - seems to be reaching across the aisle to assist CXMT with hybrid bonding techniques for more reliable stacking and thermal dissipation.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-43" href="#footnote-43" target="_self">43</a></p><p>It could also be the case that since Moore Threads manufactures both consumer gaming GPUs and datacenter cards alike, the decision to use GDDR6 rather than HBM2 may be to simplify its supply chain. Undoubtedly GDDR has pretty steep tradeoffs in comparison. Whatever the case may be, entity listing has definitely impacted the competitiveness of Moore Threads cards against non-sanctioned domestic and foreign competitors.</p><p>For more on HBM, I strongly recommend Ray Wang&#8217;s guest piece on Nomad Semi.</p><div class="embedded-post-wrap" data-attrs="{&quot;id&quot;:164567572,&quot;url&quot;:&quot;https://www.nomadsemi.com/p/deep-dive-on-hbm&quot;,&quot;publication_id&quot;:2511102,&quot;publication_name&quot;:&quot;Nomad Semi&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!dg1m!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F892a41a7-f352-4776-831e-e30f65605552_1024x1024.png&quot;,&quot;title&quot;:&quot;Deep Dive on HBM&quot;,&quot;truncated_body_text&quot;:&quot;This deep dive on HBM is written jointly with Ray Wang&quot;,&quot;date&quot;:&quot;2025-06-03T14:32:09.259Z&quot;,&quot;like_count&quot;:123,&quot;comment_count&quot;:10,&quot;bylines&quot;:[{&quot;id&quot;:42239,&quot;name&quot;:&quot;Moore Morris&quot;,&quot;handle&quot;:&quot;nomadsemi&quot;,&quot;previous_name&quot;:&quot;Moore &amp; Morris&quot;,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/05f2cfe7-8d8c-4bc8-a746-96da5693d538_1024x1024.png&quot;,&quot;bio&quot;:&quot;Former portfolio manager with >10 years of experience investing in semiconductor stocks. I seek to provide a global perspective for a comprehensive understanding of this dynamic industry.&quot;,&quot;profile_set_up_at&quot;:&quot;2022-11-04T03:25:08.877Z&quot;,&quot;reader_installed_at&quot;:&quot;2022-11-04T03:24:29.543Z&quot;,&quot;publicationUsers&quot;:[{&quot;id&quot;:2541906,&quot;user_id&quot;:42239,&quot;publication_id&quot;:2511102,&quot;role&quot;:&quot;admin&quot;,&quot;public&quot;:true,&quot;is_primary&quot;:true,&quot;publication&quot;:{&quot;id&quot;:2511102,&quot;name&quot;:&quot;Nomad Semi&quot;,&quot;subdomain&quot;:&quot;nomadsemi&quot;,&quot;custom_domain&quot;:&quot;www.nomadsemi.com&quot;,&quot;custom_domain_optional&quot;:false,&quot;hero_text&quot;:&quot;Dive into the global semiconductor investment landscape&quot;,&quot;logo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/892a41a7-f352-4776-831e-e30f65605552_1024x1024.png&quot;,&quot;author_id&quot;:42239,&quot;primary_user_id&quot;:42239,&quot;theme_var_background_pop&quot;:&quot;#B599F1&quot;,&quot;created_at&quot;:&quot;2024-04-12T08:04:14.314Z&quot;,&quot;email_from_name&quot;:&quot;Nomad Semi&quot;,&quot;copyright&quot;:&quot;Moore Morris&quot;,&quot;founding_plan_name&quot;:null,&quot;community_enabled&quot;:true,&quot;invite_only&quot;:false,&quot;payments_state&quot;:&quot;disabled&quot;,&quot;language&quot;:null,&quot;explicit&quot;:false,&quot;homepage_type&quot;:&quot;magaziney&quot;,&quot;is_personal_mode&quot;:false}}],&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null,&quot;status&quot;:{&quot;bestsellerTier&quot;:null,&quot;subscriberTier&quot;:5,&quot;leaderboard&quot;:null,&quot;vip&quot;:false,&quot;badge&quot;:{&quot;type&quot;:&quot;subscriber&quot;,&quot;tier&quot;:5,&quot;color&quot;:null}}},{&quot;id&quot;:205724729,&quot;name&quot;:&quot;Ray Wang&quot;,&quot;handle&quot;:&quot;raywang2&quot;,&quot;previous_name&quot;:null,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!qMX_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F811a7523-2557-44c0-8ee5-e2c26f6e61b3_3170x3170.jpeg&quot;,&quot;bio&quot;:&quot;Ray Wang is a Research Director, Semiconductors, Supply Chain, and Emerging Technology at The Futurum Group. Wang previously based in Taipei and Seoul. Focus on Semiconductors and AI, sometime macro.&quot;,&quot;profile_set_up_at&quot;:&quot;2024-06-14T14:31:24.135Z&quot;,&quot;reader_installed_at&quot;:&quot;2025-01-18T11:40:15.414Z&quot;,&quot;is_guest&quot;:true,&quot;bestseller_tier&quot;:null,&quot;status&quot;:{&quot;bestsellerTier&quot;:null,&quot;subscriberTier&quot;:null,&quot;leaderboard&quot;:null,&quot;vip&quot;:false,&quot;badge&quot;:null}}],&quot;utm_campaign&quot;:null,&quot;belowTheFold&quot;:true,&quot;type&quot;:&quot;newsletter&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="EmbeddedPostToDOM"><a class="embedded-post" native="true" href="https://www.nomadsemi.com/p/deep-dive-on-hbm?utm_source=substack&amp;utm_campaign=post_embed&amp;utm_medium=web"><div class="embedded-post-header"><img class="embedded-post-publication-logo" src="https://substackcdn.com/image/fetch/$s_!dg1m!,w_56,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F892a41a7-f352-4776-831e-e30f65605552_1024x1024.png" loading="lazy"><span class="embedded-post-publication-name">Nomad Semi</span></div><div class="embedded-post-title-wrapper"><div class="embedded-post-title">Deep Dive on HBM</div></div><div class="embedded-post-body">This deep dive on HBM is written jointly with Ray Wang&#8230;</div><div class="embedded-post-cta-wrapper"><span class="embedded-post-cta">Read more</span></div><div class="embedded-post-meta">a year ago &#183; 123 likes &#183; 10 comments &#183; Moore Morris and Ray Wang</div></a></div><h3><strong>Lower Yields, Higher Unit Costs</strong></h3><p>One area where the Entity List is having a measurable systemic impact is domestic chip unit costs. Sometimes, Chinese companies will list more granular cost of goods sold for specific line items in their reporting materials. For our analysis, we have a useful comparison: two Little Dragons, one entity-listed (Moore Threads) and one not (MetaX).</p><p>To proxy our deployment costs, $ / TFLOPS is a helpful metric for training, but less informative for inference. We&#8217;ll use a dollar cost ratio of memory bandwidth speeds.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!8ByW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe3ca0ee-684e-4122-9383-3b3acaea9b16_704x291.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!8ByW!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe3ca0ee-684e-4122-9383-3b3acaea9b16_704x291.png 424w, https://substackcdn.com/image/fetch/$s_!8ByW!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe3ca0ee-684e-4122-9383-3b3acaea9b16_704x291.png 848w, https://substackcdn.com/image/fetch/$s_!8ByW!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe3ca0ee-684e-4122-9383-3b3acaea9b16_704x291.png 1272w, https://substackcdn.com/image/fetch/$s_!8ByW!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe3ca0ee-684e-4122-9383-3b3acaea9b16_704x291.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!8ByW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe3ca0ee-684e-4122-9383-3b3acaea9b16_704x291.png" width="724" height="299.26704545454544" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fe3ca0ee-684e-4122-9383-3b3acaea9b16_704x291.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:291,&quot;width&quot;:704,&quot;resizeWidth&quot;:724,&quot;bytes&quot;:35713,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe3ca0ee-684e-4122-9383-3b3acaea9b16_704x291.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!8ByW!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe3ca0ee-684e-4122-9383-3b3acaea9b16_704x291.png 424w, https://substackcdn.com/image/fetch/$s_!8ByW!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe3ca0ee-684e-4122-9383-3b3acaea9b16_704x291.png 848w, https://substackcdn.com/image/fetch/$s_!8ByW!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe3ca0ee-684e-4122-9383-3b3acaea9b16_704x291.png 1272w, https://substackcdn.com/image/fetch/$s_!8ByW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe3ca0ee-684e-4122-9383-3b3acaea9b16_704x291.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">H100: Secondhand reported Raymond James estimate by Tae Kim, author of &#8220;The NVIDIA Way.&#8221; MetaX: Not yet entity-listed. C-Series cards likely being fabbed by TSMC, SMIC transition in progress. Moore Threads: Entity-listed, using SMIC.</figcaption></figure></div><p>The H100 edges out the C500 in memory bandwidth ROI by a small margin, owing to its more advanced HBM3 memory stack vs. the C500&#8217;s HBM2E, though the C500 costs ~28% less to produce than the H100.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-44" href="#footnote-44" target="_self">44</a></p><p>Despite lower performance stats, the MTT data center card lineup costs more than 2.5x as much as an H100, and 11x as much on a $ / GB/s basis.</p><p>Since Moore Threads is entity-listed (and MetaX is not), its chips - despite lower performance - cost considerably more than MetaX&#8217;s competing chip line (2.5x gross, 11x worse ROI). That puts Moore Threads at a severe disadvantage on hardware alone.</p><p>This is an illustrative impact of entity listing on available process nodes to a downstream customer. Yield rates - the number of successful dies cut from a wafer divided by total potential dies - amortize the cost of wafer production over all resulting chips sold. It&#8217;s generally understood that for more advanced process nodes, SMIC yields are currently between 30-40% compared to 80-90% at TSMC. All else being equal, TSMC is able to spread its wafer production costs over twice as many chips sold as SMIC can.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-45" href="#footnote-45" target="_self">45</a></p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!-Q7W!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a5c850f-b044-4a10-a1c4-9ef5b6101877_679x197.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!-Q7W!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a5c850f-b044-4a10-a1c4-9ef5b6101877_679x197.png 424w, https://substackcdn.com/image/fetch/$s_!-Q7W!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a5c850f-b044-4a10-a1c4-9ef5b6101877_679x197.png 848w, https://substackcdn.com/image/fetch/$s_!-Q7W!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a5c850f-b044-4a10-a1c4-9ef5b6101877_679x197.png 1272w, https://substackcdn.com/image/fetch/$s_!-Q7W!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a5c850f-b044-4a10-a1c4-9ef5b6101877_679x197.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!-Q7W!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a5c850f-b044-4a10-a1c4-9ef5b6101877_679x197.png" width="727" height="210.92636229749633" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9a5c850f-b044-4a10-a1c4-9ef5b6101877_679x197.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:197,&quot;width&quot;:679,&quot;resizeWidth&quot;:727,&quot;bytes&quot;:22358,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a5c850f-b044-4a10-a1c4-9ef5b6101877_679x197.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!-Q7W!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a5c850f-b044-4a10-a1c4-9ef5b6101877_679x197.png 424w, https://substackcdn.com/image/fetch/$s_!-Q7W!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a5c850f-b044-4a10-a1c4-9ef5b6101877_679x197.png 848w, https://substackcdn.com/image/fetch/$s_!-Q7W!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a5c850f-b044-4a10-a1c4-9ef5b6101877_679x197.png 1272w, https://substackcdn.com/image/fetch/$s_!-Q7W!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a5c850f-b044-4a10-a1c4-9ef5b6101877_679x197.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>Source: <a href="https://brief.bismarckanalysis.com/p/chinas-struggle-to-manufacture-advanced">Bismarck Analysis</a>, <a href="https://www.granitefirm.com/blog/us/2022/05/13/yield-rate-comparison/">Granite Firm</a>, <a href="https://semiwiki.com/forum/threads/huawei-the-leader-in-chinese-semiconductor-development%E2%80%A6-%E2%80%98life-or-death%E2%80%99-for-smic-5nm-mass-production-next-year.22690/">SemiWiki</a></p><p>A deeper-dive on the domestic logic and memory chip fabrication supply chain will be the subject of a future post.</p><h3><strong>Unintended Consequences of Myopia (UCM)</strong></h3><p>Undeniably, the lack of leading-edge HBM from foreign suppliers is hindering domestic chip competitiveness. However, there are ways around this issue. In a dramatic reveal on August 12 2025, Huawei revealed a new software tool called the Unified Cache Manager (UCM) as a way to accelerate training and inference workloads without access to HBM.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-46" href="#footnote-46" target="_self">46</a></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!G-kO!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04deff72-c72e-4c60-84c8-7c74687ef551_640x344.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!G-kO!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04deff72-c72e-4c60-84c8-7c74687ef551_640x344.jpeg 424w, https://substackcdn.com/image/fetch/$s_!G-kO!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04deff72-c72e-4c60-84c8-7c74687ef551_640x344.jpeg 848w, https://substackcdn.com/image/fetch/$s_!G-kO!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04deff72-c72e-4c60-84c8-7c74687ef551_640x344.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!G-kO!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04deff72-c72e-4c60-84c8-7c74687ef551_640x344.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!G-kO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04deff72-c72e-4c60-84c8-7c74687ef551_640x344.jpeg" width="727" height="390.7625" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/04deff72-c72e-4c60-84c8-7c74687ef551_640x344.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:344,&quot;width&quot;:640,&quot;resizeWidth&quot;:727,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!G-kO!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04deff72-c72e-4c60-84c8-7c74687ef551_640x344.jpeg 424w, https://substackcdn.com/image/fetch/$s_!G-kO!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04deff72-c72e-4c60-84c8-7c74687ef551_640x344.jpeg 848w, https://substackcdn.com/image/fetch/$s_!G-kO!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04deff72-c72e-4c60-84c8-7c74687ef551_640x344.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!G-kO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04deff72-c72e-4c60-84c8-7c74687ef551_640x344.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Specifically, UCM builds a three-tiered storage architecture to split up the KV cache of LLMs across different memory types:</p><ul><li><p>HBM for extremely common data, accessed in real-time and at high frequency</p></li><li><p>DRAM for a balanced approach, storing data with moderate frequency</p></li><li><p>SSDs for low-frequency data. This evades the bottleneck of VRAM capacity in a chip, and accesses much cheaper, more readily available storage for low-frequency values.</p></li></ul><p>This would unlock astoundingly long context windows without the need for bigger HBM - critical for reasoning models with ballooning token volumes, and overcoming the HBM denial hurdle while CXMT ramps up production.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-47" href="#footnote-47" target="_self">47</a></p><p>The net effect is a triple-purpose optimization across performance, energy, <em>and</em> financial costs:</p><ol><li><p>First token latency is <strong>reduced by up to 90%</strong></p></li><li><p>Sharding KV cache across multiple memory types <strong>10x&#8217;s the inference context window</strong></p></li><li><p>Intelligent routing of data &#8220;heat&#8221; <strong>improves tokens-per-second by 2-22x</strong></p></li></ol><p>Huawei intends to open-source UCM this month, which would provide a systemic boost to all domestic chip companies currently gated by HBM restrictions. While benefits on single cards may be nominal, supernodes (multiple-card clusters) with multiple memory types (SSDs, DRAM, SRAM) would see substantial benefits. Not to mention cloud-scale deployments.</p><h3><strong>Heterogeneous Computing</strong></h3><p>In the same way that UCM leverages relative strengths of different memory types, heterogeneous computing optimizes and distributed workloads according to the relative strengths of different chips.</p><p>Paul Triolo and I have written about advancements in heterogeneous computing in the past. I&#8217;ll simply restate that there are large-scale projects sponsored by major hyperscalers and research labs achieving better training and inference performance through heterogeneous (multiple-vendor) computing clusters than homogenous ones.</p><div class="embedded-post-wrap" data-attrs="{&quot;id&quot;:171271836,&quot;url&quot;:&quot;https://pstaidecrypted.substack.com/p/china-ai-update-innovation-across&quot;,&quot;publication_id&quot;:2296890,&quot;publication_name&quot;:&quot;AIStackDecrypted&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!pvMv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30b81416-44a5-4354-8d0d-fdb7f9e0d5f1_300x300.png&quot;,&quot;title&quot;:&quot;China AI update: Innovation across a distributed and heterogeneous AI stack heats up&quot;,&quot;truncated_body_text&quot;:&quot;After a two-week swing through the Chinese AI sector in late July that included detailed discussions with Chinese AI players at the World AI Conference (WAIC) and visits to many individual companies, plus discussions with domestic and international investors, I was struck by the dynamism of the companies involved, and the out-of-the-box thinking on AI m&#8230;&quot;,&quot;date&quot;:&quot;2025-08-22T14:53:51.981Z&quot;,&quot;like_count&quot;:16,&quot;comment_count&quot;:1,&quot;bylines&quot;:[{&quot;id&quot;:18097050,&quot;name&quot;:&quot;Paul Triolo&quot;,&quot;handle&quot;:&quot;pstasiatech&quot;,&quot;previous_name&quot;:&quot;Paul T&quot;,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ae5afe75-2e43-4924-9013-5e457f8c73c4_400x400.jpeg&quot;,&quot;bio&quot;:&quot;Long time civil servant now swimming in the private sector &quot;,&quot;profile_set_up_at&quot;:&quot;2021-12-05T16:30:20.359Z&quot;,&quot;reader_installed_at&quot;:&quot;2024-03-11T01:49:34.656Z&quot;,&quot;publicationUsers&quot;:[{&quot;id&quot;:2316045,&quot;user_id&quot;:18097050,&quot;publication_id&quot;:2296890,&quot;role&quot;:&quot;admin&quot;,&quot;public&quot;:true,&quot;is_primary&quot;:true,&quot;publication&quot;:{&quot;id&quot;:2296890,&quot;name&quot;:&quot;AIStackDecrypted&quot;,&quot;subdomain&quot;:&quot;pstaidecrypted&quot;,&quot;custom_domain&quot;:null,&quot;custom_domain_optional&quot;:false,&quot;hero_text&quot;:&quot;My personal Substack devoted to AI Stack issues and US China relations&quot;,&quot;logo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/30b81416-44a5-4354-8d0d-fdb7f9e0d5f1_300x300.png&quot;,&quot;author_id&quot;:18097050,&quot;primary_user_id&quot;:18097050,&quot;theme_var_background_pop&quot;:&quot;#00C2FF&quot;,&quot;created_at&quot;:&quot;2024-01-28T02:17:27.339Z&quot;,&quot;email_from_name&quot;:null,&quot;copyright&quot;:&quot;Paul Triolo&quot;,&quot;founding_plan_name&quot;:&quot;Founding Member&quot;,&quot;community_enabled&quot;:true,&quot;invite_only&quot;:false,&quot;payments_state&quot;:&quot;enabled&quot;,&quot;language&quot;:null,&quot;explicit&quot;:false,&quot;homepage_type&quot;:&quot;newspaper&quot;,&quot;is_personal_mode&quot;:false}}],&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null,&quot;status&quot;:{&quot;bestsellerTier&quot;:null,&quot;subscriberTier&quot;:10,&quot;leaderboard&quot;:null,&quot;vip&quot;:false,&quot;badge&quot;:{&quot;type&quot;:&quot;subscriber&quot;,&quot;tier&quot;:10,&quot;accent_colors&quot;:null}}},{&quot;id&quot;:2295132,&quot;name&quot;:&quot;Ryan Cunningham&quot;,&quot;handle&quot;:&quot;machineyearning&quot;,&quot;previous_name&quot;:null,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!cF6f!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e40b64a-002f-4b1b-bc62-16df254e2f7b_995x995.png&quot;,&quot;bio&quot;:&quot;energy-compute and technoeconomics &#8226; founder @ Edgerunner Ventures &#8226; ex-Uber&quot;,&quot;profile_set_up_at&quot;:&quot;2022-01-31T20:24:54.099Z&quot;,&quot;reader_installed_at&quot;:&quot;2022-03-11T16:39:03.951Z&quot;,&quot;is_guest&quot;:true,&quot;bestseller_tier&quot;:null,&quot;status&quot;:{&quot;bestsellerTier&quot;:null,&quot;subscriberTier&quot;:1,&quot;leaderboard&quot;:null,&quot;vip&quot;:false,&quot;badge&quot;:{&quot;type&quot;:&quot;subscriber&quot;,&quot;tier&quot;:1,&quot;accent_colors&quot;:null}},&quot;primaryPublicationId&quot;:108589,&quot;primaryPublicationName&quot;:&quot;Machine Yearning&quot;,&quot;primaryPublicationUrl&quot;:&quot;https://www.machineyearning.io&quot;,&quot;primaryPublicationSubscribeUrl&quot;:&quot;https://www.machineyearning.io/subscribe?&quot;}],&quot;utm_campaign&quot;:null,&quot;belowTheFold&quot;:true,&quot;type&quot;:&quot;newsletter&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="EmbeddedPostToDOM"><a class="embedded-post" native="true" href="https://pstaidecrypted.substack.com/p/china-ai-update-innovation-across?utm_source=substack&amp;utm_campaign=post_embed&amp;utm_medium=web"><div class="embedded-post-header"><img class="embedded-post-publication-logo" src="https://substackcdn.com/image/fetch/$s_!pvMv!,w_56,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30b81416-44a5-4354-8d0d-fdb7f9e0d5f1_300x300.png" loading="lazy"><span class="embedded-post-publication-name">AIStackDecrypted</span></div><div class="embedded-post-title-wrapper"><div class="embedded-post-title">China AI update: Innovation across a distributed and heterogeneous AI stack heats up</div></div><div class="embedded-post-body">After a two-week swing through the Chinese AI sector in late July that included detailed discussions with Chinese AI players at the World AI Conference (WAIC) and visits to many individual companies, plus discussions with domestic and international investors, I was struck by the dynamism of the companies involved, and the out-of-the-box thinking on AI m&#8230;</div><div class="embedded-post-cta-wrapper"><span class="embedded-post-cta">Read more</span></div><div class="embedded-post-meta">8 months ago &#183; 16 likes &#183; 1 comment &#183; Paul Triolo and Ryan Cunningham</div></a></div><p>This is a key part of the Shanghai municipal government&#8217;s &#8220;AI+Manufacturing&#8221; plan - building a low-latency, distributed industrial &#8220;smart compute cloud.&#8221; <strong>Sugon</strong> - the supercomputing company recently acquired by <strong>Hygon</strong> - also recently announced a high-profile partnership with 20 other hardware companies, OEMs, and research labs, with the intent to develop and promote large-scale heterogeneous computing systems.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-48" href="#footnote-48" target="_self">48</a> Sugon has already launched a new supercluster product to demonstrate these capabilities.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-49" href="#footnote-49" target="_self">49</a></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!0mfV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfa1e926-e794-4a86-936c-372be3b8cedc_1080x462.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!0mfV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfa1e926-e794-4a86-936c-372be3b8cedc_1080x462.png 424w, https://substackcdn.com/image/fetch/$s_!0mfV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfa1e926-e794-4a86-936c-372be3b8cedc_1080x462.png 848w, https://substackcdn.com/image/fetch/$s_!0mfV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfa1e926-e794-4a86-936c-372be3b8cedc_1080x462.png 1272w, https://substackcdn.com/image/fetch/$s_!0mfV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfa1e926-e794-4a86-936c-372be3b8cedc_1080x462.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!0mfV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfa1e926-e794-4a86-936c-372be3b8cedc_1080x462.png" width="1080" height="462" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bfa1e926-e794-4a86-936c-372be3b8cedc_1080x462.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:462,&quot;width&quot;:1080,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:502551,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfa1e926-e794-4a86-936c-372be3b8cedc_1080x462.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!0mfV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfa1e926-e794-4a86-936c-372be3b8cedc_1080x462.png 424w, https://substackcdn.com/image/fetch/$s_!0mfV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfa1e926-e794-4a86-936c-372be3b8cedc_1080x462.png 848w, https://substackcdn.com/image/fetch/$s_!0mfV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfa1e926-e794-4a86-936c-372be3b8cedc_1080x462.png 1272w, https://substackcdn.com/image/fetch/$s_!0mfV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbfa1e926-e794-4a86-936c-372be3b8cedc_1080x462.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Source: <a href="https://mp.weixin.qq.com/s/069VzI0wd9FJRKh9H4Bq_Q">Sugon WeChat channel</a>.</p><blockquote><p>&#8220;Compared with closed systems, Sugon AI super cluster system not only works as efficiently as a single computer through its tightly coupled design, but also supports multi-brand AI accelerator cards and is compatible with mainstream software ecosystems such as CUDA, providing users with more open choices and significantly reducing hardware costs and software development and adaptation costs, thus protecting initial investments.&#8221;</p></blockquote><p>That excerpt from the Sugon announcement is critical. Functional heterogeneity should not just be considered in the context of entity listing and trade wars. It is an inherently <strong>anti-fragile</strong> economic paradigm that will reduce unit costs for training and inference workloads overtime.</p><div><hr></div><h2>2. Strategic Backing</h2><p><em>Standouts: <strong>Huawei, T-HEAD, Enflame, Moore Threads</strong></em></p><p><em>Laggards: <strong>Iluvatar CoreX, Biren</strong></em></p><p><em>Losers: <strong>Jingjia Micro, Denglin Technology</strong></em></p><div class="pullquote"><p><em>In our analysis, government funding does not play nearly as important a role as does creating a strategic and/or financial relationship with a pre-existing hyperscaler (Tencent, Alibaba, ByteDance, etc.).</em></p><p><em>Hyperscaler or CVC partnerships can pave the way for later commercial adoption, as evidenced by the tight-knit relationship between <strong>Enflame </strong>and Tencent. While <strong>Moore Threads</strong> has the highest private valuation in the list, Enflame&#8217;s commercial adoption has been more voluminous given Tencent (and Meitu&#8217;s) endorsements.</em></p><p><em><strong>Biren </strong>and <strong>MetaX </strong>do not have explicit hyperscaler financial backing, though both have a significant relationship with SMIC&#8217;s CVC which could mean expedited iteration cycles. However, Biren has seen an objectively slow turnaround since its entity listing in 2022.</em></p><p><em><strong>Iluvatar CoreX</strong> and <strong>Denglin </strong>lack significant strategic backers, which may contribute to more muted adoption.</em></p><p><em>Finally, <strong>Moffett AI </strong>and <strong>InnoStar</strong> share Ant Group (and indirectly Alibaba) as a strategic backer. InnoStar also includes ByteDance and Lam Research on its capitalization table.</em></p></div><p>The &#8220;Made in China 2025&#8221; industrial policy is often cited as the blueprint for state-led &#8220;winner-picking&#8221; in high-tech sectors, semiconductors among them. In this limited analysis, that shorthand does not hold up.</p><p>First, many of these players share the same funding entities on the cap table (like the Big Fund), which eliminates any idiosyncratic advantage. Second, not all kinds of government capital in China are created equal. And third, an overly activist government presence on the executive team seems to slow things down faster than it can open doors for adoption.</p><p>Therefore it should not be taken on faith that Chinese government favorability or backing is a guarantor of success. Instead, we assign a higher weight to strategic or financial relationships with established technology incumbents. These players generally have the most sophisticated products, much faster iteration cycles, and strongest commercial adoption thus far.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZDSs!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9dff6516-0c9c-40b0-a3e2-acb418936726_830x1499.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZDSs!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9dff6516-0c9c-40b0-a3e2-acb418936726_830x1499.png 424w, https://substackcdn.com/image/fetch/$s_!ZDSs!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9dff6516-0c9c-40b0-a3e2-acb418936726_830x1499.png 848w, https://substackcdn.com/image/fetch/$s_!ZDSs!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9dff6516-0c9c-40b0-a3e2-acb418936726_830x1499.png 1272w, https://substackcdn.com/image/fetch/$s_!ZDSs!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9dff6516-0c9c-40b0-a3e2-acb418936726_830x1499.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZDSs!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9dff6516-0c9c-40b0-a3e2-acb418936726_830x1499.png" width="996" height="1798.8" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9dff6516-0c9c-40b0-a3e2-acb418936726_830x1499.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:1499,&quot;width&quot;:830,&quot;resizeWidth&quot;:996,&quot;bytes&quot;:164081,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9dff6516-0c9c-40b0-a3e2-acb418936726_830x1499.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-large" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ZDSs!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9dff6516-0c9c-40b0-a3e2-acb418936726_830x1499.png 424w, https://substackcdn.com/image/fetch/$s_!ZDSs!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9dff6516-0c9c-40b0-a3e2-acb418936726_830x1499.png 848w, https://substackcdn.com/image/fetch/$s_!ZDSs!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9dff6516-0c9c-40b0-a3e2-acb418936726_830x1499.png 1272w, https://substackcdn.com/image/fetch/$s_!ZDSs!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9dff6516-0c9c-40b0-a3e2-acb418936726_830x1499.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Source: Crunchbase, Bloomberg, company announcements.</p><p><em>* Zhonghao Xinying is navigating a potentially botched attempt to go public via reverse merger with a publicly traded tire and rubber company. Source: <a href="https://www.moomoo.com/news/post/58230191/tempu-co-ltd-has-faced-seven-consecutive-inquiries-from-the">Tempu Co., Ltd. has faced seven consecutive inquiries from the Shanghai Stock Exchange, leading to a halt in leveraged buyouts, with funds reportedly "in transit."</a></em></p><h3><strong>The Role of &#8220;Patient Capital&#8221;</strong></h3><p>&#8220;Patient capital&#8221; is a colloquial term used to describe investment funds (typically state-connected) with much longer horizons for expected returns, usually targeting sectors deemed critical for economic growth or industrial self-sufficiency. This is not to say that profitability and ROI don&#8217;t matter, just that that is second to longer-term goals.</p><h4><strong>National Team</strong></h4><p>The national team, led by the China Integrated Circuit Industry Investment Fund (&#8220;Big Fund&#8221;), focuses on strengthening national strategic industries, filling gaps in the supply chain, and anchoring private capital around key technologies.</p><p>Financial returns are not the priority. The goal is to build industrial security.</p><p>As an example, Cambricon received early funding from the Big Fund. The company originated from the Institute of Computing Technology at the Chinese Academy of Sciences, making it a textbook case of industry-academia-research integration.</p><p>The Big Fund&#8217;s investment was both a bet on a promising semiconductor startup and an endorsement of the pathway from state laboratories to commercial application, aimed at encouraging more frontier research to be translated into market-ready products that serve national strategy.</p><h4><strong>Local Funds</strong></h4><p>Local funds, by contrast, are closely tied to regional agendas. They nurture companies that place headquarters, research centers, or production facilities in their jurisdictions in order to strengthen the local industry ecosystem and reinforce the value chain.</p><p>For example, the Shanghai IC Fund&#8217;s mission is to consolidate and expand the city&#8217;s leadership in China&#8217;s semiconductor sector by building a complete industrial cluster that spans design, manufacturing, packaging and testing. They frequently act as bridge investors, supporting promising firms in early rounds before inviting the Big Fund to co-invest, creating &#8220;national plus local&#8221; synergies.</p><p>With the support of both the Shanghai and Beijing IC funds, these two cities are emerging as distinct national leaders: Shanghai as the hub of the full semiconductor value chain with champions such as SMIC (Semiconductor Manufacturing International Corporation), AMEC (Advanced Micro-Fabrication Equipment), MetaX, and Enflame.</p><p>Beijing is establishing itself as the capital of R&amp;D and design, represented by companies like Cambricon, Moore Threads, and Hygon.</p><h4><strong>SOE Venture Arms</strong></h4><p>SOE venture arms sit somewhere in between. They pursue some of the same financial goals as commercial vc firms, but their deeper value lies in the strategic resources they provide.</p><p>Startups gain access to procurement pipelines, pilot projects, and anchor customers within the SOE system. They also benefit from credibility and political backing that smooth the path for future fundraising, bank creditline, and government project participation.</p><p>On top of that, SOE venture arms open doors to industrial expertise, supply chain partners, and even manufacturing capacity. Unlike financial investors, they are often more patient, willing to stay with companies through multiple growth cycles.</p><p>All four of the dragons have attracted both SOE venture arms and local funds, showing how deeply these funding vehicles are embedded in the country&#8217;s industrial rise. Together, national direction, local execution, SOE backing, and corporate upgrading create a layered investment ecosystem that de-risks financing while guiding capital recipients towards real use cases and sustained demand.</p><h3><strong>Drawbacks of State-Led Innovation</strong></h3><p>Patient capital is best when passive. Companies with the strongest connections to government are actually among the worst performers.</p><p>Despite a major head start, deep connections to China Electronics Technology Group Corporation (CETC), government, and PLA contract revenues, <strong>Jingjia Micro (&#26223;&#22025;&#24494;)</strong> has been wholly unable to transition their product line towards AI accelerators, seeing their GPU revenues completely collapse by 40-70% YoY from 2021 to present as new competitors have come online.</p><p>While <strong>Iluvatar CoreX</strong> (another early entrant) was founded by an Oracle veteran, in 2021 they appointed &#8203;&#8203;<strong>Diao Shijing</strong> (&#20993;&#30707;&#20140;), former Head of the Ministry of Industry and Information Technology, to the position. Though the company has had some success with their training card accelerator lineup, order volumes for shared heterogeneous computing projects are much smaller than competitors&#8217; by an order of magnitude (100 inference cards vs 1000 Moore Threads GPUs, 2000 Huawei Ascend 910Bs, 3000 MetaX GPUs).<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-50" href="#footnote-50" target="_self">50</a> It seems Iluvatar has had a slower time transitioning from R&amp;D into widespread commercialization.</p><h3><strong>Hyperscaler Tagalongs</strong></h3><p>Based on the rate of commercial deployments, there seem to be two winning strategies for domestic chip designers: either you are the customer and you build it yourself (<strong>Huawei, T-HEAD, Kunlunxin</strong>), or you team up with an incumbent hyperscaler.</p><p>Even with advancements in heterogeneous computing technology, it seems far more difficult to go it alone without a close alliance to a major hyperscaler. <strong>Biren </strong>and <strong>Iluvatar CoreX, </strong>despite strong teams and early traction, have yet to see large-scale adoption in hyperscalar clouds. <strong>Jingjia Micro</strong> and <strong>Denglin Technology</strong> are completely absent from any major commercial rollouts.</p><p>It should come as no surprise that China&#8217;s hyperscalers - Huawei, Alibaba, Baidu, Tencent - have immense incentive to manage their cloud infrastructure costs, for the same reason that Western equivalents all have internal silicon teams (Meta: MTIA, Microsoft: Maia 100, Google: TPU, Amazon: Trainium/Inferentia). The first three have their own wholly-owned fabless design firms, while Tencent has facilitated close relationships with at least two little dragons, Enflame and Moore Threads, through several rounds of funding.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-51" href="#footnote-51" target="_self">51</a></p><p>This close relationship with Tencent is significantly paying off for Enflame, which is reporting the largest volume of commercial adoption for their S60 inference cards compared to other independent fabless firms. Even in the more hands-off approach that Tencent uses, it still leveraged Enflame&#8217;s expertise to co-design its custom Zixiao AI inference chip for Tencent Cloud.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-52" href="#footnote-52" target="_self">52</a></p><div><hr></div><h2>3. Leadership</h2><p><em>Standouts: <strong>Huawei, Cambricon, Enflame, Moore Threads, MetaX.</strong></em></p><p><em>Laggards: <strong>Biren</strong> (though initially promising, has suffered greatly from post-entity list turmoil).</em></p><div><hr></div><div class="pullquote"><p><em>&#8220;Silicon Valley DNA + Localized Innovation&#8221; is a common turn of phrase for successful company leadership. Teams with executive tenure at major SV institutions (NVIDIA and AMD especially) have some of the best performing chips, though Huawei diaspora is also quite strong.</em></p><p><em>In general, engineers leading engineers seems the clearest path to victory. Government or ministry presence in leadership is actually a major detractor from progress.</em></p></div><p>The prevailing pattern for domestic chip leaders seems to be &#8220;&#30789;&#35895;&#22522;&#22240;+&#26412;&#22303;&#21270;&#21019;&#26032;&#8221;, or &#8220;Silicon Valley DNA + localized innovation.&#8221; Many of the top performers in the space have executives with deep experience in Silicon Valley companies, but have been implementing those solutions in the local ecosystem.</p><p>The phrase &#25216;&#32780;&#20248;&#21017;&#31649; (<em>j&#236; &#233;r y&#333;u z&#233; gu&#462;n, </em>literally &#8220;those whose technical skill is excellent then manage&#8221;) accurately describes the best performing companies on this list. It more or less means &#8220;promote the best engineers to lead engineers,&#8221; and is a common saying in Chinese tech circles to insist that the person running an engineering team should be an engineer, not an MBA.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-53" href="#footnote-53" target="_self">53</a></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Dp9c!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51d126f9-7a0e-4d61-9de2-4d9ee4fc0c1e_1060x1174.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Dp9c!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51d126f9-7a0e-4d61-9de2-4d9ee4fc0c1e_1060x1174.png 424w, https://substackcdn.com/image/fetch/$s_!Dp9c!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51d126f9-7a0e-4d61-9de2-4d9ee4fc0c1e_1060x1174.png 848w, https://substackcdn.com/image/fetch/$s_!Dp9c!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51d126f9-7a0e-4d61-9de2-4d9ee4fc0c1e_1060x1174.png 1272w, https://substackcdn.com/image/fetch/$s_!Dp9c!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51d126f9-7a0e-4d61-9de2-4d9ee4fc0c1e_1060x1174.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Dp9c!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51d126f9-7a0e-4d61-9de2-4d9ee4fc0c1e_1060x1174.png" width="1060" height="1174" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/51d126f9-7a0e-4d61-9de2-4d9ee4fc0c1e_1060x1174.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1174,&quot;width&quot;:1060,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:196080,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51d126f9-7a0e-4d61-9de2-4d9ee4fc0c1e_1060x1174.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Dp9c!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51d126f9-7a0e-4d61-9de2-4d9ee4fc0c1e_1060x1174.png 424w, https://substackcdn.com/image/fetch/$s_!Dp9c!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51d126f9-7a0e-4d61-9de2-4d9ee4fc0c1e_1060x1174.png 848w, https://substackcdn.com/image/fetch/$s_!Dp9c!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51d126f9-7a0e-4d61-9de2-4d9ee4fc0c1e_1060x1174.png 1272w, https://substackcdn.com/image/fetch/$s_!Dp9c!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51d126f9-7a0e-4d61-9de2-4d9ee4fc0c1e_1060x1174.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><em>* Org structure slightly unclear. Eddie Wu was appointed CEO in 2023, replacing Daniel Zhang. He centralized the cloud business under his reporting line, and has been orienting Alibaba towards artificial intelligence through significant capital expenditures and strategic investments in research labs like Moonshot, MiniMax, and <a href="http://z.ai">Z.AI</a>. Presumably he has ownership over the semiconductor division, but team-specific leadership is not yet known. <a href="https://en.wikipedia.org/wiki/Eddie_Wu">https://en.wikipedia.org/wiki/Eddie_Wu</a></em></figcaption></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ckqc!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56a233b0-74e6-458b-a4fc-c0094d3f4315_1060x985.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ckqc!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56a233b0-74e6-458b-a4fc-c0094d3f4315_1060x985.png 424w, https://substackcdn.com/image/fetch/$s_!ckqc!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56a233b0-74e6-458b-a4fc-c0094d3f4315_1060x985.png 848w, https://substackcdn.com/image/fetch/$s_!ckqc!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56a233b0-74e6-458b-a4fc-c0094d3f4315_1060x985.png 1272w, https://substackcdn.com/image/fetch/$s_!ckqc!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56a233b0-74e6-458b-a4fc-c0094d3f4315_1060x985.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ckqc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56a233b0-74e6-458b-a4fc-c0094d3f4315_1060x985.png" width="1060" height="985" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/56a233b0-74e6-458b-a4fc-c0094d3f4315_1060x985.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:985,&quot;width&quot;:1060,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:166347,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56a233b0-74e6-458b-a4fc-c0094d3f4315_1060x985.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ckqc!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56a233b0-74e6-458b-a4fc-c0094d3f4315_1060x985.png 424w, https://substackcdn.com/image/fetch/$s_!ckqc!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56a233b0-74e6-458b-a4fc-c0094d3f4315_1060x985.png 848w, https://substackcdn.com/image/fetch/$s_!ckqc!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56a233b0-74e6-458b-a4fc-c0094d3f4315_1060x985.png 1272w, https://substackcdn.com/image/fetch/$s_!ckqc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56a233b0-74e6-458b-a4fc-c0094d3f4315_1060x985.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!c_mX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f2b9658-6b6e-46bc-8f63-d0deb9af2dc1_1060x672.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!c_mX!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f2b9658-6b6e-46bc-8f63-d0deb9af2dc1_1060x672.png 424w, https://substackcdn.com/image/fetch/$s_!c_mX!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f2b9658-6b6e-46bc-8f63-d0deb9af2dc1_1060x672.png 848w, https://substackcdn.com/image/fetch/$s_!c_mX!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f2b9658-6b6e-46bc-8f63-d0deb9af2dc1_1060x672.png 1272w, https://substackcdn.com/image/fetch/$s_!c_mX!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f2b9658-6b6e-46bc-8f63-d0deb9af2dc1_1060x672.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!c_mX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f2b9658-6b6e-46bc-8f63-d0deb9af2dc1_1060x672.png" width="1060" height="672" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8f2b9658-6b6e-46bc-8f63-d0deb9af2dc1_1060x672.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:672,&quot;width&quot;:1060,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:105525,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f2b9658-6b6e-46bc-8f63-d0deb9af2dc1_1060x672.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!c_mX!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f2b9658-6b6e-46bc-8f63-d0deb9af2dc1_1060x672.png 424w, https://substackcdn.com/image/fetch/$s_!c_mX!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f2b9658-6b6e-46bc-8f63-d0deb9af2dc1_1060x672.png 848w, https://substackcdn.com/image/fetch/$s_!c_mX!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f2b9658-6b6e-46bc-8f63-d0deb9af2dc1_1060x672.png 1272w, https://substackcdn.com/image/fetch/$s_!c_mX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8f2b9658-6b6e-46bc-8f63-d0deb9af2dc1_1060x672.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The three best performing little dragons - <strong>Enflame</strong>, <strong>Moore Threads</strong>, and <strong>MetaX</strong> - all have &#30789;&#35895;&#22522;&#22240; in founding leadership. Enflame and MetaX have decade-plus AMD veterans with a long history of working together, and Moore Threads is effectively carved out from the NVIDIA China team.</p><p>Exceptions to this rule would be among the Chinese Heavyweights, where leadership is cultivated and promoted from within, and <strong>Cambricon</strong>. Cambricon is a unique story where the two founding brothers, both STEM prodigies, were fast-tracked into a Chinese Academy of Science-sponsored ecosystem through a program for gifted youths. Their prowess in mathematics and computer architecture, along with the CAS resources at their disposal, eventually led to the founding of Cambricon out of their research into custom accelerator hardware.</p><p>Aside from that, we note that an outsized government or ministry presence on the leadership team does not seem to be associated with better product performance (no surprise) or commercial adoption. Examples here would be <strong>Iluvatar CoreX</strong> and <strong>Jingjia Micro.</strong> While government connections may seem helpful in large-scale intelligent cluster contracts, the basis for procurement seems to be more meritocratic.</p><div><hr></div><h2>4. Commercial Adoption</h2><p><em>Standouts: <strong>Huawei, Kunlunxin, T-HEAD, Enflame, Cambricon, Hygon</strong></em></p><p><em>Honorable Mention: <strong>SOPHGO</strong>.</em></p><div><hr></div><p><em>The most widely supported domestic chip designers to date include all three heavyweights (<strong>HiSilicon, Kunlunxin, T-HEAD</strong>), two of the four little dragons (<strong>Enflame, Biren</strong>), both public champions (<strong>Cambricon, Hygon</strong>) and finally <strong>Iluvatar CoreX.</strong></em></p><p>The Chinese hyperscaler market is dominated by the Big Three: Huawei, Tencent, and Alibaba. Baidu AI Cloud, Kingsoft Cloud, and ByteDance (Volcengine) are also notable.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-54" href="#footnote-54" target="_self">54</a> Additionally, there are the state-owned Chinese telecommunications companies China Unicom (&#20013;&#22269;&#32852;&#36890;), China Mobile (&#8203;&#8203;&#20013;&#22269;&#31227;&#21160;&#8203;&#8203;), and China Telecom (&#20013;&#22269;&#30005;&#20449;&#8203;&#8203;) to consider.</p><p>Alibaba is the cloud computing leader in terms of market share, and provides the richest detail on supported accelerator programming languages and hardware platforms. NVIDIA and AMD-based computing instances are supported by nearly all Chinese hyperscalers - this is not the case for its domestic chip designers.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Ez7Z!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4200e0b6-3131-4c86-84fd-8fef3e99c8a7_1269x830.bin" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Ez7Z!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4200e0b6-3131-4c86-84fd-8fef3e99c8a7_1269x830.bin 424w, https://substackcdn.com/image/fetch/$s_!Ez7Z!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4200e0b6-3131-4c86-84fd-8fef3e99c8a7_1269x830.bin 848w, https://substackcdn.com/image/fetch/$s_!Ez7Z!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4200e0b6-3131-4c86-84fd-8fef3e99c8a7_1269x830.bin 1272w, https://substackcdn.com/image/fetch/$s_!Ez7Z!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4200e0b6-3131-4c86-84fd-8fef3e99c8a7_1269x830.bin 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Ez7Z!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4200e0b6-3131-4c86-84fd-8fef3e99c8a7_1269x830.bin" width="724" height="473.53821907013395" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4200e0b6-3131-4c86-84fd-8fef3e99c8a7_1269x830.bin&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:830,&quot;width&quot;:1269,&quot;resizeWidth&quot;:724,&quot;bytes&quot;:null,&quot;alt&quot;:1,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="1" title="1" srcset="https://substackcdn.com/image/fetch/$s_!Ez7Z!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4200e0b6-3131-4c86-84fd-8fef3e99c8a7_1269x830.bin 424w, https://substackcdn.com/image/fetch/$s_!Ez7Z!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4200e0b6-3131-4c86-84fd-8fef3e99c8a7_1269x830.bin 848w, https://substackcdn.com/image/fetch/$s_!Ez7Z!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4200e0b6-3131-4c86-84fd-8fef3e99c8a7_1269x830.bin 1272w, https://substackcdn.com/image/fetch/$s_!Ez7Z!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4200e0b6-3131-4c86-84fd-8fef3e99c8a7_1269x830.bin 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Source: <a href="https://www.alibabacloud.com/blog/ai-model-inference-service-an-overview_602002">Alibaba Cloud</a>.</p><p>Until recently Chinese telco server procurement did include solutions from NVIDIA, AMD, and Intel, but the MIIT has specifically instructed the telcos to phase out foreign processor usage by 2027, accelerated by looming export controls and a push for domestic self-sufficiency. This instruction is limited to CPUs at the moment (hitting Intel and AMD), but a similar instruction for GPUs has not been ruled out.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-55" href="#footnote-55" target="_self">55</a></p><p>The below table highlights public confirmations of domestic chip adoption in high-workload commercial sectors. The heavyweights, little dragons, and public champions dominate the upper rankings. This is not all-inclusive since there is a long tail of neoclouds that are not the subject of this analysis.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!8zpg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4e4da056-61fc-4580-a0b2-43ee031d1c3e_604x580.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!8zpg!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4e4da056-61fc-4580-a0b2-43ee031d1c3e_604x580.png 424w, https://substackcdn.com/image/fetch/$s_!8zpg!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4e4da056-61fc-4580-a0b2-43ee031d1c3e_604x580.png 848w, https://substackcdn.com/image/fetch/$s_!8zpg!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4e4da056-61fc-4580-a0b2-43ee031d1c3e_604x580.png 1272w, https://substackcdn.com/image/fetch/$s_!8zpg!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4e4da056-61fc-4580-a0b2-43ee031d1c3e_604x580.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!8zpg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4e4da056-61fc-4580-a0b2-43ee031d1c3e_604x580.png" width="724" height="695.2317880794702" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4e4da056-61fc-4580-a0b2-43ee031d1c3e_604x580.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:580,&quot;width&quot;:604,&quot;resizeWidth&quot;:724,&quot;bytes&quot;:58848,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170939417?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4e4da056-61fc-4580-a0b2-43ee031d1c3e_604x580.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!8zpg!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4e4da056-61fc-4580-a0b2-43ee031d1c3e_604x580.png 424w, https://substackcdn.com/image/fetch/$s_!8zpg!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4e4da056-61fc-4580-a0b2-43ee031d1c3e_604x580.png 848w, https://substackcdn.com/image/fetch/$s_!8zpg!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4e4da056-61fc-4580-a0b2-43ee031d1c3e_604x580.png 1272w, https://substackcdn.com/image/fetch/$s_!8zpg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4e4da056-61fc-4580-a0b2-43ee031d1c3e_604x580.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>In a major order from China Mobile, <strong>Huawei&#8217;s Ascend</strong> line won over 70% of the contract, and Baidu will be supplying its <strong>Kunlunxin</strong> processor line to many systems integrators in that contract.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-56" href="#footnote-56" target="_self">56</a></p><p><strong>Enflame</strong> has reportedly shipped over 70,000 S60 inference cards to date, mostly to Tencent computing clusters. Outside of Huawei and Cambricon, this is actually the largest recorded figure of commercial shipments for the newer entrants.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-57" href="#footnote-57" target="_self">57</a></p><p>Finally, in other cloud contracts with multiple vendors, we&#8217;ve seen comparable deployment figures for <strong>MetaX</strong>, <strong>Moore Threads</strong>, and Ascend accelerator cards in the thousands, with additional (though meager) participation from <strong>Biren</strong> and <strong>Iluvatar</strong>.</p><p>Deployment breakdowns are not always available by vendor or SKU. We&#8217;ll attempt to track shipment figures in future versions of the Silicon Vanguard dataset. For now, what we generally observe is that the Heavyweights are well-positioned given their cloud computing footprint, Enflame seems to be leading the weyr of dragons, and <strong>Cambricon</strong> + <strong>Hygon</strong> are making strides.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.machineyearning.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">If you&#8217;ve read all the way to the end, thank you. Truly appreciate it. Drop a comment and let me know more of what you want to see</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Ray Wang. <em>&#8220;According to Goldman Sachs, it suggests China&#8217;s lithography progress is 20 years behind based on the roadmap position of Chinese companies and ASML.&#8221;</em>. X (formerly Twitter). <a href="https://x.com/rwang07/status/1962440362024456616">https://x.com/rwang07/status/1962440362024456616</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p><em>China is 10 years behind Taiwan on chips: NSTC</em>. (2024, October 1). Taipei Times. <a href="https://www.taipeitimes.com/News/taiwan/archives/2024/10/01/2003824622">https://www.taipeitimes.com/News/taiwan/archives/2024/10/01/2003824622</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>Ezell, S. (2024, August 19). <em>How innovative is China in semiconductors?</em> Information Technology and Innovation Foundation. <a href="https://itif.org/publications/2024/08/19/how-innovative-is-china-in-semiconductors/">https://itif.org/publications/2024/08/19/how-innovative-is-china-in-semiconductors/</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>Sharwood, S. (2025, June 20). <em>US tech Czar: China just two years behind on chip design</em>. The Register. <a href="https://www.theregister.com/2025/06/20/china_us_chip_competition/">https://www.theregister.com/2025/06/20/china_us_chip_competition/</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p>Jukanlosreve. <em>China&#8217;s Answer to NVIDIA? How Struggling Cambricon Pulled Off a Dramatic Comeback [Deep Dive]</em>. X (formerly Twitter). <a href="https://x.com/Jukanlosreve/status/1961619800427511990">https://x.com/Jukanlosreve/status/1961619800427511990</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-6" href="#footnote-anchor-6" class="footnote-number" contenteditable="false" target="_self">6</a><div class="footnote-content"><p>Weixin_44005328. <em>&#21326;&#20026;UCM&#25216;&#26415;&#31616;&#20171;</em>. CSDN&#21338;&#23458;-&#19987;&#19994;IT&#25216;&#26415;&#21457;&#34920;&#24179;&#21488;. <a href="https://blog.csdn.net/weixin_44005328/article/details/150305332">https://blog.csdn.net/weixin_44005328/article/details/150305332</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-7" href="#footnote-anchor-7" class="footnote-number" contenteditable="false" target="_self">7</a><div class="footnote-content"><p>Patel, D., Kourabi, A. J., Xie, M., &amp; Koch, J. (2025, September 8). <em>Huawei ascend production ramp: Die banks, TSMC continued production, HBM is the bottleneck</em>. SemiAnalysis. <a href="https://semianalysis.com/2025/09/08/huawei-ascend-production-ramp/">https://semianalysis.com/2025/09/08/huawei-ascend-production-ramp/</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-8" href="#footnote-anchor-8" class="footnote-number" contenteditable="false" target="_self">8</a><div class="footnote-content"><p><a href="https://www.trendforce.com/news/2025/07/29/news-chinese-ai-chip-unicorns-enflame-metax-unveil-next-gen-chips-shortly-after-nvidias-h20-return/">https://www.trendforce.com/news/2025/07/29/news-chinese-ai-chip-unicorns-enflame-metax-unveil-next-gen-chips-shortly-after-nvidias-h20-return/</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-9" href="#footnote-anchor-9" class="footnote-number" contenteditable="false" target="_self">9</a><div class="footnote-content"><p><a href="https://www.sophgo.com/sophon-u/product/introduce/sc11_fp300.html?locale=zh_CN">https://www.sophgo.com/sophon-u/product/introduce/sc11_fp300.html?locale=zh_CN</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-10" href="#footnote-anchor-10" class="footnote-number" contenteditable="false" target="_self">10</a><div class="footnote-content"><p><a href="https://www.moffettai.com/xin-wen-zhong-xin/mlperf-shou-ci-da-mo-xing-tui-li-ce-ping-fang.html">https://www.moffettai.com/xin-wen-zhong-xin/mlperf-shou-ci-da-mo-xing-tui-li-ce-ping-fang.html</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-11" href="#footnote-anchor-11" class="footnote-number" contenteditable="false" target="_self">11</a><div class="footnote-content"><p><a href="https://arxiv.org/abs/2505.02146">https://arxiv.org/abs/2505.02146</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-12" href="#footnote-anchor-12" class="footnote-number" contenteditable="false" target="_self">12</a><div class="footnote-content"><p><a href="https://x.com/sriramk/status/1961072926561550366">https://x.com/sriramk/status/1961072926561550366</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-13" href="#footnote-anchor-13" class="footnote-number" contenteditable="false" target="_self">13</a><div class="footnote-content"><p><a href="https://www.economist.com/business/2025/08/21/china-is-quietly-upstaging-america-with-its-open-models">https://www.economist.com/business/2025/08/21/china-is-quietly-upstaging-america-with-its-open-models</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-14" href="#footnote-anchor-14" class="footnote-number" contenteditable="false" target="_self">14</a><div class="footnote-content"><p><a href="https://www.machineyearning.io/i/170203312/open-source-continues-to-raise-qmin">https://www.machineyearning.io/i/170203312/open-source-continues-to-raise-qmin</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-15" href="#footnote-anchor-15" class="footnote-number" contenteditable="false" target="_self">15</a><div class="footnote-content"><p><a href="https://simonwillison.net/2025/Aug/30/claude-degraded-quality/">https://simonwillison.net/2025/Aug/30/claude-degraded-quality/</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-16" href="#footnote-anchor-16" class="footnote-number" contenteditable="false" target="_self">16</a><div class="footnote-content"><div class="embedded-post-wrap" data-attrs="{&quot;id&quot;:145046153,&quot;url&quot;:&quot;https://sinocities.substack.com/p/how-is-chinas-eastern-data-western&quot;,&quot;publication_id&quot;:244069,&quot;publication_name&quot;:&quot;Sinocities&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!xnPI!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d4ea1ae-9808-493f-b5ad-caf23cc6de7d_1280x1280.png&quot;,&quot;title&quot;:&quot;How is China's \&quot;Eastern Data Western Compute\&quot;&#65288;&#19996;&#25968;&#35199;&#31639;) developing?&quot;,&quot;truncated_body_text&quot;:&quot;In 2018, Apple moved most of its data for China-based users to cloud computing servers in southwestern Guizhou&#8217;s Gui&#8217;an New Area. This decision reflected a reality since China&#8217;s 2016 Cybersecurity Law, that in order for foreign technology firms to operate in China they would have to store user data within the country. Apple no doubt saw this as a necess&#8230;&quot;,&quot;date&quot;:&quot;2024-05-28T14:03:08.816Z&quot;,&quot;like_count&quot;:27,&quot;comment_count&quot;:5,&quot;bylines&quot;:[{&quot;id&quot;:18320749,&quot;name&quot;:&quot;Andrew Stokols&quot;,&quot;handle&quot;:&quot;sinocities&quot;,&quot;previous_name&quot;:null,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4bcf3948-a878-4ab3-a9e6-626fd6e0f7b9_786x784.jpeg&quot;,&quot;bio&quot;:&quot;Phd researcher at MIT Department of Urban Studies and Planning. \nWriting about cities, China/SE Asia, geopolitics, future cities and architecture.&quot;,&quot;profile_set_up_at&quot;:&quot;2021-05-24T05:40:04.286Z&quot;,&quot;reader_installed_at&quot;:&quot;2024-01-29T17:31:55.870Z&quot;,&quot;publicationUsers&quot;:[{&quot;id&quot;:68347,&quot;user_id&quot;:18320749,&quot;publication_id&quot;:244069,&quot;role&quot;:&quot;admin&quot;,&quot;public&quot;:true,&quot;is_primary&quot;:true,&quot;publication&quot;:{&quot;id&quot;:244069,&quot;name&quot;:&quot;Sinocities&quot;,&quot;subdomain&quot;:&quot;sinocities&quot;,&quot;custom_domain&quot;:null,&quot;custom_domain_optional&quot;:false,&quot;hero_text&quot;:&quot;Cities, China, Geopolitics, Technology, and everything in between.\nBy: Andrew Stokols ; Phd, MIT Department of Urban Studies &amp; Planning, Masters in Urban Planning, Harvard.&quot;,&quot;logo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9d4ea1ae-9808-493f-b5ad-caf23cc6de7d_1280x1280.png&quot;,&quot;author_id&quot;:18320749,&quot;primary_user_id&quot;:18320749,&quot;theme_var_background_pop&quot;:&quot;#EA82FF&quot;,&quot;created_at&quot;:&quot;2020-12-22T09:28:15.902Z&quot;,&quot;email_from_name&quot;:null,&quot;copyright&quot;:&quot;Andrew Stokols&quot;,&quot;founding_plan_name&quot;:&quot;Founding Member&quot;,&quot;community_enabled&quot;:true,&quot;invite_only&quot;:false,&quot;payments_state&quot;:&quot;enabled&quot;,&quot;language&quot;:null,&quot;explicit&quot;:false,&quot;homepage_type&quot;:&quot;magaziney&quot;,&quot;is_personal_mode&quot;:false}}],&quot;twitter_screen_name&quot;:&quot;astoks&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null,&quot;status&quot;:{&quot;bestsellerTier&quot;:null,&quot;subscriberTier&quot;:1,&quot;leaderboard&quot;:{&quot;ranking&quot;:&quot;trending&quot;,&quot;rank&quot;:64,&quot;publicationName&quot;:&quot;Sinocities&quot;,&quot;label&quot;:&quot;International&quot;,&quot;categoryId&quot;:51282},&quot;vip&quot;:false,&quot;badge&quot;:{&quot;type&quot;:&quot;subscriber&quot;,&quot;tier&quot;:1,&quot;color&quot;:null}}}],&quot;utm_campaign&quot;:null,&quot;belowTheFold&quot;:true,&quot;type&quot;:&quot;newsletter&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="EmbeddedPostToDOM"><a class="embedded-post" native="true" href="https://sinocities.substack.com/p/how-is-chinas-eastern-data-western?utm_source=substack&amp;utm_campaign=post_embed&amp;utm_medium=web"><div class="embedded-post-header"><img class="embedded-post-publication-logo" src="https://substackcdn.com/image/fetch/$s_!xnPI!,w_56,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d4ea1ae-9808-493f-b5ad-caf23cc6de7d_1280x1280.png" loading="lazy"><span class="embedded-post-publication-name">Sinocities</span></div><div class="embedded-post-title-wrapper"><div class="embedded-post-title">How is China's "Eastern Data Western Compute"&#65288;&#19996;&#25968;&#35199;&#31639;) developing?</div></div><div class="embedded-post-body">In 2018, Apple moved most of its data for China-based users to cloud computing servers in southwestern Guizhou&#8217;s Gui&#8217;an New Area. This decision reflected a reality since China&#8217;s 2016 Cybersecurity Law, that in order for foreign technology firms to operate in China they would have to store user data within the country. Apple no doubt saw this as a necess&#8230;</div><div class="embedded-post-cta-wrapper"><span class="embedded-post-cta">Read more</span></div><div class="embedded-post-meta">2 years ago &#183; 27 likes &#183; 5 comments &#183; Andrew Stokols</div></a></div></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-17" href="#footnote-anchor-17" class="footnote-number" contenteditable="false" target="_self">17</a><div class="footnote-content"><div class="embedded-post-wrap" data-attrs="{&quot;id&quot;:171903403,&quot;url&quot;:&quot;https://pstaidecrypted.substack.com/p/assessing-chinas-support-for-open&quot;,&quot;publication_id&quot;:2296890,&quot;publication_name&quot;:&quot;AIStackDecrypted&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!pvMv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30b81416-44a5-4354-8d0d-fdb7f9e0d5f1_300x300.png&quot;,&quot;title&quot;:&quot;Assessing China's support for open weight models, and the real meaning of DeepSeek's latest release&quot;,&quot;truncated_body_text&quot;:&quot;Much breathless prose has been written about China and open source or, more precisely, open weight AI models. Chinese companies now sport four of the top five or so open such models, with commentators claiming these models being free means that China will lead on AI and that the US government should support more open source/weight models. Now is the tim&#8230;&quot;,&quot;date&quot;:&quot;2025-08-30T16:00:20.654Z&quot;,&quot;like_count&quot;:10,&quot;comment_count&quot;:3,&quot;bylines&quot;:[{&quot;id&quot;:18097050,&quot;name&quot;:&quot;Paul Triolo&quot;,&quot;handle&quot;:&quot;pstasiatech&quot;,&quot;previous_name&quot;:&quot;Paul T&quot;,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ae5afe75-2e43-4924-9013-5e457f8c73c4_400x400.jpeg&quot;,&quot;bio&quot;:&quot;Long time civil servant now swimming in the private sector &quot;,&quot;profile_set_up_at&quot;:&quot;2021-12-05T16:30:20.359Z&quot;,&quot;reader_installed_at&quot;:&quot;2024-03-11T01:49:34.656Z&quot;,&quot;publicationUsers&quot;:[{&quot;id&quot;:2316045,&quot;user_id&quot;:18097050,&quot;publication_id&quot;:2296890,&quot;role&quot;:&quot;admin&quot;,&quot;public&quot;:true,&quot;is_primary&quot;:true,&quot;publication&quot;:{&quot;id&quot;:2296890,&quot;name&quot;:&quot;AIStackDecrypted&quot;,&quot;subdomain&quot;:&quot;pstaidecrypted&quot;,&quot;custom_domain&quot;:null,&quot;custom_domain_optional&quot;:false,&quot;hero_text&quot;:&quot;My personal Substack devoted to AI Stack issues and US China relations&quot;,&quot;logo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/30b81416-44a5-4354-8d0d-fdb7f9e0d5f1_300x300.png&quot;,&quot;author_id&quot;:18097050,&quot;primary_user_id&quot;:18097050,&quot;theme_var_background_pop&quot;:&quot;#00C2FF&quot;,&quot;created_at&quot;:&quot;2024-01-28T02:17:27.339Z&quot;,&quot;email_from_name&quot;:null,&quot;copyright&quot;:&quot;Paul Triolo&quot;,&quot;founding_plan_name&quot;:&quot;Founding Member&quot;,&quot;community_enabled&quot;:true,&quot;invite_only&quot;:false,&quot;payments_state&quot;:&quot;enabled&quot;,&quot;language&quot;:null,&quot;explicit&quot;:false,&quot;homepage_type&quot;:&quot;newspaper&quot;,&quot;is_personal_mode&quot;:false}}],&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null,&quot;status&quot;:{&quot;bestsellerTier&quot;:null,&quot;subscriberTier&quot;:10,&quot;leaderboard&quot;:null,&quot;vip&quot;:false,&quot;badge&quot;:{&quot;type&quot;:&quot;subscriber&quot;,&quot;tier&quot;:10,&quot;color&quot;:null}}}],&quot;utm_campaign&quot;:null,&quot;belowTheFold&quot;:true,&quot;type&quot;:&quot;newsletter&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="EmbeddedPostToDOM"><a class="embedded-post" native="true" href="https://pstaidecrypted.substack.com/p/assessing-chinas-support-for-open?utm_source=substack&amp;utm_campaign=post_embed&amp;utm_medium=web"><div class="embedded-post-header"><img class="embedded-post-publication-logo" src="https://substackcdn.com/image/fetch/$s_!pvMv!,w_56,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30b81416-44a5-4354-8d0d-fdb7f9e0d5f1_300x300.png" loading="lazy"><span class="embedded-post-publication-name">AIStackDecrypted</span></div><div class="embedded-post-title-wrapper"><div class="embedded-post-title">Assessing China's support for open weight models, and the real meaning of DeepSeek's latest release</div></div><div class="embedded-post-body">Much breathless prose has been written about China and open source or, more precisely, open weight AI models. Chinese companies now sport four of the top five or so open such models, with commentators claiming these models being free means that China will lead on AI and that the US government should support more open source/weight models. Now is the tim&#8230;</div><div class="embedded-post-cta-wrapper"><span class="embedded-post-cta">Read more</span></div><div class="embedded-post-meta">8 months ago &#183; 10 likes &#183; 3 comments &#183; Paul Triolo</div></a></div></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-18" href="#footnote-anchor-18" class="footnote-number" contenteditable="false" target="_self">18</a><div class="footnote-content"><p>For more on this math, check out <a href="https://www.tensoreconomics.com/">Tensor Economics</a> - this will be a recurring theme in Machine Yearning and energy-compute research.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-19" href="#footnote-anchor-19" class="footnote-number" contenteditable="false" target="_self">19</a><div class="footnote-content"><p><a href="https://x.com/rwang07/status/1918483860498555143">https://x.com/rwang07/status/1918483860498555143</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-20" href="#footnote-anchor-20" class="footnote-number" contenteditable="false" target="_self">20</a><div class="footnote-content"><p><a href="https://en.eeworld.com.cn/mp/XSY/a400532.jspx">https://en.eeworld.com.cn/mp/XSY/a400532.jspx</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-21" href="#footnote-anchor-21" class="footnote-number" contenteditable="false" target="_self">21</a><div class="footnote-content"><p><a href="https://www.ft.com/content/b8e30c54-b71c-4113-8b3e-8f54bc36587d">https://www.ft.com/content/b8e30c54-b71c-4113-8b3e-8f54bc36587d</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-22" href="#footnote-anchor-22" class="footnote-number" contenteditable="false" target="_self">22</a><div class="footnote-content"><p><a href="https://service.caict.ac.cn/">https://service.caict.ac.cn/</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-23" href="#footnote-anchor-23" class="footnote-number" contenteditable="false" target="_self">23</a><div class="footnote-content"><p><a href="https://m.mp.oeeee.com/a/BAAFRD0000202507271106701.html">https://m.mp.oeeee.com/a/BAAFRD0000202507271106701.html</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-24" href="#footnote-anchor-24" class="footnote-number" contenteditable="false" target="_self">24</a><div class="footnote-content"><p><a href="https://semianalysis.com/2025/09/08/huawei-ascend-production-ramp/">https://semianalysis.com/2025/09/08/huawei-ascend-production-ramp/</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-25" href="#footnote-anchor-25" class="footnote-number" contenteditable="false" target="_self">25</a><div class="footnote-content"><p><a href="https://www.moffettai.com/xin-wen-zhong-xin/qian-yi-ji-mo-xin-gong-bu-zui-xin-da-mo-xing-.html">https://www.moffettai.com/xin-wen-zhong-xin/qian-yi-ji-mo-xin-gong-bu-zui-xin-da-mo-xing-.html</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-26" href="#footnote-anchor-26" class="footnote-number" contenteditable="false" target="_self">26</a><div class="footnote-content"><p><a href="https://mlcommons.org/benchmarks/inference-datacenter/">https://mlcommons.org/benchmarks/inference-datacenter/</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-27" href="#footnote-anchor-27" class="footnote-number" contenteditable="false" target="_self">27</a><div class="footnote-content"><p><a href="https://www.crunchbase.com/organization/inspur-group/financial_details">https://www.crunchbase.com/organization/inspur-group/financial_details</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-28" href="#footnote-anchor-28" class="footnote-number" contenteditable="false" target="_self">28</a><div class="footnote-content"><p>https://arxiv.org/pdf/2312.05725</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-29" href="#footnote-anchor-29" class="footnote-number" contenteditable="false" target="_self">29</a><div class="footnote-content"><p>Sources for table:</p><ul><li><p>https://eu.36kr.com/en/p/3411091131567747</p></li><li><p>https://csi-nn2.opensource.alibaba.com/blog/deploy%20on%20th1520</p></li><li><p>https://github.com/XUANTIE-RV/csi-nn2</p></li><li><p>https://www.paddlepaddle.org.cn/documentation/docs/en/install/Tables_en.html</p></li><li><p>https://forum.cambricon.com/uploadfile/user/file/20201125/1606289569710855.pdf</p></li><li><p>https://www.usenix.org/system/files/osdi25-dong.pdf</p></li><li><p>https://docs.gpustack.ai/0.5/tutorials/running-inference-with-hygon-dcus/</p></li><li><p>https://www.alibabacloud.com/blog/ai-model-inference-service-an-overview_602002</p></li><li><p>https://www.tomshardware.com/pc-components/gpus/chinas-moore-threads-polishes-homegrown-cuda-alternative-musa-supports-porting-cuda-code-using-musify-toolkit</p></li><li><p>https://www.dramx.com/News/IC/20230614-34241.html</p></li><li><p>https://www.birentech.com/product_details/1.html</p></li><li><p>https://www.kisacoresearch.com/sites/default/files/documents/moffett_ai_s4_accelerator_datasheet.pdf</p></li><li><p>https://doc.sophgo.com/sdk-docs/v23.05.01/docs_latest_release/docs/SophonSDK_doc/en/html/sdk_intro/1_intro.html</p></li><li><p>https://www.tomshardware.com/news/chinese-gpu-developer-gets-government-funds</p></li><li><p>https://watermelonwater.tech/archives/%20Efficient%20AI%20Model%20Inference%20with%20Paddle%20Lite%20on%20JM9230%20GPU</p></li></ul></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-30" href="#footnote-anchor-30" class="footnote-number" contenteditable="false" target="_self">30</a><div class="footnote-content"><p>https://www.tomshardware.com/pc-components/gpus/nvidia-bans-using-translation-layers-for-cuda-software-to-run-on-other-chips-new-restriction-apparently-targets-zluda-and-some-chinese-gpu-makers</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-31" href="#footnote-anchor-31" class="footnote-number" contenteditable="false" target="_self">31</a><div class="footnote-content"><p>https://sakana.ai/ai-cuda-engineer/</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-32" href="#footnote-anchor-32" class="footnote-number" contenteditable="false" target="_self">32</a><div class="footnote-content"><p>https://www.youtube.com/shorts/W3keOVTy2yE</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-33" href="#footnote-anchor-33" class="footnote-number" contenteditable="false" target="_self">33</a><div class="footnote-content"><p>https://xcoresigma.com/productinfo</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-34" href="#footnote-anchor-34" class="footnote-number" contenteditable="false" target="_self">34</a><div class="footnote-content"><p>https://technews.tw/2025/04/15/moore-threads-musify-toolkit-beat-cuda/</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-35" href="#footnote-anchor-35" class="footnote-number" contenteditable="false" target="_self">35</a><div class="footnote-content"><p>https://baike.baidu.com/item/%E5%BC%A0%E6%96%87/59985602</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-36" href="#footnote-anchor-36" class="footnote-number" contenteditable="false" target="_self">36</a><div class="footnote-content"><p>https://archive.md/gFn3O</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-37" href="#footnote-anchor-37" class="footnote-number" contenteditable="false" target="_self">37</a><div class="footnote-content"><p>https://www.sohu.com/a/803570474_121922042</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-38" href="#footnote-anchor-38" class="footnote-number" contenteditable="false" target="_self">38</a><div class="footnote-content"><p>https://www.birentech.com/news/118.html</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-39" href="#footnote-anchor-39" class="footnote-number" contenteditable="false" target="_self">39</a><div class="footnote-content"><p>https://hc34.hotchips.org/assets/program/conference/day1/GPU%20HPC/HC2022.BirenTech.MikeHong.LingjieXu.v01.pdf</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-40" href="#footnote-anchor-40" class="footnote-number" contenteditable="false" target="_self">40</a><div class="footnote-content"><p>https://www.federalregister.gov/documents/2023/10/19/2023-23048/entity-list-additions</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-41" href="#footnote-anchor-41" class="footnote-number" contenteditable="false" target="_self">41</a><div class="footnote-content"><p>https://www.bis.gov/press-release/commerce-strengthens-export-controls-restrict-chinas-capability-produce-advanced-semiconductors-military</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-42" href="#footnote-anchor-42" class="footnote-number" contenteditable="false" target="_self">42</a><div class="footnote-content"><p>https://www.tomshardware.com/tech-industry/chinas-cxmt-begins-mass-producing-hbm2-memory-well-ahead-of-schedule-2026-was-the-previously-telegraphed-target</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-43" href="#footnote-anchor-43" class="footnote-number" contenteditable="false" target="_self">43</a><div class="footnote-content"><div class="embedded-post-wrap" data-attrs="{&quot;id&quot;:161434996,&quot;url&quot;:&quot;https://www.chinatalk.media/p/mapping-chinas-hbm-advancement&quot;,&quot;publication_id&quot;:4220,&quot;publication_name&quot;:&quot;ChinaTalk&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!6mVK!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F9b5dde60-871d-48d4-9c21-e4f434b3f3c1_256x256.png&quot;,&quot;title&quot;:&quot;Mapping China's HBM Advances&quot;,&quot;truncated_body_text&quot;:&quot;Ray Wang is a Washington-based analyst formerly based in Taipei and Seoul. He focuses on U.S.-China economic and technological statecraft, Chinese foreign policy, and the semiconductor and AI industry in China, South Korea, and Taiwan. You can read more of his writing on his Substack:&quot;,&quot;date&quot;:&quot;2025-04-17T11:03:50.911Z&quot;,&quot;like_count&quot;:35,&quot;comment_count&quot;:3,&quot;bylines&quot;:[{&quot;id&quot;:205724729,&quot;name&quot;:&quot;Ray Wang&quot;,&quot;handle&quot;:&quot;raywang2&quot;,&quot;previous_name&quot;:null,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/15d3547f-7053-4ed7-8296-e0d44100b611_1024x1024.png&quot;,&quot;bio&quot;:&quot; I am a semiconductor analyst covering the core of silicon&#8212;logic and memory&#8212;alongside the ecosystem that makes it possible (EDA, semicaps, packaging). I also track the Asia hardware supply chain in detail, particularly Taiwan, South Korea, and China.&quot;,&quot;profile_set_up_at&quot;:&quot;2024-06-14T14:31:24.135Z&quot;,&quot;reader_installed_at&quot;:&quot;2025-01-18T11:40:15.414Z&quot;,&quot;is_guest&quot;:true,&quot;bestseller_tier&quot;:null,&quot;status&quot;:{&quot;bestsellerTier&quot;:null,&quot;subscriberTier&quot;:null,&quot;leaderboard&quot;:null,&quot;vip&quot;:false,&quot;badge&quot;:null}},{&quot;id&quot;:38373023,&quot;name&quot;:&quot;Lily Ottinger&quot;,&quot;handle&quot;:&quot;voidpoliticstaiwan&quot;,&quot;previous_name&quot;:&quot;Lydia Hansen&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F665640fe-9378-4101-9962-9cbfcd59a82c_1080x1080.jpeg&quot;,&quot;bio&quot;:&quot;Political economy, authoritarianism, and second language acquisition. Editor and Taiwan correspondent for ChinaTalk.&quot;,&quot;profile_set_up_at&quot;:&quot;2023-06-10T07:57:04.620Z&quot;,&quot;reader_installed_at&quot;:&quot;2024-02-29T13:38:18.667Z&quot;,&quot;publicationUsers&quot;:[{&quot;id&quot;:2403514,&quot;user_id&quot;:38373023,&quot;publication_id&quot;:4220,&quot;role&quot;:&quot;admin&quot;,&quot;public&quot;:true,&quot;is_primary&quot;:true,&quot;publication&quot;:{&quot;id&quot;:4220,&quot;name&quot;:&quot;ChinaTalk&quot;,&quot;subdomain&quot;:&quot;chinatalk&quot;,&quot;custom_domain&quot;:&quot;www.chinatalk.media&quot;,&quot;custom_domain_optional&quot;:false,&quot;hero_text&quot;:&quot;Deep coverage of technology, China, and US policy. We feature original analysis alongside interviews with leading thinkers and policymakers.&quot;,&quot;logo_url&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/9b5dde60-871d-48d4-9c21-e4f434b3f3c1_256x256.png&quot;,&quot;author_id&quot;:1145,&quot;primary_user_id&quot;:1145,&quot;theme_var_background_pop&quot;:&quot;#ff9900&quot;,&quot;created_at&quot;:&quot;2018-12-17T01:44:27.292Z&quot;,&quot;email_from_name&quot;:&quot;ChinaTalk&quot;,&quot;copyright&quot;:&quot;Jordan Schneider&quot;,&quot;founding_plan_name&quot;:&quot;Founding Member Plan&quot;,&quot;community_enabled&quot;:true,&quot;invite_only&quot;:false,&quot;payments_state&quot;:&quot;enabled&quot;,&quot;language&quot;:null,&quot;explicit&quot;:false,&quot;homepage_type&quot;:&quot;magaziney&quot;,&quot;is_personal_mode&quot;:false}},{&quot;id&quot;:1702440,&quot;user_id&quot;:38373023,&quot;publication_id&quot;:1722898,&quot;role&quot;:&quot;admin&quot;,&quot;public&quot;:true,&quot;is_primary&quot;:false,&quot;publication&quot;:{&quot;id&quot;:1722898,&quot;name&quot;:&quot;Void Politics Taiwan&quot;,&quot;subdomain&quot;:&quot;voidpoliticstaiwan&quot;,&quot;custom_domain&quot;:null,&quot;custom_domain_optional&quot;:false,&quot;hero_text&quot;:&quot;Political economy, authoritarianism, and more&quot;,&quot;logo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ebafe62a-b68c-424a-9189-8978bd9d8eac_144x144.png&quot;,&quot;author_id&quot;:38373023,&quot;primary_user_id&quot;:null,&quot;theme_var_background_pop&quot;:&quot;#0068EF&quot;,&quot;created_at&quot;:&quot;2023-06-10T07:57:24.867Z&quot;,&quot;email_from_name&quot;:null,&quot;copyright&quot;:&quot;Lily Ottinger&quot;,&quot;founding_plan_name&quot;:null,&quot;community_enabled&quot;:true,&quot;invite_only&quot;:false,&quot;payments_state&quot;:&quot;disabled&quot;,&quot;language&quot;:null,&quot;explicit&quot;:false,&quot;homepage_type&quot;:&quot;newspaper&quot;,&quot;is_personal_mode&quot;:false}}],&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100,&quot;status&quot;:{&quot;bestsellerTier&quot;:100,&quot;subscriberTier&quot;:1,&quot;leaderboard&quot;:null,&quot;vip&quot;:false,&quot;badge&quot;:{&quot;type&quot;:&quot;bestseller&quot;,&quot;tier&quot;:100}}}],&quot;utm_campaign&quot;:null,&quot;belowTheFold&quot;:true,&quot;type&quot;:&quot;newsletter&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="EmbeddedPostToDOM"><a class="embedded-post" native="true" href="https://www.chinatalk.media/p/mapping-chinas-hbm-advancement?utm_source=substack&amp;utm_campaign=post_embed&amp;utm_medium=web"><div class="embedded-post-header"><img class="embedded-post-publication-logo" src="https://substackcdn.com/image/fetch/$s_!6mVK!,w_56,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F9b5dde60-871d-48d4-9c21-e4f434b3f3c1_256x256.png" loading="lazy"><span class="embedded-post-publication-name">ChinaTalk</span></div><div class="embedded-post-title-wrapper"><div class="embedded-post-title">Mapping China's HBM Advances</div></div><div class="embedded-post-body">Ray Wang is a Washington-based analyst formerly based in Taipei and Seoul. He focuses on U.S.-China economic and technological statecraft, Chinese foreign policy, and the semiconductor and AI industry in China, South Korea, and Taiwan. You can read more of his writing on his Substack&#8230;</div><div class="embedded-post-cta-wrapper"><span class="embedded-post-cta">Read more</span></div><div class="embedded-post-meta">a year ago &#183; 35 likes &#183; 3 comments &#183; Ray Wang and Lily Ottinger</div></a></div></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-44" href="#footnote-anchor-44" class="footnote-number" contenteditable="false" target="_self">44</a><div class="footnote-content"><p>https://x.com/firstadopter/status/1691877797487165443</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-45" href="#footnote-anchor-45" class="footnote-number" contenteditable="false" target="_self">45</a><div class="footnote-content"><p>https://www.granitefirm.com/blog/us/2022/05/13/yield-rate-comparison/</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-46" href="#footnote-anchor-46" class="footnote-number" contenteditable="false" target="_self">46</a><div class="footnote-content"><p>https://blog.csdn.net/weixin_44005328/article/details/150305332</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-47" href="#footnote-anchor-47" class="footnote-number" contenteditable="false" target="_self">47</a><div class="footnote-content"><p>The literal translation for the tool - &#8220;&#25512;&#29702;&#35760;&#24518;&#25968;&#25454;&#31649;&#29702;&#22120;&#8221; (<em>&#8220;Reasoning Memory Data Manager&#8221;</em>) all but confirms it&#8217;s intended to meet the needs of these new workloads.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-48" href="#footnote-anchor-48" class="footnote-number" contenteditable="false" target="_self">48</a><div class="footnote-content"><p>https://mp.weixin.qq.com/s/HHh0kXdOC8EkDDm7HLEiEA</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-49" href="#footnote-anchor-49" class="footnote-number" contenteditable="false" target="_self">49</a><div class="footnote-content"><p>https://mp.weixin.qq.com/s/069VzI0wd9FJRKh9H4Bq_Q</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-50" href="#footnote-anchor-50" class="footnote-number" contenteditable="false" target="_self">50</a><div class="footnote-content"><div class="embedded-post-wrap" data-attrs="{&quot;id&quot;:171271836,&quot;url&quot;:&quot;https://pstaidecrypted.substack.com/p/china-ai-update-innovation-across&quot;,&quot;publication_id&quot;:2296890,&quot;publication_name&quot;:&quot;AIStackDecrypted&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!pvMv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30b81416-44a5-4354-8d0d-fdb7f9e0d5f1_300x300.png&quot;,&quot;title&quot;:&quot;China AI update: Innovation across a distributed and heterogeneous AI stack heats up&quot;,&quot;truncated_body_text&quot;:&quot;After a two-week swing through the Chinese AI sector in late July that included detailed discussions with Chinese AI players at the World AI Conference (WAIC) and visits to many individual companies, plus discussions with domestic and international investors, I was struck by the dynamism of the companies involved, and the out-of-the-box thinking on AI m&#8230;&quot;,&quot;date&quot;:&quot;2025-08-22T14:53:51.981Z&quot;,&quot;like_count&quot;:16,&quot;comment_count&quot;:1,&quot;bylines&quot;:[{&quot;id&quot;:18097050,&quot;name&quot;:&quot;Paul Triolo&quot;,&quot;handle&quot;:&quot;pstasiatech&quot;,&quot;previous_name&quot;:&quot;Paul T&quot;,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ae5afe75-2e43-4924-9013-5e457f8c73c4_400x400.jpeg&quot;,&quot;bio&quot;:&quot;Long time civil servant now swimming in the private sector &quot;,&quot;profile_set_up_at&quot;:&quot;2021-12-05T16:30:20.359Z&quot;,&quot;reader_installed_at&quot;:&quot;2024-03-11T01:49:34.656Z&quot;,&quot;publicationUsers&quot;:[{&quot;id&quot;:2316045,&quot;user_id&quot;:18097050,&quot;publication_id&quot;:2296890,&quot;role&quot;:&quot;admin&quot;,&quot;public&quot;:true,&quot;is_primary&quot;:true,&quot;publication&quot;:{&quot;id&quot;:2296890,&quot;name&quot;:&quot;AIStackDecrypted&quot;,&quot;subdomain&quot;:&quot;pstaidecrypted&quot;,&quot;custom_domain&quot;:null,&quot;custom_domain_optional&quot;:false,&quot;hero_text&quot;:&quot;My personal Substack devoted to AI Stack issues and US China relations&quot;,&quot;logo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/30b81416-44a5-4354-8d0d-fdb7f9e0d5f1_300x300.png&quot;,&quot;author_id&quot;:18097050,&quot;primary_user_id&quot;:18097050,&quot;theme_var_background_pop&quot;:&quot;#00C2FF&quot;,&quot;created_at&quot;:&quot;2024-01-28T02:17:27.339Z&quot;,&quot;email_from_name&quot;:null,&quot;copyright&quot;:&quot;Paul Triolo&quot;,&quot;founding_plan_name&quot;:&quot;Founding Member&quot;,&quot;community_enabled&quot;:true,&quot;invite_only&quot;:false,&quot;payments_state&quot;:&quot;enabled&quot;,&quot;language&quot;:null,&quot;explicit&quot;:false,&quot;homepage_type&quot;:&quot;newspaper&quot;,&quot;is_personal_mode&quot;:false}}],&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null,&quot;status&quot;:{&quot;bestsellerTier&quot;:null,&quot;subscriberTier&quot;:10,&quot;leaderboard&quot;:null,&quot;vip&quot;:false,&quot;badge&quot;:{&quot;type&quot;:&quot;subscriber&quot;,&quot;tier&quot;:10,&quot;accent_colors&quot;:null}}},{&quot;id&quot;:2295132,&quot;name&quot;:&quot;Ryan Cunningham&quot;,&quot;handle&quot;:&quot;machineyearning&quot;,&quot;previous_name&quot;:null,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!cF6f!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e40b64a-002f-4b1b-bc62-16df254e2f7b_995x995.png&quot;,&quot;bio&quot;:&quot;energy-compute and technoeconomics &#8226; founder @ Edgerunner Ventures &#8226; ex-Uber&quot;,&quot;profile_set_up_at&quot;:&quot;2022-01-31T20:24:54.099Z&quot;,&quot;reader_installed_at&quot;:&quot;2022-03-11T16:39:03.951Z&quot;,&quot;is_guest&quot;:true,&quot;bestseller_tier&quot;:null,&quot;status&quot;:{&quot;bestsellerTier&quot;:null,&quot;subscriberTier&quot;:1,&quot;leaderboard&quot;:null,&quot;vip&quot;:false,&quot;badge&quot;:{&quot;type&quot;:&quot;subscriber&quot;,&quot;tier&quot;:1,&quot;accent_colors&quot;:null}},&quot;primaryPublicationId&quot;:108589,&quot;primaryPublicationName&quot;:&quot;Machine Yearning&quot;,&quot;primaryPublicationUrl&quot;:&quot;https://www.machineyearning.io&quot;,&quot;primaryPublicationSubscribeUrl&quot;:&quot;https://www.machineyearning.io/subscribe?&quot;}],&quot;utm_campaign&quot;:null,&quot;belowTheFold&quot;:true,&quot;type&quot;:&quot;newsletter&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="EmbeddedPostToDOM"><a class="embedded-post" native="true" href="https://pstaidecrypted.substack.com/p/china-ai-update-innovation-across?utm_source=substack&amp;utm_campaign=post_embed&amp;utm_medium=web"><div class="embedded-post-header"><img class="embedded-post-publication-logo" src="https://substackcdn.com/image/fetch/$s_!pvMv!,w_56,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30b81416-44a5-4354-8d0d-fdb7f9e0d5f1_300x300.png" loading="lazy"><span class="embedded-post-publication-name">AIStackDecrypted</span></div><div class="embedded-post-title-wrapper"><div class="embedded-post-title">China AI update: Innovation across a distributed and heterogeneous AI stack heats up</div></div><div class="embedded-post-body">After a two-week swing through the Chinese AI sector in late July that included detailed discussions with Chinese AI players at the World AI Conference (WAIC) and visits to many individual companies, plus discussions with domestic and international investors, I was struck by the dynamism of the companies involved, and the out-of-the-box thinking on AI m&#8230;</div><div class="embedded-post-cta-wrapper"><span class="embedded-post-cta">Read more</span></div><div class="embedded-post-meta">8 months ago &#183; 16 likes &#183; 1 comment &#183; Paul Triolo and Ryan Cunningham</div></a></div></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-51" href="#footnote-anchor-51" class="footnote-number" contenteditable="false" target="_self">51</a><div class="footnote-content"><p>https://www.scmp.com/tech/tech-war/article/3276213/chinas-nvidia-wannabe-tencent-backed-ai-chip-start-enflame-flags-ipo-intention</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-52" href="#footnote-anchor-52" class="footnote-number" contenteditable="false" target="_self">52</a><div class="footnote-content"><p>https://www.reuters.com/technology/tencent-backed-ai-chip-startup-enflame-raises-27-bln-state-linked-investors-2023-09-28/</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-53" href="#footnote-anchor-53" class="footnote-number" contenteditable="false" target="_self">53</a><div class="footnote-content"><p><a href="https://www.cyjiaomu.com/article/f26di">https://www.cyjiaomu.com/article/f26di</a> This is a deliberate twist on an older saying, &#23398;&#32780;&#20248;&#21017;&#20181; (xu&#233; &#233;r y&#333;u z&#233; sh&#236;, literally &#8220;those who excel at study become officials&#8221;).</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-54" href="#footnote-anchor-54" class="footnote-number" contenteditable="false" target="_self">54</a><div class="footnote-content"><p>https://www.yicaiglobal.com/news/bytedances-volcano-engine-starts-price-war-in-chinas-enterprise-llms-sector</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-55" href="#footnote-anchor-55" class="footnote-number" contenteditable="false" target="_self">55</a><div class="footnote-content"><p>https://mp.weixin.qq.com/s?__biz=MjM5OTM2NjgxMw==&amp;mid=2652667795&amp;idx=4&amp;sn=f587658582fce8b5902de822c184a880&amp;chksm=bd0f0d5e6112154b313ac48c832e5418e3373f2ad173140649197158ae097d372cf185f656b3#rd</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-56" href="#footnote-anchor-56" class="footnote-number" contenteditable="false" target="_self">56</a><div class="footnote-content"><p>https://www.reuters.com/technology/baidu-chip-design-unit-kunlunxin-wins-over-139-million-orders-china-mobile-2025-08-22/</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-57" href="#footnote-anchor-57" class="footnote-number" contenteditable="false" target="_self">57</a><div class="footnote-content"><p>https://www.eet-china.com/news/202507309451.html</p></div></div>]]></content:encoded></item><item><title><![CDATA[Energy-Compute Theory: China's New Objective Function]]></title><description><![CDATA[The first principles playbook for the new economy]]></description><link>https://www.machineyearning.io/p/energy-compute-theory-chinas-new</link><guid isPermaLink="false">https://www.machineyearning.io/p/energy-compute-theory-chinas-new</guid><dc:creator><![CDATA[Ryan Cunningham]]></dc:creator><pubDate>Thu, 14 Aug 2025 14:15:08 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/36bd065e-07d8-4a4a-9ef0-3ebc1e2c93b2_2912x2096.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>So, GPT-5 was released a few days ago.</p><p>Anyway, back to Machine Yearning, where every week is infrastructure week.</p><div><hr></div><p>Last week, <a href="https://pstaidecrypted.substack.com/">Paul Triolo</a> and I went on <a href="https://www.sinicapodcast.com/p/the-world-ai-conference-in-shanghai">the Sinica Podcast</a> with to tell Kaiser what we saw at Shanghai&#8217;s World AI Conference.</p><p>In it, I mentioned one of the keynote speeches at WAIC by Professor Wang Yu (&#27754;&#29577;), Chairman of the Department of Electrical Engineering at Tsinghua University, who laid out the blueprint for his philosophy towards AI infrastructure development.</p><p>I also talked about my investment philosophy - an emerging framework called &#8220;watt-to-bit&#8221; in some fringe Western investor circles.</p><p>Turns out, China&#8217;s consensus AI strategy is mapping precisely to this investment doctrine. So it&#8217;s probably time we pull back the curtain on it.</p><p>In this week&#8217;s post, I&#8217;ll cover:</p><ol><li><p><strong>The What:</strong> Introduce Energy-Compute Theory and its primary inputs</p></li><li><p><strong>The How:</strong></p><ol><li><p>Dissect Professor Wang Yu&#8217;s keynote as an application of energy-compute theory</p></li><li><p>Identify implications for model and semiconductor development</p></li></ol></li><li><p><strong>The Why:</strong></p><ol><li><p>Assess conditions for AI&#8217;s diffusion into the Chinese economy</p></li><li><p>Lay out the implications for both China and US AI ecosystems</p></li></ol></li></ol><h3>Key Takeaways to Expect</h3><ol><li><p><strong>Energy is the constraint</strong>: Watts are the ultimate scarce resource; compute capacity and intelligence output scale with how efficiently energy is converted into tokens.</p></li><li><p><strong>China&#8217;s objective function</strong>: China is explicitly reframing AI and industrial policy around energy-compute optimization, treating it as a physics-based equation rather than a vague race for &#8220;AI leadership.&#8221;</p></li><li><p><strong>A first-principles playbook</strong>: Grounding strategy in watts, tokens, and physics creates a clearer, more durable framework for competing in the new economy.</p></li><li><p><strong>The West&#8217;s blind spot</strong>: U.S. narratives fixate on talent, chips, or qualitative intelligence thresholds, ignoring the underlying energy-compute substrate.</p></li><li><p><strong>Winning advantage</strong>: Nations and firms that master energy-compute efficiency will dominate AI capability, industrial output, and ultimately geopolitical power.</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://www.sinicapodcast.com/p/the-world-ai-conference-in-shanghai" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!wqin!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f835dbb-59b3-47ab-9b42-9fef074f772e_1400x1000.png 424w, https://substackcdn.com/image/fetch/$s_!wqin!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f835dbb-59b3-47ab-9b42-9fef074f772e_1400x1000.png 848w, https://substackcdn.com/image/fetch/$s_!wqin!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f835dbb-59b3-47ab-9b42-9fef074f772e_1400x1000.png 1272w, https://substackcdn.com/image/fetch/$s_!wqin!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f835dbb-59b3-47ab-9b42-9fef074f772e_1400x1000.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!wqin!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f835dbb-59b3-47ab-9b42-9fef074f772e_1400x1000.png" width="433" height="309.2857142857143" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3f835dbb-59b3-47ab-9b42-9fef074f772e_1400x1000.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1000,&quot;width&quot;:1400,&quot;resizeWidth&quot;:433,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://www.sinicapodcast.com/p/the-world-ai-conference-in-shanghai&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!wqin!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f835dbb-59b3-47ab-9b42-9fef074f772e_1400x1000.png 424w, https://substackcdn.com/image/fetch/$s_!wqin!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f835dbb-59b3-47ab-9b42-9fef074f772e_1400x1000.png 848w, https://substackcdn.com/image/fetch/$s_!wqin!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f835dbb-59b3-47ab-9b42-9fef074f772e_1400x1000.png 1272w, https://substackcdn.com/image/fetch/$s_!wqin!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f835dbb-59b3-47ab-9b42-9fef074f772e_1400x1000.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Thanks again for having me on, Kaiser! <a href="https://www.sinicapodcast.com/p/the-world-ai-conference-in-shanghai">https://www.sinicapodcast.com/p/the-world-ai-conference-in-shanghai</a></figcaption></figure></div><h1>Watts and Bits</h1><p>Watt-to-bit frameworks merge power generation, storage, compute, cooling, and other AI infrastructure into a single asset class: &#8220;energy-compute infrastructure.&#8221;</p><p>Maximizing ROI with energy-compute infrastructure requires optimizing models and hardware along a 2-dimensional plane of intelligence throughput (tokens-per-second) and energy inputs (power), rather than strictly deploying the best models or chips.</p><p>This is not yet a mainstream philosophy in Silicon Valley group chats. Today, most Westerners are &#8220;scale-pilled&#8221;, meaning our prevailing strategy is to throw as many resources at a compute problem as possible, assume that the model will disproportionately improve, and the earliest movers will capture most of the profits, justifying incredibly high sticker prices.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a></p><p>&#8220;Resources&#8221; in this context could mean data, chips, capital, or even talent (see &#8220;The Sovereign AI Trap&#8221; in my previous post on DeepSeek).<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a></p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;8732c9d6-7469-461c-bc69-5160c5614368&quot;,&quot;caption&quot;:&quot;I've spent the past few days drafting this essay, and it's the longest I've published to date. Budget ~20-30 minutes to digest it, best read over coffee.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;DeepSeek and the End of an Era&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:2295132,&quot;name&quot;:&quot;Ryan Cunningham&quot;,&quot;bio&quot;:&quot; https://edgerunner.io&quot;,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5e40b64a-002f-4b1b-bc62-16df254e2f7b_995x995.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;post_date&quot;:&quot;2025-01-31T19:34:38.453Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0fffb4c5-ce48-4cc2-bdd5-1a554c6fdab5_2912x2096.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.machineyearning.io/p/deepseek-and-the-end-of-an-era&quot;,&quot;section_name&quot;:null,&quot;video_upload_id&quot;:null,&quot;id&quot;:156186035,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:3,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;Machine Yearning&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!-RAu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39397fed-3de8-46df-ab35-4f48dc5edf4e_300x300.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><p>Common refrains of this strategy are the Blitzscaling playbooks of the preceding tech boom and Sam Altman&#8217;s famous call to action to &#8220;capture the light cone of all future value in the universe&#8221; via AGI.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a> The latter is probably the most literal interpretation of the <em>leading sector innovation</em> strategy Jeffrey Ding argues against in his debut book, &#8220;Technology and the Rise of Great Powers.&#8221;<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a></p><p>I&#8217;m an accelerationist, but scale-pilled thinking is a memetic hazard. It dangerously restrains intellectual and industrial discourse to a Thucydides Trap<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-6" href="#footnote-6" target="_self">6</a> where the &#8220;race to AGI&#8221; must be won by any means&#8230; without an empirical measure for what success means in the first place.</p><p>Also, it&#8217;s abundantly clear that the Chinese AI ecosystem doesn&#8217;t operate on that paradigm anyway.</p><h1>Introducing Energy-Compute Theory</h1><p>This isn&#8217;t a treatise or an econ lecture. It&#8217;s just a quick intro to energy-compute theory as I&#8217;ve been drafting it - an &#8220;efficiency-pilled&#8221; counter to the &#8220;scale-pilled&#8221; doctrine dominating accelerationist dialogue. It&#8217;s also a work in progress as my investment theses evolve overtime.</p><h2>&#129752; BeanChina vs. BeanUSA</h2><p>Imagine two rival coffee shop chains, <strong>BeanChina</strong> and <strong>BeanUSA</strong>. Every day they try to deliver as many high-quality lattes as possible. Both stores have 3 levers they can pull, and are subject to a quality constraint.</p><ol><li><p><strong>Number of espresso machines on the counter.</strong> More machines = more cups you can produce simultaneously. It also means more energy.</p></li><li><p><strong>Energy per machine.</strong> Machines with higher power ratings can generally brew faster, serving more customers.</p></li><li><p><strong>Shots per bag of beans</strong>. How many shots you&#8217;re able to stretch out of a given bean quantity.</p></li><li><p><strong>Taste score the barista must hit</strong>. This is your minimum flavor standard<em>. </em>Drinks below this quality won&#8217;t sell. Tastier drinks can be more expensive.</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!W3Nm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0477e578-c4a1-459a-83a2-ea84f43903ad_1536x1024.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!W3Nm!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0477e578-c4a1-459a-83a2-ea84f43903ad_1536x1024.png 424w, https://substackcdn.com/image/fetch/$s_!W3Nm!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0477e578-c4a1-459a-83a2-ea84f43903ad_1536x1024.png 848w, https://substackcdn.com/image/fetch/$s_!W3Nm!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0477e578-c4a1-459a-83a2-ea84f43903ad_1536x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!W3Nm!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0477e578-c4a1-459a-83a2-ea84f43903ad_1536x1024.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!W3Nm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0477e578-c4a1-459a-83a2-ea84f43903ad_1536x1024.png" width="1456" height="971" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0477e578-c4a1-459a-83a2-ea84f43903ad_1536x1024.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:971,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!W3Nm!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0477e578-c4a1-459a-83a2-ea84f43903ad_1536x1024.png 424w, https://substackcdn.com/image/fetch/$s_!W3Nm!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0477e578-c4a1-459a-83a2-ea84f43903ad_1536x1024.png 848w, https://substackcdn.com/image/fetch/$s_!W3Nm!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0477e578-c4a1-459a-83a2-ea84f43903ad_1536x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!W3Nm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0477e578-c4a1-459a-83a2-ea84f43903ad_1536x1024.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: It came to me in a dream.</figcaption></figure></div><h3>Throughput</h3><p>The stores pursue different strategies:</p><ul><li><p>BeanUSA&#8217;s top-of-the-line <strong>H200 coffee-makers </strong>can process 4.8 units of coffee slurry per second, but the store uses a two-shot-per-lattee recipe. Its lattes are generally rated as tastier - a 9/10 on the scale. The H200 coffee-makers cost $35,000 each and have a 3 year useful life.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-7" href="#footnote-7" target="_self">7</a></p></li><li><p>BeanChina&#8217;s <strong>H20 coffee-makers</strong> are a bit less powerful at 4.0 units per second, but they buy <em>slightly</em> better grinders and experiment with single-shot recipes that customers still rate at least 8/10. The H20s cost $14,000 each and have a 3 year useful life. Additionally, BeanChina has ~40% cheaper power on its side of the street. <a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-8" href="#footnote-8" target="_self">8</a></p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!t6lz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59e29114-130d-4c27-8543-6f916fc9bba2_1888x844.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!t6lz!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59e29114-130d-4c27-8543-6f916fc9bba2_1888x844.png 424w, https://substackcdn.com/image/fetch/$s_!t6lz!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59e29114-130d-4c27-8543-6f916fc9bba2_1888x844.png 848w, https://substackcdn.com/image/fetch/$s_!t6lz!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59e29114-130d-4c27-8543-6f916fc9bba2_1888x844.png 1272w, https://substackcdn.com/image/fetch/$s_!t6lz!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59e29114-130d-4c27-8543-6f916fc9bba2_1888x844.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!t6lz!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59e29114-130d-4c27-8543-6f916fc9bba2_1888x844.png" width="1200" height="536.4406779661017" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/59e29114-130d-4c27-8543-6f916fc9bba2_1888x844.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:844,&quot;width&quot;:1888,&quot;resizeWidth&quot;:1200,&quot;bytes&quot;:1785106,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170203312?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd96bfa39-9cb6-4a3c-ba5a-d14c12366450_1888x844.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-large" alt="" srcset="https://substackcdn.com/image/fetch/$s_!t6lz!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59e29114-130d-4c27-8543-6f916fc9bba2_1888x844.png 424w, https://substackcdn.com/image/fetch/$s_!t6lz!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59e29114-130d-4c27-8543-6f916fc9bba2_1888x844.png 848w, https://substackcdn.com/image/fetch/$s_!t6lz!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59e29114-130d-4c27-8543-6f916fc9bba2_1888x844.png 1272w, https://substackcdn.com/image/fetch/$s_!t6lz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F59e29114-130d-4c27-8543-6f916fc9bba2_1888x844.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Given the above, we estimate their respective throughputs at <strong>60 cups (BeanUSA) and 100 cups (BeanChina) per-machine-hour. </strong>With cups per hour as brewing bandwidth &#247; (shots per latte):</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!sHmT!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2f4d3db-cadd-4971-8151-8a96f714f80b_1892x364.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!sHmT!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2f4d3db-cadd-4971-8151-8a96f714f80b_1892x364.png 424w, https://substackcdn.com/image/fetch/$s_!sHmT!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2f4d3db-cadd-4971-8151-8a96f714f80b_1892x364.png 848w, https://substackcdn.com/image/fetch/$s_!sHmT!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2f4d3db-cadd-4971-8151-8a96f714f80b_1892x364.png 1272w, https://substackcdn.com/image/fetch/$s_!sHmT!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2f4d3db-cadd-4971-8151-8a96f714f80b_1892x364.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!sHmT!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2f4d3db-cadd-4971-8151-8a96f714f80b_1892x364.png" width="1200" height="230.76923076923077" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e2f4d3db-cadd-4971-8151-8a96f714f80b_1892x364.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:280,&quot;width&quot;:1456,&quot;resizeWidth&quot;:1200,&quot;bytes&quot;:508825,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170203312?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2f4d3db-cadd-4971-8151-8a96f714f80b_1892x364.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-large" alt="" srcset="https://substackcdn.com/image/fetch/$s_!sHmT!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2f4d3db-cadd-4971-8151-8a96f714f80b_1892x364.png 424w, https://substackcdn.com/image/fetch/$s_!sHmT!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2f4d3db-cadd-4971-8151-8a96f714f80b_1892x364.png 848w, https://substackcdn.com/image/fetch/$s_!sHmT!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2f4d3db-cadd-4971-8151-8a96f714f80b_1892x364.png 1272w, https://substackcdn.com/image/fetch/$s_!sHmT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2f4d3db-cadd-4971-8151-8a96f714f80b_1892x364.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>Even with slower silicon in their coffee-makers, BeanChina can still serve ~67% more lattes per machine than BeanUSA.</p><h3>Daily P&amp;L</h3><p>Now let&#8217;s see what that looks like on an income statement:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!7GB6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F72403f0a-3e59-4837-9e2e-7b360722d8c7_1880x990.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!7GB6!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F72403f0a-3e59-4837-9e2e-7b360722d8c7_1880x990.png 424w, https://substackcdn.com/image/fetch/$s_!7GB6!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F72403f0a-3e59-4837-9e2e-7b360722d8c7_1880x990.png 848w, https://substackcdn.com/image/fetch/$s_!7GB6!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F72403f0a-3e59-4837-9e2e-7b360722d8c7_1880x990.png 1272w, https://substackcdn.com/image/fetch/$s_!7GB6!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F72403f0a-3e59-4837-9e2e-7b360722d8c7_1880x990.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!7GB6!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F72403f0a-3e59-4837-9e2e-7b360722d8c7_1880x990.png" width="1200" height="632.1428571428571" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/72403f0a-3e59-4837-9e2e-7b360722d8c7_1880x990.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:767,&quot;width&quot;:1456,&quot;resizeWidth&quot;:1200,&quot;bytes&quot;:1324455,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170203312?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F72403f0a-3e59-4837-9e2e-7b360722d8c7_1880x990.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-large" alt="" srcset="https://substackcdn.com/image/fetch/$s_!7GB6!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F72403f0a-3e59-4837-9e2e-7b360722d8c7_1880x990.png 424w, https://substackcdn.com/image/fetch/$s_!7GB6!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F72403f0a-3e59-4837-9e2e-7b360722d8c7_1880x990.png 848w, https://substackcdn.com/image/fetch/$s_!7GB6!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F72403f0a-3e59-4837-9e2e-7b360722d8c7_1880x990.png 1272w, https://substackcdn.com/image/fetch/$s_!7GB6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F72403f0a-3e59-4837-9e2e-7b360722d8c7_1880x990.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>BeanChina sells <strong>~70% more lattes,</strong> while its <strong>energy cost per cup is ~3.3&#215; lower (</strong>0.28 cups per watt-hour vs. 0.09 cups per watt-hour)<strong>,</strong> and its <strong>all-in cost per cup is roughly half</strong> of BeanUSA&#8217;s - even though the drink scores just one point lower on taste.</p><h3>Corporate Innovation</h3><p>BeanChina charges less per cup than BeanUSA, but makes virtually the same amount of profit.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-9" href="#footnote-9" target="_self">9</a> BeanChina invests the margin into one of three options:</p><ul><li><p><strong>Efficiency gains.</strong> R&amp;D for much denser bean variants. Further grinder technique refinements. At best, BeanChina now only needs 7 grams of beans per cup - and early taste tests indicate their lattes are even tastier than before.</p></li><li><p><strong>More machines. </strong>A linear increase to throughput.</p></li><li><p><strong>Hybrid strategy. </strong>A combination of greater per-cup efficiency and volume increase.</p></li></ul><p>BeanUSA tries to make up for this by buying more of its H200 coffee-makers, but without changing the underlying recipe, their unit costs remain the same.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Mw8i!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9793754b-0f7c-47aa-9177-ebb40060b67a_1878x1038.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Mw8i!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9793754b-0f7c-47aa-9177-ebb40060b67a_1878x1038.png 424w, https://substackcdn.com/image/fetch/$s_!Mw8i!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9793754b-0f7c-47aa-9177-ebb40060b67a_1878x1038.png 848w, https://substackcdn.com/image/fetch/$s_!Mw8i!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9793754b-0f7c-47aa-9177-ebb40060b67a_1878x1038.png 1272w, https://substackcdn.com/image/fetch/$s_!Mw8i!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9793754b-0f7c-47aa-9177-ebb40060b67a_1878x1038.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Mw8i!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9793754b-0f7c-47aa-9177-ebb40060b67a_1878x1038.png" width="1200" height="663.4615384615385" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9793754b-0f7c-47aa-9177-ebb40060b67a_1878x1038.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:805,&quot;width&quot;:1456,&quot;resizeWidth&quot;:1200,&quot;bytes&quot;:1341951,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170203312?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9793754b-0f7c-47aa-9177-ebb40060b67a_1878x1038.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-large" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Mw8i!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9793754b-0f7c-47aa-9177-ebb40060b67a_1878x1038.png 424w, https://substackcdn.com/image/fetch/$s_!Mw8i!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9793754b-0f7c-47aa-9177-ebb40060b67a_1878x1038.png 848w, https://substackcdn.com/image/fetch/$s_!Mw8i!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9793754b-0f7c-47aa-9177-ebb40060b67a_1878x1038.png 1272w, https://substackcdn.com/image/fetch/$s_!Mw8i!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9793754b-0f7c-47aa-9177-ebb40060b67a_1878x1038.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>After the investments in bean R&amp;D, BeanChina maintains its energy cost per cup advantage, while simultaneously reducing bean costs. Unit costs are now just 34% of BeanUSA&#8217;s.</p><p>In strats B and C, the two stores are still making roughly the same amount of profit, but BeanUSA had to shell out $350K in upfront capital to do it - $210K more than BeanChina&#8217;s incremental capex.</p><h3>The DeepBean Price War</h3><p>Finally, BeanChina&#8217;s scientists unveil a new recipe, the <strong>DeepBean-R1</strong>, which is rated as slightly tastier than BeanUSA&#8217;s most popular brew: a 9.2/10. And what&#8217;s more, DeepBean-R1 is available for just $2.25 / cup vs. BeanUSA&#8217;s $3.00.</p><p>Curious customers swamp BeanChina&#8217;s counters, and BeanUSA panics, slashing its own price to $2.25 while scrambling for a new recipe.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-10" href="#footnote-10" target="_self">10</a></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!PtPe!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1cd0758b-23d6-43cb-b7ac-c22695623dd5_1878x872.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!PtPe!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1cd0758b-23d6-43cb-b7ac-c22695623dd5_1878x872.png 424w, https://substackcdn.com/image/fetch/$s_!PtPe!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1cd0758b-23d6-43cb-b7ac-c22695623dd5_1878x872.png 848w, https://substackcdn.com/image/fetch/$s_!PtPe!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1cd0758b-23d6-43cb-b7ac-c22695623dd5_1878x872.png 1272w, https://substackcdn.com/image/fetch/$s_!PtPe!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1cd0758b-23d6-43cb-b7ac-c22695623dd5_1878x872.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!PtPe!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1cd0758b-23d6-43cb-b7ac-c22695623dd5_1878x872.png" width="1200" height="557.1428571428571" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1cd0758b-23d6-43cb-b7ac-c22695623dd5_1878x872.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:676,&quot;width&quot;:1456,&quot;resizeWidth&quot;:1200,&quot;bytes&quot;:1157893,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170203312?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1cd0758b-23d6-43cb-b7ac-c22695623dd5_1878x872.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-large" alt="" srcset="https://substackcdn.com/image/fetch/$s_!PtPe!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1cd0758b-23d6-43cb-b7ac-c22695623dd5_1878x872.png 424w, https://substackcdn.com/image/fetch/$s_!PtPe!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1cd0758b-23d6-43cb-b7ac-c22695623dd5_1878x872.png 848w, https://substackcdn.com/image/fetch/$s_!PtPe!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1cd0758b-23d6-43cb-b7ac-c22695623dd5_1878x872.png 1272w, https://substackcdn.com/image/fetch/$s_!PtPe!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1cd0758b-23d6-43cb-b7ac-c22695623dd5_1878x872.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>BeanUSA&#8217;s daily profit drops by about <strong>$11K</strong>. BeanChina now clears <strong>$52K</strong>, a gap of <strong>$23K</strong> despite only spending 40% of BeanUSA&#8217;s capex.</p><h3>&#129380; Takeaways</h3><p>BeanChina might have started at a disadvantage on machines and taste, but several key investments helped close the gap:</p><ul><li><p><strong>Scarcity-driven innovation. </strong>Invested in key efficiency gains (the single-shot recipe, finer grinders) which materially lowered both their energy cost per bean, and the beans required per cup.</p></li><li><p><strong>Reinvestment in R&amp;D. </strong>Overtime, investments in bean engineering yielded material improvements to taste that closed the quality gap with <strong>BeanPT-5 </strong>on a much lower cost basis.</p></li><li><p><strong>Energy abundance. </strong>Greater energy abundance on the China side of the street meant <em>both</em> lower energy costs <em>and</em> more power available for coffee-makers. BeanChina has plenty of runway to expand to larger store footprints.</p></li></ul><p>In response, BeanUSA tried to maintain its lead by buying more coffee-makers. But starting from a higher cost basis put them at a long-term disadvantage, especially once BeanChina closed the quality gap and started a price war. BeanUSA is speeding up their recipe development cycle, but is now in a defensive position.</p><p><strong>This is energy-compute theory in a nutshell.</strong><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-11" href="#footnote-11" target="_self">11</a></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.machineyearning.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Still with me? Subscribe if you&#8217;re enjoying this so far</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h2>Doctrine</h2><p>Energy-compute theory prioritizes the diffusion of valuable, affordable intelligence into sectors of the economy where those qualities are paramount for adoption.</p><ul><li><p>Value is determined by <em>human-level performance</em> (e.g. tasks completed) at or above a pre-determined quality constraint (yield, defect rate, MCAT score)</p></li><li><p>Affordability is determined by <em>energy efficiency</em> (e.g. FLOPS / Watt, tokens / joule) as a proxy for cost. Importantly, we use physical energy units rather than fiat currency units to remove noise from market saturation, trade policies, or general animal spirits.</p></li></ul><h2>Empiricism</h2><p>In plain English, your objective is to maximize <strong>valuable intelligence per energy unit</strong>, subject to a <strong>minimum</strong> <strong>IQ score</strong> and <strong>token throughput speed.</strong></p><p>Here&#8217;s how we capture those values and constraints:</p><div class="latex-rendered" data-attrs="{&quot;persistentExpression&quot;:&quot;\\begin{aligned}\n\\text{Daily tokens} \\;(\\mathcal{T})\n&amp;= E \\times \\eta \\\\[4pt]\n\\text{Instant throughput} \\;(\\tau_{\\text{sec}})\n&amp;= \\frac{E}{S}\\times\\eta\n&amp;&amp;\\bigl(S = 86{,}400\\ \\text{sec / day}\\bigr) \\\\[6pt]\n\\text{s.t.}\\quad\n&amp;Q \\;\\ge\\; Q_{\\min},\n\\quad\n\\tau_{\\text{sec}} \\;\\ge\\; \\tau_{\\text{sec,min}}\n\\end{aligned}&quot;,&quot;id&quot;:&quot;NDIWJSTHWQ&quot;}" data-component-name="LatexBlockToDOM"></div><p>Where, for a given combination of model(s), chip(s), and cooling infrastructure:</p><ul><li><p><em>E</em> = compute budget in joules (e.g. 1 megawatt-hour = 3.6 x 10^9 joules)</p></li><li><p><em>eta</em> = token efficiency (tokens per joule). This is a combination metric which is based on the specs of your model and the chip it&#8217;s deployed on.</p></li><li><p><em>S</em> = your time conversion factor, in this case seconds to days</p></li><li><p><em>Q</em> = model intelligence (e.g. MMLU, HLE, GPQA-diamond scores)</p></li><li><p><em>Q_min</em> = model intelligence threshold (defined as GPT-4o equivalent, DeepSeek-V3 equivalent, Llama 2 7B equivalent, etc. on some task-defined benchmark)</p></li><li><p><em>tau_sec</em> = real-time token throughput (tokens per second)</p></li><li><p><em>tau_sec,min</em> = Some throughput SLA (e.g. median 100 tokens / second in production)</p></li></ul><p>Let&#8217;s break out each of those pieces.</p><h3>Power (<em>E</em>)</h3><p>This one is easy. What is your total budget of power over time (watt-hours, joules) for your compute load? This can be scaled up to a national level (a country&#8217;s total datacenter terawatt-hours per year)<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-12" href="#footnote-12" target="_self">12</a>, down to individual units (a Tesla Optimus 2.3kWh battery)<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-13" href="#footnote-13" target="_self">13</a>, or even at Kardashev scales (Type II civilizations = 4 x 10^26 watts, the total luminosity of our sun).<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-14" href="#footnote-14" target="_self">14</a></p><h3>Efficiency (<em>eta</em>)</h3><p>Efficiency is derived from calculating token throughput of a given model-chip combination (tokens / second), subject to the total envelope of power for the hardware and cooling infrastructure. Using our coffee example, this is a combination of the coffee slurry your machine can process per second, and the beans required per cup.</p><p>Why does this matter? If you have a limited power budget - whether for edge robotics or a single data center - your output potential is governed by your token throughput on that budget. That could be revenue and breakeven prices if you&#8217;re a neocloud, or task completions if you&#8217;re a robot.</p><p>Fortunately, all model-chip deployment combinations can derive these values, provided you have the requisite specs. A recent survey from researchers at Shanghai Jiao Tong University found a range of energy efficiency values for different deployment combinations of 7B models, from 0.0167 tokens/J at the low end to as high as 46.66 tokens/J.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-15" href="#footnote-15" target="_self">15</a></p><p>Those highest values, deployed on PIM/NDP hardware, are no slouches on throughput speed either, achieving 481 to 1998 tokens/sec on a modest 12 to 43 watt power budget - see below.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!V-E_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F77703b6c-5c4a-4e5f-a311-8120a55e7262_1310x1174.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!V-E_!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F77703b6c-5c4a-4e5f-a311-8120a55e7262_1310x1174.png 424w, https://substackcdn.com/image/fetch/$s_!V-E_!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F77703b6c-5c4a-4e5f-a311-8120a55e7262_1310x1174.png 848w, https://substackcdn.com/image/fetch/$s_!V-E_!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F77703b6c-5c4a-4e5f-a311-8120a55e7262_1310x1174.png 1272w, https://substackcdn.com/image/fetch/$s_!V-E_!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F77703b6c-5c4a-4e5f-a311-8120a55e7262_1310x1174.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!V-E_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F77703b6c-5c4a-4e5f-a311-8120a55e7262_1310x1174.png" width="1310" height="1174" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/77703b6c-5c4a-4e5f-a311-8120a55e7262_1310x1174.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1174,&quot;width&quot;:1310,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:233549,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170203312?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F77703b6c-5c4a-4e5f-a311-8120a55e7262_1310x1174.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!V-E_!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F77703b6c-5c4a-4e5f-a311-8120a55e7262_1310x1174.png 424w, https://substackcdn.com/image/fetch/$s_!V-E_!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F77703b6c-5c4a-4e5f-a311-8120a55e7262_1310x1174.png 848w, https://substackcdn.com/image/fetch/$s_!V-E_!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F77703b6c-5c4a-4e5f-a311-8120a55e7262_1310x1174.png 1272w, https://substackcdn.com/image/fetch/$s_!V-E_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F77703b6c-5c4a-4e5f-a311-8120a55e7262_1310x1174.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: &#8220;<a href="https://arxiv.org/pdf/2410.04466v2">Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective</a>&#8221;, Jinhao Li et al., 2024.</figcaption></figure></div><p>You&#8217;ll notice that clusters of different hardware types - FPGAs, ASICs, PIM/NDP - are shifting upwards and to the left on this chart, compared to the left-skewed distribution of GPU values.</p><p>There are physical limits to how much power can be deployed into a single chip governed by its thermal resistance and offtake from cooling equipment. Since nearly all energy into a chip is converted into heat, past a certain point, the chip will literally melt.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-16" href="#footnote-16" target="_self">16</a></p><p>So yielding better performance-per-chip as we approach that thermal wall is of critical importance.</p><p>Investing in different combinations of model and chip designs can greatly influence the <strong>affordability</strong> of your energy-compute infrastructure.</p><h3>Intelligence (<em>Q, Qmin</em>)</h3><p>But suppose you didn&#8217;t want just a 7B parameter model. Suppose you wanted GPT-4o or DeepSeek-V3, with hundreds of billions of parameters?</p><p>The above chart keeps model intelligence (<em>Q</em>) constant but varies the hardware to create efficiency frontiers. All else equal, larger and more intelligent models require more storage and more power to run, so you can&#8217;t just compare efficiency values - they must be subject to IQ constraints.</p><p>A small model might be fine for some tasks, and insufficient for others. This depends entirely on the context in which the model is deployed. That context determines your <em>Qmin</em>, the minimum acceptable IQ threshold.</p><p>Generally speaking, Qmin negatively impacts energy efficiency. Memory footprints for larger models influence the size of the chips required to store them (more VRAM), and the computational load of inference operations (arithmetic intensity, or operations-per-byte).</p><p>Model architectural and inference advancements like sparsity, quantization, fast decoding, and more can significantly reduce your memory footprint and arithmetic intensity, letting you compress more capable models into smaller, more energy-efficient footprints.</p><p>Investments in all of these areas influence the <strong>value</strong> of your energy-compute infrastructure.</p><h3>Constraints (<em>tau_sec, tau_sec,min</em>)</h3><p>Human conversation and reading speeds typically fall between 4 and 8 tokens per second.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-17" href="#footnote-17" target="_self">17</a> That might be enough for basic chat, but it&#8217;s not a material improvement for productivity use cases. This is even more pronounced with reasoning models, which spend 10x more tokens &#8220;thinking&#8221; about their response before issuing a verdict.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-18" href="#footnote-18" target="_self">18</a></p><p>If an inference deployment wants the model to be considered useful, especially if the target for time between chat turns is between 240ms and 760ms, you need to significantly overshoot typical reading speeds.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-19" href="#footnote-19" target="_self">19</a> Artificial Analysis pegs some of the top performing API endpoints at a median of 100 to 500 tokens / second or greater.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-20" href="#footnote-20" target="_self">20</a></p><p>Furthermore, the longer your model takes to generate a response, the more energy (watt-seconds, joules) is being used in that response.</p><p>These constraints jointly govern <strong>perceived value</strong> and <strong>affordability </strong>of your energy-compute infrastructure.</p><h1>An Efficiency-Pilled Ecosystem</h1><p>With that summed up, let&#8217;s jump in to how the China AI ecosystem seems to be applying energy-compute theory at scale. The keynote speeches for the Shanghai State-Owned Capital Investment (SSCI) forum were live-streamed via the conference mobile app, so I was able to get the raw files, transcribe, and translate them while I was in-country.</p><p>What follows isn&#8217;t a one-for-one translation of the slide, nor is it all the slides. It&#8217;s just a narrative voiceover of what&#8217;s being discussed. Some of my translations might be off, so please correct me if I&#8217;m way off base on something.  </p><h2>China&#8217;s new objective function</h2><p>In his keynote, and supported by several other speakers in the SSCI (Shanghai State-Owned Capital Investment) forum, Prof. Yu lays out his vision for breaking free of one-dimensional thinking - reframing the goal from &#8220;race to AGI&#8221; to &#8220;ubiquitous edge intelligence for applications&#8221; (&#27867;&#31471;&#20391;&#26234;&#33021;&#24212;&#29992; - <em>f&#224;n du&#257;n c&#232; zh&#236;n&#233;ng y&#236;ngy&#242;ng</em>).<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-21" href="#footnote-21" target="_self">21</a></p><p>To achieve this, Prof. Yu calls for merging intelligence capability and accessibility into a single objective function: <strong>maximizing inference energy efficiency subject to a given IQ threshold. </strong>That IQ constraint guarantees we are focused on what Prof. Yu calls <strong>&#8220;high quality (&#39640;&#36136;&#37327; - </strong><em><strong>g&#257;o zh&#236;li&#224;ng</strong></em><strong>) tokens.&#8221;</strong></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!HDmz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30964609-3c5b-4c1e-b62e-7b61aba7228f_2032x1192.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!HDmz!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30964609-3c5b-4c1e-b62e-7b61aba7228f_2032x1192.png 424w, https://substackcdn.com/image/fetch/$s_!HDmz!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30964609-3c5b-4c1e-b62e-7b61aba7228f_2032x1192.png 848w, https://substackcdn.com/image/fetch/$s_!HDmz!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30964609-3c5b-4c1e-b62e-7b61aba7228f_2032x1192.png 1272w, https://substackcdn.com/image/fetch/$s_!HDmz!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30964609-3c5b-4c1e-b62e-7b61aba7228f_2032x1192.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!HDmz!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30964609-3c5b-4c1e-b62e-7b61aba7228f_2032x1192.png" width="1180" height="692.1153846153846" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/30964609-3c5b-4c1e-b62e-7b61aba7228f_2032x1192.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:854,&quot;width&quot;:1456,&quot;resizeWidth&quot;:1180,&quot;bytes&quot;:1791044,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170203312?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30964609-3c5b-4c1e-b62e-7b61aba7228f_2032x1192.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-large" alt="" srcset="https://substackcdn.com/image/fetch/$s_!HDmz!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30964609-3c5b-4c1e-b62e-7b61aba7228f_2032x1192.png 424w, https://substackcdn.com/image/fetch/$s_!HDmz!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30964609-3c5b-4c1e-b62e-7b61aba7228f_2032x1192.png 848w, https://substackcdn.com/image/fetch/$s_!HDmz!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30964609-3c5b-4c1e-b62e-7b61aba7228f_2032x1192.png 1272w, https://substackcdn.com/image/fetch/$s_!HDmz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30964609-3c5b-4c1e-b62e-7b61aba7228f_2032x1192.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">&#20160;&#20040;&#26159;&#39640;&#36136;&#37327; - what is high-quality &#8220;tokens/J&#8221;? &#20248;&#21270;&#30446;&#26631; - objective function, or &#8220;optimization goals&#8221;</figcaption></figure></div><p>Prof. Yu explains that typical energy efficiency metrics for hardware - like FLOPS per watt, or OPS per watt - do not appropriately capture this tradeoff.</p><ul><li><p>In the AI 1.0 era, speech, images, and text were stored in various kinds of data formats, which made comparative performance for hardware and models difficult to measure</p></li><li><p>In the AI 2.0 era however, newer multi-modal models are capable of converting all kinds of data inputs into <em>tokens</em>, then reformulating the problem into a next-token-prediction challenge</p></li></ul><p>Prof. Yu calls the token the &#8220;core production factor&#8221; in the era of artificial intelligence. (&#26680;&#24515;&#30340;&#29983;&#20135;&#35201;&#32032;<em> - h&#233;x&#299;n de sh&#275;ngch&#462;n y&#224;os&#249;</em>).</p><ul><li><p>Tokens are more generalizable than OPS for modern machine learning applications</p></li><li><p>They can be used to represent all kinds of encoded information - pixels, code, language, conditional motion sequences (see Waymo)<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-22" href="#footnote-22" target="_self">22</a>, and so on</p></li></ul><p>As the core production factor, developers care greatly about our systems&#8217; ability to generate these quickly and efficiently.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!pYhM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0f7c404-8543-468d-b586-d02ace3f0994_3158x1836.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!pYhM!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0f7c404-8543-468d-b586-d02ace3f0994_3158x1836.png 424w, https://substackcdn.com/image/fetch/$s_!pYhM!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0f7c404-8543-468d-b586-d02ace3f0994_3158x1836.png 848w, https://substackcdn.com/image/fetch/$s_!pYhM!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0f7c404-8543-468d-b586-d02ace3f0994_3158x1836.png 1272w, https://substackcdn.com/image/fetch/$s_!pYhM!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0f7c404-8543-468d-b586-d02ace3f0994_3158x1836.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!pYhM!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0f7c404-8543-468d-b586-d02ace3f0994_3158x1836.png" width="1182" height="686.7939560439561" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a0f7c404-8543-468d-b586-d02ace3f0994_3158x1836.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:846,&quot;width&quot;:1456,&quot;resizeWidth&quot;:1182,&quot;bytes&quot;:4534200,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170203312?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0f7c404-8543-468d-b586-d02ace3f0994_3158x1836.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-large" alt="" srcset="https://substackcdn.com/image/fetch/$s_!pYhM!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0f7c404-8543-468d-b586-d02ace3f0994_3158x1836.png 424w, https://substackcdn.com/image/fetch/$s_!pYhM!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0f7c404-8543-468d-b586-d02ace3f0994_3158x1836.png 848w, https://substackcdn.com/image/fetch/$s_!pYhM!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0f7c404-8543-468d-b586-d02ace3f0994_3158x1836.png 1272w, https://substackcdn.com/image/fetch/$s_!pYhM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0f7c404-8543-468d-b586-d02ace3f0994_3158x1836.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">&#20154;&#24037;&#26234;&#33021;&#26102;&#20195;&#26680;&#24515;&#30340;&#29983;&#20135;&#35201;&#32032;&#8211;&#8211;Token: &#8220;The core production factor in the era of artificial intelligence &#8211;&#8211; the Token&#8221;</figcaption></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!tHhV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c7211ac-5d50-419e-bcd1-bf53c706355d_3248x1924.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!tHhV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c7211ac-5d50-419e-bcd1-bf53c706355d_3248x1924.png 424w, https://substackcdn.com/image/fetch/$s_!tHhV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c7211ac-5d50-419e-bcd1-bf53c706355d_3248x1924.png 848w, https://substackcdn.com/image/fetch/$s_!tHhV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c7211ac-5d50-419e-bcd1-bf53c706355d_3248x1924.png 1272w, https://substackcdn.com/image/fetch/$s_!tHhV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c7211ac-5d50-419e-bcd1-bf53c706355d_3248x1924.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!tHhV!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c7211ac-5d50-419e-bcd1-bf53c706355d_3248x1924.png" width="1207" height="714.5837912087912" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1c7211ac-5d50-419e-bcd1-bf53c706355d_3248x1924.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:862,&quot;width&quot;:1456,&quot;resizeWidth&quot;:1207,&quot;bytes&quot;:4825492,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170203312?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c7211ac-5d50-419e-bcd1-bf53c706355d_3248x1924.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-large" alt="" srcset="https://substackcdn.com/image/fetch/$s_!tHhV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c7211ac-5d50-419e-bcd1-bf53c706355d_3248x1924.png 424w, https://substackcdn.com/image/fetch/$s_!tHhV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c7211ac-5d50-419e-bcd1-bf53c706355d_3248x1924.png 848w, https://substackcdn.com/image/fetch/$s_!tHhV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c7211ac-5d50-419e-bcd1-bf53c706355d_3248x1924.png 1272w, https://substackcdn.com/image/fetch/$s_!tHhV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c7211ac-5d50-419e-bcd1-bf53c706355d_3248x1924.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Eagle-eyed readers might notice the chart Prof. Yu is using comes from the SJTU paper we covered earlier in the article. </figcaption></figure></div><p>Generating tokens requires energy. So let&#8217;s next assess our energy efficiency requirements (&#33021;&#25928;&#38656;&#27714;<em> - n&#233;ngxi&#224;o x&#363;qi&#250;</em>) across different intelligence levels.</p><p>According to Prof. Yu, he pre-defines target requirement zones (&#38656;&#27714;&#21306;&#22495;<em> - x&#363;qi&#250; q&#363;y&#249;</em>) in green polygons for efficiency targets in each intelligence level.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!TdvS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf9a0285-ae0e-42d6-af3d-009c40ab06a5_3158x1836.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!TdvS!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf9a0285-ae0e-42d6-af3d-009c40ab06a5_3158x1836.png 424w, https://substackcdn.com/image/fetch/$s_!TdvS!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf9a0285-ae0e-42d6-af3d-009c40ab06a5_3158x1836.png 848w, https://substackcdn.com/image/fetch/$s_!TdvS!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf9a0285-ae0e-42d6-af3d-009c40ab06a5_3158x1836.png 1272w, https://substackcdn.com/image/fetch/$s_!TdvS!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf9a0285-ae0e-42d6-af3d-009c40ab06a5_3158x1836.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!TdvS!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf9a0285-ae0e-42d6-af3d-009c40ab06a5_3158x1836.png" width="1178" height="684.4697802197802" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/df9a0285-ae0e-42d6-af3d-009c40ab06a5_3158x1836.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:846,&quot;width&quot;:1456,&quot;resizeWidth&quot;:1178,&quot;bytes&quot;:4608848,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170203312?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf9a0285-ae0e-42d6-af3d-009c40ab06a5_3158x1836.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-large" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!TdvS!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf9a0285-ae0e-42d6-af3d-009c40ab06a5_3158x1836.png 424w, https://substackcdn.com/image/fetch/$s_!TdvS!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf9a0285-ae0e-42d6-af3d-009c40ab06a5_3158x1836.png 848w, https://substackcdn.com/image/fetch/$s_!TdvS!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf9a0285-ae0e-42d6-af3d-009c40ab06a5_3158x1836.png 1272w, https://substackcdn.com/image/fetch/$s_!TdvS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf9a0285-ae0e-42d6-af3d-009c40ab06a5_3158x1836.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><ol><li><p><strong>Level 1 Intelligence: Intelligent chatbots</strong> (&#26234;&#33021;&#23545;&#35805;&#21161;&#25163;<em> - zh&#236;n&#233;ng du&#236;hu&#224; zh&#249;sh&#466;u</em>). Some systems have been able to break the target zone, hitting 10 tokens / joule. So we&#8217;re making progress there.</p></li><li><p><strong>Level 2 Intelligence: Logical question-answering</strong> (&#36923;&#36753;&#38382;&#39064;&#35299;&#31572; <em>- lu&#243;j&#237; w&#232;nt&#237; ji&#283;d&#225;</em>). No systems, which include various configurations of Qwen-3 and DeepSeek-R1 models (the text is hard to make out) at this point have been able to breach the target eta of &gt;10 tokens / joule. In fact, we&#8217;re actually 1 to 2 orders of magnitude outside the target area.</p></li><li><p><strong>Level 3 Intelligence: Embodied AI systems</strong> (&#20855;&#36523;&#26234;&#33021;&#25511;&#21046;<em> - j&#249; sh&#275;n zh&#236;n&#233;ng k&#242;ngzh&#236;</em>). This is the realm of robotics deployments and visual-language-action (VLA) models. Here, the target area is further restricted to 20 tokens / joule, due to higher throughput (<em>&#21534;&#21520; - t&#363;nt&#468;</em>) and energy efficiency requirements. Currently, we&#8217;re even further behind in this realm, by 2-3 orders of magnitude off in energy efficiency.</p></li></ol><p><strong>This brings up a key challenge in energy-compute theory</strong>: currently, as IQ requirements increase, your inference energy efficiency decreases. But our energy efficiency requirements <strong>actually</strong> <strong>increase</strong> as we ascend the intelligence curve.</p><p>So how can we solve these two conflicting equations?</p><h2>Robotics as an energy-compute problem</h2><p>Prof. Yu asks the audience to consider the challenge of mobile robotics. A mobile robot does not have access to wired power, and cannot offload more complex queries to a centralized, larger model. Like humans, it must be able to think on the fly within a limited energy envelope.</p><h3>Background</h3><p>Power limits chips, chips limit memory and computational throughput, which limits token throughput. If we want robots to perform well in industrial environments (20+ hour operations), and current battery tech constrains unit lifecycles to 0.9 to 2.3 kilowatt-hours<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-23" href="#footnote-23" target="_self">23</a> (3.2 to 8.3 megajoules), we need to think critically about how to allocate that energy budget in computations.</p><ul><li><p>At best, with an energy-efficiency rating of 5 tokens per joule, an 8 MJ (think Tesla Optimus) form factor would have a budget of 40M tokens to process on a single charge, not accounting for idle power, locomotion, etc..</p></li><li><p>Given typical consumption of ~50K tokens for a reasoning run with thinking turned off, you might get 800 tasks completed with a reasoning model. Visual tokens are processed differently than text, but let&#8217;s simplify and avoid that for now.</p></li><li><p>Versus 500K to 2.5M tokens for end-to-end reasoning with thinking turned on, you may only get ~20 to 80 truly complex tasks completed in that budget.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-24" href="#footnote-24" target="_self">24</a></p></li><li><p>Speed is also a factor - robots must be able to not only think through the task, but complete it quickly. Is that range of completed tasks in a given power envelope faster or slower than human-level performance in an equivalent workday?</p></li></ul><h3>Modeling</h3><p>Let&#8217;s reframe this in the form of an energy-compute problem. Assuming:</p><ul><li><p>You&#8217;d only require a single VLA query per task (&#8220;What are my instructions for placing this box on the shelf&#8221;)</p></li><li><p>Subject to the following constraints:</p><ul><li><p>Energy used cannot exceed your energy budget</p></li><li><p>Tasks completed must meet or exceed human-level performance in the same time period</p></li></ul></li></ul><p>Your task completion ceiling is the lesser of your <em>token budget </em>and your <em>wall-clock time</em>.</p><div class="latex-rendered" data-attrs="{&quot;persistentExpression&quot;:&quot;\\begin{aligned}\n\\textbf{(1) Energy budget}:\\qquad\nE &amp;= \\bigl(B_{\\text{kWh}}\\times 3.6\\times10^{6}\\bigr)\n\\;-\\; P_{\\text{para}}\\;t_{\\text{op}}\n\\\\[6pt]\n\\textbf{(2) Token capacity}:\\qquad\n\\mathcal{T} &amp;= \\eta \\cdot E\n\\\\[6pt]\n\\textbf{(2a) Token-limited tasks}:\\qquad\nN_{\\text{tokens}} &amp;=\n\\biggl [ \\frac{\\mathcal{T}}{\\tau_{\\text{query}}} \\biggr ]\n\\\\[6pt]\n\\textbf{(3) Time-limited tasks}:\\qquad\nN_{\\text{time}} &amp;=\n\\biggl [\n\\frac{86\\,400}\n{\\,\\displaystyle\\frac{\\tau_{\\text{query}}}{\\tau_{\\text{sec}}}\n+ t_{\\text{exec}}}\\;\n\\biggr ]\n\\\\[8pt]\n\\textbf{(4) Daily task~budget}:\\qquad\nN_{\\text{tasks/day}} &amp;=\n\\min\\!\\bigl(N_{\\text{tokens}},\\,N_{\\text{time}}\\bigr)\n\\\\[4pt]\n\\text{s.t.}\\;&amp;\\;\nN_{\\text{tasks/day}} \\;\\ge\\; N_{\\text{human}}\n\\end{aligned}&quot;,&quot;id&quot;:&quot;ETUBIDUBNE&quot;}" data-component-name="LatexBlockToDOM"></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!u_IX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1d97243-60e3-4ad1-857c-2508bf65630f_1900x798.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!u_IX!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1d97243-60e3-4ad1-857c-2508bf65630f_1900x798.png 424w, https://substackcdn.com/image/fetch/$s_!u_IX!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1d97243-60e3-4ad1-857c-2508bf65630f_1900x798.png 848w, https://substackcdn.com/image/fetch/$s_!u_IX!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1d97243-60e3-4ad1-857c-2508bf65630f_1900x798.png 1272w, https://substackcdn.com/image/fetch/$s_!u_IX!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1d97243-60e3-4ad1-857c-2508bf65630f_1900x798.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!u_IX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1d97243-60e3-4ad1-857c-2508bf65630f_1900x798.png" width="1456" height="612" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e1d97243-60e3-4ad1-857c-2508bf65630f_1900x798.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:612,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:866559,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170203312?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1d97243-60e3-4ad1-857c-2508bf65630f_1900x798.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!u_IX!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1d97243-60e3-4ad1-857c-2508bf65630f_1900x798.png 424w, https://substackcdn.com/image/fetch/$s_!u_IX!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1d97243-60e3-4ad1-857c-2508bf65630f_1900x798.png 848w, https://substackcdn.com/image/fetch/$s_!u_IX!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1d97243-60e3-4ad1-857c-2508bf65630f_1900x798.png 1272w, https://substackcdn.com/image/fetch/$s_!u_IX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1d97243-60e3-4ad1-857c-2508bf65630f_1900x798.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3>Results</h3><p>Without getting too much further into the weeds, Prof. Yu assumes that ~100-1000 tokens/sec throughput is about the baseline (<em>tau_sec,min</em>) for useful embodied AI systems. That&#8217;s not a problem for well-designed cloud systems - 8 H20s can run full-blooded DeepSeek R1 at up to 8,000 tokens/sec - but today, decent models on edge hardware achieve 5 to 25 tokens/sec. He derives these figures from a 2019 study on edge robotics requirements.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-25" href="#footnote-25" target="_self">25</a></p><p>In terms of speed, that puts us 2-3 orders of magnitude behind the usefulness threshold. We can&#8217;t yet clear our human-level completed tasks threshold.</p><p>This still doesn&#8217;t factor in the constraint of <em>high-quality</em> token throughput - that you&#8217;re not just completing tasks <em>faster</em> than humans, but qualitatively <em>better</em>. That could be measured in terms of yield, defect rate, intervention rate, etc..</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!QEQV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95387f27-72cc-46b3-bfc5-25206321f449_3246x1924.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!QEQV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95387f27-72cc-46b3-bfc5-25206321f449_3246x1924.png 424w, https://substackcdn.com/image/fetch/$s_!QEQV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95387f27-72cc-46b3-bfc5-25206321f449_3246x1924.png 848w, https://substackcdn.com/image/fetch/$s_!QEQV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95387f27-72cc-46b3-bfc5-25206321f449_3246x1924.png 1272w, https://substackcdn.com/image/fetch/$s_!QEQV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95387f27-72cc-46b3-bfc5-25206321f449_3246x1924.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!QEQV!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95387f27-72cc-46b3-bfc5-25206321f449_3246x1924.png" width="996" height="590.3489010989011" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/95387f27-72cc-46b3-bfc5-25206321f449_3246x1924.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:863,&quot;width&quot;:1456,&quot;resizeWidth&quot;:996,&quot;bytes&quot;:4576467,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170203312?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95387f27-72cc-46b3-bfc5-25206321f449_3246x1924.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-large" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!QEQV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95387f27-72cc-46b3-bfc5-25206321f449_3246x1924.png 424w, https://substackcdn.com/image/fetch/$s_!QEQV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95387f27-72cc-46b3-bfc5-25206321f449_3246x1924.png 848w, https://substackcdn.com/image/fetch/$s_!QEQV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95387f27-72cc-46b3-bfc5-25206321f449_3246x1924.png 1272w, https://substackcdn.com/image/fetch/$s_!QEQV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F95387f27-72cc-46b3-bfc5-25206321f449_3246x1924.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">&#31471;&#20391;&#24212;&#29992;&#38656;&#27714;&#39044;&#35745;&#22312;100-1000 token/s&#65292;&#29616;&#26377;&#31995;&#32479;&#20165;&#33021;&#23454;&#29616;5-25 token/s: &#8220;Edge requirements: 100-1000 token/s, Current systems: 5-25 token/s.&#8221; &#20113;&#20391;&#31995;&#32479;&#29702;&#35770;&#19978;&#38480;&#22312;25-30k token/s/&#33410;&#28857;&#65292;&#29616;&#26377;&#23454;&#29616;&#20173;&#26377;2-3&#20493;&#20248;&#21270;&#31354;&#38388;: &#8220;Cloud-side theoretical max: 25-30k token/s/node, Current systems: 2-3&#215; optimization potential.&#8221;</figcaption></figure></div><h2>It&#8217;s not just about robotics</h2><p>Let&#8217;s zoom out for a second. Why the obsession with robotics here?</p><p>Robotics is not just a sexy category. It&#8217;s a forcing function for rapid economic diffusion. By solving for energy-efficiency, you&#8217;re also solving for affordability in the economy&#8217;s most cost-sensitive sectors.</p><p>So when Prof. Yu mentions &#8220;ubiquitous edge intelligence for applications,&#8221; he&#8217;s talking about all kinds of economic applications, not just robots.</p><p>Consider one of the other keynote speakers: She Ying, founder of Haizhi Online.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-26" href="#footnote-26" target="_self">26</a> Haizhi is a digital platform connecting global industrial buyers with ~300K small and mid-sized factories.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!0eEY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b260678-94cc-404b-b7c2-f2b3e184b9ff_3246x1924.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!0eEY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b260678-94cc-404b-b7c2-f2b3e184b9ff_3246x1924.png 424w, https://substackcdn.com/image/fetch/$s_!0eEY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b260678-94cc-404b-b7c2-f2b3e184b9ff_3246x1924.png 848w, https://substackcdn.com/image/fetch/$s_!0eEY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b260678-94cc-404b-b7c2-f2b3e184b9ff_3246x1924.png 1272w, https://substackcdn.com/image/fetch/$s_!0eEY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b260678-94cc-404b-b7c2-f2b3e184b9ff_3246x1924.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!0eEY!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b260678-94cc-404b-b7c2-f2b3e184b9ff_3246x1924.png" width="1004" height="595.0906593406594" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9b260678-94cc-404b-b7c2-f2b3e184b9ff_3246x1924.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:863,&quot;width&quot;:1456,&quot;resizeWidth&quot;:1004,&quot;bytes&quot;:5270650,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170203312?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b260678-94cc-404b-b7c2-f2b3e184b9ff_3246x1924.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-large" alt="" srcset="https://substackcdn.com/image/fetch/$s_!0eEY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b260678-94cc-404b-b7c2-f2b3e184b9ff_3246x1924.png 424w, https://substackcdn.com/image/fetch/$s_!0eEY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b260678-94cc-404b-b7c2-f2b3e184b9ff_3246x1924.png 848w, https://substackcdn.com/image/fetch/$s_!0eEY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b260678-94cc-404b-b7c2-f2b3e184b9ff_3246x1924.png 1272w, https://substackcdn.com/image/fetch/$s_!0eEY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b260678-94cc-404b-b7c2-f2b3e184b9ff_3246x1924.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Haizhi Online may only have ~300K vendor partners, but they&#8217;re a statistically significant representation of China&#8217;s 4 million small and medium sized factories. China is also the only country in the world to have factories handling ALL industrial categories in the United Nations.</figcaption></figure></div><p>In her world, AI&#8217;s (and technology in general) adoption by small and medium-sized manufacturing businesses is contingent on it being &#8220;understandable, useful, <strong>affordable, and effective.</strong>&#8221; That&#8217;s not a controversial statement, but it is rooted in customer empathy - all of her clients operate in intensely competitive local markets on razor-thin margins.</p><p>To that end, the concept of &#8220;SaaS subscription&#8221; as a revenue model does not really vibe in China. This is a big difference between American and Chinese consumers and businesses that&#8217;s often overlooked.</p><p>Many startups we spoke to are selling AI-enabled services by inking revenue-share contracts with their clients. Their strategy is to demonstrate that by partnering up, they can significantly boost their client&#8217;s sales, improving their competitiveness in highly crowded fields (manufacturing, textiles, etc.). Few, if any, were demanding up-front monthly subscription costs.</p><p>Incentives therefore are aligned for <strong>usefulness </strong>(high revenue growth for its partners) and <strong>affordability</strong> (to remain above breakeven), rather than passing usage costs onto customers.</p><p>For a great non-techie summary of this predisposition to ruthless optimization, check out <a href="https://x.com/PoeticJusticeHA">Lesley Gao</a>&#8217;s latest piece on Shaoyang and the &#165;1 disposable lighter.</p><div class="embedded-post-wrap" data-attrs="{&quot;id&quot;:170903501,&quot;url&quot;:&quot;https://theshearforce.substack.com/p/how-a-1-lighter-defied-inflation&quot;,&quot;publication_id&quot;:2614074,&quot;publication_name&quot;:&quot;Shear Force&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!rCS5!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31c88490-4d6b-4c5d-a08d-0d7d2484804b_1024x1024.png&quot;,&quot;title&quot;:&quot;How a $1 Lighter Defied Inflation for 20 Years&quot;,&quot;truncated_body_text&quot;:&quot;At a time when coffee costs $6 and even cup noodles have seen price hikes, a few products have managed to hold their prices steady for decades. Costco&#8217;s $1.50 hot dog is perhaps the best-known case, maintained at that price only because it is intentionally&quot;,&quot;date&quot;:&quot;2025-08-13T20:59:51.233Z&quot;,&quot;like_count&quot;:2,&quot;comment_count&quot;:0,&quot;bylines&quot;:[{&quot;id&quot;:1286302,&quot;name&quot;:&quot;Lesley Gao&quot;,&quot;handle&quot;:&quot;lesleygao&quot;,&quot;previous_name&quot;:&quot;China Decoded&quot;,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/018c997a-158f-449e-8756-1056df9d3dc0_403x403.jpeg&quot;,&quot;bio&quot;:&quot;Perspectives on China&#8217;s economy and policy from years in its factories, supply chains, and cross-border tech ventures.&quot;,&quot;profile_set_up_at&quot;:&quot;2022-05-11T01:13:23.279Z&quot;,&quot;reader_installed_at&quot;:&quot;2022-05-11T01:11:29.203Z&quot;,&quot;publicationUsers&quot;:[{&quot;id&quot;:2648971,&quot;user_id&quot;:1286302,&quot;publication_id&quot;:2614074,&quot;role&quot;:&quot;admin&quot;,&quot;public&quot;:true,&quot;is_primary&quot;:true,&quot;publication&quot;:{&quot;id&quot;:2614074,&quot;name&quot;:&quot;Shear Force&quot;,&quot;subdomain&quot;:&quot;theshearforce&quot;,&quot;custom_domain&quot;:null,&quot;custom_domain_optional&quot;:false,&quot;hero_text&quot;:&quot;Shear Force offers deep analysis of China&#8217;s industrial policy and manufacturing architecture and explores the structural prerequisites for a plausible American reindustrialization.&quot;,&quot;logo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/31c88490-4d6b-4c5d-a08d-0d7d2484804b_1024x1024.png&quot;,&quot;author_id&quot;:1286302,&quot;primary_user_id&quot;:1286302,&quot;theme_var_background_pop&quot;:&quot;#E8B500&quot;,&quot;created_at&quot;:&quot;2024-05-11T04:06:52.186Z&quot;,&quot;email_from_name&quot;:null,&quot;copyright&quot;:&quot;Shear Force&quot;,&quot;founding_plan_name&quot;:null,&quot;community_enabled&quot;:true,&quot;invite_only&quot;:false,&quot;payments_state&quot;:&quot;disabled&quot;,&quot;language&quot;:null,&quot;explicit&quot;:false,&quot;homepage_type&quot;:&quot;newspaper&quot;,&quot;is_personal_mode&quot;:false}}],&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;utm_campaign&quot;:null,&quot;belowTheFold&quot;:true,&quot;type&quot;:&quot;newsletter&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="EmbeddedPostToDOM"><a class="embedded-post" native="true" href="https://theshearforce.substack.com/p/how-a-1-lighter-defied-inflation?utm_source=substack&amp;utm_campaign=post_embed&amp;utm_medium=web"><div class="embedded-post-header"><img class="embedded-post-publication-logo" src="https://substackcdn.com/image/fetch/$s_!rCS5!,w_56,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31c88490-4d6b-4c5d-a08d-0d7d2484804b_1024x1024.png" loading="lazy"><span class="embedded-post-publication-name">Shear Force</span></div><div class="embedded-post-title-wrapper"><div class="embedded-post-title">How a $1 Lighter Defied Inflation for 20 Years</div></div><div class="embedded-post-body">At a time when coffee costs $6 and even cup noodles have seen price hikes, a few products have managed to hold their prices steady for decades. Costco&#8217;s $1.50 hot dog is perhaps the best-known case, maintained at that price only because it is intentionally&#8230;</div><div class="embedded-post-cta-wrapper"><span class="embedded-post-cta">Read more</span></div><div class="embedded-post-meta">9 months ago &#183; 2 likes &#183; Lesley Gao</div></a></div><h1>&#8220;Ubiquitous edge intelligence&#8221;</h1><p>Let&#8217;s wrap up the keynote by summarizing Prof. Yu&#8217;s calls to action. His milestone for ubiquitous edge intelligence requires the following:</p><ul><li><p>&#9989;   GPT-4o/o1 grade intellect</p></li><li><p>&#9989;   In a &lt;7B parameter (read: sub-8GB) form factor</p></li><li><p>&#9989;   At 100 - 2000 tokens/sec throughput (tau_sec)</p></li><li><p>&#9989;   With &gt;20 tokens/joule energy efficiency (eta)</p></li></ul><p>This succeeds a migration from fused, modular systems (&#27169;&#22359;&#21270;&#31995;&#32479;<em> - m&#243;ku&#224;i hu&#224; x&#236;t&#466;ng</em>) into a single end-to-end model (&#31471;&#21040;&#31471;&#27169;&#22411;<em> - du&#257;n d&#224;o du&#257;n m&#243;x&#237;ng</em>) which can handle a variety of tasks and challenges.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!6ZZ9!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc7416f3-7f49-43a3-b62e-9bef73c5252b_2516x1464.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!6ZZ9!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc7416f3-7f49-43a3-b62e-9bef73c5252b_2516x1464.png 424w, https://substackcdn.com/image/fetch/$s_!6ZZ9!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc7416f3-7f49-43a3-b62e-9bef73c5252b_2516x1464.png 848w, https://substackcdn.com/image/fetch/$s_!6ZZ9!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc7416f3-7f49-43a3-b62e-9bef73c5252b_2516x1464.png 1272w, https://substackcdn.com/image/fetch/$s_!6ZZ9!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc7416f3-7f49-43a3-b62e-9bef73c5252b_2516x1464.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!6ZZ9!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc7416f3-7f49-43a3-b62e-9bef73c5252b_2516x1464.png" width="962" height="559.625" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fc7416f3-7f49-43a3-b62e-9bef73c5252b_2516x1464.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:847,&quot;width&quot;:1456,&quot;resizeWidth&quot;:962,&quot;bytes&quot;:2890578,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170203312?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc7416f3-7f49-43a3-b62e-9bef73c5252b_2516x1464.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-large" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!6ZZ9!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc7416f3-7f49-43a3-b62e-9bef73c5252b_2516x1464.png 424w, https://substackcdn.com/image/fetch/$s_!6ZZ9!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc7416f3-7f49-43a3-b62e-9bef73c5252b_2516x1464.png 848w, https://substackcdn.com/image/fetch/$s_!6ZZ9!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc7416f3-7f49-43a3-b62e-9bef73c5252b_2516x1464.png 1272w, https://substackcdn.com/image/fetch/$s_!6ZZ9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc7416f3-7f49-43a3-b62e-9bef73c5252b_2516x1464.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">&#26426;&#22120;&#20154;&#31995;&#32479;&#33539;&#24335;&#21464;&#21270;&#23545;&#31471;&#20391;&#24179;&#21488;&#25552;&#20986;&#20102;&#26356;&#39640;&#30340;&#38656;&#27714;&#8203; &#8594; &#8220;The paradigm shift in robotic systems is driving increased requirements for edge computing platforms.&#8221; Prof. Yu calls for improving &#8220;high quality&#8221; inference energy efficiency to close a 1-2 order of magnitude gap.</figcaption></figure></div><p>In the upper-left quadrant there is is a low-power, high-throughput domain which no tested combinations have been able to breach to date, let alone to do so with GPT-4o grade intellect.</p><p>As we&#8217;ve discussed, at a given iso-efficiency contour (~10 <em>eta</em> in best-case deployments), scaling alone does not take you closer to the upper-left quadrant. It just makes you more power-hungry.</p><p>End-to-end VLA models therefore require investments in hardware, model, and hosting infrastructure designs that considerably depart from scaling law theory. We need order of magnitude improvements in model and chip efficiencies.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JKRg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F55d3224b-03f3-45a9-9076-3d2ec8e56910_2516x1464.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JKRg!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F55d3224b-03f3-45a9-9076-3d2ec8e56910_2516x1464.png 424w, https://substackcdn.com/image/fetch/$s_!JKRg!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F55d3224b-03f3-45a9-9076-3d2ec8e56910_2516x1464.png 848w, https://substackcdn.com/image/fetch/$s_!JKRg!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F55d3224b-03f3-45a9-9076-3d2ec8e56910_2516x1464.png 1272w, https://substackcdn.com/image/fetch/$s_!JKRg!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F55d3224b-03f3-45a9-9076-3d2ec8e56910_2516x1464.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JKRg!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F55d3224b-03f3-45a9-9076-3d2ec8e56910_2516x1464.png" width="1078" height="627.1057692307693" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/55d3224b-03f3-45a9-9076-3d2ec8e56910_2516x1464.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:847,&quot;width&quot;:1456,&quot;resizeWidth&quot;:1078,&quot;bytes&quot;:2741766,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170203312?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F55d3224b-03f3-45a9-9076-3d2ec8e56910_2516x1464.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-large" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!JKRg!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F55d3224b-03f3-45a9-9076-3d2ec8e56910_2516x1464.png 424w, https://substackcdn.com/image/fetch/$s_!JKRg!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F55d3224b-03f3-45a9-9076-3d2ec8e56910_2516x1464.png 848w, https://substackcdn.com/image/fetch/$s_!JKRg!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F55d3224b-03f3-45a9-9076-3d2ec8e56910_2516x1464.png 1272w, https://substackcdn.com/image/fetch/$s_!JKRg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F55d3224b-03f3-45a9-9076-3d2ec8e56910_2516x1464.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>He groups these improvements into three buckets. They are not mutually exclusive, as each technique carries advantages in each bucket.</p><ol><li><p>&#8203;&#8203;&#21333;&#27425;&#25512;&#29702;&#8203;&#8203;: Single-step inference (Token-by-Token)</p></li><li><p>&#24605;&#32500;&#38142;&#25512;&#29702;: Chain-of-thought reasoning (Action-by-Action)</p></li><li><p>&#31639;&#21147;&#31995;&#32479;: Hardware or system-level optimizations</p></li></ol><h4><strong>Quantization (&#37327;&#21270; - li&#224;nghu&#224;)</strong></h4><p>Conserving memory bandwidth by casting model weights into reduced precision (e.g. FP32, FP16, integer-based formats, or new datatypes). Most hosts use quantization in inference today, but some research labs (like DeepSeek) are using mixed-precision in model training as well. Critical for fitting larger models into constrained memory footprints.</p><p>BinaryBERT was mentioned as an extreme demonstration.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-27" href="#footnote-27" target="_self">27</a></p><h4><strong>Sparsity (&#31232;&#30095;&#21270;&#8203;</strong>&#8203; - x&#299;sh&#363; hu&#224;)</h4><p>Inspired by biological neural networks. &#8220;Deactivating&#8221; unused parameter weights in neural network activations. Even if a weight is set to &#8220;0&#8221;, the operation of multiplying that weight by another still contributes to arithmetic intensity. Critical for reducing computational overhead, and therefore, energy consumption.</p><p>This was a major focus area at the conference. Prof. Yu emphasized spent a fair amount of time detailing the benefits (maximize resource savings) and challenges (minimize degradation) from sparsification methods, and credited <a href="https://arxiv.org/pdf/2301.00774">SparseGPT</a> and <a href="https://arxiv.org/abs/2405.04434">DeepSeek-V2</a> for demonstrating its potential in post-training and training-aware approaches.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-28" href="#footnote-28" target="_self">28</a><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-29" href="#footnote-29" target="_self">29</a></p><h4><strong>Fast Decoding</strong></h4><p>Speculative decoding and dynamic batching techniques to maximize token throughput within strict latency budgets. DeepSeek&#8217;s MLA attention mechanism is one such example. Critical for aligning with real-time requirements for inference with physical actuators.</p><p>In longer dialogues and super-long context windows (e.g. Kimi K2&#8217;s 1M token context window), inefficient decoding can carry serious memory loads and throughput penalties, making these techniques super valuable as user demands influence context length, memory, and so on.</p><h4><strong>Operator Optimization</strong></h4><p>Kernel fusion and hardware-aware operations (e.g., grouped GEMMs, hoisted activations) to minimize memory bottlenecks and exploit low-power silicon microarchitectures. Critical for achieving token-time and efficiency targets.</p><h4><strong>New Architectures</strong></h4><p>This could be either new chip designs like FPGA accelerators, neuromorphic chips, brain-inspired AI, and more.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-30" href="#footnote-30" target="_self">30</a>  New memory-efficient model architectures like state-space models and task-specific language models are also increasingly in favor, especially for long context windows.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-31" href="#footnote-31" target="_self">31</a><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-32" href="#footnote-32" target="_self">32</a></p><p>As you can tell from the earlier chart, novel chip designs (ASIC, PIM/NDP) are achieving considerably better token energy efficiency than general-purpose GPUs. However, they may not be suitable for all kinds of computations&#8230; which is why the next advancement is so important.</p><h4><strong>Heterogeneous Cooperation</strong></h4><p>Many research labs throughout China are now training or hosting models on heterogeneous rather than homogeneous chipsets. Distributing inference across hybrid chipsets balances the benefits of each platform, such as the flexibility of multi-core (&#22810;&#26680;) CPU with highly parallel (&#39640;&#24182;&#34892;) GPU throughput.</p><p>As an example, in this paper from the Shanghai AI Lab, the team demonstrates they can not only meet, but actually exceed token throughput for LLM training runs using mixed chip stacks:<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-33" href="#footnote-33" target="_self">33</a></p><blockquote><p>In practical production environments, lower-spec chips usually feature significantly lower pricing and reduced power consumption compared to their high-spec chips. By leveraging the HeteroPP framework for heterogeneous training, which integrates both lower-spec and high-spec chips, we can achieve training performance comparable to or even exceeding that of homogeneous high-spec setups, which preserves high performance and reduces overall training costs.</p><p>Ding Tang, et al., 2025.</p></blockquote><p>Successful implementations here not only yield energy efficiency improvements - they de-link China&#8217;s chip ecosystem from the chip export bottlenecks, as they&#8217;re no longer tied to performance of homogeneous leading-edge GPUs.</p><h1>Takeaways</h1><p>Alright, that was a lot. Let&#8217;s wind it down and bring it all back.</p><p>I started this post by introducing <strong>energy-compute theory, </strong>its inputs, and its constraints. Then, we investigated how China&#8217;s AI ecosystem has been leaning full-tilt into this development framework, in comparison to the US&#8217; preference for IQ-maxxing.</p><p>What are the implications if these ecosystems continue on those development tracks? Here are three to follow:</p><h2><strong>Open-source continues to raise </strong><em><strong>Qmin</strong>.</em></h2><p>Strategically, continuing to release more and more capable open-source models raises the baseline for expected intelligence levels in deployed models. But without commensurate investments in model or chip optimizations, energy efficiency will drop, and inference costs will rise.</p><p>GPT-5 may already be a casualty here. Some skeptics consider its launch a cost-cutting measure rather than a material improvement in capability - an ill-conceived attempt to abstract reasoning effort and save on inference costs by nerfing the consumer experience for 1BN users in the process.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-34" href="#footnote-34" target="_self">34</a></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!NAGq!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F662fb11b-64c1-4a7a-aa17-20e0e0d783c3_568x526.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!NAGq!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F662fb11b-64c1-4a7a-aa17-20e0e0d783c3_568x526.png 424w, https://substackcdn.com/image/fetch/$s_!NAGq!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F662fb11b-64c1-4a7a-aa17-20e0e0d783c3_568x526.png 848w, https://substackcdn.com/image/fetch/$s_!NAGq!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F662fb11b-64c1-4a7a-aa17-20e0e0d783c3_568x526.png 1272w, https://substackcdn.com/image/fetch/$s_!NAGq!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F662fb11b-64c1-4a7a-aa17-20e0e0d783c3_568x526.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!NAGq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F662fb11b-64c1-4a7a-aa17-20e0e0d783c3_568x526.png" width="568" height="526" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/662fb11b-64c1-4a7a-aa17-20e0e0d783c3_568x526.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:526,&quot;width&quot;:568,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:70257,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/170203312?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F662fb11b-64c1-4a7a-aa17-20e0e0d783c3_568x526.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!NAGq!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F662fb11b-64c1-4a7a-aa17-20e0e0d783c3_568x526.png 424w, https://substackcdn.com/image/fetch/$s_!NAGq!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F662fb11b-64c1-4a7a-aa17-20e0e0d783c3_568x526.png 848w, https://substackcdn.com/image/fetch/$s_!NAGq!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F662fb11b-64c1-4a7a-aa17-20e0e0d783c3_568x526.png 1272w, https://substackcdn.com/image/fetch/$s_!NAGq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F662fb11b-64c1-4a7a-aa17-20e0e0d783c3_568x526.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: <a href="https://community.openai.com/t/gpt-5-is-very-slow-compared-to-4-1-responses-api/1337859/4">OpenAI community forums,</a> accessed 8/12/2025.</figcaption></figure></div><h2><strong>Chip wars diverge national </strong><em><strong>eta</strong></em><strong> values.</strong></h2><p>Thanks to recent shenanigans, Chinese domestic semiconductor independence is now a matter of when, not if.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-35" href="#footnote-35" target="_self">35</a><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-36" href="#footnote-36" target="_self">36</a></p><p>As the market leader, NVIDIA will continue pushing out more capable chips, but every release since Volta has only averaged a 1.53x improvement in performance-per-watt. So it&#8217;s doubtful that homogeneous NVIDIA systems will achieve the energy efficiency values necessary to meet audacious economic diffusion targets (to say nothing about developer friendliness towards CUDA - NVIDIA still takes the crown there).</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!pawt!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31f42bed-4369-4bc6-8f29-fbba859f39a1_2400x1600.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!pawt!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31f42bed-4369-4bc6-8f29-fbba859f39a1_2400x1600.png 424w, https://substackcdn.com/image/fetch/$s_!pawt!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31f42bed-4369-4bc6-8f29-fbba859f39a1_2400x1600.png 848w, https://substackcdn.com/image/fetch/$s_!pawt!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31f42bed-4369-4bc6-8f29-fbba859f39a1_2400x1600.png 1272w, https://substackcdn.com/image/fetch/$s_!pawt!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31f42bed-4369-4bc6-8f29-fbba859f39a1_2400x1600.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!pawt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31f42bed-4369-4bc6-8f29-fbba859f39a1_2400x1600.png" width="1456" height="971" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/31f42bed-4369-4bc6-8f29-fbba859f39a1_2400x1600.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:971,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;NVIDIA Data Center GPU FP16 Performance per Watt (GFLOPS/Watt) from V100 to Rubin (Corrected B300)&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="NVIDIA Data Center GPU FP16 Performance per Watt (GFLOPS/Watt) from V100 to Rubin (Corrected B300)" title="NVIDIA Data Center GPU FP16 Performance per Watt (GFLOPS/Watt) from V100 to Rubin (Corrected B300)" srcset="https://substackcdn.com/image/fetch/$s_!pawt!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31f42bed-4369-4bc6-8f29-fbba859f39a1_2400x1600.png 424w, https://substackcdn.com/image/fetch/$s_!pawt!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31f42bed-4369-4bc6-8f29-fbba859f39a1_2400x1600.png 848w, https://substackcdn.com/image/fetch/$s_!pawt!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31f42bed-4369-4bc6-8f29-fbba859f39a1_2400x1600.png 1272w, https://substackcdn.com/image/fetch/$s_!pawt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F31f42bed-4369-4bc6-8f29-fbba859f39a1_2400x1600.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: NVIDIA data sheets, <a href="https://www.tomshardware.com/tech-industry/semiconductors/nvidia-enterprise-roadmap-rubin-rubin-ultra-feynman-and-silicon-photonics">Toms Hardware (Rubin, projected)</a>.</figcaption></figure></div><p>Hyperscalers have been hard at work developing their own custom silicon in order to reduce their total cost of ownership (TCO), of which energy use is a major factor.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-37" href="#footnote-37" target="_self">37</a> Even in the most resource-rich playgrounds, efficiency matters.</p><p>The post-WAIC establishment of Model-Chips Ecosystem Innovation Alliance is an encouraging signal that integrative R&amp;D circles in China may spur joint advancements in model and chip energy efficiencies. The initial listed members cut across each category:<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-38" href="#footnote-38" target="_self">38</a></p><ul><li><p>Models: StepFun</p></li><li><p>Infrastructure: Infinigence AI, SiliconFlow</p></li><li><p>Hardware: Huawei Ascend, MetaX, Biren, Enflame, Iluvatar CoreX, Cambricon, Moore Threads</p></li></ul><h2><strong>China surpasses the US in energy-compute budgets (</strong><em><strong>E</strong></em><strong>)</strong><em><strong>.</strong></em></h2><p>This will likely happen regardless. China already enjoys a 6,000 terawatt-hour (TWh) per year advantage over the United States today, and that may rise to 10,000 TWh by the end of the decade.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-39" href="#footnote-39" target="_self">39</a></p><p>While the US spends a larger proportion of power on data centers today at 327 TWh vs. China&#8217;s 163 TWh in 2024, that budget is impossible to ignore&#8230; it&#8217;s the equivalent of the cumulative TWh consumption of the entire OECD.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-40" href="#footnote-40" target="_self">40</a><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-41" href="#footnote-41" target="_self">41</a></p><h2>The Triple Product Advantage</h2><p>Single-dimension scaling has plateaued; the limits of pure parameter growth are now pretty much mainstream.</p><p>Unless the US keeps pace on all three fronts - novel model architectures, high-efficiency silicon, and plenty of power - China&#8217;s AI output will compound past ours in economic value.</p><p>It will be gradual at first, then sudden. In energy-compute theory these levers don&#8217;t add, they multiply: a modest gain in each lever (energy, efficiency, IQ) produces a polynomial jump in the total. This is called a <strong>triple-product advantage.</strong></p><p>Is this doom and gloom for the American AI ecosystem? Not at all, just a call to action. Perhaps we should be thinking less like software venture capitalists and more like energy companies.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-42" href="#footnote-42" target="_self">42</a> That&#8217;s what energy-compute theory is meant to capture.</p><p>Simply put, as model intelligence commoditizes thanks to scaling law limitations, we may finally be entering the era of ruthless optimization.</p><p>Some might call it an &#8220;AI Winter&#8221;&#8230; I call it the start of something great.</p><div><hr></div><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.machineyearning.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Machine Yearning! Subscribe for free to receive new posts and support what I do.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><div class="embedded-post-wrap" data-attrs="{&quot;id&quot;:149705443,&quot;url&quot;:&quot;https://www.dwarkesh.com/p/dylan-jon&quot;,&quot;publication_id&quot;:69345,&quot;publication_name&quot;:&quot;Dwarkesh Podcast&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!QEPJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F90fa9666-5b8b-4685-a8fb-4b64cb7e0333_1080x1080.png&quot;,&quot;title&quot;:&quot;Dylan Patel &amp; Jon (Asianometry) &#8211; How the Semiconductor Industry Actually Works&quot;,&quot;truncated_body_text&quot;:null,&quot;date&quot;:&quot;2024-10-02T14:19:08.659Z&quot;,&quot;like_count&quot;:84,&quot;comment_count&quot;:7,&quot;bylines&quot;:[{&quot;id&quot;:4281466,&quot;name&quot;:&quot;Dwarkesh Patel&quot;,&quot;handle&quot;:&quot;dwarkesh&quot;,&quot;previous_name&quot;:null,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb715ffd1-f7d7-4755-af88-c48efe647f5b_400x400.jpeg&quot;,&quot;bio&quot;:&quot;Host of Dwarkesh Podcast&quot;,&quot;profile_set_up_at&quot;:&quot;2021-06-09T22:58:10.864Z&quot;,&quot;reader_installed_at&quot;:&quot;2022-04-03T20:37:19.142Z&quot;,&quot;publicationUsers&quot;:[{&quot;id&quot;:246192,&quot;user_id&quot;:4281466,&quot;publication_id&quot;:69345,&quot;role&quot;:&quot;admin&quot;,&quot;public&quot;:true,&quot;is_primary&quot;:true,&quot;publication&quot;:{&quot;id&quot;:69345,&quot;name&quot;:&quot;Dwarkesh Podcast&quot;,&quot;subdomain&quot;:&quot;dwarkesh&quot;,&quot;custom_domain&quot;:&quot;www.dwarkesh.com&quot;,&quot;custom_domain_optional&quot;:false,&quot;hero_text&quot;:&quot;Deeply researched interviews&quot;,&quot;logo_url&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/90fa9666-5b8b-4685-a8fb-4b64cb7e0333_1080x1080.png&quot;,&quot;author_id&quot;:4281466,&quot;primary_user_id&quot;:4281466,&quot;theme_var_background_pop&quot;:&quot;#D10000&quot;,&quot;created_at&quot;:&quot;2020-07-18T16:36:25.723Z&quot;,&quot;email_from_name&quot;:&quot;Dwarkesh Patel&quot;,&quot;copyright&quot;:&quot;Dwarkesh Patel&quot;,&quot;founding_plan_name&quot;:&quot;Founding Member&quot;,&quot;community_enabled&quot;:true,&quot;invite_only&quot;:false,&quot;payments_state&quot;:&quot;enabled&quot;,&quot;language&quot;:null,&quot;explicit&quot;:false,&quot;homepage_type&quot;:null,&quot;is_personal_mode&quot;:false}}],&quot;twitter_screen_name&quot;:&quot;dwarkesh_sp&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;utm_campaign&quot;:null,&quot;belowTheFold&quot;:true,&quot;type&quot;:&quot;podcast&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="EmbeddedPostToDOM"><a class="embedded-post" native="true" href="https://www.dwarkesh.com/p/dylan-jon?utm_source=substack&amp;utm_campaign=post_embed&amp;utm_medium=web"><div class="embedded-post-header"><img class="embedded-post-publication-logo" src="https://substackcdn.com/image/fetch/$s_!QEPJ!,w_56,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F90fa9666-5b8b-4685-a8fb-4b64cb7e0333_1080x1080.png" loading="lazy"><span class="embedded-post-publication-name">Dwarkesh Podcast</span></div><div class="embedded-post-title-wrapper"><div class="embedded-post-title-icon"><svg width="19" height="19" viewBox="0 0 24 24" fill="none" xmlns="http://www.w3.org/2000/svg">
  <path d="M3 18V12C3 9.61305 3.94821 7.32387 5.63604 5.63604C7.32387 3.94821 9.61305 3 12 3C14.3869 3 16.6761 3.94821 18.364 5.63604C20.0518 7.32387 21 9.61305 21 12V18" stroke-linecap="round" stroke-linejoin="round"></path>
  <path d="M21 19C21 19.5304 20.7893 20.0391 20.4142 20.4142C20.0391 20.7893 19.5304 21 19 21H18C17.4696 21 16.9609 20.7893 16.5858 20.4142C16.2107 20.0391 16 19.5304 16 19V16C16 15.4696 16.2107 14.9609 16.5858 14.5858C16.9609 14.2107 17.4696 14 18 14H21V19ZM3 19C3 19.5304 3.21071 20.0391 3.58579 20.4142C3.96086 20.7893 4.46957 21 5 21H6C6.53043 21 7.03914 20.7893 7.41421 20.4142C7.78929 20.0391 8 19.5304 8 19V16C8 15.4696 7.78929 14.9609 7.41421 14.5858C7.03914 14.2107 6.53043 14 6 14H3V19Z" stroke-linecap="round" stroke-linejoin="round"></path>
</svg></div><div class="embedded-post-title">Dylan Patel &amp; Jon (Asianometry) &#8211; How the Semiconductor Industry Actually Works</div></div><div class="embedded-post-cta-wrapper"><div class="embedded-post-cta-icon"><svg width="32" height="32" viewBox="0 0 24 24" xmlns="http://www.w3.org/2000/svg">
  <path classname="inner-triangle" d="M10 8L16 12L10 16V8Z" stroke-width="1.5" stroke-linecap="round" stroke-linejoin="round"></path>
</svg></div><span class="embedded-post-cta">Listen now</span></div><div class="embedded-post-meta">2 years ago &#183; 84 likes &#183; 7 comments &#183; Dwarkesh Patel</div></a></div></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>&#8220;The proposal, put to co-founder Andrew Tulloch, included a billion-dollar package that could exceed $1.5 billion over six years with bonuses taken into account, sources told the <em>WSJ</em>.&#8221; <a href="https://www.itpro.com/technology/artificial-intelligence/deepmind-ceo-demis-hassabis-thinks-metas-multi-billion-dollar-hiring-spree-shows-its-scrambling-to-catch-up-in-the-ai-race">https://www.itpro.com/technology/artificial-intelligence/deepmind-ceo-demis-hassabis-thinks-metas-multi-billion-dollar-hiring-spree-shows-its-scrambling-to-catch-up-in-the-ai-race</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>&#8220;Blitzscaling&#8221;, Chris Yeh and Reid Hoffman, 2016. Strong recommend. <a href="https://hbr.org/2016/04/blitzscaling">https://hbr.org/2016/04/blitzscaling</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>Specifically, he said that the opportunity with artificial general intelligence is so incomprehensibly enormous that if OpenAI manages to crack this particular nut, it could &#8220;maybe capture the light cone of all future value in the universe.&#8221; <a href="https://techcrunch.com/2019/05/18/sam-altmans-leap-of-faith/">https://techcrunch.com/2019/05/18/sam-altmans-leap-of-faith/</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p>Jeffrey Ding, 2024. Read it! <a href="https://www.amazon.com/Technology-Rise-Great-Powers-International/dp/0691260346">https://www.amazon.com/Technology-Rise-Great-Powers-International/dp/0691260346</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-6" href="#footnote-anchor-6" class="footnote-number" contenteditable="false" target="_self">6</a><div class="footnote-content"><p><a href="https://en.wikipedia.org/wiki/Thucydides_Trap">https://en.wikipedia.org/wiki/Thucydides_Trap</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-7" href="#footnote-anchor-7" class="footnote-number" contenteditable="false" target="_self">7</a><div class="footnote-content"><p>NVIDIA H200 datasheet. <a href="https://nvdam.widen.net/s/nb5zzzsjdf/hpc-datasheet-sc23-h200-datasheet-3002446">https://nvdam.widen.net/s/nb5zzzsjdf/hpc-datasheet-sc23-h200-datasheet-3002446</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-8" href="#footnote-anchor-8" class="footnote-number" contenteditable="false" target="_self">8</a><div class="footnote-content"><p>Derived from Viperatech listing. <a href="https://viperatech.com/product/nvidia-hgx-h20/">https://viperatech.com/product/nvidia-hgx-h20/</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-9" href="#footnote-anchor-9" class="footnote-number" contenteditable="false" target="_self">9</a><div class="footnote-content"><p>We could add a line in here about how BeanUSA&#8217;s CEO just hired a Michelin-rated barista for a 6-year, $1.5B contract - you get the point.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-10" href="#footnote-anchor-10" class="footnote-number" contenteditable="false" target="_self">10</a><div class="footnote-content"><p>&#8220;OpenAI announces 80% price drop for o3, it&#8217;s most powerful reasoning model.&#8221; Venturebeat, 2025.<strong> </strong><a href="https://venturebeat.com/ai/openai-announces-80-price-drop-for-o3-its-most-powerful-reasoning-model/">https://venturebeat.com/ai/openai-announces-80-price-drop-for-o3-its-most-powerful-reasoning-model/</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-11" href="#footnote-anchor-11" class="footnote-number" contenteditable="false" target="_self">11</a><div class="footnote-content"><p>This story has way overstayed its welcome, but have fun with three what-if epilogues:</p><ul><li><p><strong>New product launch. </strong>BeanUSA unveils its successor brew, the <strong>BeanPT-5</strong>. Initial reviews are positive, but more customers are reporting its taste is somewhat diluted, and some brews take longer to make. Insiders later reveal than BeanUSA is algorithmically varying the &#8220;effort&#8221; of bean flavor in each cup depending on the customer&#8217;s tone at the counter. The net effect does lower unit costs, but the brew&#8217;s reputation suffers.</p></li><li><p><strong>Gridlock. </strong>BeanUSA files for a permit to triple its floorplan and boost foot traffic. The electric utility rejects the plan, stating the grid can&#8217;t support the extra power from the 40 new H200s, and it will take 3 to 8 years to get more power.</p></li><li><p><strong>Nuclear moonshot. </strong>BeanUSA announces they are bypassing the grid application process and are installing a small modular nuclear reactor (SMR) to support up to 1000 new H200s&#8230; but the SMR will take 7 years to license, develop, test, and install.</p></li></ul></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-12" href="#footnote-anchor-12" class="footnote-number" contenteditable="false" target="_self">12</a><div class="footnote-content"><p>https://www.ciphernews.com/articles/the-u-s-and-china-drive-data-center-power-consumption/</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-13" href="#footnote-anchor-13" class="footnote-number" contenteditable="false" target="_self">13</a><div class="footnote-content"><p>https://batteryswapcabinet.com/powering-the-future-how-the-humanoid-robot-boom-is-reshaping-the-lithium-battery-industry/</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-14" href="#footnote-anchor-14" class="footnote-number" contenteditable="false" target="_self">14</a><div class="footnote-content"><p>https://en.wikipedia.org/wiki/Kardashev_scale</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-15" href="#footnote-anchor-15" class="footnote-number" contenteditable="false" target="_self">15</a><div class="footnote-content"><p>LARGE LANGUAGE MODEL INFERENCE ACCELERATION: A COMPREHENSIVE HARDWARE PERSPECTIVE. Jinhao Li et al., 2024. <a href="https://arxiv.org/pdf/2410.04466v2">https://arxiv.org/pdf/2410.04466v2</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-16" href="#footnote-anchor-16" class="footnote-number" contenteditable="false" target="_self">16</a><div class="footnote-content"><p>They don&#8217;t melt in practice, but you won&#8217;t be getting anything out of them - most accelerators&#8217; software shuts down the card until it&#8217;s back in a safe operating temperature range. <a href="https://www.servermania.com/kb/articles/gpu-temperature-range-guide">https://www.servermania.com/kb/articles/gpu-temperature-range-guide</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-17" href="#footnote-anchor-17" class="footnote-number" contenteditable="false" target="_self">17</a><div class="footnote-content"><p>Average adults apparently read up to 350 wpm. We&#8217;ll say that&#8217;s 75th percentile. <a href="https://scholarwithin.com/average-reading-speed">https://scholarwithin.com/average-reading-speed</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-18" href="#footnote-anchor-18" class="footnote-number" contenteditable="false" target="_self">18</a><div class="footnote-content"><p>&#8220;&#8230;reasoning models produce responses 5 to 10 times longer than the similar sized instruct models, even for questions when both types of models can correctly solve.&#8220; CoThink: Token-Efficient Reasoning via Instruct Models Guiding Reasoning Models. Siqi Fan, et al., 2025. <a href="https://arxiv.org/pdf/2505.22017v1">https://arxiv.org/pdf/2505.22017v1</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-19" href="#footnote-anchor-19" class="footnote-number" contenteditable="false" target="_self">19</a><div class="footnote-content"><p>Human Latency Conversational Turns for Spoken Avatar Systems. Jacoby et al., 2024. <a href="https://arxiv.org/pdf/2404.16053v1">https://arxiv.org/pdf/2404.16053v1</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-20" href="#footnote-anchor-20" class="footnote-number" contenteditable="false" target="_self">20</a><div class="footnote-content"><p>Artificial Analysis Language Model API Endpoint Leaderboard, Medium models filter. Accessed 8/13/2025. <a href="https://artificialanalysis.ai/leaderboards/providers?size_class=medium">https://artificialanalysis.ai/leaderboards/providers?size_class=medium</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-21" href="#footnote-anchor-21" class="footnote-number" contenteditable="false" target="_self">21</a><div class="footnote-content"><p>Derived from &#8220;&#27867;&#31471;&#20391;&#26234;&#33021;&#24212;&#29992;&#8221; - &#8220;&#27867;" meaning &#8220;ubiquitous&#8221;, &#8220;&#31471;&#20391;&#8221; meaning edge-side or &#8220;edge&#8221;, &#26234;&#33021; derived from artificial intelligence (&#20154;&#24037;&#26234;&#33021;), and &#24212;&#29992; meaning &#8220;applications.&#8221;</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-22" href="#footnote-anchor-22" class="footnote-number" contenteditable="false" target="_self">22</a><div class="footnote-content"><p>Scaling Laws of Motion Forecasting and Planning: A Technical Report. Baniodeh et al., 2025. <a href="https://arxiv.org/pdf/2506.08228v1">https://arxiv.org/pdf/2506.08228v1</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-23" href="#footnote-anchor-23" class="footnote-number" contenteditable="false" target="_self">23</a><div class="footnote-content"><p>See <a href="https://batteryswapcabinet.com/powering-the-future-how-the-humanoid-robot-boom-is-reshaping-the-lithium-battery-industry/">BatterySwapCabinet</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-24" href="#footnote-anchor-24" class="footnote-number" contenteditable="false" target="_self">24</a><div class="footnote-content"><p>Benchmarking Reasoning Models: From Tokens to Answers. AMD, 2025. <a href="https://rocm.blogs.amd.com/artificial-intelligence/benchmark-reasoning-models/README.html">https://rocm.blogs.amd.com/artificial-intelligence/benchmark-reasoning-models/README.html</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-25" href="#footnote-anchor-25" class="footnote-number" contenteditable="false" target="_self">25</a><div class="footnote-content"><p>&#21069;&#27839;&#24212;&#29992;&#19968;&#26426;&#22120;&#20154;&#26234;&#33021;&#30340;&#22522;&#26412;&#38656;&#27714;&#65292;&#21442;&#32771;&#33258; (&#8220;The basic requirements for cutting-edge applications of robotic intelligence, refer to&#8221;) Deray, Jeremle, Joan Sola, and Juan Andrade-Cetto, "Joint on-manifold self-calibration of odometry model and sensor extrinsics using pre-Integration." 2019 European Conference on Mobile Robots (ECMR). IEEE, 2019. <a href="https://ieeexplore.ieee.org/document/8870942">https://ieeexplore.ieee.org/document/8870942</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-26" href="#footnote-anchor-26" class="footnote-number" contenteditable="false" target="_self">26</a><div class="footnote-content"><p>She Ying: Promote more small and medium-sized factories to be "seen" and "selected."<strong> </strong>INEWS, 2025. <a href="https://inf.news/en/economy/497374f93538b62172833997d324324e.html">https://inf.news/en/economy/497374f93538b62172833997d324324e.html</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-27" href="#footnote-anchor-27" class="footnote-number" contenteditable="false" target="_self">27</a><div class="footnote-content"><p>BinaryBERT: Pushing the Limit of BERT Quantization. Haoli Bai, et al., 2020. <a href="https://arxiv.org/abs/2012.15701">https://arxiv.org/abs/2012.15701</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-28" href="#footnote-anchor-28" class="footnote-number" contenteditable="false" target="_self">28</a><div class="footnote-content"><p>SparseGPT: Massive Language Models Can be Accurately Pruned in One-Shot. Frantar, Alistarh, 2023. <a href="https://arxiv.org/pdf/2301.00774">https://arxiv.org/pdf/2301.00774</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-29" href="#footnote-anchor-29" class="footnote-number" contenteditable="false" target="_self">29</a><div class="footnote-content"><p>DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model. DeepSeek-AI, et al., 2024. <a href="https://arxiv.org/abs/2405.04434">https://arxiv.org/abs/2405.04434</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-30" href="#footnote-anchor-30" class="footnote-number" contenteditable="false" target="_self">30</a><div class="footnote-content"><p>China AI-Brain Research. CSET, 2020. <a href="https://cset.georgetown.edu/publication/china-ai-brain-research/">https://cset.georgetown.edu/publication/china-ai-brain-research/</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-31" href="#footnote-anchor-31" class="footnote-number" contenteditable="false" target="_self">31</a><div class="footnote-content"><p>Mamba: Linear-Time Sequence Modeling with Selective State Spaces. Albert Gu, Tri Dao, 2023. <a href="https://arxiv.org/abs/2312.00752">https://arxiv.org/abs/2312.00752</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-32" href="#footnote-anchor-32" class="footnote-number" contenteditable="false" target="_self">32</a><div class="footnote-content"><p>Fastino Launches TLMs (Task-Specific Language Models) with $17.5M Seed Round Led by Khosla Ventures. Yahoo! Finance, 2025. <a href="https://finance.yahoo.com/news/fastino-launches-tlms-task-specific-215600480.html">https://finance.yahoo.com/news/fastino-launches-tlms-task-specific-215600480.html</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-33" href="#footnote-anchor-33" class="footnote-number" contenteditable="false" target="_self">33</a><div class="footnote-content"><p>H2: Towards Efficient Large-Scale LLM Training on Hyper-Heterogeneous Cluster over 1,000 Chips. Ding Tang, et al., 2025. <a href="https://arxiv.org/pdf/2505.17548v1">https://arxiv.org/pdf/2505.17548v1</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-34" href="#footnote-anchor-34" class="footnote-number" contenteditable="false" target="_self">34</a><div class="footnote-content"><p>Hacker News, accessed 8/12/2025. <a href="https://news.ycombinator.com/item?id=44851557">https://news.ycombinator.com/item?id=44851557</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-35" href="#footnote-anchor-35" class="footnote-number" contenteditable="false" target="_self">35</a><div class="footnote-content"><p>NVIDIA and AMD to pay 15% of China chip sale revenues to US government. Financial Times, 8/10/2025. <a href="https://www.ft.com/content/cd1a0729-a8ab-41e1-a4d2-8907f4c01cac">https://www.ft.com/content/cd1a0729-a8ab-41e1-a4d2-8907f4c01cac</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-36" href="#footnote-anchor-36" class="footnote-number" contenteditable="false" target="_self">36</a><div class="footnote-content"><p>China cautions tech firms over NVIDIA H20 AI chip purchases, sources say. Reuters, 8/12/2025. <a href="https://www.reuters.com/world/china/china-cautions-tech-firms-over-nvidia-h20-ai-chip-purchases-sources-say-2025-08-12/">https://www.reuters.com/world/china/china-cautions-tech-firms-over-nvidia-h20-ai-chip-purchases-sources-say-2025-08-12/</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-37" href="#footnote-anchor-37" class="footnote-number" contenteditable="false" target="_self">37</a><div class="footnote-content"><p>The Rise of Custom AI Chips: How Big Tech is Challenging NVIDIA&#8217;s Dominance. Aranca, 4/28/2025. <a href="https://www.aranca.com/knowledge-library/articles/investment-research/the-rise-of-custom-ai-chips-how-big-tech-is-challenging-nvidias-dominance">https://www.aranca.com/knowledge-library/articles/investment-research/the-rise-of-custom-ai-chips-how-big-tech-is-challenging-nvidias-dominance</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-38" href="#footnote-anchor-38" class="footnote-number" contenteditable="false" target="_self">38</a><div class="footnote-content"><p>Chinese semiconductor, AI firms form pact as NVIDIA faces inquiry on H20 chip&#8217;s security. SCMP, 7/31/2025. <a href="https://www.scmp.com/tech/tech-war/article/3320301/chinese-semiconductor-ai-firms-form-pact-nvidia-faces-inquiry-h20-chips-security">https://www.scmp.com/tech/tech-war/article/3320301/chinese-semiconductor-ai-firms-form-pact-nvidia-faces-inquiry-h20-chips-security</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-39" href="#footnote-anchor-39" class="footnote-number" contenteditable="false" target="_self">39</a><div class="footnote-content"><p>AsianPower, 8/7/2025. <a href="https://asian-power.com/news/chinas-power-consumption-breach-13000-twh-in-2030">https://asian-power.com/news/chinas-power-consumption-breach-13000-twh-in-2030</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-40" href="#footnote-anchor-40" class="footnote-number" contenteditable="false" target="_self">40</a><div class="footnote-content"><p>Monthly Electricity Statistics, IEA, 7/17/2025. <a href="https://www.iea.org/data-and-statistics/data-tools/monthly-electricity-statistics">https://www.iea.org/data-and-statistics/data-tools/monthly-electricity-statistics</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-41" href="#footnote-anchor-41" class="footnote-number" contenteditable="false" target="_self">41</a><div class="footnote-content"><p>Energy and AI, IEA,  4/10/2025. <a href="https://iea.blob.core.windows.net/assets/dd7c2387-2f60-4b60-8c5f-6563b6aa1e4c/EnergyandAI.pdf">https://iea.blob.core.windows.net/assets/dd7c2387-2f60-4b60-8c5f-6563b6aa1e4c/EnergyandAI.pdf</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-42" href="#footnote-anchor-42" class="footnote-number" contenteditable="false" target="_self">42</a><div class="footnote-content"><p>&#8220;Hydraulically Fractured Horizontal Wells: A Technology Poised To Deliver Another Energy-Related Breakthrough of Enormous Scale.&#8221; Greg Leveille,<strong> </strong>Journal of Petroleum Technology, 2/1/2024. <a href="https://jpt.spe.org/guest-editorial-hydraulically-fractured-horizontal-wells-a-technology-poised-to-deliver-another-energy-related-breakthrough-of-enormous-scale">https://jpt.spe.org/guest-editorial-hydraulically-fractured-horizontal-wells-a-technology-poised-to-deliver-another-energy-related-breakthrough-of-enormous-scale</a></p></div></div>]]></content:encoded></item><item><title><![CDATA[AI, the Tortoise, and the Hare]]></title><description><![CDATA[assessing the American AI Action Plan]]></description><link>https://www.machineyearning.io/p/ai-the-tortoise-and-the-hare</link><guid isPermaLink="false">https://www.machineyearning.io/p/ai-the-tortoise-and-the-hare</guid><dc:creator><![CDATA[Ryan Cunningham]]></dc:creator><pubDate>Mon, 04 Aug 2025 17:52:40 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/f5438b2e-e02d-40cd-9db2-4ad0ef5b39f2_2912x2096.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>I just wrapped up a week in China at the World AI Conference where I visited with leading research labs, cloud computing incumbents, robotics startups, and so many more talented teams. It was an eye-opening experience that confirmed many of my suspicions about the breakneck pace of development in that ecosystem, while correcting some other ones.</em></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!fo3w!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F117e28ab-a0e7-4495-9be1-6d4cfe9f1455_3283x1799.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!fo3w!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F117e28ab-a0e7-4495-9be1-6d4cfe9f1455_3283x1799.jpeg 424w, https://substackcdn.com/image/fetch/$s_!fo3w!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F117e28ab-a0e7-4495-9be1-6d4cfe9f1455_3283x1799.jpeg 848w, https://substackcdn.com/image/fetch/$s_!fo3w!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F117e28ab-a0e7-4495-9be1-6d4cfe9f1455_3283x1799.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!fo3w!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F117e28ab-a0e7-4495-9be1-6d4cfe9f1455_3283x1799.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!fo3w!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F117e28ab-a0e7-4495-9be1-6d4cfe9f1455_3283x1799.jpeg" width="1456" height="798" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/117e28ab-a0e7-4495-9be1-6d4cfe9f1455_3283x1799.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:798,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1978301,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/169853792?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F117e28ab-a0e7-4495-9be1-6d4cfe9f1455_3283x1799.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!fo3w!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F117e28ab-a0e7-4495-9be1-6d4cfe9f1455_3283x1799.jpeg 424w, https://substackcdn.com/image/fetch/$s_!fo3w!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F117e28ab-a0e7-4495-9be1-6d4cfe9f1455_3283x1799.jpeg 848w, https://substackcdn.com/image/fetch/$s_!fo3w!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F117e28ab-a0e7-4495-9be1-6d4cfe9f1455_3283x1799.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!fo3w!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F117e28ab-a0e7-4495-9be1-6d4cfe9f1455_3283x1799.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Outside the Shanghai Expo Center with the Tech Buzz China crew</figcaption></figure></div><p><em>I&#8217;ve got a lot of upcoming posts on the China AI market map, its rate of diffusion, and global AI infrastructure developments coming out of this. But for now, let&#8217;s take stock of the Action Plans unveiled last week and set the table.</em></p><p><em>HUGE thank you to <a href="https://x.com/ruima">Rui Ma</a> and the <a href="https://techbuzzchina.com/">Tech Buzz China </a>team for organizing this tour. Our delegation of researchers, academics, and industry experts definitely came away informed and invigorated.</em></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.machineyearning.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">If you haven&#8217;t yet, hit &#8220;Subscribe&#8221; and don&#8217;t miss the next drop.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><div><hr></div><h1>The &#8220;race&#8221; so far</h1><p>Last week, the US published its renewed <a href="https://www.whitehouse.gov/wp-content/uploads/2025/07/Americas-AI-Action-Plan.pdf">AI Action Plan</a>, subtitled &#8220;Winning the AI Race,&#8221; and it had some surprises in it. Some great, some mid, and some <em>d&#233;j&#224; vu.</em></p><p>This dropped while I was in in Shanghai, shuttling between the World AI Conference expo hall (sold out to 30,000 participants), state-sponsored industrial parks, and cutting-edge research labs. And at this conference, China&#8217;s competing <a href="https://www.mfa.gov.cn/eng/xw/zyxw/202507/t20250729_11679232.html">Global Governance AI Action Plan</a> also dropped. This led to some fun conversations with curious local experts.</p><p>I left the tour with two reactions:</p><ol><li><p>China&#8217;s AI build-out is even faster than Western headlines admit, and</p></li><li><p>Most Westerners are still obsessed with who&#8217;s first, not how they get there</p></li></ol><p>Whenever I hear the phrase &#8220;AI race,&#8221; it deeply bothers me. A race implies a finite game, a finish line. But with artificial intelligence there is no discrete finish line, despite pundits and policymakers&#8217; insistence otherwise.</p><p>What would happen after we &#8220;reach AGI&#8221;? Technology is over, the world ends? More than likely, we&#8217;ll just shift the goalposts forward to some new threshold. No, progress will just continue to compound as its benefits diffuse into every economic sector.</p><p>But for argument&#8217;s sake, let&#8217;s play into the narrative.</p><p>We&#8217;ve all heard the allegory of the tortoise and the hare. In the old fable, the tortoise wins by never stopping.</p><p>Take electricity generation for example. There&#8217;s a viral graph that&#8217;s been making the rounds that, if you haven&#8217;t seen it, puts things into perspective. This chart shows the total amount of electricity generated in the US and China in the past 40 years, starting a few years after Deng Xiaopeng&#8217;s economic modernizations.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!YGQJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd86b0cd1-dcd6-4d90-8aa1-f0dc88791a84_3400x2943.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!YGQJ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd86b0cd1-dcd6-4d90-8aa1-f0dc88791a84_3400x2943.png 424w, https://substackcdn.com/image/fetch/$s_!YGQJ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd86b0cd1-dcd6-4d90-8aa1-f0dc88791a84_3400x2943.png 848w, https://substackcdn.com/image/fetch/$s_!YGQJ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd86b0cd1-dcd6-4d90-8aa1-f0dc88791a84_3400x2943.png 1272w, https://substackcdn.com/image/fetch/$s_!YGQJ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd86b0cd1-dcd6-4d90-8aa1-f0dc88791a84_3400x2943.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!YGQJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd86b0cd1-dcd6-4d90-8aa1-f0dc88791a84_3400x2943.png" width="1456" height="1260" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d86b0cd1-dcd6-4d90-8aa1-f0dc88791a84_3400x2943.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1260,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:518767,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/169853792?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd86b0cd1-dcd6-4d90-8aa1-f0dc88791a84_3400x2943.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!YGQJ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd86b0cd1-dcd6-4d90-8aa1-f0dc88791a84_3400x2943.png 424w, https://substackcdn.com/image/fetch/$s_!YGQJ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd86b0cd1-dcd6-4d90-8aa1-f0dc88791a84_3400x2943.png 848w, https://substackcdn.com/image/fetch/$s_!YGQJ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd86b0cd1-dcd6-4d90-8aa1-f0dc88791a84_3400x2943.png 1272w, https://substackcdn.com/image/fetch/$s_!YGQJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd86b0cd1-dcd6-4d90-8aa1-f0dc88791a84_3400x2943.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Today, China generates over <strong>6,000 terawatt-hours (TWh) more electricity</strong> than the United States, enough juice to:</p><ul><li><p>Power every US home 4 times over (541 million homes worth of power)</p></li><li><p>Build ~7,000 new large industrial facilities</p></li><li><p>Run 4.9 million Blackwell GPUs at full-tilt</p></li></ul><p>Meanwhile, America&#8217;s year-over-year growth in power demand, already middling, flatlined to an average of just 8 basis points from 2010 to now. We literally took a nap.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!kc5n!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b5e3c07-7838-4cb6-ac07-6a41c11d0ac3_653x141.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!kc5n!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b5e3c07-7838-4cb6-ac07-6a41c11d0ac3_653x141.png 424w, https://substackcdn.com/image/fetch/$s_!kc5n!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b5e3c07-7838-4cb6-ac07-6a41c11d0ac3_653x141.png 848w, https://substackcdn.com/image/fetch/$s_!kc5n!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b5e3c07-7838-4cb6-ac07-6a41c11d0ac3_653x141.png 1272w, https://substackcdn.com/image/fetch/$s_!kc5n!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b5e3c07-7838-4cb6-ac07-6a41c11d0ac3_653x141.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!kc5n!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b5e3c07-7838-4cb6-ac07-6a41c11d0ac3_653x141.png" width="727" height="156.97856049004594" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3b5e3c07-7838-4cb6-ac07-6a41c11d0ac3_653x141.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:141,&quot;width&quot;:653,&quot;resizeWidth&quot;:727,&quot;bytes&quot;:47332,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/169853792?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b5e3c07-7838-4cb6-ac07-6a41c11d0ac3_653x141.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!kc5n!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b5e3c07-7838-4cb6-ac07-6a41c11d0ac3_653x141.png 424w, https://substackcdn.com/image/fetch/$s_!kc5n!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b5e3c07-7838-4cb6-ac07-6a41c11d0ac3_653x141.png 848w, https://substackcdn.com/image/fetch/$s_!kc5n!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b5e3c07-7838-4cb6-ac07-6a41c11d0ac3_653x141.png 1272w, https://substackcdn.com/image/fetch/$s_!kc5n!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b5e3c07-7838-4cb6-ac07-6a41c11d0ac3_653x141.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>Energy has and always will be a leading correlative factor for economic output. It forms the foundation for modern standards of living, domestic production and manufacturing capabilities, and now - all of a sudden - the bedrock for the next generation of the services economy, the &#8220;intelligence economy.&#8221; Advanced artificial intelligence systems are here, and boy are they hungry for power.</p><p>So while dominant Western social movements advocated for <a href="https://www.sciencedirect.com/science/article/pii/S2214629625001264">de-growth</a> and McKinsey was proclaiming a <a href="https://www.mckinsey.com/industries/electric-power-and-natural-gas/our-insights/the-decoupling-of-gdp-and-energy-growth-a-ceo-guide#/">post-energy future</a>, the rest of the world didn&#8217;t seem to get the memo.</p><p>China&#8217;s tortoise has been plodding along at a <a href="https://danwang.co/breakneck/">Breakneck</a> pace, but apparently too slow for most Americans to notice. Until now.</p><p>The American hare has woken up.</p><p>And it&#8217;s got some ground to cover.</p><h1>Review: America&#8217;s AI Action Plan</h1><p>Honestly I had low expectations given my experience with Trump 1.0&#8217;s equivalent action plan which I wrote about <a href="https://www.machineyearning.io/p/whats-wrong-with-the-ai-arms-race">three years ago</a>, but I was genuinely and pleasantly surprised to see a lot of sensible calls to action in here.</p><p>The Plan has three pillars: innovation, infrastructure, and international diplomacy and security (wait that&#8217;s four pillars&#8230;), each with 15, 8, and 7 respective subsections. I won&#8217;t be commenting on each one here, just a handful that I&#8217;ve added &#128172; emojis next to in the below table.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!1ks1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d73e819-0664-4939-92a9-6efe38650126_950x1258.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!1ks1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d73e819-0664-4939-92a9-6efe38650126_950x1258.png 424w, https://substackcdn.com/image/fetch/$s_!1ks1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d73e819-0664-4939-92a9-6efe38650126_950x1258.png 848w, https://substackcdn.com/image/fetch/$s_!1ks1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d73e819-0664-4939-92a9-6efe38650126_950x1258.png 1272w, https://substackcdn.com/image/fetch/$s_!1ks1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d73e819-0664-4939-92a9-6efe38650126_950x1258.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!1ks1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d73e819-0664-4939-92a9-6efe38650126_950x1258.png" width="950" height="1258" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0d73e819-0664-4939-92a9-6efe38650126_950x1258.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1258,&quot;width&quot;:950,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1024069,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/169853792?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d73e819-0664-4939-92a9-6efe38650126_950x1258.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!1ks1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d73e819-0664-4939-92a9-6efe38650126_950x1258.png 424w, https://substackcdn.com/image/fetch/$s_!1ks1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d73e819-0664-4939-92a9-6efe38650126_950x1258.png 848w, https://substackcdn.com/image/fetch/$s_!1ks1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d73e819-0664-4939-92a9-6efe38650126_950x1258.png 1272w, https://substackcdn.com/image/fetch/$s_!1ks1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d73e819-0664-4939-92a9-6efe38650126_950x1258.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>China&#8217;s competing <a href="https://www.mfa.gov.cn/eng/xw/zyxw/202507/t20250729_11679232.html">Plan</a> was actually rather pallid in comparison: a slim, 4 page document with mostly broad statements about the importance of multilateral collaboration. This was yet another surprise for me, as that was an unusual departure from typical contrasts between these nations&#8217; industrial planning documents. However, we are still 8 years into China&#8217;s 13 year <a href="https://digichina.stanford.edu/work/full-translation-chinas-new-generation-artificial-intelligence-development-plan-2017/">development roadmap</a>, so it could be the case that Beijing decided to restrict their new plan to matters of governance, safety, and standards-setting in the generative era.</p><p>This limits an apples-to-apples comparison to just the third pillar in the US Action Plan, regarding International Diplomacy and Security. I&#8217;ve got a ton of upcoming blog posts on AI diffusion (pillar one) and AI infrastructure (pillar two) in China, so I&#8217;ll save those for another time.</p><p>With that, let&#8217;s cover some of the noteworthy sections in the plan, and give a quick gut-check on national readiness to rise to the challenge.</p><h1>I. Accelerate AI Innovation</h1><p>General themes for this first pillar are:</p><ol><li><p>Decluttering red tape</p></li><li><p>Greasing the wheels for public and private sector adoption</p></li><li><p>&#8220;De-woking&#8221; frontier LLMs</p></li></ol><p>Overall this sets things in a positive direction, celebrating open-source advocacy, common-sense adoption levers, and thematically supporting a liquid compute marketplace. But in contrast to the <em>laissez-faire</em> approach for the first two, that latter theme concerns me not in its ideology, but in its practical implementation that would lead only to a quagmire.</p><h2>Oh, we like open-source now</h2><p>The plan&#8217;s strongest surprise is an explicit embrace of of open-weight models. It acknowledges that it gives startups more flexibility, lets governments and businesses keep data in-house, and are absolutely essential for academic researchers.</p><p>In my <a href="https://www.machineyearning.io/p/deepseek-and-the-end-of-an-era">last post</a>, I was concerned by many leading voices in the American AI ecosystem calling for pulling the rug out on open-source. But as the plan reads, the US is taking a very sensible position which leans into the narrative that open models contribute to soft power in the new economy.</p><p>Had we gone the other way I&#8217;d be writing a very different blog post.</p><h2>Towards a liquid compute marketplace</h2><p>Another welcome surprise was the acknowledgment that many are in unfavorable positions when trying to access reliable (i.e. not spot market) compute.</p><p>We absolutely should be ensuring level access to large-scale computing power rather than concentrating it oligopolistically. The <a href="https://nairrpilot.org/opportunities/allocations">National AI Research Resource (NAIRR) Pilot</a> from NSF is a start, but it&#8217;s still a boutique collection of one-off credits and in-kind resources.</p><p>At a major industrial park in Hangzhou, I learned the city government actually created a <a href="https://www.lightreading.com/data-centers/china-to-set-up-cloud-service-selling-spare-data-center-capacity---report">state-owned cloud compute reseller</a> to flip compute to startups at up to a 50% discount. The Shanghai city government <a href="https://english.shanghai.gov.cn/en-PolicyInsights/20250729/6410d74d98cf4d85831b5b74b3442eb6.html">announced</a> similar subsidies last week.</p><p>The faster we align on a similar view that AI is part of a new energy-compute asset class, the faster we&#8217;ll lower the unit cost for all participants, not just large incumbents.</p><h2>Evals for enterprise adoption</h2><p>I think that at steady-state, a healthy evaluations ecosystem vs. &#8220;trust me bro&#8221; benchmarks is going to provide much more trust in a market that can be crowded with vibes and grifters. Third-party assistance here will make it easier for founders to cut through that noise.</p><p>Academic tests are decent barometers of research progress, but they are <strong>not </strong>procurement criteria. <a href="https://www.youtube.com/watch?v=_ogxZxu6cjM">Stats can be juked</a> through selective tuning, hand-crafted prompts that don&#8217;t represent production use, or <a href="https://techcrunch.com/2025/04/07/meta-exec-denies-the-company-artificially-boosted-llama-4s-benchmark-scores/">straight up fraud</a>. You need real-world performance standards informed by industry input.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://www.reddit.com/r/LocalLLaMA/comments/1jspbqk/two_months_later_and_after_llama_4s_release_im/" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!-iOV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09e68642-2537-47ac-a521-d9af8ac452a2_720x549.png 424w, https://substackcdn.com/image/fetch/$s_!-iOV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09e68642-2537-47ac-a521-d9af8ac452a2_720x549.png 848w, https://substackcdn.com/image/fetch/$s_!-iOV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09e68642-2537-47ac-a521-d9af8ac452a2_720x549.png 1272w, https://substackcdn.com/image/fetch/$s_!-iOV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09e68642-2537-47ac-a521-d9af8ac452a2_720x549.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!-iOV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09e68642-2537-47ac-a521-d9af8ac452a2_720x549.png" width="720" height="549" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/09e68642-2537-47ac-a521-d9af8ac452a2_720x549.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:549,&quot;width&quot;:720,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:187296,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://www.reddit.com/r/LocalLLaMA/comments/1jspbqk/two_months_later_and_after_llama_4s_release_im/&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/169853792?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09e68642-2537-47ac-a521-d9af8ac452a2_720x549.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!-iOV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09e68642-2537-47ac-a521-d9af8ac452a2_720x549.png 424w, https://substackcdn.com/image/fetch/$s_!-iOV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09e68642-2537-47ac-a521-d9af8ac452a2_720x549.png 848w, https://substackcdn.com/image/fetch/$s_!-iOV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09e68642-2537-47ac-a521-d9af8ac452a2_720x549.png 1272w, https://substackcdn.com/image/fetch/$s_!-iOV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09e68642-2537-47ac-a521-d9af8ac452a2_720x549.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Open-source == meritocracy. You can cheat benchmarks but you can&#8217;t cheat keen-eyed developers.</figcaption></figure></div><p>NIST&#8217;s <a href="https://pages.nist.gov/frvt/html/frvt11.html">Face Recognition Technology Evaluation</a> works because it measures an end-to-end task with clear failure modes. Domain-specific LLM applications need equivalent rigor in their benchmarking.</p><p>It would be great to see something like this rolled out with fixed datasets, common metrics, and quarterly reports. Vendors will either improve their baseline, or they won&#8217;t. Grifters would be reluctant to put their slop through an open evaluation process like this, and the truly talented would be elevated (one can hope).</p><p>Transparency will do wonders for clarity in the marketplace, and hopefully speed up the enterprise adoption cycle from quarters to weeks in the process.</p><h2>No bias but our bias</h2><p>Unfortunately, this pillar is also internally inconsistent. It&#8217;s second section, &#8220;Ensure that Frontier AI Protects Free Speech and American Values,&#8221; calls for models that are &#8220;objective and free from top-down ideological bias,&#8221; yet immediately instructs federal agencies <strong>only</strong> to buy models that exhibit a particular ideological slant. In this case, the absence of DEI or climate-related content, and the rejection of &#8220;Chinese Communist Party talking points.&#8221; You cannot outlaw bias with one hand while prescribing it with the other.</p><p>If procurement rules require that certain topics (misinformation, DEI, climate change) be removed from the NIST AI Risk Management Framework, the policy is no longer content-neutral. It privileges one worldview and disqualifies others, contradicting the First Amendment principle the plan claims to defend.</p><h2><strong>Unintended consequences and the &#8220;Waluigi Effect&#8221;</strong></h2><p>LLMs learn concepts in oppositional pairs - full/empty, wet/dry, loyal/subversive. When training forcibly suppresses one side of a conceptual pair, the model often preserves it latently - the same vector arithmetic that yields <em>man : king :: woman : queen </em>can flip suppressed traits back on, sometimes in exaggerated form.</p><p>Public examples of what alignment researchers call the <a href="https://www.lesswrong.com/w/waluigi-effect">Waluigi Effect</a> include Bing Chat / &#8220;Sydney&#8221; abruptly becoming hostile or conspiratorial under pressure, and the Grok / &#8220;MechaHitler&#8221; jailbreak which flipped the model into extremist rhetoric.</p><p>In any case, these failures stem less from hidden agendas than from brittle, rules-based patches that ignore how general these systems really are. In truth, these are likely influenced by the role of fictional tropes in training data (&#8220;<a href="https://tvtropes.org/pmwiki/pmwiki.php/Main/EvilAllAlong">Evil All Along</a>&#8221; is a very common plot twist in literature, and &#8220;MechaHitler&#8221; is a reference to the 1992 video game Wolfenstein) rather than some sort of top-down bias or Manchurian candidate situation.</p><p>Policymaking based on ideological, rather than technocratic, goals can only introduce inadvertent externalities down the road. Platitudes like &#8220;ensure free speech,&#8221; &#8220;reject CCP talking points,&#8221; or &#8220;DO NOT HALLUCINATE&#8221; will not deliver the reliability policymakers want. Empirically validated, context-aware guardrails seem like a better way to accomplish those goals without sacrificing the very freedom of expression the plan seeks to protect.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.machineyearning.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Still with me? Let me know what you think. I truly appreciate it</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h1>II. Build American AI Infrastructure</h1><p>The most well-flushed out pillar of the Action Plan focused on practical constraints and development plans for energy-compute infrastructure, which I&#8217;m very excited by. The administration calls for the following:</p><ul><li><p>Streamlining permitting for new datacenters, semiconductor manufacturing facilities, and energy infrastructure</p></li><li><p>Prioritizing grid stability and rapid buildouts</p></li><li><p>Restoring American semiconductor manufacturing</p></li><li><p>Beefing up a skilled infra workforce</p></li></ul><p>I&#8217;m honestly stoked about most of this pillar. However, I can&#8217;t help but lament how long it took for us to get to this point, and wonder about our long-term commitment to revitalization. In particular, the plan ignores the critical role that international talent has to play, and I&#8217;m convinced that blind spot will bite us in the long run.</p><h2>Prioritizing stability over ideology</h2><blockquote><p><em>Show me the incentive and I&#8217;ll show you the outcome.</em></p><p>&#8211; Charlie Munger</p></blockquote><p>This is probably my favorite part of the entire action plan, for one specific line: <em>&#8220;reform power markets to align financial incentives with the goal of grid stability.&#8221;</em> </p><p>It starts by reiterating that our electric grid is critical for all aspects of the modern economy and must be safeguarded, while acknowledging that in its current form, it&#8217;s unsuited for the increased pressures of AI datacenters. Training and deploying large-scale models carries high load variability (&#177;15MW peak-to-trough in <em>milliseconds</em>) and therefore <a href="https://semianalysis.com/2025/06/25/ai-training-load-fluctuations-at-gigawatt-scale-risk-of-power-grid-blackout/">very real blackout risk</a> for energy grids which cannot absorb that much demand-time.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!KzHE!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a357887-72be-431c-8280-b9f1272cfadb_2326x798.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!KzHE!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a357887-72be-431c-8280-b9f1272cfadb_2326x798.png 424w, https://substackcdn.com/image/fetch/$s_!KzHE!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a357887-72be-431c-8280-b9f1272cfadb_2326x798.png 848w, https://substackcdn.com/image/fetch/$s_!KzHE!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a357887-72be-431c-8280-b9f1272cfadb_2326x798.png 1272w, https://substackcdn.com/image/fetch/$s_!KzHE!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a357887-72be-431c-8280-b9f1272cfadb_2326x798.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!KzHE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a357887-72be-431c-8280-b9f1272cfadb_2326x798.png" width="1456" height="500" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2a357887-72be-431c-8280-b9f1272cfadb_2326x798.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:500,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!KzHE!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a357887-72be-431c-8280-b9f1272cfadb_2326x798.png 424w, https://substackcdn.com/image/fetch/$s_!KzHE!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a357887-72be-431c-8280-b9f1272cfadb_2326x798.png 848w, https://substackcdn.com/image/fetch/$s_!KzHE!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a357887-72be-431c-8280-b9f1272cfadb_2326x798.png 1272w, https://substackcdn.com/image/fetch/$s_!KzHE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a357887-72be-431c-8280-b9f1272cfadb_2326x798.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: Google at OCP EMEA Summit 2025, SemiAnalysis</figcaption></figure></div><p>The safest way to accomplish this is by introducing large volumes of &#8220;baseload&#8221; power - reliable, stable sources like natural gas, hydropower, and nuclear - to create a high watermark in excess of that demand variance, say 85% of peak.</p><ul><li><p>Excess power from these projects during downtimes can be added back to the grid, enhancing stability.</p></li><li><p>During peak times, pairing these projects with battery energy storage systems (BESS) can significantly help with sub-second ramps, since lithium ion batteries excel at this.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!leuL!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8be4dfac-20ba-45f6-84dc-bdf772db8c56_800x450.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!leuL!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8be4dfac-20ba-45f6-84dc-bdf772db8c56_800x450.jpeg 424w, https://substackcdn.com/image/fetch/$s_!leuL!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8be4dfac-20ba-45f6-84dc-bdf772db8c56_800x450.jpeg 848w, https://substackcdn.com/image/fetch/$s_!leuL!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8be4dfac-20ba-45f6-84dc-bdf772db8c56_800x450.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!leuL!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8be4dfac-20ba-45f6-84dc-bdf772db8c56_800x450.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!leuL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8be4dfac-20ba-45f6-84dc-bdf772db8c56_800x450.jpeg" width="800" height="450" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8be4dfac-20ba-45f6-84dc-bdf772db8c56_800x450.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:450,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Megapack.&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Megapack." title="Megapack." srcset="https://substackcdn.com/image/fetch/$s_!leuL!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8be4dfac-20ba-45f6-84dc-bdf772db8c56_800x450.jpeg 424w, https://substackcdn.com/image/fetch/$s_!leuL!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8be4dfac-20ba-45f6-84dc-bdf772db8c56_800x450.jpeg 848w, https://substackcdn.com/image/fetch/$s_!leuL!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8be4dfac-20ba-45f6-84dc-bdf772db8c56_800x450.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!leuL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8be4dfac-20ba-45f6-84dc-bdf772db8c56_800x450.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">The Tesla Megapack 2XL is a 3.9 MWh system. BESS provide auxiliary power during blackouts and contribute to grid stability during load fluctuations, smoothing demand</figcaption></figure></div><p>Up to now, the interconnection queue has not at all resembled these needs. In the <a href="https://emp.lbl.gov/queues">most recent report</a> from the Lawrence Berkeley National Lab, of the 1,570 GW of new generation proposals still in the queue (which takes 3 to 8 years to clear right now, by the way), a meager 118 GW (7.5%) of new power fits this baseload profile. Intermittent generation like solar and wind dominate the queue, commanding 1,086 GW (69.1%) and 366 GW (23.3%) respectively. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!IIFh!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F12e238eb-e4fd-4022-99aa-5a81f9691f05_1337x774.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!IIFh!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F12e238eb-e4fd-4022-99aa-5a81f9691f05_1337x774.jpeg 424w, https://substackcdn.com/image/fetch/$s_!IIFh!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F12e238eb-e4fd-4022-99aa-5a81f9691f05_1337x774.jpeg 848w, https://substackcdn.com/image/fetch/$s_!IIFh!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F12e238eb-e4fd-4022-99aa-5a81f9691f05_1337x774.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!IIFh!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F12e238eb-e4fd-4022-99aa-5a81f9691f05_1337x774.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!IIFh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F12e238eb-e4fd-4022-99aa-5a81f9691f05_1337x774.jpeg" width="1337" height="774" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/12e238eb-e4fd-4022-99aa-5a81f9691f05_1337x774.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:774,&quot;width&quot;:1337,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Capacity in interconnection queues as of the end of 2023. *Hybrid storage capacity is estimated in some cases using storage: generator ratios from projects that provide separate capacity data. Storage capacity in hybrids was not estimated for years prior to 2020.&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Capacity in interconnection queues as of the end of 2023. *Hybrid storage capacity is estimated in some cases using storage: generator ratios from projects that provide separate capacity data. Storage capacity in hybrids was not estimated for years prior to 2020." title="Capacity in interconnection queues as of the end of 2023. *Hybrid storage capacity is estimated in some cases using storage: generator ratios from projects that provide separate capacity data. Storage capacity in hybrids was not estimated for years prior to 2020." srcset="https://substackcdn.com/image/fetch/$s_!IIFh!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F12e238eb-e4fd-4022-99aa-5a81f9691f05_1337x774.jpeg 424w, https://substackcdn.com/image/fetch/$s_!IIFh!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F12e238eb-e4fd-4022-99aa-5a81f9691f05_1337x774.jpeg 848w, https://substackcdn.com/image/fetch/$s_!IIFh!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F12e238eb-e4fd-4022-99aa-5a81f9691f05_1337x774.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!IIFh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F12e238eb-e4fd-4022-99aa-5a81f9691f05_1337x774.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>A good deal of this buildout was no doubt influenced by the prior administration&#8217;s <a href="https://www.canarymedia.com/articles/policy-regulation/doe-loan-programs-office-races-to-get-cleantech-money-out-as-trump-looms">last-minute bum rush to push out $400BN </a>of debt financing to renewable energy projects&#8230; influenced by ideology, not by the practical needs of energy-compute infrastructure in the new economy.</p><h2>Not all power is equal</h2><p>Remember, we measure power not just in instantaneous terms (gigawatts) but in time-denominated terms (gigawatt-hours). So when comparing power from different generation resources, you need to consider over what period of time that&#8217;s deployed.</p><p>Since battery duration is limited to just a few hours (1 to 4 commonly), we can&#8217;t rely on BESS alone to handle these loads. When paired with renewables, developers have to significantly overspec in order to provide continuous power.</p><ul><li><p>Using Las Vegas as a sunny example, irradiance patterns allow for delivering <a href="https://ember-energy.org/latest-insights/solar-electricity-every-hour-of-every-day-is-here-and-it-changes-everything/#:~:text=24%2Dhour%20solar%20generation%20is%20possible%20%E2%80%93%20just,sufficient%20for%20most%20regions%20across%20the%20world.">1 kW of continuous power via 5 kW of solar panels paired with a 17 kWh battery</a></p></li><li><p>Scaled up to a 100 MW AI data center, this would require 500 MW of solar panels (making it the 6th largest deployment in the US) and 355 MWh of storage supported by at least 91 Tesla Megapacks (<a href="https://portal.ct.gov/-/media/csc/3_petitions-medialibrary/petitions_medialibrary/mediapetitionnos1601-1700/pe1607/petitionersubmissions/supplement-attachment-a---megapack_2_xl_datasheet.pdf">4-hour configuration</a>) </p></li></ul><p>Recall from earlier that our current delta with China&#8217;s generation capacity is roughly 6,000 terawatt-hours (TWh) - that&#8217;s 6 million gigawatt-hours (GWh). If we waved a magic wand and activated all of our interconnection queue simultaneously, could we close the gap?</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!i6VR!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7a266e77-a432-4cf2-8158-169a2b845653_952x676.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!i6VR!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7a266e77-a432-4cf2-8158-169a2b845653_952x676.png 424w, https://substackcdn.com/image/fetch/$s_!i6VR!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7a266e77-a432-4cf2-8158-169a2b845653_952x676.png 848w, https://substackcdn.com/image/fetch/$s_!i6VR!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7a266e77-a432-4cf2-8158-169a2b845653_952x676.png 1272w, https://substackcdn.com/image/fetch/$s_!i6VR!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7a266e77-a432-4cf2-8158-169a2b845653_952x676.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!i6VR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7a266e77-a432-4cf2-8158-169a2b845653_952x676.png" width="952" height="676" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7a266e77-a432-4cf2-8158-169a2b845653_952x676.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:676,&quot;width&quot;:952,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:514729,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/169853792?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7a266e77-a432-4cf2-8158-169a2b845653_952x676.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!i6VR!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7a266e77-a432-4cf2-8158-169a2b845653_952x676.png 424w, https://substackcdn.com/image/fetch/$s_!i6VR!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7a266e77-a432-4cf2-8158-169a2b845653_952x676.png 848w, https://substackcdn.com/image/fetch/$s_!i6VR!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7a266e77-a432-4cf2-8158-169a2b845653_952x676.png 1272w, https://substackcdn.com/image/fetch/$s_!i6VR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7a266e77-a432-4cf2-8158-169a2b845653_952x676.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>By some reasonable estimates, yeah, we could narrow or even close it entirely. But those hours aren&#8217;t uniformly distributed throughout the day. There&#8217;s no reason why baseload power can&#8217;t be <em>augmented</em> with renewables and BESS, but putting all your chips in the intermittent basket with the fluctuations AI workloads introduce doesn&#8217;t make sense. Hence the Plan&#8217;s emphasis on comprehensiveness, stability, and consistency.</p><h2>Cutting the red tape</h2><p>Most data center infrastructure projects require significant upgrades to grid infrastructure to handle contemporary AI workloads. The Action Plan more or less gives the green light for data center-related projects to either fast-track or bypass reviews and permitting, so long as the site is consistent with the size of a modern AI data center. What that size is specifically remains unclear in this document.</p><p>Aside from energy and labor, seeking relevant permits and environmental reviews do take a significant portion of time in new and brownfield builds. This administration calls for using exclusions granted by existing regulations to speed things up, rather than go through Congress:</p><ul><li><p>Establishing new Categorical Exclusions under the National Environmental Policy Act (NEPA) to cover data center-related actions</p></li><li><p>Expand the use of the FAST-41 process (Title 41 of the Fixing America&#8217;s Surface Transportation Act) for speedier reviews and processing</p></li><li><p>Make Federal lands available for data center construction and the construction of power generation infrastructure for those data centers</p></li></ul><p>This is generally good news, especially on the third point, which I detailed in a <a href="https://www.linkedin.com/posts/rydcunningham_edgerunner-response-to-doe-rfi-52025-activity-7327739369442684930-TnxM?utm_source=share&amp;utm_medium=member_desktop&amp;rcm=ACoAAAY5PW8BEiT8Q1ECTudUXzJ9NK3mrVlYE-0">May submission</a> to the <a href="https://www.energy.gov/policy/ai-infrastructure-doe-lands-request-information">Department of Energy&#8217;s RFI </a>for building AI data centers on federal lands. A coalition of industry reps from energy, AI, and power that I gathered identified and proposed the most suitable DOE-owned sites that could house energy-compute projects, based on a number of factors. It&#8217;s good to see the DOE following through on this.</p><h2>Restore American semiconductor manufacturing</h2><p>Perhaps in a future post I&#8217;ll give a history and near-term forecast of the US semiconductor industry. For now, all you need to know is that while American companies like NVIDIA, AMD (as well as new entrants like Positron and Groq) may lead fabless chip design, on-shore production of bleeding-edge nodes is in a pretty dire state:</p><ul><li><p>Intel&#8217;s recently decided to <a href="https://www.eetimes.com/intel-facing-another-crossroads-18-a-or-14a-process-node/">shelve its 18A (1.8 nm) process</a> in favor of diverting resources to 14A (1.4 nm) underscores the challenge</p></li><li><p>Analysts warn Intel has <a href="https://www.pcgamer.com/hardware/processors/intel-has-just-18-months-to-land-a-hero-customer-on-14a-or-its-cutting-edge-fabs-are-toast-says-chip-industry-analyst/">roughly 18 months</a> to land a &#8220;hero customer&#8221; for 14A&#8230; otherwise its cutting-edge fabs could be abandoned altogether (forfeitting any further CHIPS Act disbursements)</p></li><li><p>Even foreign foundries with US fabs are facing problems. Samsung&#8217;s $44BN Texas mega-fab - originally slated for 4nm mass production - has struggled with <a href="https://www.tomshardware.com/tech-industry/semiconductors/samsung-delays-usd44-billion-texas-chip-fab-sources-say-completion-halted-because-there-are-no-customers">yields and customer commitments</a>, prompting executive leadership to declare a <a href="https://x.com/Jukanlosreve/status/1942149827833262248">6 month</a> transition window to 2nm, and delay <a href="https://x.com/Jukanlosreve/status/1939966750386237617">1.4nm to 2029</a> while they dial in reliability</p></li></ul><p>The US government does wield some levers. In the next decade, the Semiconductor Industry Association (SIA) <a href="https://www.semiconductors.org/wp-content/uploads/2025/07/SIA-State-of-the-Industry-Report-2025.pdf">projects the US fab capacity will grow by 200%+</a> over the next decade - doubling the global average - and American semiconductor R&amp;D outlays exceed $60BN, the highest in the world today in purchasing power parity (PPP), despite our tax incentives lagging peer nations.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!GBOX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7cc27e-c7b8-43f7-b6b4-a1b3540341fa_623x457.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!GBOX!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7cc27e-c7b8-43f7-b6b4-a1b3540341fa_623x457.png 424w, https://substackcdn.com/image/fetch/$s_!GBOX!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7cc27e-c7b8-43f7-b6b4-a1b3540341fa_623x457.png 848w, https://substackcdn.com/image/fetch/$s_!GBOX!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7cc27e-c7b8-43f7-b6b4-a1b3540341fa_623x457.png 1272w, https://substackcdn.com/image/fetch/$s_!GBOX!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7cc27e-c7b8-43f7-b6b4-a1b3540341fa_623x457.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!GBOX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7cc27e-c7b8-43f7-b6b4-a1b3540341fa_623x457.png" width="717" height="525.9534510433386" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/dc7cc27e-c7b8-43f7-b6b4-a1b3540341fa_623x457.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:457,&quot;width&quot;:623,&quot;resizeWidth&quot;:717,&quot;bytes&quot;:60572,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/169853792?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7cc27e-c7b8-43f7-b6b4-a1b3540341fa_623x457.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!GBOX!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7cc27e-c7b8-43f7-b6b4-a1b3540341fa_623x457.png 424w, https://substackcdn.com/image/fetch/$s_!GBOX!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7cc27e-c7b8-43f7-b6b4-a1b3540341fa_623x457.png 848w, https://substackcdn.com/image/fetch/$s_!GBOX!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7cc27e-c7b8-43f7-b6b4-a1b3540341fa_623x457.png 1272w, https://substackcdn.com/image/fetch/$s_!GBOX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7cc27e-c7b8-43f7-b6b4-a1b3540341fa_623x457.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">However, at current growth rates, China may very well overtake our spend by EOY 2025.</figcaption></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!bueQ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7fa8c99-e12f-43e2-a965-116f3ecedb11_626x285.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!bueQ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7fa8c99-e12f-43e2-a965-116f3ecedb11_626x285.png 424w, https://substackcdn.com/image/fetch/$s_!bueQ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7fa8c99-e12f-43e2-a965-116f3ecedb11_626x285.png 848w, https://substackcdn.com/image/fetch/$s_!bueQ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7fa8c99-e12f-43e2-a965-116f3ecedb11_626x285.png 1272w, https://substackcdn.com/image/fetch/$s_!bueQ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7fa8c99-e12f-43e2-a965-116f3ecedb11_626x285.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!bueQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7fa8c99-e12f-43e2-a965-116f3ecedb11_626x285.png" width="724" height="329.61661341853033" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a7fa8c99-e12f-43e2-a965-116f3ecedb11_626x285.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:285,&quot;width&quot;:626,&quot;resizeWidth&quot;:724,&quot;bytes&quot;:42804,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/169853792?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7fa8c99-e12f-43e2-a965-116f3ecedb11_626x285.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!bueQ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7fa8c99-e12f-43e2-a965-116f3ecedb11_626x285.png 424w, https://substackcdn.com/image/fetch/$s_!bueQ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7fa8c99-e12f-43e2-a965-116f3ecedb11_626x285.png 848w, https://substackcdn.com/image/fetch/$s_!bueQ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7fa8c99-e12f-43e2-a965-116f3ecedb11_626x285.png 1272w, https://substackcdn.com/image/fetch/$s_!bueQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7fa8c99-e12f-43e2-a965-116f3ecedb11_626x285.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: https://www.semiconductors.org/wp-content/uploads/2025/07/SIA-State-of-the-Industry-Report-2025.pdf</figcaption></figure></div><p>But funding is just one part of the problem, as our prior examples show. If domestic champions like Intel can&#8217;t make the cut meritocratically, then we shouldn&#8217;t be casting taxpayer pearls before swine.</p><p>To that end, the SIA projects a <a href="https://www.semiconductors.org/wp-content/uploads/2025/07/SIA-State-of-the-Industry-Report-2025.pdf">domestic shortfall of 67,000 workers</a> across the semiconductor manufacturing chain - device and machinery manufacturing, design, and EDA tools - by 2030.</p><p>Given that this section imposes a sensible constraint of &#8220;lead[ing this] revitalization without <strong>making bad deals for the American taxpayer</strong>&#8221; (emphasis mine), I fear that without significant advances in domestic semiconductor talent to cross critical yield thresholds (70%+) in a short period of time, domestic manufacturing will continue to be a satellite for foreign fabs like TSMC and Samsung, while domestic players only satisfy trailing edge process nodes.</p><p>This isn&#8217;t the worst possible outcome, but it does put the U.S. at a disadvantage if China continues to improve in fabless chip design and domestic manufacturing.</p><h2>We need international talent. Full stop.</h2><p>Even if the U.S. fast-tracks every permit, reopens every chip fab, and supercharges the grid, we still face a crash course in human capital. We&#8217;re not just talking about the semiconductor shortfall - we need tens of thousands of electricians, HVAC technicians, network engineers, and other tradespeople to support this buildout.</p><p>Yes, the administration&#8217;s push for national, state, and local workforce programs is welcome - industry-driven training, revamped apprenticeships, early-career exposure and the like. But this transformation will take at least a <strong>generation</strong> <strong>without top-tier international engineers and technicians</strong>.</p><p><a href="https://theconversation.com/trump-administrations-conflicting-messages-on-chinese-student-visas-reflect-complex-us-china-relations-258351">Nativist tit-for-tats</a> on Chinese student visas are worsening, not improving, the problem. I&#8217;m not even coming from an emotional place on this, despite my obvious affinity for both countries. I&#8217;m stating a fact that in many STEM industries, even in our most advanced AI research labs, the diffusion of Chinese talent is impossible to ignore. <a href="https://x.com/deedydas/status/1946597162068091177?ref_src=twsrc%5Etfw%7Ctwcamp%5Etweetembed%7Ctwterm%5E1946597162068091177%7Ctwgr%5E40c865cdfb1cbb4c2679aa66b9a5482d90d5869b%7Ctwcon%5Es1_c10&amp;ref_url=https%3A%2F%2Fwccftech.com%2Fmeta-superintelligence-team-44-members-50-percent-are-from-china%2F">Half of Meta&#8217;s Superintelligence team</a> got their undergraduate degrees in China, for Christ&#8217;s sake.</p><p>This is not to say that domestic employment concerns are unwarranted. There are some common-sense changes we can implement that achieve a favorable middle ground, like revamping the green card system to a merit-based skills prioritization matrix rather than a country quota exercise. Permanent residency for critical roles like semiconductor engineers, AI researchers, etc. should be sought.</p><p>But what is the benefit of antagonizing those talented young engineers, if only to send back home and accelerate China&#8217;s progress further? As Kaiser Kuo said in his response &#8220;<a href="https://www.sinicapodcast.com/p/an-own-goal-of-historic-proportions">An Own-Goal of Historic Proportions</a>,&#8221; Chinese officials were likely publicly condemning the move, but privately popping the <em>baijiu</em>.</p><div class="embedded-post-wrap" data-attrs="{&quot;id&quot;:164688215,&quot;url&quot;:&quot;https://www.sinicapodcast.com/p/an-own-goal-of-historic-proportions&quot;,&quot;publication_id&quot;:2079154,&quot;publication_name&quot;:&quot;Sinica&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!hki0!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2502d26c-e974-417b-878d-0571b80581f6_600x600.png&quot;,&quot;title&quot;:&quot;An Own-Goal of Historic Proportions: &quot;,&quot;truncated_body_text&quot;:&quot;Written in haste &#8212; forgive the unpolished prose.&quot;,&quot;date&quot;:&quot;2025-05-29T00:16:24.183Z&quot;,&quot;like_count&quot;:597,&quot;comment_count&quot;:11,&quot;bylines&quot;:[{&quot;id&quot;:2051,&quot;name&quot;:&quot;Kaiser Y Kuo&quot;,&quot;handle&quot;:&quot;kaiserykuo&quot;,&quot;previous_name&quot;:null,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F399968e5-9d5d-43f2-9d6b-e471dbbe72e3_400x400.png&quot;,&quot;bio&quot;:&quot;A weekly podcast about current affairs in China, hosted by Kaiser Kuo and featuring in-depth conversations about books, ideas, new research, intellectual currents, and cultural trends that can help us better understand what&#8217;s happening in China.&quot;,&quot;profile_set_up_at&quot;:&quot;2024-02-13T21:38:49.173Z&quot;,&quot;reader_installed_at&quot;:&quot;2024-02-14T22:07:42.457Z&quot;,&quot;publicationUsers&quot;:[{&quot;id&quot;:2208464,&quot;user_id&quot;:2051,&quot;publication_id&quot;:2079154,&quot;role&quot;:&quot;admin&quot;,&quot;public&quot;:true,&quot;is_primary&quot;:true,&quot;publication&quot;:{&quot;id&quot;:2079154,&quot;name&quot;:&quot;Sinica&quot;,&quot;subdomain&quot;:&quot;sinica&quot;,&quot;custom_domain&quot;:&quot;www.sinicapodcast.com&quot;,&quot;custom_domain_optional&quot;:false,&quot;hero_text&quot;:&quot;Podcasts, columns, and essays about current affairs in China&quot;,&quot;logo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2502d26c-e974-417b-878d-0571b80581f6_600x600.png&quot;,&quot;author_id&quot;:2051,&quot;primary_user_id&quot;:2051,&quot;theme_var_background_pop&quot;:&quot;#0068EF&quot;,&quot;created_at&quot;:&quot;2023-11-03T16:58:28.775Z&quot;,&quot;email_from_name&quot;:&quot;Sinica Podcast&quot;,&quot;copyright&quot;:&quot;The Sinica Podcast&quot;,&quot;founding_plan_name&quot;:&quot;Founding Member&quot;,&quot;community_enabled&quot;:true,&quot;invite_only&quot;:false,&quot;payments_state&quot;:&quot;enabled&quot;,&quot;language&quot;:null,&quot;explicit&quot;:false,&quot;homepage_type&quot;:&quot;newspaper&quot;,&quot;is_personal_mode&quot;:false}}],&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;utm_campaign&quot;:null,&quot;belowTheFold&quot;:true,&quot;type&quot;:&quot;newsletter&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="EmbeddedPostToDOM"><a class="embedded-post" native="true" href="https://www.sinicapodcast.com/p/an-own-goal-of-historic-proportions?utm_source=substack&amp;utm_campaign=post_embed&amp;utm_medium=web"><div class="embedded-post-header"><img class="embedded-post-publication-logo" src="https://substackcdn.com/image/fetch/$s_!hki0!,w_56,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2502d26c-e974-417b-878d-0571b80581f6_600x600.png" loading="lazy"><span class="embedded-post-publication-name">Sinica</span></div><div class="embedded-post-title-wrapper"><div class="embedded-post-title">An Own-Goal of Historic Proportions: </div></div><div class="embedded-post-body">Written in haste &#8212; forgive the unpolished prose&#8230;</div><div class="embedded-post-cta-wrapper"><span class="embedded-post-cta">Read more</span></div><div class="embedded-post-meta">a year ago &#183; 597 likes &#183; 11 comments &#183; Kaiser Y Kuo</div></a></div><p>Meanwhile, at that same industrial park in Hangzhou I mentioned earlier, I asked a senior government official what their policy was towards foreign talent. What kinds of personnel were they looking for, and what support (if any) did they offer? I&#8217;m paraphrasing his translation, but</p><blockquote><p>&#8220;Foreign talent in key industries, especially PhD holders, are more than welcome. In addition to the office lease discounts and apartment leasing subsidies for college graduates, overseas talent can get free housing ($1M to $2M USD)&#8221;</p></blockquote><p>I&#8217;ve heard of such programs in China&#8217;s broader &#8220;<a href="https://sccei.fsi.stanford.edu/china-briefs/evaluating-success-chinas-young-thousand-talents-stem-recruitment-program">Thousand Talents Plan</a>,&#8221; but it was my first time seeing it face to face, despite the escalating tensions.</p><p>Even if we solve our energy problems&#8230; even if we cut all the red tape&#8230; we risk building golden factories with empty benches. Unless the administration pivots here, our nativism will kneecap our ambition.</p><h1>III. Lead in International AI Diplomacy and Security</h1><p>Overall, this part of the action plan falls woefully short. Predictably, the administration pushes for a unilateral promotion of American AI systems, compute hardware, and standards throughout the world. It&#8217;s also logically inconsistent, or at least overestimates the negotiating leverage Americans have in bilateral trade of these systems outside of a vacuum.</p><p>Consider the <a href="https://www.scielo.br/j/rbpi/a/x9dKKDM9DLJW7MSy9JHzg6w/">U.S.&#8217; failed efforts to limit Huawei&#8217;s 5G diffusion</a> throughout the developing world. I&#8217;ll briefly summarize those points below, then apply them to the Action Plan&#8217;s stated intentions against the status quo:</p><h2>Recap: Why the Huawei 5G Ban Failed</h2><ol><li><p><strong>U.S. &#8220;Security Threat&#8221; Narrative. </strong>Washington repeatedly labeled Huawei &#8220;an arm of the Chinese state,&#8221; warning that its 5G equipment could be used for espionage or cyber-sabotage. Brazilian and South African officials pushed back, demanding concrete proof of back-doors or spying, but no transparent or legally vetted evidence could be shared.</p></li><li><p><strong>No American 5G Champion. </strong>Meanwhile, as a counter, no U.S. or U.S.-allied vendor offered a full end-to-end 5G solution at Huawei&#8217;s price and scale. With neither domestic champions nor subsidized U.S. exporters stepping up, local operators saw little downside in choosing Huawei for cost-effective, rapid-deployed, field-proven equipment.</p></li><li><p><strong>China&#8217;s &#8220;Digital Silk Road&#8221;. </strong>Beijing wove Huawei into broader &#8220;Digital Silk Road&#8221; and BRICS-era cooperation, offering low-interest loans, technical assistance packages bundled with infrastructure financing, and joint development labs and exchanges. As usual, this was a financial &#8220;no strings attached&#8221; proposal boosted by multilateral collaboration on shared principles for digital development.</p></li></ol><p>Taken together, these factor left Brazil, South Africa, and many countries free to integrate Huawei into their 5G roadmaps.</p><h2>The more things change...</h2><p>However, the current situation is not completely congruent. The first item in this section advocates for exporting America&#8217;s &#8220;full technology stack &#8211; hardware, models, software, applications, and standards.&#8221; That&#8217;s a genuine step up from 5G, since American labs and chip designers currently own the leading edge.</p><p>But the stack is more than just chips and models. As mentioned earlier, AI datacenters have insanely high load variability and require significant energy infrastructure upgrades. Without behind-the-meter generation, high-voltage transmission lines, battery energy storage systems (BESS) and the like, many developing countries wouldn&#8217;t be able to locally deploy complete stacks even if they wanted to.</p><p>China bundling energy infrastructure investment and financing together with datacenter rollouts would be a successful repeat of its 5G playbook. And early innings of that strategy were on display in the WAIC expo hall.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/hlntnr/status/1950745456188776794" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!5Wtw!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F20a54398-36c6-4933-b7f3-6030ca459dd9_600x566.png 424w, https://substackcdn.com/image/fetch/$s_!5Wtw!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F20a54398-36c6-4933-b7f3-6030ca459dd9_600x566.png 848w, https://substackcdn.com/image/fetch/$s_!5Wtw!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F20a54398-36c6-4933-b7f3-6030ca459dd9_600x566.png 1272w, https://substackcdn.com/image/fetch/$s_!5Wtw!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F20a54398-36c6-4933-b7f3-6030ca459dd9_600x566.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!5Wtw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F20a54398-36c6-4933-b7f3-6030ca459dd9_600x566.png" width="600" height="566" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/20a54398-36c6-4933-b7f3-6030ca459dd9_600x566.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:566,&quot;width&quot;:600,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:449067,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://x.com/hlntnr/status/1950745456188776794&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.machineyearning.io/i/169853792?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F20a54398-36c6-4933-b7f3-6030ca459dd9_600x566.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!5Wtw!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F20a54398-36c6-4933-b7f3-6030ca459dd9_600x566.png 424w, https://substackcdn.com/image/fetch/$s_!5Wtw!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F20a54398-36c6-4933-b7f3-6030ca459dd9_600x566.png 848w, https://substackcdn.com/image/fetch/$s_!5Wtw!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F20a54398-36c6-4933-b7f3-6030ca459dd9_600x566.png 1272w, https://substackcdn.com/image/fetch/$s_!5Wtw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F20a54398-36c6-4933-b7f3-6030ca459dd9_600x566.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>...The more they stay the same</h2><p>Because energy and digital infrastructure rollouts are tightly interdependent, local operators and regulators may naturally gravitate toward China&#8217;s packaged offerings, especially where BRI momentum is already strong.</p><p>Further, ceding the field on AI safety and governance presents too wide a gap that China&#8217;s &#8220;<a href="https://www.mfa.gov.cn/eng/xw/zyxw/202507/t20250729_11679232.html">AI Governance Action Plan</a>&#8221; is primed to fill. Beijing called for convening governments, industry and civil-society around multilateral standards, capacity-building programs and joint R&amp;D - while at WAIC not only did no Trump administration AI safety officials show, the only representative from a US AI lab <strong>even proposed an outright ban on open-source models, </strong>arguing &#8220;you wouldn&#8217;t give everyone a nuclear bomb.&#8221;<strong> </strong>From Paul Triolo:</p><blockquote><p><em><strong>The lone official from a leading US AI lab, Dan Hendrycks, raised eyebrows by essentially calling for a ban on open source/weight models, while some Chinese AI safety experts called for just the opposite, urging the US to force proprietary model developers such as OpenAI and Anthropic to open source their models, arguing that this was a better way to ensure the safety of advanced models going forward.</strong></em></p></blockquote><p>To say Hendrycks read the room badly would be an understatement. While he is a respected safety researcher and red-teamer, his position leaves no room for negotiation or joint development, is in contrast to the Action Plan&#8217;s commitment to open-source, and left most of the room feeling puzzled.</p><p>Overall, I&#8217;m getting a strong sense of <em>d&#233;j&#224; vu</em>: Beijing&#8217;s &#8220;inclusive multilateralism&#8221; mirrors the Digital Silk Road playbook, whereas the US push to &#8220;Counter Chinese Influence in International Governance Bodies&#8221; suggests that any multilateral consensus that includes Chinese participation is inherently suspect, prejudicially undermining sovereign decision-making.</p><p>I&#8217;ll stop my comparisons here, as Paul goes into excellent depth in his latest post on his Substack, <a href="https://pstaidecrypted.substack.com/p/china-set-to-lead-global-effort-on">AIStackDecrypted</a>, which I strongly agree with and encourage you to read.</p><div class="embedded-post-wrap" data-attrs="{&quot;id&quot;:169409012,&quot;url&quot;:&quot;https://pstaidecrypted.substack.com/p/china-set-to-lead-global-effort-on&quot;,&quot;publication_id&quot;:2296890,&quot;publication_name&quot;:&quot;AIStackDecrypted&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!pvMv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30b81416-44a5-4354-8d0d-fdb7f9e0d5f1_300x300.png&quot;,&quot;title&quot;:&quot;China set to lead global effort on AI safety?&quot;,&quot;truncated_body_text&quot;:&quot;This year&#8217;s World AI Conference (WAIC) in Shanghai was well attended by Chinese AI companies, robotics firms, infrastructure providers, investors, and other companies across the AI stack. The record attendance cited by organizers testifies to the intense interest in AI at many levels across China&#8217;s economy, and the event was dominated by the innovation &#8230;&quot;,&quot;date&quot;:&quot;2025-08-01T22:15:43.048Z&quot;,&quot;like_count&quot;:9,&quot;comment_count&quot;:3,&quot;bylines&quot;:[{&quot;id&quot;:18097050,&quot;name&quot;:&quot;Paul Triolo&quot;,&quot;handle&quot;:&quot;pstasiatech&quot;,&quot;previous_name&quot;:&quot;Paul T&quot;,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ae5afe75-2e43-4924-9013-5e457f8c73c4_400x400.jpeg&quot;,&quot;bio&quot;:&quot;Long time civil servant now swimming in the private sector &quot;,&quot;profile_set_up_at&quot;:&quot;2021-12-05T16:30:20.359Z&quot;,&quot;reader_installed_at&quot;:&quot;2024-03-11T01:49:34.656Z&quot;,&quot;publicationUsers&quot;:[{&quot;id&quot;:2316045,&quot;user_id&quot;:18097050,&quot;publication_id&quot;:2296890,&quot;role&quot;:&quot;admin&quot;,&quot;public&quot;:true,&quot;is_primary&quot;:true,&quot;publication&quot;:{&quot;id&quot;:2296890,&quot;name&quot;:&quot;AIStackDecrypted&quot;,&quot;subdomain&quot;:&quot;pstaidecrypted&quot;,&quot;custom_domain&quot;:null,&quot;custom_domain_optional&quot;:false,&quot;hero_text&quot;:&quot;My personal Substack devoted to AI Stack issues and US China relations&quot;,&quot;logo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/30b81416-44a5-4354-8d0d-fdb7f9e0d5f1_300x300.png&quot;,&quot;author_id&quot;:18097050,&quot;primary_user_id&quot;:18097050,&quot;theme_var_background_pop&quot;:&quot;#00C2FF&quot;,&quot;created_at&quot;:&quot;2024-01-28T02:17:27.339Z&quot;,&quot;email_from_name&quot;:null,&quot;copyright&quot;:&quot;Paul Triolo&quot;,&quot;founding_plan_name&quot;:&quot;Founding Member&quot;,&quot;community_enabled&quot;:true,&quot;invite_only&quot;:false,&quot;payments_state&quot;:&quot;enabled&quot;,&quot;language&quot;:null,&quot;explicit&quot;:false,&quot;homepage_type&quot;:&quot;newspaper&quot;,&quot;is_personal_mode&quot;:false}}],&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;utm_campaign&quot;:null,&quot;belowTheFold&quot;:true,&quot;type&quot;:&quot;newsletter&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="EmbeddedPostToDOM"><a class="embedded-post" native="true" href="https://pstaidecrypted.substack.com/p/china-set-to-lead-global-effort-on?utm_source=substack&amp;utm_campaign=post_embed&amp;utm_medium=web"><div class="embedded-post-header"><img class="embedded-post-publication-logo" src="https://substackcdn.com/image/fetch/$s_!pvMv!,w_56,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30b81416-44a5-4354-8d0d-fdb7f9e0d5f1_300x300.png" loading="lazy"><span class="embedded-post-publication-name">AIStackDecrypted</span></div><div class="embedded-post-title-wrapper"><div class="embedded-post-title">China set to lead global effort on AI safety?</div></div><div class="embedded-post-body">This year&#8217;s World AI Conference (WAIC) in Shanghai was well attended by Chinese AI companies, robotics firms, infrastructure providers, investors, and other companies across the AI stack. The record attendance cited by organizers testifies to the intense interest in AI at many levels across China&#8217;s economy, and the event was dominated by the innovation &#8230;</div><div class="embedded-post-cta-wrapper"><span class="embedded-post-cta">Read more</span></div><div class="embedded-post-meta">9 months ago &#183; 9 likes &#183; 3 comments &#183; Paul Triolo</div></a></div><h1>Conclusions</h1><p>The American AI Action Plan is a welcome signal that Washington is waking up to what&#8217;s necessary to succeed in the new economy:</p><ul><li><p><strong>Open-source wins. </strong>It embraces open-source and open-weight models - crucial to developer and industry soft power</p></li><li><p><strong>Grid stability over ideology. </strong>It confronts the brutal physics of the grid, without an ideological slant - no electrons, no intelligence economy</p></li></ul><p>Yet it tiptoes around two levers that decide whether the American hare will actually start closing the gap:</p><ul><li><p><strong>Talent</strong>. 67K semiconductor hires plus tens of thousands of skilled trades won&#8217;t materialize out of thin air. Skills-first green cards, visa-to-apprenticeship tracks, and a reversal in the student-visa nonsense is critical</p></li><li><p><strong>Diplomacy</strong>. Export controls and America-first platitudes aren&#8217;t effective strategies. Allies need a full AI stack bundled with infrastructure, financing, and standards toolkits, but they need it on a multilateral basis, not one-way</p></li></ul><p>So yes, it&#8217;s a start. But we need more than an Action Plan&#8230; we need a Marshall Plan. Alliances built out of watts, wafers, workers, and a shared vision of the future.</p><p>Because there isn&#8217;t a finish line. There&#8217;s just progress.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.machineyearning.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Wow, you made it to the end. Kudos! If you liked it, subscribe and drop a comment below</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[DeepSeek and the End of an Era]]></title><description><![CDATA[commoditized intelligence and the rebalancing of AI superpowers]]></description><link>https://www.machineyearning.io/p/deepseek-and-the-end-of-an-era</link><guid isPermaLink="false">https://www.machineyearning.io/p/deepseek-and-the-end-of-an-era</guid><dc:creator><![CDATA[Ryan Cunningham]]></dc:creator><pubDate>Fri, 31 Jan 2025 19:34:38 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/0fffb4c5-ce48-4cc2-bdd5-1a554c6fdab5_2912x2096.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>I've spent the past few days drafting this essay, and it's the longest I've published to date. Budget ~20-30 minutes to digest it, best read over coffee.</em></p><p><em>I kept this long because I wanted the arguments in a complete form, and to discourage readers jumping to conclusions from unqualified headlines and half-truths. Fortunately, some major voices have provided fulsome commentary by now, so we get a complete and timely picture vs. pigeonholing into any one narrative.</em></p><div><hr></div><h1>Introduction</h1><p>Over the holidays, a Chinese AI lab called <a href="https://www.deepseek.com">DeepSeek</a> released a pair of large language models which rocked the markets and AI industry. For most, the prevailing wisdom had been that bigger was better - raw compute power, ever-larger parameter counts, and economies of scale were the only path to achieving state-of-the-art intelligence that could compete with the top players.</p><p>Except DeepSeek trounced them all with a tiny fraction of their resources. Reportedly, DeepSeek-V3 was trained on just $5.6M worth of compute<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>, compared to $78M for GPT-4<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a> and over $100M for LLaMa 3 405B.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Bpt-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4295f24-bbe4-43d5-9798-c183f4fbebf3_1320x792.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Bpt-!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4295f24-bbe4-43d5-9798-c183f4fbebf3_1320x792.png 424w, https://substackcdn.com/image/fetch/$s_!Bpt-!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4295f24-bbe4-43d5-9798-c183f4fbebf3_1320x792.png 848w, https://substackcdn.com/image/fetch/$s_!Bpt-!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4295f24-bbe4-43d5-9798-c183f4fbebf3_1320x792.png 1272w, https://substackcdn.com/image/fetch/$s_!Bpt-!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4295f24-bbe4-43d5-9798-c183f4fbebf3_1320x792.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Bpt-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4295f24-bbe4-43d5-9798-c183f4fbebf3_1320x792.png" width="1320" height="792" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d4295f24-bbe4-43d5-9798-c183f4fbebf3_1320x792.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:792,&quot;width&quot;:1320,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:173119,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Bpt-!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4295f24-bbe4-43d5-9798-c183f4fbebf3_1320x792.png 424w, https://substackcdn.com/image/fetch/$s_!Bpt-!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4295f24-bbe4-43d5-9798-c183f4fbebf3_1320x792.png 848w, https://substackcdn.com/image/fetch/$s_!Bpt-!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4295f24-bbe4-43d5-9798-c183f4fbebf3_1320x792.png 1272w, https://substackcdn.com/image/fetch/$s_!Bpt-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4295f24-bbe4-43d5-9798-c183f4fbebf3_1320x792.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Credit: DeepSeek.</figcaption></figure></div><p>This was a shot across the bow of the AI industry's reigning "sovereigns" - the super-scale labs whose strategy rests on cornering the market for AI capital, compute, and energy. It's the first domino to fall in a chain reaction that will reshape the technopolitical and economic landscape of AI as we know it - to no surprise to watchful observers, but a major surprise to the sovereign ecosystem.</p><p>Here's how we'll break it down:</p><ol><li><p>First, I'll demystify the architectural decisions that contributed to DeepSeek's efficiency gains.</p></li><li><p>Second, I'll lay out strategic implications for "sovereigns," the super-scale AI labs whose entire strategy rested on accessing capital, compute, and energy resources on the scale of nation-states.</p></li><li><p>Third, we'll assess the immediate market fallout and compelling emergent investment themes.</p></li><li><p>Fourth, cover the geopolitical reckoning with example reactions from American and Chinese stakeholders.</p></li></ol><p>Ultimately, I argue that DeepSeek's breakthrough validates the AI commoditization thesis, challenges the sustainability of the sovereign AI model, and heralds a new era of AI diffusion across players and industries - one that demands a strategic reset from stakeholders looking to avoid the "sovereign trap".</p><p><em>Before diving in, keep in mind this essay is a deliberately provocative take meant to challenge orthodoxy. I&#8217;m an investor in companies like <a href="https://fastino.ai">Fastino</a> and <a href="https://positron.ai">Positron</a> that compete with sovereign AI strategies, and am a vocal advocate for open-source models, which may color my analysis of centralized AI power structures. I&#8217;ve aimed for objectivity, but please weigh my affiliations against the merit of my arguments.</em></p><h1>1. The Efficiency Revolution</h1><p>When talking about great power competitions, especially in the tech context, I often talk about "diffusion," a framework Jeffrey Ding (<a href="https://chinai.substack.com/">ChinAI</a>, George Washington University) introduced to characterize the spread of an innovation throughout a population or ecosystem.</p><p>To summarize, general purpose technologies (GPTs) like <a href="https://www.gsb.stanford.edu/insights/andrew-ng-why-ai-new-electricity">electricity</a> or broadband gradually "diffuse" across many industries which make use of their horizontal capabilities. This contrasts against the "leading sector" approach which emphasizes concentrated upheavals in singular technologies, like railroads or automobiles, which while impactful are not as portable across industries.</p><p>Leading sector technologies lend themselves to monopolization, while GPTs benefit greatly from tailwinds that accelerate societal adoption, like drastically lowering the cost of inputs, or widening the base of engineering skills that take advantage of the GPT. Nation-states' success in leveraging these innovations depends on how effectively their institutions respond to those needs.</p><p>Up to now, western institutions in the sovereign AI ecosystem have behaved very much in a leading sector approach, emphasizing concentration of talent, compute, energy, and capital in a handful of captured labs. But DeepSeek's revolutionary efficiency gains was an ecosystem earthquake, causing many to rethink their priors and question the sustainability of the sovereign AI model against an impending tsunami of efficient diffusion.</p><h2>DeepSeek's Architectural Innovations</h2><p>First, let's get up to speed. There are two models DeepSeek released recently, both with their own implications:</p><ul><li><p><strong>DeepSeek-V3</strong> is an advanced language model that tops leaderboards among open-source models, despite its puny compute resources. It builds upon the successful techniques discovered in prior DeepSeek model iterations.</p></li><li><p><strong>DeepSeek-R1</strong> is a "reasoning model," a modified version of DeepSeek-V3 developed using pure Reinforcement Learning (GRPO). It is fully open-source, MIT-licensed, and reportedly performs on par with leading AI models. The model was released on January 20, 2025, and is claimed to be 20 to 50 times more efficient than its primary competitor, OpenAI's o1.</p></li></ul><p>A "reasoning" model is an evolved LLM which uses "chain of thought" to improve its problem-solving capabilities. Basically, it thinks out loud to itself before giving an answer. This usually leads to higher quality answers, especially in complex tasks, workflows, and critical thinking problems. R1's efficiency gains stem from the same algorithmic and recipe improvements that went into V3, so I often refer to them both in the collective.</p><p>To keep it simple, think of DeepSeek's breakthrough as building a more fuel-efficient engine rather than building bigger gas tanks. The team introduced four key innovations that, working together, deliver the same performance as models 10x their size while using a fraction of the resources. Let's break those down:</p><h3>1. FP8 mixed-precision training framework</h3><p>At its core, this is about being smarter about how we store numbers in AI models. Traditional models insist on extreme precision, storing all parameter weights in 16-bit or 32-bit precision, like measuring your height down to the nanometer. </p><p>DeepSeek proved you don't need that level of precision for every calculation, choosing instead to use 8-bit precision wherever possible without inducing quality loss. They're not the first to implement this approach, but it is the first to do it on such a large scale. Several research teams have previously implemented mixed-precision training approaches, such as Microsoft's FP8-LM framework<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a> and BitNet<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-6" href="#footnote-6" target="_self">6</a> (which uses binary weights, seriously pushing the boundaries of low-precision training).</p><p>DeepSeek's implementation yields a 40% reduction in memory usage while maintaining model quality. This is a direct challenge to the "more compute = better AI" orthodoxy that's dominated the field up to now.</p><p>For the technically curious, DeepSeek's implementation includes specialized hardware optimizations and dynamic range adaptation that prevent information loss, detailed in their <a href="https://www.machineyearning.io/publish/post/156186035#footnote-1">technical report</a>.</p><h3>2. Multi-head Latent Attention (MLA)</h3><p>Traditional transformer attention is like having to remember every word you've ever read to understand the next sentence. Developers try to improve prediction performance with larger and larger context windows, but without addressing _how_ information is stored and retrieved, they're anchored to an exponential scaling problem.</p><p>In contrast, DeepSeek's MLA mechanism is more like human memory: it focuses on key concepts and relationships rather than raw data. Instead of storing the entire context window in memory, DeepSeek's MLA mechanism stores a latent vector representation of the context window, compressing the information into a smaller, more manageable size - by 93.3%.</p><p>This reduces memory complexity from O(n&#178;) to O(n&#183;k), where k is a constant - a game-changing improvement for scaling. It enables models to handle context windows 4x longer while using 60% less memory.</p><p>The full architectural details are available in their MLA paper published last year<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-7" href="#footnote-7" target="_self">7</a>, but the key takeaway is simple: longer context windows no longer mean exponentially higher costs.</p><h3>3. Auxiliary-loss-free MoE</h3><p>A mixture-of-experts (MoE) model like DeepSeek's is a lot like a hospital. You have lots of specialists on call, and when patients come through the door, you want to route them to the right specialist who can give them the best help. That's why it's called "mixture" of experts - the full 671B model is actually a lot of smaller expert models stitched together.</p><p>You want to make sure each patient sees the right specialist, but time is money, and you don't want doctors sitting idle while others are overwhelmed. Traditional Mixture of Experts (MoE) models face the same challenge - they need complex "traffic control" systems (auxiliary losses) to route work efficiently among specialist neural networks.</p><blockquote><p>If this sounds familiar, it's because it's conceptually similar to the model routers we discussed in <a href="https://www.machineyearning.io/p/betting-on-model-marketplaces-not">Betting on Model Marketplaces</a></p></blockquote><p>While external routers can use fancy techniques to choose between different models, internal MoE routing traditionally required crude balancing mechanisms that often hurt performance by forcing work to less-qualified experts, just to maintain balance. Like sending patients to a dermatologist when they really need a cardiologist, just because the dermatologist's waiting room is empty.</p><p>DeepSeek's innovation is a new gating mechanism that achieves natural load balancing through the architecture itself, by considering both expert specializations and current load. Their implementation modifies the traditional sparse gating function.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-8" href="#footnote-8" target="_self">8</a></p><p>When combined with MLA, this approach yielded a 42.5% reduction in training costs and a 5.76x boost in throughput. Using just 21B parameters per token, DeepSeek-V2 matched the performance of models 2-3.5x its size, like Mixtral-8x22B and LLaMa-3 70B, on a drastically lower compute budget.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!awyN!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F967419b1-8941-4e3c-b82d-6c2d86bbc594_1136x526.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!awyN!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F967419b1-8941-4e3c-b82d-6c2d86bbc594_1136x526.png 424w, https://substackcdn.com/image/fetch/$s_!awyN!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F967419b1-8941-4e3c-b82d-6c2d86bbc594_1136x526.png 848w, https://substackcdn.com/image/fetch/$s_!awyN!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F967419b1-8941-4e3c-b82d-6c2d86bbc594_1136x526.png 1272w, https://substackcdn.com/image/fetch/$s_!awyN!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F967419b1-8941-4e3c-b82d-6c2d86bbc594_1136x526.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!awyN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F967419b1-8941-4e3c-b82d-6c2d86bbc594_1136x526.png" width="1136" height="526" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/967419b1-8941-4e3c-b82d-6c2d86bbc594_1136x526.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:526,&quot;width&quot;:1136,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:169424,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!awyN!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F967419b1-8941-4e3c-b82d-6c2d86bbc594_1136x526.png 424w, https://substackcdn.com/image/fetch/$s_!awyN!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F967419b1-8941-4e3c-b82d-6c2d86bbc594_1136x526.png 848w, https://substackcdn.com/image/fetch/$s_!awyN!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F967419b1-8941-4e3c-b82d-6c2d86bbc594_1136x526.png 1272w, https://substackcdn.com/image/fetch/$s_!awyN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F967419b1-8941-4e3c-b82d-6c2d86bbc594_1136x526.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Credit: DeepSeek.</figcaption></figure></div><p>This matters because MoE architectures are one of the most promising paths to scaling AI capabilities without linear compute increases. While Mistral's Mixtral<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-9" href="#footnote-9" target="_self">9</a>, Microsoft's Retentive Network<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-10" href="#footnote-10" target="_self">10</a>, and Google's Switch Transformer<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-11" href="#footnote-11" target="_self">11</a> demonstrated MoE's potential (and GPT-4 is rumored to use it<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-12" href="#footnote-12" target="_self">12</a>), auxiliary losses remained a key bottleneck until now. DeepSeek's implementation found a path to more efficient scaling without complex load balancing schemes.</p><h3>4. Multi-token prediction system</h3><p>Traditional language models generate text one token at a time - like someone reading a book word by word. In order to predict the next token in a sequence, the model draws on both its world knowledge and prior tokens in the current context. Sequential generation therefore creates practical memory and throughput bottlenecks in real-world applications, as the amount of context in the conversation grows and grows. </p><p>To get around this, DeepSeek's implementation predicts four tokens simultaneously, using a "look-ahead" mechanism during both training and inference, to make parallel predictions about the next token. Their technical report describes a causal masking approach that maintains proper sequence dependencies.</p><p>When combined with their MLA architecture and auxiliary-loss-free MoE, DeepSeek reports:</p><ul><li><p>2.5x speedup in text generation</p></li><li><p>40% reduction in training time</p></li><li><p>Quality metrics comparable to sequential prediction</p></li></ul><p>Once again - there's prior art for this technique. Most recently, Meta FAIR published research showing up to 3x speedups through multi-token prediction while improving model quality.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-13" href="#footnote-13" target="_self">13</a></p><h3>Recap</h3><p>I hope what's clear by now is that DeepSeek's innovations in no way seem to be secret research or state-sponsored voodoo. These implementations have been validated in prior work by Western sovereigns and open research teams. They're just the first to combine them.</p><p>When combined, the efficiency gains compound tremendously:</p><ul><li><p>Memory: 93.3% reduction through MLA, while extended context windows by 32x</p></li><li><p>Training: 71% reduction in compute costs, achieved with just $5.6M total training budget</p></li><li><p>Throughput: Up to 5.76x improvement during inference</p></li><li><p>Performance: Matches or exceeds models 2-3.5x larger on standard benchmarks</p></li></ul><p>Culminating in a state-of-the-art model trained on $5.6M worth of compute, compared to <a href="https://aiindex.stanford.edu/wp-content/uploads/2024/04/HAI_AI-Index-Report-2024.pdf">$78M (GPT-4)</a>, over <a href="https://www.factorialfunds.com/blog/thoughts-on-llama-3">$100M (LLaMa 3 405B)</a>, and <a href="https://aiindex.stanford.edu/wp-content/uploads/2024/04/HAI_AI-Index-Report-2024.pdf">$191M (Google Gemini Ultra)</a>.</p><h2>The Sovereign AI Trap</h2><p>These efficiency breakthroughs expose a critical flaw in the sovereign AI strategy: what if bigger isn't actually better?</p><p>Thus far, sovereigns' approach to AI development has followed a predictable pattern:</p><ul><li><p>Raise massive amounts of capital</p></li><li><p>Build increasingly large models behind closed doors</p></li><li><p>Maintain strict control over architecture and training data</p></li><li><p>Rely on raw compute power as a competitive moat</p></li><li><p>Where possible, use regulatory capture to strengthen their position<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-14" href="#footnote-14" target="_self">14</a><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-15" href="#footnote-15" target="_self">15</a><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-16" href="#footnote-16" target="_self">16</a><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-17" href="#footnote-17" target="_self">17</a></p></li></ul><p>Wind back the clock a year to February 2024: OpenAI's Sam Altman was attempting to raise $7T (TRILLION) to reshape the global semiconductor industry, arguing he needed unheard of chip fabrication investments meet AI compute demands<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-18" href="#footnote-18" target="_self">18</a>, no doubt influenced by OpenAI's massive cost basis<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-19" href="#footnote-19" target="_self">19</a>.</p><p>Sam's concerns weren't exactly unfounded at the time, but DeepSeek's reported cost advantages have now cast significant doubt on the sticker price he was quoting.</p><p>To put it in perspective, The Information estimated that for 2024, OpenAI spent up to $6B on compute-related costs against annual revenues of $3.5 - $4.5B. If the numbers we've heard are true, that would mean OpenAI is spending more than 2 DeepSeek-R1 training runs <em>per day</em> on compute costs.</p><p>Now it's at this point that skeptics would assert one of three things:</p><ol><li><p>DeepSeek had inherent cost advantages as a 'follower' rather than an 'innovator.'</p></li><li><p>Those aren't apples to apples comparisons on compute-per-dollar efficiency.</p></li><li><p>You fool, you absolute buffoon, don't you understand? Sovereigns can simply replicate DeepSeek's efficiency gains and deploy them across their larger compute footprints.</p></li></ol><p>To which I say: fair. Most articles I've read up to now stop at this point. Let's dig deeper.</p><h3>1. Second-mover advantages</h3><p>Truth be told, followers do have an advantage - Epoch AI has pointed out the physical compute required to achieve a given level of performance drops by 3x every year, so for a 10x training cost reduction, about a 14 month lag time seems appropriate<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-20" href="#footnote-20" target="_self">20</a>. Except DeepSeek-V3 caught up to Llama 3.1 405B, the prior OSS state of the art, in just 5 months - less than half that estimate. That's a significant bucking of the trend.</p><p>Further still, open-source models tend to lag their closed source competitors by about one year. The going assumption is closed-source research labs have access to more compute and data than their open source counterparts (Meta's contributions notwithstanding<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-21" href="#footnote-21" target="_self">21</a>), and it takes time to replicate those models in the open.  </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ENWB!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c7b0482-789f-4831-8029-40f2723b5fe4_2348x1288.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ENWB!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c7b0482-789f-4831-8029-40f2723b5fe4_2348x1288.png 424w, https://substackcdn.com/image/fetch/$s_!ENWB!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c7b0482-789f-4831-8029-40f2723b5fe4_2348x1288.png 848w, https://substackcdn.com/image/fetch/$s_!ENWB!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c7b0482-789f-4831-8029-40f2723b5fe4_2348x1288.png 1272w, https://substackcdn.com/image/fetch/$s_!ENWB!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c7b0482-789f-4831-8029-40f2723b5fe4_2348x1288.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ENWB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c7b0482-789f-4831-8029-40f2723b5fe4_2348x1288.png" width="1456" height="799" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1c7b0482-789f-4831-8029-40f2723b5fe4_2348x1288.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:799,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:302192,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ENWB!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c7b0482-789f-4831-8029-40f2723b5fe4_2348x1288.png 424w, https://substackcdn.com/image/fetch/$s_!ENWB!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c7b0482-789f-4831-8029-40f2723b5fe4_2348x1288.png 848w, https://substackcdn.com/image/fetch/$s_!ENWB!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c7b0482-789f-4831-8029-40f2723b5fe4_2348x1288.png 1272w, https://substackcdn.com/image/fetch/$s_!ENWB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c7b0482-789f-4831-8029-40f2723b5fe4_2348x1288.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Credit: Epoch AI</figcaption></figure></div><p>But even OpenAI's o1, considered a breakthrough in reasoning capabilities and a potential "path forward" beyond traditional scaling limitations<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-22" href="#footnote-22" target="_self">22</a><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-23" href="#footnote-23" target="_self">23</a>, wasn't immune to the DeepSeek effect. Their open-source reasoning model, DeepSeek-R1, met or surpassed o1's performance just 4 months later.</p><p>So yes, DeepSeek enjoyed the fruits of second-mover advantages as Meta, X, Cohere, and others have. But where all the others took about a year to catch up, DeepSeek did it in a third of the time.  </p><h3>2. Compute-per-dollar efficiency</h3><p>Given the effective compute efficiency curves I just mentioned, we know training costs become cheaper overtime for a given architecture and training recipe. So it's unfair to compare GPT-4 and Gemini Ultra costs _at that time_ to DeepSeek's today.</p><p>Fortunately, @arankomatsuzaki and @ldjconfirmed [have done the work for us](https://x.com/arankomatsuzaki/status/1884676245922934788?s=46). Here's the estimated training costs for a benchmark set in 2025 prices:  </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!yVDH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42a39957-6870-499f-9de9-4430dd818344_1090x723.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!yVDH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42a39957-6870-499f-9de9-4430dd818344_1090x723.png 424w, https://substackcdn.com/image/fetch/$s_!yVDH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42a39957-6870-499f-9de9-4430dd818344_1090x723.png 848w, https://substackcdn.com/image/fetch/$s_!yVDH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42a39957-6870-499f-9de9-4430dd818344_1090x723.png 1272w, https://substackcdn.com/image/fetch/$s_!yVDH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42a39957-6870-499f-9de9-4430dd818344_1090x723.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!yVDH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42a39957-6870-499f-9de9-4430dd818344_1090x723.png" width="1090" height="723" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/42a39957-6870-499f-9de9-4430dd818344_1090x723.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:723,&quot;width&quot;:1090,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:222493,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!yVDH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42a39957-6870-499f-9de9-4430dd818344_1090x723.png 424w, https://substackcdn.com/image/fetch/$s_!yVDH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42a39957-6870-499f-9de9-4430dd818344_1090x723.png 848w, https://substackcdn.com/image/fetch/$s_!yVDH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42a39957-6870-499f-9de9-4430dd818344_1090x723.png 1272w, https://substackcdn.com/image/fetch/$s_!yVDH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42a39957-6870-499f-9de9-4430dd818344_1090x723.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Credit: @arankomatsuzaki</figcaption></figure></div><p>For the curious, the researchers provide an online calculator for you to run the numbers yourself.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-24" href="#footnote-24" target="_self">24</a></p><p>"Ah," you might say, "but DeepSeek's model family only has a 6x cost advantage over GPT-4 and Claude-3.5!" And again, I say: fair. But that's not the point. OpenAI and Anthropic have already booked the capital expenditures required to train those models - the money is out the door. And for the upcoming 100K H100 colossal language models, those costs are expected to be higher, astronomically higher.</p><p>What I'm saying is, the apples-to-apples appeal may be important academically, but I only minored, not majored, in economics. I care a lot more about the all-in cost.</p><h3>3. Compute economies of scale</h3><p>Sovereigns absolutely could replicate V3 and R1's algorithmic improvements in their own systems, and I hope they do, users would be better off for it. But they've already spent the money they needed to scale up their inference businesses and training clusters, and need to recoup their capex somehow.</p><p>Under normal circumstances, that might mean artificially higher API prices in the initial months post-rollout, which 'magically' come down as equivalent caliber models diffuse into the market.</p><p>DeepSeek may have just nuked that chance, leaving OpenAI (and others) holding the bag. At the time of writing, OpenAI is charging $15 / 1M input tokens for o1 access, and DeepSeek-R1 is charging $0.55 / 1M tokens.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ilFH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3af7c196-5495-4ab7-ad22-ccfbfb4e1a16_720x900.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ilFH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3af7c196-5495-4ab7-ad22-ccfbfb4e1a16_720x900.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ilFH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3af7c196-5495-4ab7-ad22-ccfbfb4e1a16_720x900.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ilFH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3af7c196-5495-4ab7-ad22-ccfbfb4e1a16_720x900.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ilFH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3af7c196-5495-4ab7-ad22-ccfbfb4e1a16_720x900.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ilFH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3af7c196-5495-4ab7-ad22-ccfbfb4e1a16_720x900.jpeg" width="720" height="900" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3af7c196-5495-4ab7-ad22-ccfbfb4e1a16_720x900.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:900,&quot;width&quot;:720,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ilFH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3af7c196-5495-4ab7-ad22-ccfbfb4e1a16_720x900.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ilFH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3af7c196-5495-4ab7-ad22-ccfbfb4e1a16_720x900.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ilFH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3af7c196-5495-4ab7-ad22-ccfbfb4e1a16_720x900.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ilFH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3af7c196-5495-4ab7-ad22-ccfbfb4e1a16_720x900.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Credit: Artificialintellgencenews.in, Instagram.</figcaption></figure></div><p>Expect these margins to narrow soon, but sovereigns need to differentiate themselves and <em>fast</em>.</p><h3>Recap</h3><p>Knowing what we know now, it's worth re-examining OpenAI's trillion-dollar chip ambitions. Were they driven purely by market demand, to more democratically diffuse access to intelligence? Or were they solidifying a competitive moat through artificial scarcity, maintaining a leading edge through anti-competitive means?</p><p>Wherever OpenAI is on the spectrum, DeepSeek severely upset the apple cart. Sovereigns (and their backers) have gone all-in on an extravagantly expensive strategy, and may have cornered themselves into a trap. Without serious differentiation, their lead may not last long enough to recoup their capex. That threatens the multi-billion dollar ecosystem of research labs, hyperscalers, energy companies, venture capital, and national industrial policies that are betting it all on a centralized, compute-intensive future.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-25" href="#footnote-25" target="_self">25</a></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.machineyearning.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Still with me? Subscribe for more.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h1>2. The Efficiency Paradox</h1><p>This was a major shock to the markets, which in the short term clearly wondered whether low-cost compute would nerf demand for high-performance chips and components. Fortunately, as a general-purpose technology becomes more efficient and accessible, aggregate demand increases as the market finds more uses for it. The commoditization of intelligence will likely lead to accelerated adoption across multiple industries, rather than a market contraction.</p><h2>Immediate Fallout</h2><p>This faster-than-expected moat erosion has incumbents scrambling to figure out a response<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-26" href="#footnote-26" target="_self">26</a>, and the markets took notice.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-27" href="#footnote-27" target="_self">27</a> Headlines screamed about the bursting of the hype bubble, as investors processed what DeepSeek's efficiency gains might mean for the ecosystem.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-28" href="#footnote-28" target="_self">28</a></p><p>NVIDIA, the primary beneficiary of sovereigns' compute addiction, booked the biggest one-day loss in history - close to $593B in market cap, a 17% hit<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-29" href="#footnote-29" target="_self">29</a>. Semiconductor and hardware component stocks like Broadcom, Marvell, Micron, and others also took double-digit hits.  </p><p>More surprising to some were the unexpected beneficiaries. Apple, criticized for its conservativism (and borderline braindead Apple Intelligence rollout), saw its stock price actually appreciate by a few points. While surprising to some, Apple has been pursuing a very different strategy, emphasizing edge inference and custom silicon for their own devices. To Apple, it really doesn't matter what models people prefer to use - what matters is where the models live and run.</p><p>sign&#252;ll on twitter had a great breakdown:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!GjPp!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb87652c-2807-4a26-8998-ef57577b7dfa_1170x1290.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!GjPp!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb87652c-2807-4a26-8998-ef57577b7dfa_1170x1290.png 424w, https://substackcdn.com/image/fetch/$s_!GjPp!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb87652c-2807-4a26-8998-ef57577b7dfa_1170x1290.png 848w, https://substackcdn.com/image/fetch/$s_!GjPp!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb87652c-2807-4a26-8998-ef57577b7dfa_1170x1290.png 1272w, https://substackcdn.com/image/fetch/$s_!GjPp!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb87652c-2807-4a26-8998-ef57577b7dfa_1170x1290.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!GjPp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb87652c-2807-4a26-8998-ef57577b7dfa_1170x1290.png" width="1170" height="1290" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/eb87652c-2807-4a26-8998-ef57577b7dfa_1170x1290.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1290,&quot;width&quot;:1170,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:345685,&quot;alt&quot;:&quot;\&quot;apple bet early on model commoditization. they didn&#8217;t blow capex on some me-too race, didn&#8217;t overreact, didn&#8217;t pivot into panic mode. they just played it steady, focused, controlled. they bet on *edge inference*. the models don&#8217;t live in the cloud; they live on the device. &amp; that strategy is now unequivocally proven directionally accurate\&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="&quot;apple bet early on model commoditization. they didn&#8217;t blow capex on some me-too race, didn&#8217;t overreact, didn&#8217;t pivot into panic mode. they just played it steady, focused, controlled. they bet on *edge inference*. the models don&#8217;t live in the cloud; they live on the device. &amp; that strategy is now unequivocally proven directionally accurate&quot;" title="&quot;apple bet early on model commoditization. they didn&#8217;t blow capex on some me-too race, didn&#8217;t overreact, didn&#8217;t pivot into panic mode. they just played it steady, focused, controlled. they bet on *edge inference*. the models don&#8217;t live in the cloud; they live on the device. &amp; that strategy is now unequivocally proven directionally accurate&quot;" srcset="https://substackcdn.com/image/fetch/$s_!GjPp!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb87652c-2807-4a26-8998-ef57577b7dfa_1170x1290.png 424w, https://substackcdn.com/image/fetch/$s_!GjPp!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb87652c-2807-4a26-8998-ef57577b7dfa_1170x1290.png 848w, https://substackcdn.com/image/fetch/$s_!GjPp!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb87652c-2807-4a26-8998-ef57577b7dfa_1170x1290.png 1272w, https://substackcdn.com/image/fetch/$s_!GjPp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb87652c-2807-4a26-8998-ef57577b7dfa_1170x1290.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Credit: <a href="https://x.com/signulll/status/1884060119714910276">@signulll</a></figcaption></figure></div><p>We're only a few days out from the Monday correction, but I expect Apple will continue to sit pretty for a while longer, thanks to an economic principle that's picked up again in online meme circles - Jevons Paradox.  </p><h2>Jevons Paradox</h2><p>In 1865, English economist William Stanley Jevons observed that across a wide range of industries, the increased efficiency of coal use led to its increased consumption. The less it cost to use, the more uses people found for it, roughly maintaining or growing aggregate demand.</p><p>That's it. It's not complicated. There, saved you a lengthy twitter thread.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Mg6D!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1f7ae239-7dcb-4b38-93f7-029883df4c22_400x290.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Mg6D!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1f7ae239-7dcb-4b38-93f7-029883df4c22_400x290.png 424w, https://substackcdn.com/image/fetch/$s_!Mg6D!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1f7ae239-7dcb-4b38-93f7-029883df4c22_400x290.png 848w, https://substackcdn.com/image/fetch/$s_!Mg6D!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1f7ae239-7dcb-4b38-93f7-029883df4c22_400x290.png 1272w, https://substackcdn.com/image/fetch/$s_!Mg6D!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1f7ae239-7dcb-4b38-93f7-029883df4c22_400x290.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Mg6D!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1f7ae239-7dcb-4b38-93f7-029883df4c22_400x290.png" width="400" height="290" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1f7ae239-7dcb-4b38-93f7-029883df4c22_400x290.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:290,&quot;width&quot;:400,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:49147,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Mg6D!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1f7ae239-7dcb-4b38-93f7-029883df4c22_400x290.png 424w, https://substackcdn.com/image/fetch/$s_!Mg6D!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1f7ae239-7dcb-4b38-93f7-029883df4c22_400x290.png 848w, https://substackcdn.com/image/fetch/$s_!Mg6D!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1f7ae239-7dcb-4b38-93f7-029883df4c22_400x290.png 1272w, https://substackcdn.com/image/fetch/$s_!Mg6D!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1f7ae239-7dcb-4b38-93f7-029883df4c22_400x290.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Economists have observed the same phenomenon in hybrid engines and driving distances, expanded highways and traffic times, bandwidth costs and internet usage, and many other examples.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-30" href="#footnote-30" target="_self">30</a></p><p>Now contemporaries are applying the same logic to intelligence as a commodity - the more we have, the more we want. If true, what tailwinds are contributing to an accelerating intelligence' diffusion?<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-31" href="#footnote-31" target="_self">31</a><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-32" href="#footnote-32" target="_self">32</a>.</p><h2>Tailwinds for intelligence efficiency gains</h2><p>First, let's look at the data. Epoch AI's analysis argues that, in terms of raw resources required to achieve a given level of performance, 2/3rds of the gains are attributable to more scale (chips), and 1/3rd to algorithmic progress.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!1zkh!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F118c3dff-c07f-4d96-9ee5-eb000fc718f2_2230x1358.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!1zkh!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F118c3dff-c07f-4d96-9ee5-eb000fc718f2_2230x1358.png 424w, https://substackcdn.com/image/fetch/$s_!1zkh!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F118c3dff-c07f-4d96-9ee5-eb000fc718f2_2230x1358.png 848w, https://substackcdn.com/image/fetch/$s_!1zkh!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F118c3dff-c07f-4d96-9ee5-eb000fc718f2_2230x1358.png 1272w, https://substackcdn.com/image/fetch/$s_!1zkh!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F118c3dff-c07f-4d96-9ee5-eb000fc718f2_2230x1358.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!1zkh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F118c3dff-c07f-4d96-9ee5-eb000fc718f2_2230x1358.png" width="1456" height="887" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/118c3dff-c07f-4d96-9ee5-eb000fc718f2_2230x1358.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:887,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:351036,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!1zkh!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F118c3dff-c07f-4d96-9ee5-eb000fc718f2_2230x1358.png 424w, https://substackcdn.com/image/fetch/$s_!1zkh!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F118c3dff-c07f-4d96-9ee5-eb000fc718f2_2230x1358.png 848w, https://substackcdn.com/image/fetch/$s_!1zkh!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F118c3dff-c07f-4d96-9ee5-eb000fc718f2_2230x1358.png 1272w, https://substackcdn.com/image/fetch/$s_!1zkh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F118c3dff-c07f-4d96-9ee5-eb000fc718f2_2230x1358.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Credit: Epoch AI.</figcaption></figure></div><p>It's either better algorithms, more hardware, or better hardware. And while up to now we've focused mostly on training costs (building intelligence), the much larger category of spend is actually inference costs (using intelligence).</p><h3>1. Better algorithms</h3><p>Foundation model developers that can mimic sovereigns' performance capabilities at a fraction of the price are well-positioned, as businesses question the sticker prices they've been sold up to now. <a href="https://www.fastino.ai/">Fastino</a> is one such company, whose novel architecture is so ridiculously efficient that it yields 2,000+ tokens per second on a CPU. (full disclosure, I'm an early investor).</p><p><a href="https://cartesia.ai/">Cartesia</a> is another example that use efficient alternative architectures to meet or eclipse dense transformers' performance on various tasks, with the company claiming its state-space models (SSM) are efficient enough to run just about anywhere.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-33" href="#footnote-33" target="_self">33</a> Coincidentally, SSMs use similar memory compression techniques to DeepSeek's MLA, summarizing context to enable efficient, long-range memory.</p><h3>2. Better hardware</h3><p>The same logic applies to semiconductor efficiency gains with companies like <a href="https://www.positron.ai/">Positron</a>, which I'm also an investor in. Assuming token generation throughput parity (and model compatibility), performance per watt improvements are likely to yield increased demand as a multiple over the NVIDIA baseline.</p><p>Datacenters will need to expand to keep up with compute demand, but newer Blackwell chips from NVIDIA are expected to cost more, not less, in opex, due to critical design decisions that raise electricity consumption and cooling needs.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-34" href="#footnote-34" target="_self">34</a> With downward pressure on compute costs, and physical limits on electricity consumption per rack, datacenter customers have to improve margins where possible.</p><h3>3. Efficient energy systems</h3><p>To get around these problems, datacenters can do quite a lot in optimizing their operations before making any chip decisions. They have physical limits they have to abide by. Options vary for new builds vs retrofits, but some contributors I'm seeing include:</p><ol><li><p><strong>Liquid immersion cooling.</strong> Highly efficient method of cooling datacenter equipment. Strictly on a square-foot basis, reduces datacenter footprints by up to 10x, with the input costs being the container and the coolant, that can upgrade the power usage effectiveness (PUE) of datacenters from <a href="https://journal.uptimeinstitute.com/large-data-centers-are-mostly-more-efficient-analysis-confirms/">~1.55</a> down to as low as <a href="https://www.parkplacetechnologies.com/data-center-liquid-cooling/immersion-cooling/">1.05</a></p></li><li><p><strong>Datacenter power and energy management software.</strong> This market is relatively immature, and most DCs have been caught flat-footed as they've mostly rolled their own custom software up to now. It hasn't been battle-tested for intense AI workloads. Companies like <a href="https://www.centralaxis.com/">CentralAxis</a> that can provide mature, scalable, and user-friendly solutions are well-positioned.</p></li><li><p><strong>Natural gas suppliers.</strong> Everyone in my circles won't stop talking about nuclear energy, which I'm all for and excited about, but the demand exists NOW and we don't yet have nuclear capacity to absorb it. Fortunately, existing natural gas suppliers all over the Permian are already raking it in with datacenter end-customers, and I expect that only to increase overtime.</p></li></ol><h3>4. Edge inference and model commoditization</h3><p>Finally, as we saw with Apple's stock performance, companies positioning themselves for a world of efficient, widely-available models - whether through edge computing, optimization layers, or deployment platforms - are likely to benefit from increased AI adoption even as sovereign margins get compressed. Model commoditization, rather than artificial scarcity, is a more anti-fragile bet.</p><h2>Chip Market Implications</h2><p>With all that said, obviously more chips to use means greater capacity for intelligence diffusion. But as for specific winners here, the hardware story is more nuanced.</p><h3>1. Hardware baselines</h3><p>NVIDIA remains the only serious game in town for high-performance training hardware, though with a twist I didn't expect. Historically, their chips configurations have been inefficiently utilized in inference, using only about 30% of their memory bandwidth at peak<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-35" href="#footnote-35" target="_self">35</a>.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Ygs8!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F217af13f-d8ef-4025-b9dc-c50b8ba9f9c9_1200x716.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Ygs8!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F217af13f-d8ef-4025-b9dc-c50b8ba9f9c9_1200x716.png 424w, https://substackcdn.com/image/fetch/$s_!Ygs8!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F217af13f-d8ef-4025-b9dc-c50b8ba9f9c9_1200x716.png 848w, https://substackcdn.com/image/fetch/$s_!Ygs8!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F217af13f-d8ef-4025-b9dc-c50b8ba9f9c9_1200x716.png 1272w, https://substackcdn.com/image/fetch/$s_!Ygs8!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F217af13f-d8ef-4025-b9dc-c50b8ba9f9c9_1200x716.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Ygs8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F217af13f-d8ef-4025-b9dc-c50b8ba9f9c9_1200x716.png" width="1200" height="716" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/217af13f-d8ef-4025-b9dc-c50b8ba9f9c9_1200x716.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:716,&quot;width&quot;:1200,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:67126,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Ygs8!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F217af13f-d8ef-4025-b9dc-c50b8ba9f9c9_1200x716.png 424w, https://substackcdn.com/image/fetch/$s_!Ygs8!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F217af13f-d8ef-4025-b9dc-c50b8ba9f9c9_1200x716.png 848w, https://substackcdn.com/image/fetch/$s_!Ygs8!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F217af13f-d8ef-4025-b9dc-c50b8ba9f9c9_1200x716.png 1272w, https://substackcdn.com/image/fetch/$s_!Ygs8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F217af13f-d8ef-4025-b9dc-c50b8ba9f9c9_1200x716.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Credit: Databricks.</figcaption></figure></div><p>But their new GB300 chip seems purpose-built for reasoning models, which require more memory bandwidth for longer chains of thought. If reasoning models like R1 virally spread at DeepSeek prices, inference demand may actually spur even more demand for NVIDIA's latest chips.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-36" href="#footnote-36" target="_self">36</a></p><h3>2. Inference optimization</h3><p>Aside from performance-per-watt optimization via Positron, customers may want to optimize against speed and latency above all else. Hyper-fast inference chips from companies like <a href="https://www.groq.com">Groq</a> and <a href="https://cerebras.ai/">Cerebras</a> are valuable... but need to make economic sense on a total cost of ownership basis. Groq's design decisions for its LPU chips optimize for speed, but severely sacrifice memory capacity, requiring 10 racks to deploy a 70B model that would otherwise fit on a single GPU<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-37" href="#footnote-37" target="_self">37</a><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-38" href="#footnote-38" target="_self">38</a><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-39" href="#footnote-39" target="_self">39</a>.</p><p>The question remains whether that strategy makes sense for diffusing into the entire datacenter market, or if the only path forward is vertical integration, which could be another version of a sovereign trap.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-40" href="#footnote-40" target="_self">40</a></p><h3>3. Infra software</h3><p>The infrastructure software layer is where we start to see clearer winners and losers. NVIDIA's years of investment in CUDA has created an ecosystem and user experience that competitors struggle to match. AMD's story is instructive - despite MI300X being on par with (or better than) NVIDIA's H100/H200 line, AMD's ROCm dogwater software stack prevents them from realizing those hardware advantages, and they've made no successful effort in the past year plus to close that gap.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-41" href="#footnote-41" target="_self">41</a><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-42" href="#footnote-42" target="_self">42</a></p><p>Having said that, it's worth noting that DeepSeek did collaborate with AMD in developing DeepSeek-V3, which may be a soft endorsement of their Instinct chip line and software stack.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-43" href="#footnote-43" target="_self">43</a></p><p>Basically, what I'm saying is that a rising tide does not lift all boats here. Some boats have holes in them. They need to be patched up, and until then exist on an entirely different demand curve.</p><h2>The Real Paradox</h2><p>I'm crudely bisecting the AI market into 2 sets of players: those accelerating its diffusion, and those rent-seeking through artificial scarcity.</p><p>DeepSeek's breakthrough demonstrated that the moats closed-source model had are not as defensible as sovereigns believed. They may yet adapt by adopting these efficiency gains, and closed-source models do still dominate enterprise workflows for the time being. But DeepSeek's cost trajectory suggests a reckoning is inevitable. The market's reaction shows that investors are starting to recognize this new reality.</p><p>The real efficiency paradox is this: do sovereigns have the cojones to disrupt themselves? Or will they cry foul and double down on protectionism?</p><h1>3. The Technopolitical Reckoning</h1><h2>American Sovereign Responses</h2><p>Predictably, American stakeholders have responded with a mix of hostility, defensiveness, and existential ruminating. I mean, it's hundreds of billions of dollars on the line here.</p><h3>"They cheated"</h3><p>OpenAI <a href="https://www.ft.com/content/a0dfedd1-5255-4fa9-8ccc-1fe01de87ea6">directly accused</a> DeepSeek of using OpenAI models to train their own, by training based on outputs from ChatGPT, which would give DeepSeek an unfair advantage. No evidence has actually be presented yet, just conjecture, and the reactions from Twitter have been... less than sympathetic.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!olZu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9ad20f5-a1c4-40f9-86e0-c0359e26971d_1186x1162.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!olZu!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9ad20f5-a1c4-40f9-86e0-c0359e26971d_1186x1162.png 424w, https://substackcdn.com/image/fetch/$s_!olZu!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9ad20f5-a1c4-40f9-86e0-c0359e26971d_1186x1162.png 848w, https://substackcdn.com/image/fetch/$s_!olZu!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9ad20f5-a1c4-40f9-86e0-c0359e26971d_1186x1162.png 1272w, https://substackcdn.com/image/fetch/$s_!olZu!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9ad20f5-a1c4-40f9-86e0-c0359e26971d_1186x1162.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!olZu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9ad20f5-a1c4-40f9-86e0-c0359e26971d_1186x1162.png" width="1186" height="1162" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e9ad20f5-a1c4-40f9-86e0-c0359e26971d_1186x1162.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1162,&quot;width&quot;:1186,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:338204,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!olZu!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9ad20f5-a1c4-40f9-86e0-c0359e26971d_1186x1162.png 424w, https://substackcdn.com/image/fetch/$s_!olZu!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9ad20f5-a1c4-40f9-86e0-c0359e26971d_1186x1162.png 848w, https://substackcdn.com/image/fetch/$s_!olZu!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9ad20f5-a1c4-40f9-86e0-c0359e26971d_1186x1162.png 1272w, https://substackcdn.com/image/fetch/$s_!olZu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe9ad20f5-a1c4-40f9-86e0-c0359e26971d_1186x1162.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Credit: <a href="https://x.com/growing_daniel/status/1884705676884160959">@growing_daniel</a>.</figcaption></figure></div><h3>"Actually, we hate open source"</h3><p>Meanwhile, <a href="https://pitchbook.com/news/articles/vcs-with-large-llm-stakes-enter-crisis-mode-over-deepseek?utm_medium=newsletter&amp;utm_source=daily_pitch&amp;sourceType=NEWSLETTER">venture capitalists with large stakes in American AI companies</a> are reportedly in "crisis mode", as this breakthrough threatens the competitive advantage of their portfolio companies. Some are questioning whether models should be allowed to be open-sourced at all... which is a ridiculous take.</p><h3>Alleged export control violations</h3><p>To tie this all together, the "<a href="https://x.com/nealkhosla/status/1882859736737194183">DeepSeek is a Chinese psyop</a>" conspiracy (disclosure: the tweet author's dad has a large stake in OpenAI) only works if the CCP secretly granted DeepSeek access to advanced chips China procured through illicit means.</p><p>Alexandr Wang, CEO of Scale AI, made <a href="https://www.reuters.com/technology/artificial-intelligence/what-is-deepseek-why-is-it-disrupting-ai-sector-2025-01-27/">one such (unsubstantiated) claim</a>, that DeepSeek has access to 50,000 advanced AI chips that it shouldn't have, given US export controls. Dario Amodei (Anthropic) and Elon Musk have also been <a href="https://www.inc.com/ben-sherry/ai-leaders-in-the-u-s-react-to-deepseek-calling-it-impressive-but-staying-skeptical/91140125">supporters</a> of this theory.</p><p>I'm not na&#239;ve - there are certainly shady dealings happening in Southeast Asia to evade export controls.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-44" href="#footnote-44" target="_self">44</a> That said, in true open-source spirit, DeepSeek released its complete training recipe for its models, and HuggingFace is already hard at work rebuilding it from scratch... so if it truly would take 50,000 H100s to build it, we'll all know soon enough anyway.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-45" href="#footnote-45" target="_self">45</a></p><p>If it were actually a conspiracy, that would be like handing over a smoking gun. A couple weeks of market confusion, maybe, but no lasting damage. Just a really dumb prank.</p><p>Altogether, these gripes betray a deep misunderstanding of how open source research works. I provided rich citations for DeepSeek's architectural decisions and training recipes explicitly to point out that _none of these techniques are new or unheard of_. This is just the first time someone has put them all together. It's a <a href="https://www.edwardconard.com/macro-roundup/deepseek-a-chinese-ai-company-has-released-deepseek-v3-an-apparent-efficiency-breakthrough-training-deepseeks-open-model-required-significantly-less-more-compute-power-than-closed-model">Sputnik Moment</a>, sure... but as far as China's concerned, their own Sputnik Moment was two years ago when ChatGPT was released.  </p><h2>The China takes</h2><p>In contrast, the response from Chinese media, AI executives, and netizens has been overwhelmingly positive, tinged with national pride and interesting fanart. I'm especially grateful to Jordan Schneider and his team at <a href="https://www.chinatalk.media/">ChinaTalk</a> (longtime listener) for collecting some of the best quotes from first-party sources.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-46" href="#footnote-46" target="_self">46</a></p><blockquote><p>China Daily declared, "For a Chinese LLM, it's a historical moment to surpass ChatGPT in the US."<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-47" href="#footnote-47" target="_self">47</a> Daily Economic News echoed this sentiment, stating, "Silicon Valley Shocked! Chinese AI Dominates Foreign Media, AI Experts Say: 'It Has Caught Up with the U.S.!'"<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-48" href="#footnote-48" target="_self">48</a></p><p>Feng Ji &#20911;&#39589;, founder of Game Science (the studio behind Black Myth: Wukong), called DeepSeek "a scientific and technological achievement that shapes our national destiny (&#22269;&#36816;)."<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-49" href="#footnote-49" target="_self">49</a></p><p>Zhou Hongyi, Chairperson of Qihoo 360, told Jiemian News that DeepSeek will be a key player in the "Chinese Large-Model Technology Avengers Team" to counter U.S. AI dominance.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-50" href="#footnote-50" target="_self">50</a>)</p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!CNj-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa39288e9-1950-442c-8d42-42543abbaccf_1219x702.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!CNj-!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa39288e9-1950-442c-8d42-42543abbaccf_1219x702.png 424w, https://substackcdn.com/image/fetch/$s_!CNj-!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa39288e9-1950-442c-8d42-42543abbaccf_1219x702.png 848w, https://substackcdn.com/image/fetch/$s_!CNj-!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa39288e9-1950-442c-8d42-42543abbaccf_1219x702.png 1272w, https://substackcdn.com/image/fetch/$s_!CNj-!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa39288e9-1950-442c-8d42-42543abbaccf_1219x702.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!CNj-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa39288e9-1950-442c-8d42-42543abbaccf_1219x702.png" width="1219" height="702" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a39288e9-1950-442c-8d42-42543abbaccf_1219x702.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:702,&quot;width&quot;:1219,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1151335,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!CNj-!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa39288e9-1950-442c-8d42-42543abbaccf_1219x702.png 424w, https://substackcdn.com/image/fetch/$s_!CNj-!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa39288e9-1950-442c-8d42-42543abbaccf_1219x702.png 848w, https://substackcdn.com/image/fetch/$s_!CNj-!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa39288e9-1950-442c-8d42-42543abbaccf_1219x702.png 1272w, https://substackcdn.com/image/fetch/$s_!CNj-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa39288e9-1950-442c-8d42-42543abbaccf_1219x702.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Credit: ChinaTalk fanart.</figcaption></figure></div><p>A closed-door session between <a href="https://www.shixiangcap.com/">Shixiang &#25342;&#35937;</a>, a VC spun out from Sequoia China, and dozens of AI researchers, investors, and insiders provides an even richer view on DeepSeek's market entry.</p><p>Key takeaways from the discussion include:</p><ol><li><p>**Celebrating efficiency as a source of resilience.**</p><blockquote><p>"How Chinese large-model teams use less computing power to produce results, thereby having some definite resilience &#8212; or even doing better &#8212; might end up being how the US-China AI landscape plays out in the future."</p></blockquote></li><li><p>**Addressing the efficiency paradox head-on.**</p><blockquote><p>"In the long-run, questions about computing power will remain. Demand for compute remains strong and no company has enough."</p></blockquote></li><li><p>**Open source controls the margins of the whole market; the US discovered that China is not two years behind, but 3 to 9 months.**</p><blockquote><p>"If the capabilities of open-source and closed-source models do not differ greatly, then this presents a big challenge for closed source."</p></blockquote></li><li><p>**This, plus the lack of a good business model for AI labs, heightens the commoditization risk.**</p><blockquote><p>"The business model of AI labs in the United States is not good either. AI does not have a good business model today and will require viable solutions in the future. Liang Wenfeng is ambitious; DeepSeek does not care about the model and is just heading towards AGI."</p></blockquote></li><li><p>**Differentiation has to therefore come from vision, not from technology scarcity.**</p><blockquote><p>"China is still replicating technical solutions; reasoning was proposed by OpenAI in o1, so the next gap between various AI labs will be about who can propose the next reasoning. Infinite-length reasoning might be one vision. The core difference between different AI labs&#8217; models lies not in technology, but in what each lab&#8217;s next vision is. After all, vision matters more than technology."</p></blockquote></li></ol><h1>4. The End of the Sovereign Era?</h1><p>To put it bluntly, "<a href="https://mp.weixin.qq.com/s/DSTLFyM_wj-hE96tG85Jjw">the global diffusion of AI is now irreversible</a>" (full translation <a href="https://www.geopolitechs.org/p/the-global-diffusion-of-ai-is-now">here</a>).</p><ul><li><p>HuggingFace is already hard at work <a href="https://github.com/huggingface/open-r1">replicating R1</a> to prove, without a shadow of a doubt, its validity</p></li><li><p>Community researchers are <a href="https://x.com/carrigmat/status/1884244369907278106">sharing resources</a> for running full, undistilled versions of R1 on prosumer hardware that is highly unlikely to fall into an export ban</p></li></ul><p>The nail seems to be in the coffin for the sovereigns' initial strategy. Don't worry, they've got big treasuries and have time to adapt, but <a href="https://www.machineyearning.io/p/the-model-is-not-the-product">the model is not the product</a>, and never has been. That said, they've been inefficiently burning capital at a rapid clip, and will likely use the market moment to <a href="https://techcrunch.com/2025/01/30/openai-said-to-be-in-talks-to-raise-40b-at-a-340b-valuation/">double-down with additional funding</a>.</p><p>The stark contrast between the American and Chinese responses to DeepSeek's breakthrough reveals deep geopolitical fault lines we already knew were there. Americans responded with skepticism and allegations of foul play, while the Chinese AI community (and frankly, a lot of the open source community) celebrated DeepSeek's achievements as a validation of their own strategic vision.</p><p>Its exciting to see open source provide such a strong counter to power ossification, and I strongly believe this will be a huge boon for boosting AI supply chains and markets in the aggregate. Efficient training techniques, edge inference, and energy-efficient datacenters all point to model commoditization as an inevitability, and these efficiency gains will drive demand to new heights.</p><p>Critics might argue my optimism underplays real moats that sovereigns possess: proprietary data, regulatory capture, or ecosystem lock-in. These are valid concerns. But I still believe efficiency and community momentum will outweigh them overtime.</p><p>In the new era of AI diffusion, I believe winners will be those who embrace intelligence' commoditization and find ways to create value on top of it. If we get this right, we can uplift the entire economy... rather than a few magnificent slices of it.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.machineyearning.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Stay in the loop. If you liked this, subscribe for more. It&#8217;s free.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p><a href="https://arxiv.org/pdf/2412.19437v1">https://arxiv.org/pdf/2412.19437v1</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p><a href="https://aiindex.stanford.edu/wp-content/uploads/2024/04/HAI_AI-Index-Report-2024.pdf">https://aiindex.stanford.edu/wp-content/uploads/2024/04/HAI_AI-Index-Report-2024.pdf</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p><a href="https://www.factorialfunds.com/blog/thoughts-on-llama-3">https://www.factorialfunds.com/blog/thoughts-on-llama-3</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p><a href="https://arxiv.org/pdf/2310.18313">https://arxiv.org/pdf/2310.18313</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p><a href="https://arxiv.org/pdf/2310.11453">https://arxiv.org/pdf/2310.11453</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-6" href="#footnote-anchor-6" class="footnote-number" contenteditable="false" target="_self">6</a><div class="footnote-content"><p><a href="https://arxiv.org/pdf/2402.17764">https://arxiv.org/pdf/2402.17764</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-7" href="#footnote-anchor-7" class="footnote-number" contenteditable="false" target="_self">7</a><div class="footnote-content"><p><a href="https://arxiv.org/pdf/2405.04434">https://arxiv.org/pdf/2405.04434</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-8" href="#footnote-anchor-8" class="footnote-number" contenteditable="false" target="_self">8</a><div class="footnote-content"><p><a href="https://arxiv.org/pdf/1701.06538.pdf">https://arxiv.org/pdf/1701.06538.pdf</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-9" href="#footnote-anchor-9" class="footnote-number" contenteditable="false" target="_self">9</a><div class="footnote-content"><p><a href="https://mistral.ai/news/mixtral-of-experts/">https://mistral.ai/news/mixtral-of-experts/</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-10" href="#footnote-anchor-10" class="footnote-number" contenteditable="false" target="_self">10</a><div class="footnote-content"><p><a href="https://arxiv.org/pdf/2307.08621.pdf">https://arxiv.org/pdf/2307.08621.pdf</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-11" href="#footnote-anchor-11" class="footnote-number" contenteditable="false" target="_self">11</a><div class="footnote-content"><p><a href="https://arxiv.org/pdf/2101.03961.pdf">https://arxiv.org/pdf/2101.03961.pdf</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-12" href="#footnote-anchor-12" class="footnote-number" contenteditable="false" target="_self">12</a><div class="footnote-content"><p><a href="https://www.semianalysis.com/p/gpt-4-architecture-infrastructure">https://www.semianalysis.com/p/gpt-4-architecture-infrastructure</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-13" href="#footnote-anchor-13" class="footnote-number" contenteditable="false" target="_self">13</a><div class="footnote-content"><p><a href="https://arxiv.org/pdf/2404.19737">https://arxiv.org/pdf/2404.19737</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-14" href="#footnote-anchor-14" class="footnote-number" contenteditable="false" target="_self">14</a><div class="footnote-content"><p>This last point is important to track, as it <em>strongly</em> influences sovereign decision-making. OpenAI recently appointed former NSA director Paul Nakasone to its board, while Mistral AI has benefited from an exceptionally close relationship with the French government during EU AI Act negotiations. Even Cohere received $240M from the Canadian government to build out AI compute capacity.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-15" href="#footnote-anchor-15" class="footnote-number" contenteditable="false" target="_self">15</a><div class="footnote-content"><p><a href="https://www.washingtonpost.com/technology/2024/06/13/openai-board-paul-nakasone-nsa/">https://www.washingtonpost.com/technology/2024/06/13/openai-board-paul-nakasone-nsa/</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-16" href="#footnote-anchor-16" class="footnote-number" contenteditable="false" target="_self">16</a><div class="footnote-content"><p><a href="https://jacobin.com/2024/03/mistral-france-eu-monopoly-ai-regulation">https://jacobin.com/2024/03/mistral-france-eu-monopoly-ai-regulation</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-17" href="#footnote-anchor-17" class="footnote-number" contenteditable="false" target="_self">17</a><div class="footnote-content"><p><a href="https://www.canada.ca/en/department-finance/news/2024/12/deputy-prime-minister-announces-240-million-for-cohere-to-scale-up-ai-compute-capacity.html">https://www.canada.ca/en/department-finance/news/2024/12/deputy-prime-minister-announces-240-million-for-cohere-to-scale-up-ai-compute-capacity.html</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-18" href="#footnote-anchor-18" class="footnote-number" contenteditable="false" target="_self">18</a><div class="footnote-content"><p><a href="https://www.wsj.com/tech/ai/sam-altman-seeks-trillions-of-dollars-to-reshape-business-of-chips-and-ai-89ab3db0">https://www.wsj.com/tech/ai/sam-altman-seeks-trillions-of-dollars-to-reshape-business-of-chips-and-ai-89ab3db0</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-19" href="#footnote-anchor-19" class="footnote-number" contenteditable="false" target="_self">19</a><div class="footnote-content"><p><a href="https://www.theinformation.com/articles/why-openai-could-lose-5-billion-this-year?rc=adlzu4">https://www.theinformation.com/articles/why-openai-could-lose-5-billion-this-year?rc=adlzu4</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-20" href="#footnote-anchor-20" class="footnote-number" contenteditable="false" target="_self">20</a><div class="footnote-content"><p><a href="https://epoch.ai/blog/algorithmic-progress-in-language-models">https://epoch.ai/blog/algorithmic-progress-in-language-models</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-21" href="#footnote-anchor-21" class="footnote-number" contenteditable="false" target="_self">21</a><div class="footnote-content"><p><a href="https://venturebeat.com/ai/meta-launches-open-source-llama-3-3-shrinking-powerful-bigger-model-into-smaller-size/">https://venturebeat.com/ai/meta-launches-open-source-llama-3-3-shrinking-powerful-bigger-model-into-smaller-size/</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-22" href="#footnote-anchor-22" class="footnote-number" contenteditable="false" target="_self">22</a><div class="footnote-content"><p><a href="https://semianalysis.com/2024/12/11/scaling-laws-o1-pro-architecture-reasoning-training-infrastructure-orion-and-claude-3-5-opus-failures/">https://semianalysis.com/2024/12/11/scaling-laws-o1-pro-architecture-reasoning-training-infrastructure-orion-and-claude-3-5-opus-failures/</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-23" href="#footnote-anchor-23" class="footnote-number" contenteditable="false" target="_self">23</a><div class="footnote-content"><div class="embedded-post-wrap" data-attrs="{&quot;id&quot;:151579244,&quot;url&quot;:&quot;https://garrisonlovely.substack.com/p/is-deep-learning-actually-hitting&quot;,&quot;publication_id&quot;:1990953,&quot;publication_name&quot;:&quot;The Obsolete Newsletter&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1b915aa7-6930-4bec-84e9-8c2cdc96290c_500x500.png&quot;,&quot;title&quot;:&quot;Is Deep Learning Actually Hitting a Wall?&quot;,&quot;truncated_body_text&quot;:&quot;[EDIT: I should have credited Gary Marcus for coining the term &#8220;deep learning is hitting a wall\&quot; back in March 2022. I didn&#8217;t actually realize it had originated entirely with him, given how much it&#8217;s entered the lexicon. He argued that LLMs were hitting diminishing returns in April 2024, which was an input into my similar prediction in June. He also had&#8230;&quot;,&quot;date&quot;:&quot;2024-11-13T00:58:58.782Z&quot;,&quot;like_count&quot;:48,&quot;comment_count&quot;:0,&quot;bylines&quot;:[{&quot;id&quot;:97227789,&quot;name&quot;:&quot;Garrison Lovely&quot;,&quot;handle&quot;:&quot;garrisonlovely&quot;,&quot;previous_name&quot;:null,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2310100-d92d-4bc5-adb6-f868062b91d9_906x906.jpeg&quot;,&quot;bio&quot;:&quot;Freelance journalist with work in NY Times, BBC Future, The Guardian US, TIME, The Verge, Vox, The Thomson Reuters Foundation, Le Monde Diplomatique, The Nation, Jacobin, and elsewhere. Host of The Most Interesting People I Know.&quot;,&quot;profile_set_up_at&quot;:&quot;2022-06-23T21:05:08.378Z&quot;,&quot;publicationUsers&quot;:[{&quot;id&quot;:1989154,&quot;user_id&quot;:97227789,&quot;publication_id&quot;:1990953,&quot;role&quot;:&quot;admin&quot;,&quot;public&quot;:true,&quot;is_primary&quot;:false,&quot;publication&quot;:{&quot;id&quot;:1990953,&quot;name&quot;:&quot;The Obsolete Newsletter&quot;,&quot;subdomain&quot;:&quot;garrisonlovely&quot;,&quot;custom_domain&quot;:null,&quot;custom_domain_optional&quot;:false,&quot;hero_text&quot;:&quot;Reporting and analysis on capitalism, great power competition, and the race for machine superintelligence from journalist w/ work in NYT, BBC Future, The Verge, TIME, Vox, The Guardian, The Nation, and elsewhere.&quot;,&quot;logo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1b915aa7-6930-4bec-84e9-8c2cdc96290c_500x500.png&quot;,&quot;author_id&quot;:97227789,&quot;theme_var_background_pop&quot;:&quot;#B599F1&quot;,&quot;created_at&quot;:&quot;2023-09-29T18:06:00.426Z&quot;,&quot;rss_website_url&quot;:null,&quot;email_from_name&quot;:&quot;Garrison Lovely from The Obsolete Newsletter&quot;,&quot;copyright&quot;:&quot;Garrison Lovely&quot;,&quot;founding_plan_name&quot;:&quot;Founding Member&quot;,&quot;community_enabled&quot;:true,&quot;invite_only&quot;:false,&quot;payments_state&quot;:&quot;enabled&quot;,&quot;language&quot;:null,&quot;explicit&quot;:false,&quot;is_personal_mode&quot;:false}}],&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;utm_campaign&quot;:null,&quot;belowTheFold&quot;:true,&quot;type&quot;:&quot;newsletter&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="EmbeddedPostToDOM"><a class="embedded-post" native="true" href="https://garrisonlovely.substack.com/p/is-deep-learning-actually-hitting?utm_source=substack&amp;utm_campaign=post_embed&amp;utm_medium=web"><div class="embedded-post-header"><img class="embedded-post-publication-logo" src="https://substackcdn.com/image/fetch/$s_!Zgd0!,w_56,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1b915aa7-6930-4bec-84e9-8c2cdc96290c_500x500.png" loading="lazy"><span class="embedded-post-publication-name">The Obsolete Newsletter</span></div><div class="embedded-post-title-wrapper"><div class="embedded-post-title">Is Deep Learning Actually Hitting a Wall?</div></div><div class="embedded-post-body">[EDIT: I should have credited Gary Marcus for coining the term &#8220;deep learning is hitting a wall" back in March 2022. I didn&#8217;t actually realize it had originated entirely with him, given how much it&#8217;s entered the lexicon. He argued that LLMs were hitting diminishing returns in April 2024, which was an input into my similar prediction in June. He also had&#8230;</div><div class="embedded-post-cta-wrapper"><span class="embedded-post-cta">Read more</span></div><div class="embedded-post-meta">a year ago &#183; 48 likes &#183; Garrison Lovely</div></a></div></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-24" href="#footnote-anchor-24" class="footnote-number" contenteditable="false" target="_self">24</a><div class="footnote-content"><p><a href="https://tnyqnervqldjme1y.vercel.app/">https://tnyqnervqldjme1y.vercel.app/</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-25" href="#footnote-anchor-25" class="footnote-number" contenteditable="false" target="_self">25</a><div class="footnote-content"><p><a href="https://pitchbook.com/news/articles/vcs-with-large-llm-stakes-enter-crisis-mode-over-deepseek">https://pitchbook.com/news/articles/vcs-with-large-llm-stakes-enter-crisis-mode-over-deepseek</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-26" href="#footnote-anchor-26" class="footnote-number" contenteditable="false" target="_self">26</a><div class="footnote-content"><p><a href="https://fortune.com/2025/01/27/mark-zuckerberg-meta-llama-assembling-war-rooms-engineers-deepseek-ai-china/">https://fortune.com/2025/01/27/mark-zuckerberg-meta-llama-assembling-war-rooms-engineers-deepseek-ai-china/</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-27" href="#footnote-anchor-27" class="footnote-number" contenteditable="false" target="_self">27</a><div class="footnote-content"><p><a href="https://www.reuters.com/technology/chinas-deepseek-sets-off-ai-market-rout-2025-01-27/">https://www.reuters.com/technology/chinas-deepseek-sets-off-ai-market-rout-2025-01-27/</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-28" href="#footnote-anchor-28" class="footnote-number" contenteditable="false" target="_self">28</a><div class="footnote-content"><p><a href="https://www.inc.com/ben-sherry/ai-leaders-in-the-u-s-react-to-deepseek-calling-it-impressive-but-staying-skeptical/91140125">https://www.inc.com/ben-sherry/ai-leaders-in-the-u-s-react-to-deepseek-calling-it-impressive-but-staying-skeptical/91140125</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-29" href="#footnote-anchor-29" class="footnote-number" contenteditable="false" target="_self">29</a><div class="footnote-content"><p><a href="https://www.reuters.com/technology/tech-stock-selloff-deepens-deepseek-triggers-ai-rethink-2025-01-28/">https://www.reuters.com/technology/tech-stock-selloff-deepens-deepseek-triggers-ai-rethink-2025-01-28/</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-30" href="#footnote-anchor-30" class="footnote-number" contenteditable="false" target="_self">30</a><div class="footnote-content"><p><a href="https://www.nngroup.com/articles/law-of-bandwidth/">https://www.nngroup.com/articles/law-of-bandwidth/</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-31" href="#footnote-anchor-31" class="footnote-number" contenteditable="false" target="_self">31</a><div class="footnote-content"><p><a href="https://darioamodei.com/on-deepseek-and-export-controls">https://darioamodei.com/on-deepseek-and-export-controls</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-32" href="#footnote-anchor-32" class="footnote-number" contenteditable="false" target="_self">32</a><div class="footnote-content"><p><a href="https://www.linkedin.com/posts/satyanadella_jevons-paradox-wikipedia-activity-7289521182721093633-5gJ5/">https://www.linkedin.com/posts/satyanadella_jevons-paradox-wikipedia-activity-7289521182721093633-5gJ5/</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-33" href="#footnote-anchor-33" class="footnote-number" contenteditable="false" target="_self">33</a><div class="footnote-content"><p><a href="https://techcrunch.com/2024/12/12/cartesia-claims-its-ai-is-efficient-enough-to-run-pretty-much-anywhere">https://techcrunch.com/2024/12/12/cartesia-claims-its-ai-is-efficient-enough-to-run-pretty-much-anywhere</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-34" href="#footnote-anchor-34" class="footnote-number" contenteditable="false" target="_self">34</a><div class="footnote-content"><p><a href="https://semianalysis.com/2024/08/04/nvidias-blackwell-reworked-shipment/">https://semianalysis.com/2024/08/04/nvidias-blackwell-reworked-shipment/</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-35" href="#footnote-anchor-35" class="footnote-number" contenteditable="false" target="_self">35</a><div class="footnote-content"><p><a href="https://www.databricks.com/blog/llm-training-and-inference-intel-gaudi2-ai-accelerators">https://www.databricks.com/blog/llm-training-and-inference-intel-gaudi2-ai-accelerators</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-36" href="#footnote-anchor-36" class="footnote-number" contenteditable="false" target="_self">36</a><div class="footnote-content"><p><a href="https://semianalysis.com/2024/12/25/nvidias-christmas-present-gb300-b300-reasoning-inference-amazon-memory-supply-chain/">https://semianalysis.com/2024/12/25/nvidias-christmas-present-gb300-b300-reasoning-inference-amazon-memory-supply-chain/</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-37" href="#footnote-anchor-37" class="footnote-number" contenteditable="false" target="_self">37</a><div class="footnote-content"><p><a href="https://groq.com/wp-content/uploads/2024/02/GroqISCAPaper2022_ASoftwareDefinedTensorStreamingMultiprocessorForLargeScaleMachineLearning.pdf">https://groq.com/wp-content/uploads/2024/02/GroqISCAPaper2022_ASoftwareDefinedTensorStreamingMultiprocessorForLargeScaleMachineLearning.pdf</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-38" href="#footnote-anchor-38" class="footnote-number" contenteditable="false" target="_self">38</a><div class="footnote-content"><p><a href="https://www.reddit.com/r/LocalLLaMA/comments/1afm9af/comment/kp2x27l/">https://www.reddit.com/r/LocalLLaMA/comments/1afm9af/comment/kp2x27l/</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-39" href="#footnote-anchor-39" class="footnote-number" contenteditable="false" target="_self">39</a><div class="footnote-content"><p><a href="https://semianalysis.com/2024/02/21/groq-inference-tokenomics-speed-but/">https://semianalysis.com/2024/02/21/groq-inference-tokenomics-speed-but/</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-40" href="#footnote-anchor-40" class="footnote-number" contenteditable="false" target="_self">40</a><div class="footnote-content"><p><a href="https://groq.com/news_press/aramco-digital-and-groq-announce-progress-in-building-the-worlds-largest-inferencing-data-center-in-saudi-arabia-following-leap-mou-signing/">https://groq.com/news_press/aramco-digital-and-groq-announce-progress-in-building-the-worlds-largest-inferencing-data-center-in-saudi-arabia-following-leap-mou-signing/</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-41" href="#footnote-anchor-41" class="footnote-number" contenteditable="false" target="_self">41</a><div class="footnote-content"><p><a href="https://ir.amd.com/news-events/press-releases/detail/1224/amd-reports-third-quarter-2024-financial-results">https://ir.amd.com/news-events/press-releases/detail/1224/amd-reports-third-quarter-2024-financial-results</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-42" href="#footnote-anchor-42" class="footnote-number" contenteditable="false" target="_self">42</a><div class="footnote-content"><p><a href="https://semianalysis.com/2024/12/22/mi300x-vs-h100-vs-h200-benchmark-part-1-training/">https://semianalysis.com/2024/12/22/mi300x-vs-h100-vs-h200-benchmark-part-1-training/</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-43" href="#footnote-anchor-43" class="footnote-number" contenteditable="false" target="_self">43</a><div class="footnote-content"><p><a href="https://www.amd.com/en/developer/resources/technical-articles/amd-instinct-gpus-power-deepseek-v3-revolutionizing-ai-development-with-sglang.html">https://www.amd.com/en/developer/resources/technical-articles/amd-instinct-gpus-power-deepseek-v3-revolutionizing-ai-development-with-sglang.html</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-44" href="#footnote-anchor-44" class="footnote-number" contenteditable="false" target="_self">44</a><div class="footnote-content"><p><a href="https://www.yahoo.com/news/tsmc-cuts-ties-singapore-firm-093000904.html">https://www.yahoo.com/news/tsmc-cuts-ties-singapore-firm-093000904.html </a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-45" href="#footnote-anchor-45" class="footnote-number" contenteditable="false" target="_self">45</a><div class="footnote-content"><p><a href="https://huggingface.co/blog/open-r1">https://huggingface.co/blog/open-r1</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-46" href="#footnote-anchor-46" class="footnote-number" contenteditable="false" target="_self">46</a><div class="footnote-content"><div class="embedded-post-wrap" data-attrs="{&quot;id&quot;:155916148,&quot;url&quot;:&quot;https://www.chinatalk.media/p/deepseek-the-view-from-china&quot;,&quot;publication_id&quot;:4220,&quot;publication_name&quot;:&quot;ChinaTalk&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F9b5dde60-871d-48d4-9c21-e4f434b3f3c1_256x256.png&quot;,&quot;title&quot;:&quot;DeepSeek: The View from China&quot;,&quot;truncated_body_text&quot;:&quot;Before December 2024, DeepSeek was rarely mentioned in China&#8217;s AI community. With the release of DeepSeek-V3 and the reasoning model R1, Chinese media and AI researchers started to ask the same question as their American counterparts: Who is DeepSeek and how should we feel about them?&quot;,&quot;date&quot;:&quot;2025-01-28T14:53:14.582Z&quot;,&quot;like_count&quot;:134,&quot;comment_count&quot;:5,&quot;bylines&quot;:[{&quot;id&quot;:1145,&quot;name&quot;:&quot;Jordan Schneider&quot;,&quot;handle&quot;:&quot;chinatalk&quot;,&quot;previous_name&quot;:&quot;jordan schneider&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F03d46bad-4858-4a40-a833-24843e15bf6f_400x400.jpeg&quot;,&quot;bio&quot;:&quot;ChinaTalk Founder and EIC&quot;,&quot;profile_set_up_at&quot;:&quot;2022-03-16T16:20:12.484Z&quot;,&quot;publicationUsers&quot;:[{&quot;id&quot;:233878,&quot;user_id&quot;:1145,&quot;publication_id&quot;:4220,&quot;role&quot;:&quot;admin&quot;,&quot;public&quot;:true,&quot;is_primary&quot;:true,&quot;publication&quot;:{&quot;id&quot;:4220,&quot;name&quot;:&quot;ChinaTalk&quot;,&quot;subdomain&quot;:&quot;chinatalk&quot;,&quot;custom_domain&quot;:&quot;www.chinatalk.media&quot;,&quot;custom_domain_optional&quot;:false,&quot;hero_text&quot;:&quot;Deep coverage of technology, China, and US policy. We feature original analysis alongside interviews with leading thinkers and policymakers.&quot;,&quot;logo_url&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/9b5dde60-871d-48d4-9c21-e4f434b3f3c1_256x256.png&quot;,&quot;author_id&quot;:1145,&quot;theme_var_background_pop&quot;:&quot;#ff9900&quot;,&quot;created_at&quot;:&quot;2018-12-17T01:44:27.292Z&quot;,&quot;rss_website_url&quot;:null,&quot;email_from_name&quot;:&quot;ChinaTalk&quot;,&quot;copyright&quot;:&quot;Jordan Schneider&quot;,&quot;founding_plan_name&quot;:&quot;Founding Member Plan&quot;,&quot;community_enabled&quot;:true,&quot;invite_only&quot;:false,&quot;payments_state&quot;:&quot;enabled&quot;,&quot;language&quot;:null,&quot;explicit&quot;:false,&quot;is_personal_mode&quot;:false}}],&quot;twitter_screen_name&quot;:&quot;jordanschnyc&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100},{&quot;id&quot;:12682021,&quot;name&quot;:&quot;Irene Zhang&quot;,&quot;handle&quot;:&quot;irenezhang&quot;,&quot;previous_name&quot;:null,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/af57f9bb-ce01-4a87-9ca9-13612d58e4d9_1168x930.png&quot;,&quot;bio&quot;:&quot;   &quot;,&quot;profile_set_up_at&quot;:&quot;2022-07-17T15:43:17.567Z&quot;,&quot;publicationUsers&quot;:[{&quot;id&quot;:1128679,&quot;user_id&quot;:12682021,&quot;publication_id&quot;:1175441,&quot;role&quot;:&quot;admin&quot;,&quot;public&quot;:true,&quot;is_primary&quot;:true,&quot;publication&quot;:{&quot;id&quot;:1175441,&quot;name&quot;:&quot;Second Drafts&quot;,&quot;subdomain&quot;:&quot;irenezhang&quot;,&quot;custom_domain&quot;:null,&quot;custom_domain_optional&quot;:false,&quot;hero_text&quot;:&quot;no but actually, second drafts&quot;,&quot;logo_url&quot;:null,&quot;author_id&quot;:12682021,&quot;theme_var_background_pop&quot;:&quot;#FF5CD7&quot;,&quot;created_at&quot;:&quot;2022-11-05T05:32:14.398Z&quot;,&quot;rss_website_url&quot;:null,&quot;email_from_name&quot;:&quot;Irene from Second Drafts&quot;,&quot;copyright&quot;:&quot;Irene Zhang&quot;,&quot;founding_plan_name&quot;:null,&quot;community_enabled&quot;:true,&quot;invite_only&quot;:false,&quot;payments_state&quot;:&quot;disabled&quot;,&quot;language&quot;:null,&quot;explicit&quot;:false,&quot;is_personal_mode&quot;:false}},{&quot;id&quot;:952652,&quot;user_id&quot;:12682021,&quot;publication_id&quot;:4220,&quot;role&quot;:&quot;contributor&quot;,&quot;public&quot;:true,&quot;is_primary&quot;:false,&quot;publication&quot;:{&quot;id&quot;:4220,&quot;name&quot;:&quot;ChinaTalk&quot;,&quot;subdomain&quot;:&quot;chinatalk&quot;,&quot;custom_domain&quot;:&quot;www.chinatalk.media&quot;,&quot;custom_domain_optional&quot;:false,&quot;hero_text&quot;:&quot;Deep coverage of technology, China, and US policy. We feature original analysis alongside interviews with leading thinkers and policymakers.&quot;,&quot;logo_url&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/9b5dde60-871d-48d4-9c21-e4f434b3f3c1_256x256.png&quot;,&quot;author_id&quot;:1145,&quot;theme_var_background_pop&quot;:&quot;#ff9900&quot;,&quot;created_at&quot;:&quot;2018-12-17T01:44:27.292Z&quot;,&quot;rss_website_url&quot;:null,&quot;email_from_name&quot;:&quot;ChinaTalk&quot;,&quot;copyright&quot;:&quot;Jordan Schneider&quot;,&quot;founding_plan_name&quot;:&quot;Founding Member Plan&quot;,&quot;community_enabled&quot;:true,&quot;invite_only&quot;:false,&quot;payments_state&quot;:&quot;enabled&quot;,&quot;language&quot;:null,&quot;explicit&quot;:false,&quot;is_personal_mode&quot;:false}}],&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null},{&quot;id&quot;:73772475,&quot;name&quot;:&quot;Angela Shen&quot;,&quot;handle&quot;:&quot;angelacs&quot;,&quot;previous_name&quot;:null,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c2f194a-ab97-43ae-be73-43856d22fb63_4126x5501.jpeg&quot;,&quot;bio&quot;:&quot;chief correspondent for biotech, robotics, and AI @ ChinaTalk &quot;,&quot;profile_set_up_at&quot;:&quot;2024-08-16T11:38:15.497Z&quot;,&quot;publicationUsers&quot;:[{&quot;id&quot;:2943299,&quot;user_id&quot;:73772475,&quot;publication_id&quot;:4220,&quot;role&quot;:&quot;contributor&quot;,&quot;public&quot;:true,&quot;is_primary&quot;:true,&quot;publication&quot;:{&quot;id&quot;:4220,&quot;name&quot;:&quot;ChinaTalk&quot;,&quot;subdomain&quot;:&quot;chinatalk&quot;,&quot;custom_domain&quot;:&quot;www.chinatalk.media&quot;,&quot;custom_domain_optional&quot;:false,&quot;hero_text&quot;:&quot;Deep coverage of technology, China, and US policy. We feature original analysis alongside interviews with leading thinkers and policymakers.&quot;,&quot;logo_url&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/9b5dde60-871d-48d4-9c21-e4f434b3f3c1_256x256.png&quot;,&quot;author_id&quot;:1145,&quot;theme_var_background_pop&quot;:&quot;#ff9900&quot;,&quot;created_at&quot;:&quot;2018-12-17T01:44:27.292Z&quot;,&quot;rss_website_url&quot;:null,&quot;email_from_name&quot;:&quot;ChinaTalk&quot;,&quot;copyright&quot;:&quot;Jordan Schneider&quot;,&quot;founding_plan_name&quot;:&quot;Founding Member Plan&quot;,&quot;community_enabled&quot;:true,&quot;invite_only&quot;:false,&quot;payments_state&quot;:&quot;enabled&quot;,&quot;language&quot;:null,&quot;explicit&quot;:false,&quot;is_personal_mode&quot;:false}},{&quot;id&quot;:2970110,&quot;user_id&quot;:73772475,&quot;publication_id&quot;:2921130,&quot;role&quot;:&quot;admin&quot;,&quot;public&quot;:true,&quot;is_primary&quot;:false,&quot;publication&quot;:{&quot;id&quot;:2921130,&quot;name&quot;:&quot;Angela Shen&quot;,&quot;subdomain&quot;:&quot;angelacs&quot;,&quot;custom_domain&quot;:null,&quot;custom_domain_optional&quot;:false,&quot;hero_text&quot;:&quot;i read almost everything and write about innovation &amp; tech &quot;,&quot;logo_url&quot;:null,&quot;author_id&quot;:73772475,&quot;theme_var_background_pop&quot;:&quot;#FF6719&quot;,&quot;created_at&quot;:&quot;2024-08-22T12:39:57.024Z&quot;,&quot;rss_website_url&quot;:null,&quot;email_from_name&quot;:null,&quot;copyright&quot;:&quot;Angela Shen&quot;,&quot;founding_plan_name&quot;:null,&quot;community_enabled&quot;:true,&quot;invite_only&quot;:false,&quot;payments_state&quot;:&quot;disabled&quot;,&quot;language&quot;:null,&quot;explicit&quot;:false,&quot;is_personal_mode&quot;:true}}],&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null},{&quot;id&quot;:8523415,&quot;name&quot;:&quot;Yiwen&quot;,&quot;handle&quot;:&quot;uncoolkids&quot;,&quot;previous_name&quot;:&quot;Yiwen Lu&quot;,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fa3206db-8cf1-4fcf-8617-f553f9f1ffe3_1176x882.jpeg&quot;,&quot;bio&quot;:null,&quot;profile_set_up_at&quot;:&quot;2022-10-12T23:13:43.264Z&quot;,&quot;publicationUsers&quot;:[{&quot;id&quot;:2402816,&quot;user_id&quot;:8523415,&quot;publication_id&quot;:2379381,&quot;role&quot;:&quot;admin&quot;,&quot;public&quot;:true,&quot;is_primary&quot;:false,&quot;publication&quot;:{&quot;id&quot;:2379381,&quot;name&quot;:&quot;Uncool Kids&quot;,&quot;subdomain&quot;:&quot;uncoolkids&quot;,&quot;custom_domain&quot;:null,&quot;custom_domain_optional&quot;:false,&quot;hero_text&quot;:&quot;everything that didn't get into my day job&quot;,&quot;logo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/98b557b7-f71e-40d4-a1bd-27cb38701383_500x500.png&quot;,&quot;author_id&quot;:8523415,&quot;theme_var_background_pop&quot;:&quot;#8AE1A2&quot;,&quot;created_at&quot;:&quot;2024-02-25T21:48:08.110Z&quot;,&quot;rss_website_url&quot;:null,&quot;email_from_name&quot;:&quot;Uncool Kids&quot;,&quot;copyright&quot;:&quot;Yiwen Lu&quot;,&quot;founding_plan_name&quot;:null,&quot;community_enabled&quot;:true,&quot;invite_only&quot;:false,&quot;payments_state&quot;:&quot;disabled&quot;,&quot;language&quot;:null,&quot;explicit&quot;:false,&quot;is_personal_mode&quot;:false}},{&quot;id&quot;:2819468,&quot;user_id&quot;:8523415,&quot;publication_id&quot;:4220,&quot;role&quot;:&quot;contributor&quot;,&quot;public&quot;:true,&quot;is_primary&quot;:false,&quot;publication&quot;:{&quot;id&quot;:4220,&quot;name&quot;:&quot;ChinaTalk&quot;,&quot;subdomain&quot;:&quot;chinatalk&quot;,&quot;custom_domain&quot;:&quot;www.chinatalk.media&quot;,&quot;custom_domain_optional&quot;:false,&quot;hero_text&quot;:&quot;Deep coverage of technology, China, and US policy. We feature original analysis alongside interviews with leading thinkers and policymakers.&quot;,&quot;logo_url&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/9b5dde60-871d-48d4-9c21-e4f434b3f3c1_256x256.png&quot;,&quot;author_id&quot;:1145,&quot;theme_var_background_pop&quot;:&quot;#ff9900&quot;,&quot;created_at&quot;:&quot;2018-12-17T01:44:27.292Z&quot;,&quot;rss_website_url&quot;:null,&quot;email_from_name&quot;:&quot;ChinaTalk&quot;,&quot;copyright&quot;:&quot;Jordan Schneider&quot;,&quot;founding_plan_name&quot;:&quot;Founding Member Plan&quot;,&quot;community_enabled&quot;:true,&quot;invite_only&quot;:false,&quot;payments_state&quot;:&quot;enabled&quot;,&quot;language&quot;:null,&quot;explicit&quot;:false,&quot;is_personal_mode&quot;:false}}],&quot;twitter_screen_name&quot;:&quot;itsyiwenlu&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;utm_campaign&quot;:null,&quot;belowTheFold&quot;:true,&quot;type&quot;:&quot;newsletter&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="EmbeddedPostToDOM"><a class="embedded-post" native="true" href="https://www.chinatalk.media/p/deepseek-the-view-from-china?utm_source=substack&amp;utm_campaign=post_embed&amp;utm_medium=web"><div class="embedded-post-header"><img class="embedded-post-publication-logo" src="https://substackcdn.com/image/fetch/$s_!6mVK!,w_56,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F9b5dde60-871d-48d4-9c21-e4f434b3f3c1_256x256.png" loading="lazy"><span class="embedded-post-publication-name">ChinaTalk</span></div><div class="embedded-post-title-wrapper"><div class="embedded-post-title">DeepSeek: The View from China</div></div><div class="embedded-post-body">Before December 2024, DeepSeek was rarely mentioned in China&#8217;s AI community. With the release of DeepSeek-V3 and the reasoning model R1, Chinese media and AI researchers started to ask the same question as their American counterparts: Who is DeepSeek and how should we feel about them&#8230;</div><div class="embedded-post-cta-wrapper"><span class="embedded-post-cta">Read more</span></div><div class="embedded-post-meta">a year ago &#183; 134 likes &#183; 5 comments &#183; Jordan Schneider, Irene Zhang, Angela Shen, and Yiwen</div></a></div></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-47" href="#footnote-anchor-47" class="footnote-number" contenteditable="false" target="_self">47</a><div class="footnote-content"><p><a href="https://www.163.com/dy/article/JMU4B2EK0530SFP3.html">https://www.163.com/dy/article/JMU4B2EK0530SFP3.html</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-48" href="#footnote-anchor-48" class="footnote-number" contenteditable="false" target="_self">48</a><div class="footnote-content"><p><a href="https://www.nbd.com.cn/articles/2025-01-26/3737743.html">https://www.nbd.com.cn/articles/2025-01-26/3737743.html</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-49" href="#footnote-anchor-49" class="footnote-number" contenteditable="false" target="_self">49</a><div class="footnote-content"><p><a href="https://i.ifeng.com/c/8gTgFsE8dQg">https://i.ifeng.com/c/8gTgFsE8dQg</a></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-50" href="#footnote-anchor-50" class="footnote-number" contenteditable="false" target="_self">50</a><div class="footnote-content"><p><a href="https://m.jiemian.com/article/12295694.html">https://m.jiemian.com/article/12295694.html</a></p></div></div>]]></content:encoded></item><item><title><![CDATA[Bet on model marketplaces, not monopolies.]]></title><description><![CDATA[An anti-fragile strategy.]]></description><link>https://www.machineyearning.io/p/betting-on-model-marketplaces-not</link><guid isPermaLink="false">https://www.machineyearning.io/p/betting-on-model-marketplaces-not</guid><dc:creator><![CDATA[Ryan Cunningham]]></dc:creator><pubDate>Thu, 06 Jun 2024 18:06:18 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/8e3b77f0-02a0-49f1-8b74-1e3190460bfc_1456x1048.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Last week, I <a href="https://www.machineyearning.io/p/the-model-is-not-the-product">introduced some critiques</a> of &#8216;model-centric&#8217; AI companies, and implications for economics and product strategy. For Part 2 here, we&#8217;ll get a bit more tactical on fine-tuning vs. model routing, then tie off some threads on making your AI business and investing strategies more anti-fragile.</em></p><div><hr></div><h3>&#8220;A word is worth a thousand vectors.&#8221;</h3><p>Given the choice, would you rather be a well-trained expert who stays in their lane, or have the raw intellectual horsepower to master anything?</p><p>Back when I took <a href="https://web.stanford.edu/class/cs224n/">CS224N</a> at Stanford, before the transformer era, Prof. Chris Manning blew our minds with a simple demo. Using an embeddings model, he showed us how semantic concepts like &#8220;royalty&#8221; or &#8220;gender&#8221; could be reduced to arithmetic operations in vector space, e.g. &#8220;king is to queen as man is to woman&#8221; <code>(woman = queen - king + man)</code>. We spent the next hour testing increasingly complex analogies across many domains - book genres, clothing styles, occupations - and in each one, the model generated outputs that seemed, well, weirdly intuitive.</p><p>This was a glimpse into how models perceive language, not as siloed domains, but as an interconnected web of concepts, with clusters and intersections that we can&#8217;t naturally envision&#8230; or as Chris Moody so pithily put it, <a href="https://multithreaded.stitchfix.com/blog/2015/03/11/word-is-worth-a-thousand-vectors/">&#8220;a word is worth a thousand vectors.&#8221;</a></p><p>I thought about this a lot as last year&#8217;s meme of &#8216;domain-specific models&#8217; took off. What constitutes a domain that <em>deserves</em> a specific model? Are the differences we perceive between subject matters so complex that a fine-tuned model is typically the best solution?</p><p>This question is at the heart of many <a href="https://www.bain.com/about/media-center/press-releases/2023/bain--company-announces-services-alliance-with-openai-to-help-enterprise-clients-identify-and-realize-the-full-potential-and-maximum-value-of-ai/">Fortune 500 companies AI strategies</a>, and I&#8217;m not sure they&#8217;re going to like the answer.</p><h3>Your data (probably) doesn&#8217;t matter</h3><p>Unfortunately, fine-tuning your own &lt;Company Name&gt;GPT might be a waste of time.</p><p>My consultant friends wouldn&#8217;t want me telling you this, but beyond some baseline of model intelligence, your data probably doesn't matter that much. While fine-tuning <a href="https://nextword.substack.com/p/rag-vs-finetuning-llms-what-to-use">can be useful</a> for consistent output formats or learning certain styles, most businesses' datasets aren't unique enough in language vector space to warrant the investment.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://www.linkedin.com/posts/emollick_remember-bloomberggpt-which-was-a-specially-activity-7150359287024795648-65rD/" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!8k0q!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f22d07d-1ae7-44d8-8049-e801ccb797ba_2000x1345.png 424w, https://substackcdn.com/image/fetch/$s_!8k0q!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f22d07d-1ae7-44d8-8049-e801ccb797ba_2000x1345.png 848w, https://substackcdn.com/image/fetch/$s_!8k0q!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f22d07d-1ae7-44d8-8049-e801ccb797ba_2000x1345.png 1272w, https://substackcdn.com/image/fetch/$s_!8k0q!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f22d07d-1ae7-44d8-8049-e801ccb797ba_2000x1345.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!8k0q!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f22d07d-1ae7-44d8-8049-e801ccb797ba_2000x1345.png" width="1456" height="979" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3f22d07d-1ae7-44d8-8049-e801ccb797ba_2000x1345.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:979,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:919991,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://www.linkedin.com/posts/emollick_remember-bloomberggpt-which-was-a-specially-activity-7150359287024795648-65rD/&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!8k0q!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f22d07d-1ae7-44d8-8049-e801ccb797ba_2000x1345.png 424w, https://substackcdn.com/image/fetch/$s_!8k0q!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f22d07d-1ae7-44d8-8049-e801ccb797ba_2000x1345.png 848w, https://substackcdn.com/image/fetch/$s_!8k0q!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f22d07d-1ae7-44d8-8049-e801ccb797ba_2000x1345.png 1272w, https://substackcdn.com/image/fetch/$s_!8k0q!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3f22d07d-1ae7-44d8-8049-e801ccb797ba_2000x1345.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: Prof. Ethan Mollick, <a href="https://www.linkedin.com/posts/emollick_remember-bloomberggpt-which-was-a-specially-activity-7150359287024795648-65rD/">LinkedIn</a>.</figcaption></figure></div><p>Take <a href="https://www.linkedin.com/posts/emollick_remember-bloomberggpt-which-was-a-specially-activity-7150359287024795648-65rD/">BloombergGPT</a>, for example. Despite costing $10M to build and being trained on 51% proprietary Bloomberg financial data, its <a href="https://arxiv.org/pdf/2303.17564">initial performance edge</a> was <a href="https://arxiv.org/pdf/2305.05862">quickly eclipsed by GPT-4</a> without the latter using any special Bloomberg sauce, just a larger parameter count and more compute.</p><p><a href="https://www.harvey.ai/">Harvey</a> may have followed a similar playbook. Raising $21M <a href="https://siliconangle.com/2023/12/20/harvey-raises-80m-build-generative-ai-legal-professionals/">just a month</a> after BloombergGPT&#8217;s release, with <a href="https://www.lawnext.com/2024/05/harvey-ai-to-move-out-of-early-access-phase-release-more-affordable-versions-of-its-custom-ai-models.html">nearly half</a> the headcount comprised of expensive lawyers, and insisting on a <a href="https://www.artificiallawyer.com/2024/04/16/john-craske-on-cmss-agile-genai-strategy-after-the-harvey-deal/">hefty user-based pricing model</a> ($1200/seat/year, 100-seat minimum), they&#8217;re betting big on fine-tuning, prompt engineering, and negotiated lock-in. But as a <a href="https://medium.com/@Connected_Dots/there-is-something-about-harvey-948e6602ddf1">viral teardown by Edward Bukstel</a> suggests, chances are Harvey spent a ton of money fine-tuning a model that newer ones already outpaced with RAG and/or agentic systems.</p><p>Might be reading into the tea leaves too much, but this sounds to me like an incredibly expensive endeavor fighting last year&#8217;s battle.</p><p>In both cases, <a href="http://www.incompleteideas.net/IncIdeas/BitterLesson.html">The Bitter Lesson</a> rings true - the biggest breakthroughs in AI tend to come from simple, scalable architectures, vs. bespoke tweaks that yield short-term improvements.</p><h3>Models as utilities</h3><p>Instead of building bespoke models that introduce complexity, simplifying how we interact with <em>other peoples&#8217; models</em> is the bigger prize.</p><p>When Andrew Ng talks about how <a href="https://www.gsb.stanford.edu/insights/andrew-ng-why-ai-new-electricity">&#8220;AI is the new electricity,&#8221;</a> here&#8217;s how I think about it: We need electricity for lots of things. Kitchen appliances, lighting, television, tons of applications. But ultimately, does my nephew care that our TV uses electricity which comes from a utility that uses 100% renewable energy? No, he just cares that he can watch Bluey on it.</p><p>However, as the one paying for the electricity, I definitely care about variables like price, availability, and climate-friendliness. I fortunately come from a state where consumers can pick their provider based on these variables, and that transparency creates a more competitive market that ultimately benefits me, the consumer. Although I now live in a state where this choice isn&#8217;t available, and outside of regulatory action, PG&amp;E&#8217;s monopoly gives it little incentive to do anything besides <a href="https://www.sfgate.com/local-donotuse/article/alternatives-to-pge-other-options-rates-monopoly-13533155.php">raise rates</a>, even in bankruptcy.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Rv4u!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F640e2ba5-cd9d-4a34-93ea-40ebe9b9dd2a_1840x1387.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Rv4u!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F640e2ba5-cd9d-4a34-93ea-40ebe9b9dd2a_1840x1387.png 424w, https://substackcdn.com/image/fetch/$s_!Rv4u!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F640e2ba5-cd9d-4a34-93ea-40ebe9b9dd2a_1840x1387.png 848w, https://substackcdn.com/image/fetch/$s_!Rv4u!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F640e2ba5-cd9d-4a34-93ea-40ebe9b9dd2a_1840x1387.png 1272w, https://substackcdn.com/image/fetch/$s_!Rv4u!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F640e2ba5-cd9d-4a34-93ea-40ebe9b9dd2a_1840x1387.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Rv4u!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F640e2ba5-cd9d-4a34-93ea-40ebe9b9dd2a_1840x1387.png" width="1456" height="1098" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/640e2ba5-cd9d-4a34-93ea-40ebe9b9dd2a_1840x1387.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1098,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1645750,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Rv4u!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F640e2ba5-cd9d-4a34-93ea-40ebe9b9dd2a_1840x1387.png 424w, https://substackcdn.com/image/fetch/$s_!Rv4u!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F640e2ba5-cd9d-4a34-93ea-40ebe9b9dd2a_1840x1387.png 848w, https://substackcdn.com/image/fetch/$s_!Rv4u!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F640e2ba5-cd9d-4a34-93ea-40ebe9b9dd2a_1840x1387.png 1272w, https://substackcdn.com/image/fetch/$s_!Rv4u!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F640e2ba5-cd9d-4a34-93ea-40ebe9b9dd2a_1840x1387.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: Public Utility Commission of Texas, https://powertochoose.org</figcaption></figure></div><p>Right now, there are dozens of competing large models, each hosted by dozens of token-as-a-service providers, all of whom are competing with each other based on price, quality, climate-friendliness, data privacy, many other variables to win your business. This is a <em>hugely</em> beneficial time to be building if you know how to play these providers off each other.</p><p>You&#8217;re welcome to lock in to a single provider who has your preferred ecosystem, but you&#8217;re SOL if the price jacks up or there&#8217;s <a href="https://www.hindustantimes.com/world-news/us-news/chatgpts-2nd-major-outage-of-the-day-brings-internet-back-to-the-middle-ages-meme-fire-erupts-on-social-media-101717520215005.html">an outage</a>. You&#8217;re also perfectly capable of setting up your own server, just like you could install solar panels on your house, but that carries maintenance costs many would rather not worry about.</p><p>To my nephew, playing Bluey is the TV&#8217;s only job. He&#8217;s not going to be very impressed with my off-the-grid solar array if it&#8217;s cloudy for more than a few days.</p><h3>Routers are an ROI multiplier</h3><p>So if sinking millions into fine-tuning models might not work for most businesses, and you want to avoid the hostile user-based pricing, what&#8217;s the alternative?</p><p>Model routers like <a href="https://withmartian.com">Martian</a> and <a href="https://openrouter.ai">OpenRouter</a> are promising, anti-fragile solutions which deliver all the benefits of a competitive model marketplace with minimal complexity.</p><p>Rather than getting locked into a single model and praying your vendor doesn't jack up prices (or <a href="https://community.openai.com/t/gpt-4-is-dumbed-down-nerfed-gpt-4-vision-is-only-for-specific-group-of-people-as-of-now-nov-2023/486245/2">get dumber</a>), routers use special techniques to dynamically send prompts to whichever model is best suited for it. To be clear, this is much more fine-grained than saying &#8220;Model X is best for &lt;Domain Y&gt;&#8221; - this is <em>task-specific</em> prompt &#8594; model routing.</p><p>The net effect is that you get orders of magnitude reductions in inference costs while enjoying the same (or better) quality results that larger, expensive models would return.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!-zl4!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F549c446d-9552-41f8-a677-eee08056ae10_1840x1159.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!-zl4!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F549c446d-9552-41f8-a677-eee08056ae10_1840x1159.png 424w, https://substackcdn.com/image/fetch/$s_!-zl4!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F549c446d-9552-41f8-a677-eee08056ae10_1840x1159.png 848w, https://substackcdn.com/image/fetch/$s_!-zl4!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F549c446d-9552-41f8-a677-eee08056ae10_1840x1159.png 1272w, https://substackcdn.com/image/fetch/$s_!-zl4!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F549c446d-9552-41f8-a677-eee08056ae10_1840x1159.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!-zl4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F549c446d-9552-41f8-a677-eee08056ae10_1840x1159.png" width="1456" height="917" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/549c446d-9552-41f8-a677-eee08056ae10_1840x1159.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:917,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:196537,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!-zl4!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F549c446d-9552-41f8-a677-eee08056ae10_1840x1159.png 424w, https://substackcdn.com/image/fetch/$s_!-zl4!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F549c446d-9552-41f8-a677-eee08056ae10_1840x1159.png 848w, https://substackcdn.com/image/fetch/$s_!-zl4!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F549c446d-9552-41f8-a677-eee08056ae10_1840x1159.png 1272w, https://substackcdn.com/image/fetch/$s_!-zl4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F549c446d-9552-41f8-a677-eee08056ae10_1840x1159.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Credit: <a href="https://withmartian.com">Martian</a>, <a href="https://cerebralvalley.beehiiv.com/p/martians-interpretable-alternative-transformer">Cerebral Valley</a></figcaption></figure></div><p>Under the hood, routers are built on datasets of pairwise comparisons between model outputs in response to a query. Prompts are assigned a high-level category, such as <code>asking_how_to_question</code> or <code>text_correction</code>, and human raters (or GPT-4 in some cases) select the best output.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!h5bx!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23441386-f33b-459a-abbb-93a964473d37_2000x1497.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!h5bx!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23441386-f33b-459a-abbb-93a964473d37_2000x1497.png 424w, https://substackcdn.com/image/fetch/$s_!h5bx!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23441386-f33b-459a-abbb-93a964473d37_2000x1497.png 848w, https://substackcdn.com/image/fetch/$s_!h5bx!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23441386-f33b-459a-abbb-93a964473d37_2000x1497.png 1272w, https://substackcdn.com/image/fetch/$s_!h5bx!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23441386-f33b-459a-abbb-93a964473d37_2000x1497.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!h5bx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23441386-f33b-459a-abbb-93a964473d37_2000x1497.png" width="1456" height="1090" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/23441386-f33b-459a-abbb-93a964473d37_2000x1497.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1090,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1422853,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!h5bx!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23441386-f33b-459a-abbb-93a964473d37_2000x1497.png 424w, https://substackcdn.com/image/fetch/$s_!h5bx!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23441386-f33b-459a-abbb-93a964473d37_2000x1497.png 848w, https://substackcdn.com/image/fetch/$s_!h5bx!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23441386-f33b-459a-abbb-93a964473d37_2000x1497.png 1272w, https://substackcdn.com/image/fetch/$s_!h5bx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F23441386-f33b-459a-abbb-93a964473d37_2000x1497.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Pulze&#8217;s open-source dataset for intent-based model routing. <a href="https://github.com/pulzeai-oss/knn-router/tree/main/deploy/pulze-intent-v0.1">https://github.com/pulzeai-oss/knn-router/tree/main/deploy/pulze-intent-v0.1</a></figcaption></figure></div><p>Overtime, a representative corpus of qualitative comparisons between models is built, which can then be used to categorize prompts, then route those prompts to the best performing model for that category. Platforms like <a href="https://chat.lmsys.org/?leaderboard">Chatbot Arena</a> are great for this, though its category granularity could be a bit better.</p><p>However, that&#8217;s just for qualitative comparisons. Deployment teams may want to optimize for speed if latency is a success metric, or cost if spend reduction is the primary goal.</p><p>Many organizations (including OpenAI, if the rumors are true*) leverage model routers internally to optimize their inference costs; they route &#8220;simpler&#8221; requests to smaller models that are much cheaper to run (but lack sophistication), and route more complex, thoughtful prompts to more expensive models better equipped to provide helpful responses. These dynamic systems can significantly outperform any single-model system on multiple dimensions.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!aoUS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45a5c4e4-18a4-4e31-ac4f-0930058a5b32_2000x1250.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!aoUS!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45a5c4e4-18a4-4e31-ac4f-0930058a5b32_2000x1250.png 424w, https://substackcdn.com/image/fetch/$s_!aoUS!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45a5c4e4-18a4-4e31-ac4f-0930058a5b32_2000x1250.png 848w, https://substackcdn.com/image/fetch/$s_!aoUS!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45a5c4e4-18a4-4e31-ac4f-0930058a5b32_2000x1250.png 1272w, https://substackcdn.com/image/fetch/$s_!aoUS!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45a5c4e4-18a4-4e31-ac4f-0930058a5b32_2000x1250.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!aoUS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45a5c4e4-18a4-4e31-ac4f-0930058a5b32_2000x1250.png" width="1456" height="910" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/45a5c4e4-18a4-4e31-ac4f-0930058a5b32_2000x1250.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:910,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:512552,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!aoUS!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45a5c4e4-18a4-4e31-ac4f-0930058a5b32_2000x1250.png 424w, https://substackcdn.com/image/fetch/$s_!aoUS!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45a5c4e4-18a4-4e31-ac4f-0930058a5b32_2000x1250.png 848w, https://substackcdn.com/image/fetch/$s_!aoUS!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45a5c4e4-18a4-4e31-ac4f-0930058a5b32_2000x1250.png 1272w, https://substackcdn.com/image/fetch/$s_!aoUS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F45a5c4e4-18a4-4e31-ac4f-0930058a5b32_2000x1250.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: <a href="https://winder.ai/exploring-small-language-models/">&#8220;Exploring Small Language Models&#8221;,</a> winder.ai.</figcaption></figure></div><p><em>* This is at the center of the &#8220;GPT-4 is getting dumber&#8221; meme. At a new model&#8217;s launch, OpenAI would want all prompts to deliver high-quality results to connect its brand to consistently high intelligence. But not all prompts are created equal. At full optimization, dumb questions get &lt;proportionate&gt; answers from lower-parameter model variants.</em></p><h3>Getting started</h3><p>Whatever system you end up using, you should definitely be capturing user prompts, outcome ratings, and other metadata for your own analyses later. This will help RLHF better routing systems over time, tailoring them to your specific application, and contributing to defensibility against unoptimized wrapper and single-model competitors.</p><p>You may also find certain tasks are consistent and frequent enough to justify creating your own small language model (SLM) to serve them. This targeted approach can compound inference cost savings even further.</p><p>If you're interested in exploring model routers for your own applications, the team at <a href="https://www.pulze.ai/">Pulze AI</a> has generously open-sourced their <a href="https://github.com/pulzeai-oss/knn-router">intent-based router</a>, <a href="https://github.com/pulzeai-oss/knn-router/tree/main/deploy/pulze-intent-v0.1">comparison dataset</a>, and <a href="https://huggingface.co/pulze/intent-v0.1">intent embedding model</a>, which are great starting points for teams looking to roll their own solutions.</p><h3>Wrap-up</h3><p>Given that &#8220;<a href="https://www.semianalysis.com/p/inference-race-to-the-bottom-make">almost everyone is losing money on LLM inference,</a>&#8221; the anti-fragile approach model routers offer is pretty refreshing. While everyone else seems to be rushing to build ever-larger generalist models or domain-specific models, you might want to leave that particular race to the well-funded labs draining VCs dry.</p><p>Model routing isn&#8217;t just a technical solution - it&#8217;s fundamentally democratizing, as it encourages <em>more</em>, not fewer, models to be built. Instead of raising billions for a generalist model only to fail, as <a href="https://inflection.ai/the-new-inflection">Inflection</a> did, devs can build niche specialist models that plug into a routable model marketplace. If they&#8217;re objectively better, they&#8217;ll be used and yield a profit.</p><p>Make no mistake, that doesn&#8217;t mean <em>every</em> specialist model constitutes building a venture-scale company around it. And I don&#8217;t think &#8220;LLM for Finance&#8221; or &#8220;LLM for Legal&#8221; counts as &#8220;specialized.&#8221; But in aggregate, a constellation of LLMs, SLMs, and task-specific agents, stitched together by a robust routing layer, starts to look an awful lot like the future of the field.</p><p>For more on routers, check out Cerebral Valley&#8217;s <a href="https://cerebralvalley.beehiiv.com/p/martians-interpretable-alternative-transformer">Martian deep dive</a> into Martian, where founders Yash and Etan share their inside story.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.machineyearning.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Stay in the loop. Subscribe for bi-weekly updates.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[The model is not the product.]]></title><description><![CDATA[Or: we need to talk about ROI.]]></description><link>https://www.machineyearning.io/p/the-model-is-not-the-product</link><guid isPermaLink="false">https://www.machineyearning.io/p/the-model-is-not-the-product</guid><dc:creator><![CDATA[Ryan Cunningham]]></dc:creator><pubDate>Fri, 31 May 2024 17:30:20 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/4d2f1231-d8a6-416a-ae5f-db3e9505dab7_1456x1048.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>This post is part 1 of a 2-part series on evolving &#8220;model-centric&#8221; building and investing themes. We'll explore economic challenges of inference and product strategy implications. Stay tuned for part 2 next week, where we'll dive into the tactical side of model routing and other cost optimizations for teams and businesses.</em></p><div><hr></div><h3>The elephant in the boardroom</h3><p>As AI hype continues to rip, a nagging question is popping up in boardroom circles - are we actually seeing real impact from our AI strategies?</p><p>A few weeks ago, we co-hosted <a href="https://peterleyden.substack.com/p/how-ai-could-supercharge-progress">an event</a> with Reinvent Futures here at SHACK15, which brought together leaders from the tech and sustainability spheres. Matt Kropp, CTO of BCG X (BCG&#8217;s tech build &amp; design unit), <a href="https://www.youtube.com/watch?v=ZdjBHgMNlpQ">kicked things off</a> with a pretty clear signal from their clients: &#8220;we're now getting to the point where there's some question about whether we're actually getting impact.&#8221;</p><p>The <a href="https://www.bcg.com/publications/2024/from-potential-to-profit-with-genai">surveys</a> definitely back this up, and I&#8217;m personally hearing this more and more among consumers and enterprises alike.</p><ul><li><p>Consumers are feeling the squeeze of subscription fees, as research labs start to lock the best models behind paywalls.</p></li><li><p>Enterprises are still feeling pressure from their boards to adopt GenAI tools (89% say AI and GenAI are in top 3 tech priorities for this year), but struggle to figure out which ones to use.</p></li><li><p>Even with a hefty "enterprise" discount, no sane CIO or CFO is going to greenlight a plan that charges $240 per user per year for access to OpenAI's finest, because they first want to know&#8230; what will I actually use this for?</p></li></ul><p>LLMs are becoming commoditized, so teams building AI products they want businesses to use need to focus on building complete solutions solving concrete problems. Simply participating in the LLM gold rush isn't enough anymore, because <strong>the model is not the product</strong>.</p><h3>Model-centric missteps</h3><p><a href="https://reutersinstitute.politics.ox.ac.uk/what-does-public-six-countries-think-generative-ai-news#header--2">Over 70% of people</a> haven&#8217;t even used generative AI tools yet, let alone know the difference between distinct models. While I&#8217;m grateful for leaderboards from <a href="https://crfm.stanford.edu/helm/mmlu/latest/">Stanford</a>, <a href="https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard">HuggingFace</a>, and most recently <a href="https://scale.com/leaderboard">Scale</a>, the benchmarks we use are somewhat academic, or at least lack the specificity one could use making a business decision about which model to go with.</p><p>The first chapter of the AI arms race has had VCs shoveling money towards teams building model-centric research labs. These models are great across a range of tasks, but lack significant differentiation from one another, and may eventually fall short of investor expectations. <a href="https://inflection.ai/the-new-inflection">Inflection</a>&#8217;s <a href="https://medium.com/@ignacio.de.gregorio.noblejas/the-first-big-ai-failure-just-took-place-about-time-0ef53fe0c941">implosion</a> and <a href="https://stability.ai">Stability AI</a>&#8217;s <a href="https://www.theinformation.com/articles/stability-ai-facing-cash-crunch-discusses-sale?rc=adlzu4">public struggle</a> demonstrate the eventual risks of this model-centric approach.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!XBUC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa65260e7-88ad-4a44-83c9-7dea536c1234_2000x1147.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!XBUC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa65260e7-88ad-4a44-83c9-7dea536c1234_2000x1147.png 424w, https://substackcdn.com/image/fetch/$s_!XBUC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa65260e7-88ad-4a44-83c9-7dea536c1234_2000x1147.png 848w, https://substackcdn.com/image/fetch/$s_!XBUC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa65260e7-88ad-4a44-83c9-7dea536c1234_2000x1147.png 1272w, https://substackcdn.com/image/fetch/$s_!XBUC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa65260e7-88ad-4a44-83c9-7dea536c1234_2000x1147.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!XBUC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa65260e7-88ad-4a44-83c9-7dea536c1234_2000x1147.png" width="1456" height="835" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a65260e7-88ad-4a44-83c9-7dea536c1234_2000x1147.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:835,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:116637,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!XBUC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa65260e7-88ad-4a44-83c9-7dea536c1234_2000x1147.png 424w, https://substackcdn.com/image/fetch/$s_!XBUC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa65260e7-88ad-4a44-83c9-7dea536c1234_2000x1147.png 848w, https://substackcdn.com/image/fetch/$s_!XBUC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa65260e7-88ad-4a44-83c9-7dea536c1234_2000x1147.png 1272w, https://substackcdn.com/image/fetch/$s_!XBUC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa65260e7-88ad-4a44-83c9-7dea536c1234_2000x1147.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Credit: Fireship&#8217;s <a href="https://www.youtube.com/watch?v=krixaEhLnlA">&#8220;The Code Report.&#8221;</a> </figcaption></figure></div><p>Inflection in particular shows that war chests are not a guarantee of product-market fit. Despite raising $1.3B last year alone, they were late to an already crowded party and failed to gain traction. Their <a href="https://inflection.ai/press">Pi assistant</a>, while friendly and acclaimed by users, ultimately may not have been distinct enough from the ChatGPTs, Claudes, and Geminis of the world to warrant much attention.</p><p>Similarly, Stability, despite hitting unicorn status in 2022 thanks to the buzz around Stable Diffusion, failed to build a compelling business around their family of open-source models. The financial situation became pretty dire: in Q1 2024, they reportedly lost $30M on a meager $5M in revenue (with nearly $100M in unpaid cloud compute bills). So despite great contributions to open source, they didn&#8217;t create a revenue model which could support demand for its service, hampering their ability to raise more cash.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!syD3!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04198de5-84fb-4d90-bb5b-df679e02f89a_2000x1197.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!syD3!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04198de5-84fb-4d90-bb5b-df679e02f89a_2000x1197.png 424w, https://substackcdn.com/image/fetch/$s_!syD3!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04198de5-84fb-4d90-bb5b-df679e02f89a_2000x1197.png 848w, https://substackcdn.com/image/fetch/$s_!syD3!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04198de5-84fb-4d90-bb5b-df679e02f89a_2000x1197.png 1272w, https://substackcdn.com/image/fetch/$s_!syD3!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04198de5-84fb-4d90-bb5b-df679e02f89a_2000x1197.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!syD3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04198de5-84fb-4d90-bb5b-df679e02f89a_2000x1197.png" width="1456" height="871" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/04198de5-84fb-4d90-bb5b-df679e02f89a_2000x1197.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:871,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1656115,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!syD3!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04198de5-84fb-4d90-bb5b-df679e02f89a_2000x1197.png 424w, https://substackcdn.com/image/fetch/$s_!syD3!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04198de5-84fb-4d90-bb5b-df679e02f89a_2000x1197.png 848w, https://substackcdn.com/image/fetch/$s_!syD3!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04198de5-84fb-4d90-bb5b-df679e02f89a_2000x1197.png 1272w, https://substackcdn.com/image/fetch/$s_!syD3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F04198de5-84fb-4d90-bb5b-df679e02f89a_2000x1197.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">The original DreamStudio from Stability. Credit: <a href="https://www.producthunt.com/products/stable-diffusion-dreamstudio#stable-diffusion-dreamstudio.">Product Hunt</a></figcaption></figure></div><h3>Behold &#8220;AI Operating Systems&#8221;</h3><p>So if the model-centric approach is losing its luster, what are market leaders doing instead? They&#8217;re shifting their focus from best-in-class general models to <a href="https://stratechery.com/2024/ai-integration-and-modularization/">tightly integrated ecosystems</a>.</p><p>OpenAI is doubling down on the sci-fi dream of an always-on, hyper-personalized AI companion (<em>Her</em>, but IRL), while Google and Apple are leveraging existing product and device ecosystems to make their models indispensable across every touchpoint. Meanwhile, Microsoft is betting on &#8220;Copilot+&#8221; branding as a value-add for its PC and productivity software businesses.</p><p>Each player is trying to weave LLMs into a stickier, more defensible product strategy as the go-to &#8220;AI Operating System.&#8221; Google, Apple, and Microsoft have extensive network effects they can already leverage for lock-in, and while OpenAI is playing catch-up here, their market share with ChatGPT has so far been substantial enough to dictate the pace of development.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://youtu.be/aQ8UVSXnefk" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!M-IB!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba459b09-332a-468f-8d55-04187205fdf2_1280x720.jpeg 424w, https://substackcdn.com/image/fetch/$s_!M-IB!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba459b09-332a-468f-8d55-04187205fdf2_1280x720.jpeg 848w, https://substackcdn.com/image/fetch/$s_!M-IB!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba459b09-332a-468f-8d55-04187205fdf2_1280x720.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!M-IB!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba459b09-332a-468f-8d55-04187205fdf2_1280x720.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!M-IB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba459b09-332a-468f-8d55-04187205fdf2_1280x720.jpeg" width="1280" height="720" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ba459b09-332a-468f-8d55-04187205fdf2_1280x720.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:720,&quot;width&quot;:1280,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://youtu.be/aQ8UVSXnefk&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!M-IB!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba459b09-332a-468f-8d55-04187205fdf2_1280x720.jpeg 424w, https://substackcdn.com/image/fetch/$s_!M-IB!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba459b09-332a-468f-8d55-04187205fdf2_1280x720.jpeg 848w, https://substackcdn.com/image/fetch/$s_!M-IB!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba459b09-332a-468f-8d55-04187205fdf2_1280x720.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!M-IB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fba459b09-332a-468f-8d55-04187205fdf2_1280x720.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">After all, what&#8217;s stickier than love &#10024;</figcaption></figure></div><p>But while this AI OS approach may be useful for consumer lock-in, it doesn&#8217;t solve the primary problems businesses have integrating models into their own products and services. Businesses aren&#8217;t buying models or ecosystems - they&#8217;re &#8220;hiring&#8221; them to perform specific <a href="https://www.christenseninstitute.org/jobs-to-be-done/">jobs-to-be-done</a>. And they need the flexibility to choose the best, fastest, and cheapest option for each job, not to be beholden to a single provider.</p><h3>Unsustainable inference costs</h3><p>Even the most seamless ecosystem will struggle if the economics don&#8217;t add up. The <a href="https://www.wsj.com/tech/ai/how-a-shifting-ai-chip-market-will-shape-nvidias-future-f0c256b1">cost of at-scale inference is staggering</a>, and it doesn&#8217;t seem like any vendors serving models today are making a profit.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!0E7H!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff115b510-f737-41e4-a5b4-7c0ce45096fd_2000x2000.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!0E7H!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff115b510-f737-41e4-a5b4-7c0ce45096fd_2000x2000.png 424w, https://substackcdn.com/image/fetch/$s_!0E7H!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff115b510-f737-41e4-a5b4-7c0ce45096fd_2000x2000.png 848w, https://substackcdn.com/image/fetch/$s_!0E7H!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff115b510-f737-41e4-a5b4-7c0ce45096fd_2000x2000.png 1272w, https://substackcdn.com/image/fetch/$s_!0E7H!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff115b510-f737-41e4-a5b4-7c0ce45096fd_2000x2000.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!0E7H!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff115b510-f737-41e4-a5b4-7c0ce45096fd_2000x2000.png" width="479" height="479" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f115b510-f737-41e4-a5b4-7c0ce45096fd_2000x2000.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1456,&quot;width&quot;:1456,&quot;resizeWidth&quot;:479,&quot;bytes&quot;:2594560,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!0E7H!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff115b510-f737-41e4-a5b4-7c0ce45096fd_2000x2000.png 424w, https://substackcdn.com/image/fetch/$s_!0E7H!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff115b510-f737-41e4-a5b4-7c0ce45096fd_2000x2000.png 848w, https://substackcdn.com/image/fetch/$s_!0E7H!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff115b510-f737-41e4-a5b4-7c0ce45096fd_2000x2000.png 1272w, https://substackcdn.com/image/fetch/$s_!0E7H!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff115b510-f737-41e4-a5b4-7c0ce45096fd_2000x2000.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: <a href="https://www.semianalysis.com/p/inference-race-to-the-bottom-make">SemiAnalysis</a></figcaption></figure></div><p>This introduces some weird market dynamics.</p><p>Token-as-a-service providers are playing by a predictable growth-at-all-costs playbook: sacrificing near-term profitability for user acquisition, then making it up on volume once you&#8217;ve cornered the market. So for the time being, no one really knows what the willingness-to-pay is, because token prices are being kept artificially low.</p><p>As we&#8217;ve seen with Inflection and Stability, that strategy may have its limits, especially if they fail to distinguish their open offerings from paid ones. In turn, these failures may cause disillusionment, consolidation, and upward pricing trends.</p><p>At first glance, this could seem like a good thing (VC-subsidized prices, yay), but there are significant risks. Basing your own product strategy on unsustainably low prices can cause a severe hangover to your cost structure if and when those prices eventually do rise. Moreover, vendor lock-in carries the risk of your chosen model suddenly becoming unavailable or prohibitively expensive if they fail to hit profitability. You need to be careful not to anchor your product too much on the idiosyncrasies of a single vendor&#8217;s model family.</p><p>A valid counter would be that Moore&#8217;s Law is naturally driving down inference costs overtime, hence the &#8220;make it up on volume later&#8221; meme. While this is true that there&#8217;s about a <a href="https://epochai.org/trends#investment-trends-section">3x reduction</a> per year in physical compute required to hit a given performance target, it&#8217;s also true that training costs for frontier models are rising by <a href="https://epochai.org/blog/trends-in-the-dollar-training-cost-of-machine-learning-systems">a similar rate</a> at 3.1x per year. So if you want last year&#8217;s model, that will be cheaper, but if you want the latest and greatest, you&#8217;re still going to have to pay.</p><p>In short, the current economics of AI inference are concerningly unstable, and businesses should be strategic in navigating that when building products that use external models.</p><h2>Recommendations</h2><p>To recap, the path to real impact and ROI remains unclear for many businesses under pressure to adopt AI in their products and services. So, what&#8217;s a savvy team to do when crafting their strategy? Two options stand out in contrast to the model-centric approach:</p><ol><li><p><strong>Highly optimized token-as-a-service providers</strong> like <a href="https://deepinfra.com/">DeepInfra</a> offer a compelling value proposition, by focusing solely on efficient model hosting and serving. By comparing models and hosts on independent leaderboards like <a href="https://artificialanalysis.ai/leaderboards/providers">Artificial Analysis</a>, businesses can find the best bang for their inference buck.</p></li><li><p><strong>Model routers</strong> like <a href="https://leaderboard.withmartian.com/">Martian</a> and <a href="https://openrouter.ai">OpenRouter</a> take this a step further by dynamically allocating queries across multiple models based on cost, speed, and performance. They&#8217;re constantly tracking the model tokenomics and quality with insane prompt-specific granularity, which lets customers tap into the best available models for any use case without vendor lock-in.</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!zBnH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb46da9fc-7f25-4806-a1c7-2499cb89cdf4_2000x1179.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!zBnH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb46da9fc-7f25-4806-a1c7-2499cb89cdf4_2000x1179.png 424w, https://substackcdn.com/image/fetch/$s_!zBnH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb46da9fc-7f25-4806-a1c7-2499cb89cdf4_2000x1179.png 848w, https://substackcdn.com/image/fetch/$s_!zBnH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb46da9fc-7f25-4806-a1c7-2499cb89cdf4_2000x1179.png 1272w, https://substackcdn.com/image/fetch/$s_!zBnH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb46da9fc-7f25-4806-a1c7-2499cb89cdf4_2000x1179.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!zBnH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb46da9fc-7f25-4806-a1c7-2499cb89cdf4_2000x1179.png" width="1456" height="858" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b46da9fc-7f25-4806-a1c7-2499cb89cdf4_2000x1179.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:858,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:736990,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!zBnH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb46da9fc-7f25-4806-a1c7-2499cb89cdf4_2000x1179.png 424w, https://substackcdn.com/image/fetch/$s_!zBnH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb46da9fc-7f25-4806-a1c7-2499cb89cdf4_2000x1179.png 848w, https://substackcdn.com/image/fetch/$s_!zBnH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb46da9fc-7f25-4806-a1c7-2499cb89cdf4_2000x1179.png 1272w, https://substackcdn.com/image/fetch/$s_!zBnH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb46da9fc-7f25-4806-a1c7-2499cb89cdf4_2000x1179.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: <a href="https://artificialanalysis.ai/leaderboards/providers">Artificial Analysis</a>.</figcaption></figure></div><p>Of course, each solution has their tradeoffs, which we&#8217;ll dive into next week with a tactical breakdown in part 2. We&#8217;ll explore, model routing, fine-tuning, leaderboarding, and more practical takeaways for AI strategies. Until then, I&#8217;d recommend reading Cerebral Valley&#8217;s <a href="https://cerebralvalley.ai/blog/martians-interpretable-alternative-to-the-transformer-5mRbYFwNosh1s7d4EYzmza">deep dive</a> with the Martian team to get familiar with what they&#8217;re up to.</p><p>But for now, the key takeaways are this: <strong>the model is not the product</strong>, and the businesses that will thrive are those that <strong>match the right model to the right job at the best price.</strong></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.machineyearning.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Stay in the loop. Subscribe for bi-weekly updates.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[It's time to move on from copilots.]]></title><description><![CDATA[We can build better than this.]]></description><link>https://www.machineyearning.io/p/its-time-to-move-on-from-copilots</link><guid isPermaLink="false">https://www.machineyearning.io/p/its-time-to-move-on-from-copilots</guid><dc:creator><![CDATA[Ryan Cunningham]]></dc:creator><pubDate>Tue, 14 May 2024 18:01:07 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/b350bdb4-7a43-4c75-8e01-171cf0381d5c_1456x1048.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>I&#8217;ve been saying for months that I don&#8217;t like to invest in application-layer AI startups. Here&#8217;s why - and what it would take to change my mind.</p><div><hr></div><h2>The first wave under-delivered</h2><p>When ChatGPT debuted, a friend raved to me about <a href="https://sudowrite.com">Sudowrite</a>, a thoughtfully designed fiction-writing app created by James Yu and Amit Gupta. Sudowrite leverages LLMs to help authors stay in flow, organizing LLM calls into purpose-built features for tasks like world-building, pacing, and outlining. With a vibrant user community stress-testing new features, <a href="https://churnkey.co/case-studies/sudowrite/">millions</a> in annual revenue, <a href="https://sudowrite.notion.site/We-re-hiring-engineers-to-make-writing-magical-389c57f5ae3a421d8f8c0b48c8407e88">profitability</a>, and fewer than <a href="https://www.linkedin.com/company/sudowrite/people/">10 employees</a>, Sudowrite exemplifies what&#8217;s possible when generative AI products are built with deep user empathy and domain knowledge.</p><p>At the time, I thought this foreshadowed a wave of similarly well-designed AI products across many industries.</p><p>A year and a half later, that prediction remains mostly unfulfilled.</p><p>Many first-wave AI startups have failed to deliver little more than thin UI wrappers around generic LLMs. I&#8217;ve critiqued Jasper before, and it looks like Tome (<a href="https://fortune.com/2022/12/20/tome-genereative-ai-presentation-software-chatgpt-openai-powerpoint/">text-to-powerpoint</a>) <a href="https://www.theinformation.com/articles/the-ai-presentation-startup-losing-its-rose-colored-glasses-musks-gpu-dreams?rc=adlzu4">may be next</a> <a href="https://www.semafor.com/article/04/16/2024/ai-startup-tome-lays-off-staff-to-focus-on-revenue">on the chopping block</a>. Turns out defining your ideal customer profile and product is crucial before hiring sales reps.</p><p>This trend extends beyond software. First-gen consumer AI hardware like the <a href="https://www.theverge.com/24126502/humane-ai-pin-review">Humane AI Pin</a> and <a href="https://www.theverge.com/2024/5/2/24147159/rabbit-r1-review-ai-gadget">Rabbit R1</a> have been dead-on-arrival, tethered to the latency, limits, and hallucinations of off-the-shelf LLMs. As <a href="https://www.youtube.com/watch?v=ddTV12hErTc&amp;ab_channel=MarquesBrownlee">MKBHD</a> (Marques Brownlee) put it, they&#8217;ve failed to deliver <a href="https://www.youtube.com/watch?v=ddTV12hErTc">compelling</a> (or even <a href="https://www.youtube.com/watch?v=TitZV6k8zfA&amp;pp=ygUMbWtiaGQgaHVtYW5l">passable</a>) user experiences.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://twitter.com/MKBHD/status/1785102259740667960?ref_src=twsrc%5Etfw%7Ctwcamp%5Etweetembed%7Ctwterm%5E1785102259740667960%7Ctwgr%5E%7Ctwcon%5Es1_c10&amp;ref_url=notion%3A%2F%2Fwww.notion.so%2FAgents-Are-All-You-Need-a40802a02efa4ff595a0ced0eb66f904%3Fpvs%3D25" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!BYti!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fee906a60-93d2-4278-be3a-c5000c9805f3_2494x1890.png 424w, https://substackcdn.com/image/fetch/$s_!BYti!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fee906a60-93d2-4278-be3a-c5000c9805f3_2494x1890.png 848w, https://substackcdn.com/image/fetch/$s_!BYti!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fee906a60-93d2-4278-be3a-c5000c9805f3_2494x1890.png 1272w, https://substackcdn.com/image/fetch/$s_!BYti!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fee906a60-93d2-4278-be3a-c5000c9805f3_2494x1890.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!BYti!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fee906a60-93d2-4278-be3a-c5000c9805f3_2494x1890.png" width="1456" height="1103" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ee906a60-93d2-4278-be3a-c5000c9805f3_2494x1890.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1103,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1766842,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://twitter.com/MKBHD/status/1785102259740667960?ref_src=twsrc%5Etfw%7Ctwcamp%5Etweetembed%7Ctwterm%5E1785102259740667960%7Ctwgr%5E%7Ctwcon%5Es1_c10&amp;ref_url=notion%3A%2F%2Fwww.notion.so%2FAgents-Are-All-You-Need-a40802a02efa4ff595a0ced0eb66f904%3Fpvs%3D25&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!BYti!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fee906a60-93d2-4278-be3a-c5000c9805f3_2494x1890.png 424w, https://substackcdn.com/image/fetch/$s_!BYti!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fee906a60-93d2-4278-be3a-c5000c9805f3_2494x1890.png 848w, https://substackcdn.com/image/fetch/$s_!BYti!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fee906a60-93d2-4278-be3a-c5000c9805f3_2494x1890.png 1272w, https://substackcdn.com/image/fetch/$s_!BYti!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fee906a60-93d2-4278-be3a-c5000c9805f3_2494x1890.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: Marques Brownlee (@MKBHD), Alex Finn (@AlexFinnX)</figcaption></figure></div><p>This &#8220;wrapper&#8221; or &#8220;copilot&#8221; design philosophy is at the center of my beef with application-layer startups, and why I generally avoid investing in them (with limited exceptions I&#8217;ll cover at the end of this post*).</p><p>Most seem to follow the same playbook - inject someone else&#8217;s LLM into a user-facing SaaS model, christen it a &#8220;copilot&#8221;, and ignore competitive analyses because they misread Christensen&#8217;s Innovator&#8217;s Dilemma. At its current stage, generative AI has been more of a <em>sustaining</em> innovation for incumbents, not a <em>disruptive</em> one they&#8217;ve been reluctant to adopt.</p><p>Yes, I&#8217;m aware many copilots claim to be much more than a wrapper. Some emphasize that they use fine-tuned, domain-specific models that outperform generic LLMs. However, even category leaders building these kinds of products can be paper tigers. Case in point - Harvey, the buzziest player in AI x legaltech.</p><h2>Harvey: Legal Jasper?</h2><p>I&#8217;m going to go on record and say what I&#8217;m seeing with Harvey smacks a great deal like Jasper last year - Harvey&#8217;s last raise <a href="https://www.theinformation.com/articles/legal-ai-startup-harvey-valued-at-700-million-in-kleiner-co-led-round">pegged them at $700M</a>+, but leaked screenshots of their tool leave a lot to be desired.</p><p>First, let&#8217;s understand how Harvey views itself in the AI x legaltech landscape.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!E8eK!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc23b5647-a3f5-4b57-92c6-e0b05ff29927_3744x1028.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!E8eK!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc23b5647-a3f5-4b57-92c6-e0b05ff29927_3744x1028.png 424w, https://substackcdn.com/image/fetch/$s_!E8eK!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc23b5647-a3f5-4b57-92c6-e0b05ff29927_3744x1028.png 848w, https://substackcdn.com/image/fetch/$s_!E8eK!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc23b5647-a3f5-4b57-92c6-e0b05ff29927_3744x1028.png 1272w, https://substackcdn.com/image/fetch/$s_!E8eK!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc23b5647-a3f5-4b57-92c6-e0b05ff29927_3744x1028.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!E8eK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc23b5647-a3f5-4b57-92c6-e0b05ff29927_3744x1028.png" width="1456" height="400" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c23b5647-a3f5-4b57-92c6-e0b05ff29927_3744x1028.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:400,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1672233,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!E8eK!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc23b5647-a3f5-4b57-92c6-e0b05ff29927_3744x1028.png 424w, https://substackcdn.com/image/fetch/$s_!E8eK!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc23b5647-a3f5-4b57-92c6-e0b05ff29927_3744x1028.png 848w, https://substackcdn.com/image/fetch/$s_!E8eK!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc23b5647-a3f5-4b57-92c6-e0b05ff29927_3744x1028.png 1272w, https://substackcdn.com/image/fetch/$s_!E8eK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc23b5647-a3f5-4b57-92c6-e0b05ff29927_3744x1028.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: https://harvey.ai</figcaption></figure></div><p>Harvey describes itself to investors and customers as an application layer that is &#8220;foundation model agnostic.&#8221;</p><p>They highlight that a team of in-house attorneys and experts &#8220;structure application and model calls&#8221; to create a best-in-class legal LLM. Off the bat, this sounds like an expensive consulting agreement for lawyers to do prompt engineering.</p><p>They also have no plans to provide real-time access to the internet for legal research, because their focus is on not providing &#8220;wrong or tainted responses.&#8221;</p><p>Let&#8217;s see how that philosophy manifests into actual product.</p><h3>Lacking basic integrations</h3><p>Harvey&#8217;s legal research capabilities are slim. The only searchable source is SEC filings - no <a href="https://scholar.google.com/scholar?hl=en&amp;as_sdt=6">US case law</a> or <a href="https://eur-lex.europa.eu/homepage.html">EUR-Lex</a> integrations, despite APIs for these services being freely available.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!gTEb!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2131ea5-0b44-4f74-9d69-82551229184d_2195x1560.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!gTEb!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2131ea5-0b44-4f74-9d69-82551229184d_2195x1560.png 424w, https://substackcdn.com/image/fetch/$s_!gTEb!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2131ea5-0b44-4f74-9d69-82551229184d_2195x1560.png 848w, https://substackcdn.com/image/fetch/$s_!gTEb!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2131ea5-0b44-4f74-9d69-82551229184d_2195x1560.png 1272w, https://substackcdn.com/image/fetch/$s_!gTEb!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2131ea5-0b44-4f74-9d69-82551229184d_2195x1560.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!gTEb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2131ea5-0b44-4f74-9d69-82551229184d_2195x1560.png" width="1456" height="1035" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b2131ea5-0b44-4f74-9d69-82551229184d_2195x1560.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1035,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:167902,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!gTEb!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2131ea5-0b44-4f74-9d69-82551229184d_2195x1560.png 424w, https://substackcdn.com/image/fetch/$s_!gTEb!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2131ea5-0b44-4f74-9d69-82551229184d_2195x1560.png 848w, https://substackcdn.com/image/fetch/$s_!gTEb!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2131ea5-0b44-4f74-9d69-82551229184d_2195x1560.png 1272w, https://substackcdn.com/image/fetch/$s_!gTEb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb2131ea5-0b44-4f74-9d69-82551229184d_2195x1560.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">These screengrabs are from earlier this year - I expect (and hope) the product has improved a lot since then.</figcaption></figure></div><p>As an attorney, when conducting legal research you&#8217;re looking for relevant cases, rulings, and written opinions from the court on your area of interest. This helps you vet whether there&#8217;s established precedent in your favor or which arguments are most likely to work.</p><p>A baseline expectation for a production-grade legal research app is that generated outputs map to real cases. These integrations would ground any generated cases in discrete outputs, and a well-designed system would assert the outputs map to real cases before including them.</p><p>To be sure, LlamaIndex built <a href="http://secinsights.ai">secinsights.ai</a> (RAG for SEC filings) in <a href="https://www.llamaindex.ai/blog/llamaindex-turns-1-f69dcdd45fe3">September 2023</a> as a feature-complete demo of their toolkit - really impressive work for LlamaIndex, and a great open-source contribution. But what has Harvey actually built on top of this?</p><h3>CYA</h3><p>Very little, it would seem. The &#8220;Assistant&#8221; tool is a bare-bones LLM wrapper with lengthy disclaimers about how LLMs hallucinate and you shouldn&#8217;t take anything Harvey says seriously.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!txhZ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff543047b-16a2-404f-93be-2843b80be917_2200x1576.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!txhZ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff543047b-16a2-404f-93be-2843b80be917_2200x1576.png 424w, https://substackcdn.com/image/fetch/$s_!txhZ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff543047b-16a2-404f-93be-2843b80be917_2200x1576.png 848w, https://substackcdn.com/image/fetch/$s_!txhZ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff543047b-16a2-404f-93be-2843b80be917_2200x1576.png 1272w, https://substackcdn.com/image/fetch/$s_!txhZ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff543047b-16a2-404f-93be-2843b80be917_2200x1576.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!txhZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff543047b-16a2-404f-93be-2843b80be917_2200x1576.png" width="1456" height="1043" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f543047b-16a2-404f-93be-2843b80be917_2200x1576.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1043,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:714464,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!txhZ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff543047b-16a2-404f-93be-2843b80be917_2200x1576.png 424w, https://substackcdn.com/image/fetch/$s_!txhZ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff543047b-16a2-404f-93be-2843b80be917_2200x1576.png 848w, https://substackcdn.com/image/fetch/$s_!txhZ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff543047b-16a2-404f-93be-2843b80be917_2200x1576.png 1272w, https://substackcdn.com/image/fetch/$s_!txhZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff543047b-16a2-404f-93be-2843b80be917_2200x1576.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Your legal AI tool priced at $700M.</figcaption></figure></div><p>Neither a wall of disclaimers nor a black box &#8220;fine-tuned by the best lawyers&#8221; instills confidence. Harvey may argue their in-house experts can prompt-hack a robust model for legal work, but don&#8217;t expect customers to trust an opaque, unverifiable system, no matter how well-built it claims to be.</p><p>If your Harvard-trained associate handed you a legal brief with a note saying &#8220;but idk I make stuff up sometimes,&#8221; you&#8217;d fire them on the spot. Or in <a href="https://www.nytimes.com/2023/06/08/nyregion/lawyer-chatgpt-sanctions.html">this real-life case</a>, you&#8217;d so thoroughly embarrass the lawyers who submitted it that you wouldn&#8217;t even need to disbar them.</p><p>Let me be clear - in domains where the cost of errors are severe (e.g. legal, finance, accounting) or even life-threatening (e.g. healthcare, pharma) you <em>absolutely should not</em> design a system solely around a zero-shot LLM.* LLMs are stochastic by nature, and need to use external resources to provide verifiable, discrete responses.</p><p>What you can do instead is create a modular system with explainability, grounding, and auditability built-in - something more akin to an &#8220;agent-based&#8221; approach.</p><p><em>* This is part of a greater degree of skepticism I have with &#8220;domain-specific models,&#8221; which is a topic for a future post - the observation that beyond a certain level of model intelligence, your domain-specific data does not matter.</em></p><h2>Agents &#8220;Do No Harm.&#8221;</h2><p>While Harvey bets on an offline, domain-specific model, <a href="https://www.hippocraticai.com/">Hippocratic AI</a> has the opposite tack. They break physician-patient interactions into individual &#8220;specialist models,&#8221; (i.e. &#8216;agents&#8217; or &#8216;modules&#8217;) narrowly scoped to address individual workflows like checklists, medication, labs &amp; vitals, etc..</p><p>It&#8217;s a fantastic example of an agentic system in action, especially notable for its clout and adoption in such a sensitive domain - healthcare - where the cost of a mistake can be deadly.</p><p>Listen to this AI-patient conversation at 2x speed, and observe the agents in action:</p><div id="vimeo-927450270" class="vimeo-wrap" data-attrs="{&quot;videoId&quot;:&quot;927450270&quot;,&quot;videoKey&quot;:&quot;&quot;,&quot;belowTheFold&quot;:true}" data-component-name="VimeoToDOM"><div class="vimeo-inner"><iframe src="https://player.vimeo.com/video/927450270?autoplay=0" frameborder="0" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" loading="lazy"></iframe></div></div><p><em>Bonus points for anyone who heard &#8220;delve&#8221; as a clear AI giveaway.</em></p><ul><li><p>The primary agent for Linda prioritizes good bedside manner, building rapport, and providing empathetic guidance</p></li><li><p>Linda&#8217;s &#8220;checklist specialist&#8221; balances extracting info and moving the conversation forward</p></li><li><p>For each checklist item, Linda has a specialist &#8220;support agent&#8221; for tasks like reviewing meds, answering insurance questions, or giving nutrition tips</p></li><li><p>These interlinked agents share data and RAG into specific knowledge bases</p></li></ul><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b522288d-b124-43f4-a0e3-aba7ecf0675b_1456x882.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/25dcad78-0824-4f63-bdbf-ae0eedc0899e_1456x1041.png&quot;}],&quot;caption&quot;:&quot;Each of these &#8220;support agents&#8221; is a separate language model ranging between 50-100B parameters. But you can do quite a lot with even smaller models - the system's performance, rated by nurses, was as good or better for each of the 5 assessed parameters. Source: Eric Topol, Ground Truths: https://erictopol.substack.com/p/a-big-week-in-medical-ai &quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e5b57c30-62dc-4c63-85fb-fd83e1fa16c7_1456x720.png&quot;}},&quot;isEditorNode&quot;:true}"></div><p>Hippocratic stress-tested with 1,200+ clinicians in simulated patient-actor conversations, and are achieving verifiable parity (or supremacy) on all 5 of the surveyed dimensions: <em>bedside manner, patient education, conversation quality, clinical readiness,</em> and <em>medical safety.</em></p><p>I&#8217;ll dive deeper into agentic systems design in a future post, but the key takeaway is you can mimic these techniques for building sophisticated AI that consistently works, even in the most sensitive domains. I don&#8217;t think customers would accept less, and neither should investors or founders.</p><h2>What&#8217;s next: beyond copilots</h2><p>I mentioned at the top of this post that as a general rule, I don&#8217;t like to invest in application-layer companies. Here are the exceptions to that rule:</p><ol><li><p>You&#8217;re selling into a highly sensitive regulatory domain. This makes it harder to sell into, but drastically improves customer stickiness, AND</p></li><li><p>You (the founder(s)) have unfair channel access / distribution into these regulated customers, AND</p></li><li><p>Because you know these customers so well, you&#8217;ve designed a task-specific agentic system that meets or exceeds human-level performance on each of the scoped modules</p></li></ol><p>Application-layer companies that meet these criteria have a better shot at building defensible, long-term value. We need modular, explainable systems that leverage generative AI while grounding it in domain knowledge and rigorous evals. Agentic systems, &#224; la Hippocratic AI, are great examples of this. Looking forward to seeing many more like it in Chapter Two of this cycle.</p><div><hr></div><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.machineyearning.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Stay in the loop. Subscribe for bi-weekly updates.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Subscriber Update]]></title><description><![CDATA[Hello! It&#8217;s been a long time since my last post. I wanted to give a quick update to you all on what I&#8217;ve been up to, what&#8217;s next, and what to expect from Machine Yearning moving forward. What&#8217;s Happened In January, I wrapped up a multi-year stint working with Andrew Ng and a team of talented builders at his venture studio,]]></description><link>https://www.machineyearning.io/p/subscriber-update</link><guid isPermaLink="false">https://www.machineyearning.io/p/subscriber-update</guid><dc:creator><![CDATA[Ryan Cunningham]]></dc:creator><pubDate>Mon, 29 Apr 2024 18:30:43 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/5183107b-52c4-42dd-8769-e102a9d13d02_1456x1048.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Hello! It&#8217;s been a long time since my last post. I wanted to give a quick update to you all on what I&#8217;ve been up to, what&#8217;s next, and what to expect from Machine Yearning moving forward.</p><div><hr></div><h2>What&#8217;s Happened</h2><p>In January, I wrapped up a multi-year stint working with Andrew Ng and a team of talented builders at his venture studio, <a href="https://aifund.ai">AI Fund</a>. Together, we built 9 companies across various domains, including <a href="https://aifund.ai/workhelix-launch-craft-your-companys-plan-for-genai/">Workhelix</a>, a GenAI readiness assessment platform for executives. We developed and shipped AI-powered products into many different verticals before and after ChatGPT&#8217;s explosive release.</p><p>In that time, I learned that:</p><ol><li><p>My skills and expertise were best utilized collaborating with teams building foundational tech like infrastructure, agentic systems, developer tools, and frontier AI research.</p></li><li><p>I believe we&#8217;re still too early to crown winners in the application layer, as there are many product categories founders are shoehorning AI into that won&#8217;t even exist three years from now.</p></li></ol><p>Just as Uber&#8217;s success relied on <a href="https://arc.net/l/quote/xrnfanqm">&#8220;existential technologies&#8221;</a> like mapping systems and cheap mobile bandwidth, AI and its supporting systems are existential technologies that will spawn entirely new categories. &#8220;<a href="https://www.gsb.stanford.edu/insights/andrew-ng-why-ai-new-electricity">Electricity</a>,&#8221; as Andrew puts it. We still have a lot of work to do building out the power grid.</p><p>After meeting tons of founders, judging hackathons, and even participating in a few, I <a href="https://x.com/jowyang/status/1765949509266456953">co-founded an angel fund</a> called SHACK15 Ventures with my partners <a href="https://www.shack15.ventures/jorn">J&#248;rn Lyseggen</a>, <a href="https://www.shack15.ventures/bogdan">Bogdan Cristei</a>, and Ashwin Ravichandran. Operating out of the eponymous social club SHACK15 atop San Francisco&#8217;s iconic Ferry Building, we&#8217;re working with a vibrant community of investors, entrepreneurs, and hackers, including Cerebral Valley, known for their exceptional AI hackathons. We&#8217;ve already made our first few investments and will have more to share in the near future.</p><p><em>To stay updated on Cerebral Valley, subscribe to their <a href="https://events.cerebralvalley.ai/">lu.ma page</a>.</em></p><h2>Machine Yearning</h2><p>I started this blog back in 2021 as an attempt to organize my thoughts on the emerging AI industry. As a product manager at the time, I wanted to ask better questions to my machine learning engineers, and more thoughtfully build great AI products from the ground up. I believed that only a deep, bottoms-up understanding of what we were building could deliver that.</p><p>This experiment yielded some promising results:</p><ul><li><p>I wrote about <a href="https://www.machineyearning.io/p/watch-your-language-part-2">large language models and transformer architectures</a> before they hit the mainstream. The lessons I learned helped position our team at Spiketrap for an <a href="https://techcrunch.com/2022/09/01/reddit-acquires-contextualization-company-spiketrap-to-boost-its-ads-business/">acquisition by reddit</a>, two months before ChatGPT's release.</p></li><li><p>Last year, I wrote about platform risk and &#8220;sherlocking&#8221;, and we're beginning to <a href="https://www.notion.so/Update-to-Subscribers-a7f07aa1e34d4afe8261d6a896b73ac0?pvs=21">see those warnings play out</a> for some of GenAI&#8217;s first movers.</p></li></ul><p>That said, there&#8217;s still a lot to explore. As we enter what some are calling an LLM-powered <a href="https://www.vice.com/en/article/y3w4gw/a-shocking-amount-of-the-web-is-already-ai-translated-trash-scientists-determine">&#8220;creativity apocalypse&#8221;</a>, it&#8217;s probably good practice to curate a whitelisted corner of genuine conversation.</p><p><em>Obviously, I'm using LLMs as creative writing assistants, just like everyone else. Just count the number of times you see <a href="https://pshapira.net/2024/03/31/delving-into-delve/">'delve'</a> in a sentence.</em></p><h2>What&#8217;s Next</h2><p>People who know me know I like to launch <a href="https://www.notion.so/Update-to-Subscribers-a7f07aa1e34d4afe8261d6a896b73ac0?pvs=21">trial balloons</a> into conversation &#8211; ideas shared with enough conviction to provoke reactions from those with more expertise. It&#8217;s also the Bayesian thing to do, publishing priors and updating them in real time.</p><p>Machine Yearning will be switching to a bi-weekly format, sharing unfinished thoughts and observations. These pieces may feel raw at times, but that&#8217;s actually the point.</p><p>If you&#8217;d like to get in touch, DM me on <a href="https://twitter.com/rydcunningham">Twitter</a> or <a href="https://linkedin.com/in/rydcunningham">LinkedIn</a>, and follow what we&#8217;re up to at <a href="https://www.shack15.ventures">shack15.ventures</a>.</p><div><hr></div><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.machineyearning.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Stay in the loop. Subscribe for bi-weekly updates.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Autonomous Agents: AIs Taking Action at the AGI House | Machine Yearning 005]]></title><description><![CDATA[If LLMs are the brains of AI, then Agents are the hands.]]></description><link>https://www.machineyearning.io/p/autonomous-agents</link><guid isPermaLink="false">https://www.machineyearning.io/p/autonomous-agents</guid><dc:creator><![CDATA[Ryan Cunningham]]></dc:creator><pubDate>Wed, 26 Jul 2023 15:00:41 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/c8984ca0-ea41-41e9-9407-4e7cb942804f_2694x2021.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>This weekend, I had the chance to participate in a fascinating hackathon at <a href="https://agihouse.ai">AGI House</a>, a community of AI founders and researchers known for their awesome events and amazing speakers. The theme for this hackathon was "Autonomous Apps," with an intriguing twist: using a new form of technology called "Agents." Our team of 7 hackers decided to take the theme of autonomous app development a little </em>too <em>literally&#8230; read on for how!</em></p><div><hr></div><h2><strong>Setting the Scene: Introducing "Agents"</strong></h2><p>If you&#8217;ve been listening to AI twitter, you&#8217;ve probably heard of &#8220;Agents&#8221; like <a href="https://mashable.com/article/autogpt-ai-agents-how-to-get-access">BabyAGI, AgentGPT, and others</a>. What are Agents anyway, and why were we at AGI House testing their limits in a hackathon?</p><p>Put one way, if large language models (LLMs) are the brains of advanced AI systems, Agents would be the hands, using LLMs to automate complex chains of tasks. With an Agent, instead of discretely programming a model, you can prompt it with an objective in natural language (e.g. <a href="https://twitter.com/AlexReibman/status/1683008546877874181?s=20">&#8220;take my information and register for a frequent flyer account number on these 90 airlines&#8217; websites&#8221;</a>), and using an LLM it will learn the steps to accomplish that objective. It&#8217;s like Excel macros or robotic process automation (RPA), but on steroids.</p><p>A lot of traditional software is designed such that humans still end up doing most of the work. Things like data entry, financial and accounting processes, or IT operations. Menial but labor-intensive. Throughout the hackathon, we saw brilliant teams and demos which took the tedium of legacy workflows and chucked them out the window. This <a href="https://twitter.com/AlexReibman/status/1683008364606013440?s=20">thread from Alex Reibman</a> captured all the demos (including ours!).</p><h2><strong>Our Approach: Building an App Autonomously</strong></h2><p>Our team decided to take a literal approach to the theme of &#8220;autonomous agents.&#8221; Inspired by the recent paper <a href="https://arxiv.org/pdf/2306.03314.pdf">"Multi-Agent Collaboration: Harnessing the Power of Intelligent LLM Agents"</a> by Yashar Talebirad and Amirhossein Nadiri, we created a constrained environment where multiple agents could collaborate towards a common goal: <em>autonomous app development</em> in Agile sprints.</p><p>For our project, we chose an Agile development sprint in <a href="https://linear.app/">Linear</a> (a popular project management tool), as our playground. Our playing pieces? A team of Agents designed to mimic the roles of a typical scrum team.</p><p>We devised three specialized Agents, each taking on a role in the Agile process with a biological analogue. To manage this efficiently, we also introduced an executive agent to oversee task delegation and bandwidth checks.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!hU0M!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96af3a5e-8798-4da4-956d-464374d0bb01_1072x589.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!hU0M!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96af3a5e-8798-4da4-956d-464374d0bb01_1072x589.png 424w, https://substackcdn.com/image/fetch/$s_!hU0M!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96af3a5e-8798-4da4-956d-464374d0bb01_1072x589.png 848w, https://substackcdn.com/image/fetch/$s_!hU0M!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96af3a5e-8798-4da4-956d-464374d0bb01_1072x589.png 1272w, https://substackcdn.com/image/fetch/$s_!hU0M!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96af3a5e-8798-4da4-956d-464374d0bb01_1072x589.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!hU0M!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96af3a5e-8798-4da4-956d-464374d0bb01_1072x589.png" width="1072" height="589" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/96af3a5e-8798-4da4-956d-464374d0bb01_1072x589.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:589,&quot;width&quot;:1072,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:123820,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!hU0M!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96af3a5e-8798-4da4-956d-464374d0bb01_1072x589.png 424w, https://substackcdn.com/image/fetch/$s_!hU0M!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96af3a5e-8798-4da4-956d-464374d0bb01_1072x589.png 848w, https://substackcdn.com/image/fetch/$s_!hU0M!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96af3a5e-8798-4da4-956d-464374d0bb01_1072x589.png 1272w, https://substackcdn.com/image/fetch/$s_!hU0M!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96af3a5e-8798-4da4-956d-464374d0bb01_1072x589.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><ol><li><p><strong>AutoPM: </strong>This GPT-4 powered agent accepts problem statements or user feedback, turning them into customer pain points. These are then translated into a framework such as "jobs to be done," converted into potential features, then refined, ranked, and finalized into a table of features and engineering sub-tasks. It uses human PM feedback before sending the tasks to AutoArch.</p></li><li><p><strong>AutoArch:  </strong>AutoArch prepares the groundwork before coding begins. It uses GPT-4 to draft markup languages for creating UML diagrams and generate the file structure for the project. These resources provide the scaffolding for the engineering work.</p></li><li><p><strong>AutoEng: </strong>With a clear customer problem, component tasks that ladder into features, and architectural constraints, AutoEng is set to start writing code.</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!UysP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4db7d241-05cd-4201-a458-01cf4f3fc42f_4032x3024.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!UysP!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4db7d241-05cd-4201-a458-01cf4f3fc42f_4032x3024.jpeg 424w, https://substackcdn.com/image/fetch/$s_!UysP!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4db7d241-05cd-4201-a458-01cf4f3fc42f_4032x3024.jpeg 848w, https://substackcdn.com/image/fetch/$s_!UysP!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4db7d241-05cd-4201-a458-01cf4f3fc42f_4032x3024.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!UysP!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4db7d241-05cd-4201-a458-01cf4f3fc42f_4032x3024.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!UysP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4db7d241-05cd-4201-a458-01cf4f3fc42f_4032x3024.jpeg" width="1456" height="1092" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4db7d241-05cd-4201-a458-01cf4f3fc42f_4032x3024.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1092,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:3452725,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!UysP!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4db7d241-05cd-4201-a458-01cf4f3fc42f_4032x3024.jpeg 424w, https://substackcdn.com/image/fetch/$s_!UysP!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4db7d241-05cd-4201-a458-01cf4f3fc42f_4032x3024.jpeg 848w, https://substackcdn.com/image/fetch/$s_!UysP!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4db7d241-05cd-4201-a458-01cf4f3fc42f_4032x3024.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!UysP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4db7d241-05cd-4201-a458-01cf4f3fc42f_4032x3024.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">A beautiful day to automate away our jobs&#8230;</figcaption></figure></div><p>We quickly found that off-the-shelf LLMs were capable at most tasks, but not perfect. The AutoPM, for instance, required a lot of human feedback before arriving at product recommendations that were relevant, actionable, rewarding, and specific enough for engineers to work with.</p><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/webp&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/95a32ef1-e427-4f9b-aba5-efc2e06a14ba_843x529.webp&quot;},{&quot;type&quot;:&quot;image/webp&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/84a2e883-5bc3-4b71-88bd-95fde561bdb6_845x479.webp&quot;},{&quot;type&quot;:&quot;image/webp&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0cc118c6-de75-40f3-9788-b10bd985a14a_845x479.webp&quot;}],&quot;caption&quot;:&quot;Prototyping a prompt engineering framework for AutoPM using GPT-4. Zero-shot results were pretty vague and unhelpful - breaking it up into intermediate steps got much more actionable engineering tasks.&quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6a252700-7aee-4ed6-b966-bf2f31cc9034_1456x474.png&quot;}},&quot;isEditorNode&quot;:true}"></div><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.machineyearning.io/p/autonomous-agents?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.machineyearning.io/p/autonomous-agents?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><h2><strong>The Outcome: Success!</strong></h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://twitter.com/AlexReibman/status/1683010355981869063" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!V8xb!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30f7dff1-8c28-4977-8ecb-469f6f66209d_566x713.png 424w, https://substackcdn.com/image/fetch/$s_!V8xb!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30f7dff1-8c28-4977-8ecb-469f6f66209d_566x713.png 848w, https://substackcdn.com/image/fetch/$s_!V8xb!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30f7dff1-8c28-4977-8ecb-469f6f66209d_566x713.png 1272w, https://substackcdn.com/image/fetch/$s_!V8xb!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30f7dff1-8c28-4977-8ecb-469f6f66209d_566x713.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!V8xb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30f7dff1-8c28-4977-8ecb-469f6f66209d_566x713.png" width="566" height="713" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/30f7dff1-8c28-4977-8ecb-469f6f66209d_566x713.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:713,&quot;width&quot;:566,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:285806,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://twitter.com/AlexReibman/status/1683010355981869063&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!V8xb!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30f7dff1-8c28-4977-8ecb-469f6f66209d_566x713.png 424w, https://substackcdn.com/image/fetch/$s_!V8xb!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30f7dff1-8c28-4977-8ecb-469f6f66209d_566x713.png 848w, https://substackcdn.com/image/fetch/$s_!V8xb!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30f7dff1-8c28-4977-8ecb-469f6f66209d_566x713.png 1272w, https://substackcdn.com/image/fetch/$s_!V8xb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30f7dff1-8c28-4977-8ecb-469f6f66209d_566x713.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Despite the experimental nature of our project and only 8 hours to do it, everything (mostly) fell into place! Once the team aligned on AutoPM&#8217;s features and sub-tasks, the whole Agent relay would take LESS THAN A MINUTE before working code is shipped.</p><p>To reiterate: this is strictly a toolchain of an LLM, a UML generator, and the Linear API. And apparently&#8230; that&#8217;s all you need to manage a team of Agents in an Agile sprint. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!4BW2!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2fd75c3-52e0-4cef-b12e-be7e2a2d4e3f_3014x1588.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!4BW2!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2fd75c3-52e0-4cef-b12e-be7e2a2d4e3f_3014x1588.png 424w, https://substackcdn.com/image/fetch/$s_!4BW2!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2fd75c3-52e0-4cef-b12e-be7e2a2d4e3f_3014x1588.png 848w, https://substackcdn.com/image/fetch/$s_!4BW2!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2fd75c3-52e0-4cef-b12e-be7e2a2d4e3f_3014x1588.png 1272w, https://substackcdn.com/image/fetch/$s_!4BW2!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2fd75c3-52e0-4cef-b12e-be7e2a2d4e3f_3014x1588.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!4BW2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2fd75c3-52e0-4cef-b12e-be7e2a2d4e3f_3014x1588.png" width="1456" height="767" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f2fd75c3-52e0-4cef-b12e-be7e2a2d4e3f_3014x1588.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:767,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:539576,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!4BW2!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2fd75c3-52e0-4cef-b12e-be7e2a2d4e3f_3014x1588.png 424w, https://substackcdn.com/image/fetch/$s_!4BW2!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2fd75c3-52e0-4cef-b12e-be7e2a2d4e3f_3014x1588.png 848w, https://substackcdn.com/image/fetch/$s_!4BW2!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2fd75c3-52e0-4cef-b12e-be7e2a2d4e3f_3014x1588.png 1272w, https://substackcdn.com/image/fetch/$s_!4BW2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2fd75c3-52e0-4cef-b12e-be7e2a2d4e3f_3014x1588.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">For a proof of concept, we ran a single sub-task through the agentic product development cycle. Here, the AutoPM specified a React app with a basic UI.</figcaption></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!dWRC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98547dac-b019-4cc5-933e-8dd0cf1ef339_1590x1510.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!dWRC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98547dac-b019-4cc5-933e-8dd0cf1ef339_1590x1510.png 424w, https://substackcdn.com/image/fetch/$s_!dWRC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98547dac-b019-4cc5-933e-8dd0cf1ef339_1590x1510.png 848w, https://substackcdn.com/image/fetch/$s_!dWRC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98547dac-b019-4cc5-933e-8dd0cf1ef339_1590x1510.png 1272w, https://substackcdn.com/image/fetch/$s_!dWRC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98547dac-b019-4cc5-933e-8dd0cf1ef339_1590x1510.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!dWRC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98547dac-b019-4cc5-933e-8dd0cf1ef339_1590x1510.png" width="1456" height="1383" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/98547dac-b019-4cc5-933e-8dd0cf1ef339_1590x1510.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1383,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:437762,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!dWRC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98547dac-b019-4cc5-933e-8dd0cf1ef339_1590x1510.png 424w, https://substackcdn.com/image/fetch/$s_!dWRC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98547dac-b019-4cc5-933e-8dd0cf1ef339_1590x1510.png 848w, https://substackcdn.com/image/fetch/$s_!dWRC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98547dac-b019-4cc5-933e-8dd0cf1ef339_1590x1510.png 1272w, https://substackcdn.com/image/fetch/$s_!dWRC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98547dac-b019-4cc5-933e-8dd0cf1ef339_1590x1510.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">With features and engineering sub-tasks defined, AutoArch generated a diagram and file structure for the engineers.</figcaption></figure></div><p>Astute readers will have already heard of <a href="https://arxiv.org/abs/2201.11903">&#8220;chain-of-thought prompting&#8221;</a> (basically, showing your work) and how that elicits better reasoning skills in LLMs. We&#8217;re effectively replicating those advancements using Linear as a scratchpad - letting Agentic team members show their work, share context, and provide clarifications where needed.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ly-S!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25829fc0-0f02-4221-9130-0c5ebc61a5c4_929x822.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ly-S!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25829fc0-0f02-4221-9130-0c5ebc61a5c4_929x822.png 424w, https://substackcdn.com/image/fetch/$s_!ly-S!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25829fc0-0f02-4221-9130-0c5ebc61a5c4_929x822.png 848w, https://substackcdn.com/image/fetch/$s_!ly-S!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25829fc0-0f02-4221-9130-0c5ebc61a5c4_929x822.png 1272w, https://substackcdn.com/image/fetch/$s_!ly-S!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25829fc0-0f02-4221-9130-0c5ebc61a5c4_929x822.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ly-S!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25829fc0-0f02-4221-9130-0c5ebc61a5c4_929x822.png" width="929" height="822" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/25829fc0-0f02-4221-9130-0c5ebc61a5c4_929x822.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:822,&quot;width&quot;:929,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:140699,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ly-S!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25829fc0-0f02-4221-9130-0c5ebc61a5c4_929x822.png 424w, https://substackcdn.com/image/fetch/$s_!ly-S!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25829fc0-0f02-4221-9130-0c5ebc61a5c4_929x822.png 848w, https://substackcdn.com/image/fetch/$s_!ly-S!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25829fc0-0f02-4221-9130-0c5ebc61a5c4_929x822.png 1272w, https://substackcdn.com/image/fetch/$s_!ly-S!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25829fc0-0f02-4221-9130-0c5ebc61a5c4_929x822.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">With all that done, AutoEng has structure and constraints for writing  the first draft of code. Later iterations would include multiple AutoEng for code review and testing.</figcaption></figure></div><p>While we had to scrap some of the elements we had hoped to include, such as AutoUXR for user feedback curation and AutoEng specializations, we were amazed we even got this far!</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Pw5N!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a767597-fc45-4d73-aeff-39c66fd7d9ff_1671x1256.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Pw5N!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a767597-fc45-4d73-aeff-39c66fd7d9ff_1671x1256.png 424w, https://substackcdn.com/image/fetch/$s_!Pw5N!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a767597-fc45-4d73-aeff-39c66fd7d9ff_1671x1256.png 848w, https://substackcdn.com/image/fetch/$s_!Pw5N!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a767597-fc45-4d73-aeff-39c66fd7d9ff_1671x1256.png 1272w, https://substackcdn.com/image/fetch/$s_!Pw5N!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a767597-fc45-4d73-aeff-39c66fd7d9ff_1671x1256.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Pw5N!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a767597-fc45-4d73-aeff-39c66fd7d9ff_1671x1256.png" width="1456" height="1094" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6a767597-fc45-4d73-aeff-39c66fd7d9ff_1671x1256.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1094,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:953056,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Pw5N!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a767597-fc45-4d73-aeff-39c66fd7d9ff_1671x1256.png 424w, https://substackcdn.com/image/fetch/$s_!Pw5N!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a767597-fc45-4d73-aeff-39c66fd7d9ff_1671x1256.png 848w, https://substackcdn.com/image/fetch/$s_!Pw5N!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a767597-fc45-4d73-aeff-39c66fd7d9ff_1671x1256.png 1272w, https://substackcdn.com/image/fetch/$s_!Pw5N!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a767597-fc45-4d73-aeff-39c66fd7d9ff_1671x1256.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">You can see and play with the shipped code yourself in this <a href="https://codesandbox.io/s/amazing-kepler-vgt9lr?file=/App.js">interactive sandbox</a></figcaption></figure></div><p>You can check out this 45s demo for yourself to see the whole thing in action on Linear.</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;70f6f22f-9e36-4161-a81a-4cba84fde72c&quot;,&quot;duration&quot;:null}"></div><p>Here&#8217;s another example generating a Tic Tac Toe game from scratch. It&#8217;s not a difficult coding exercise, but drafting a diagram, file structure, and deployment destination gives the AutoEng much more of a scaffolding.</p><div id="youtube2-fxdxEAl_fgk" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;fxdxEAl_fgk&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/fxdxEAl_fgk?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><h2><strong>Takeaways: Specializations and Human Feedback</strong></h2><p>Despite the rapid turnaround, we got some valuable insights about working with agents and LLMs:</p><ul><li><p>Off-the-shelf LLMs are very capable at handling Agile tasks, but overloading a single LLM with the end-to-end process was highly error-prone.</p></li><li><p>Most agents today are fragile - small changes to web layouts or APIs could break their logic.</p></li><li><p>Breaking the process into specialized roles and discrete tasks made error tracing easier by avoiding &#8220;black box&#8221; productivity losses.</p></li><li><p>While we didn&#8217;t implement robust RLHF features, we definitely saw how human feedback enhanced agent outputs, especially on AutoPM.</p></li></ul><p>If you imagine integrating this with other enterprise apps like Slack, Airtable, or email&#8230; you could create an entire workforce of digital agents, with auditable trails to measure and refine work outputs. How crazy would that be?</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!KVnD!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F384a9acc-563d-4e6a-b306-135c2e88a33f_4032x3024.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!KVnD!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F384a9acc-563d-4e6a-b306-135c2e88a33f_4032x3024.jpeg 424w, https://substackcdn.com/image/fetch/$s_!KVnD!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F384a9acc-563d-4e6a-b306-135c2e88a33f_4032x3024.jpeg 848w, https://substackcdn.com/image/fetch/$s_!KVnD!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F384a9acc-563d-4e6a-b306-135c2e88a33f_4032x3024.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!KVnD!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F384a9acc-563d-4e6a-b306-135c2e88a33f_4032x3024.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!KVnD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F384a9acc-563d-4e6a-b306-135c2e88a33f_4032x3024.jpeg" width="1456" height="1092" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/384a9acc-563d-4e6a-b306-135c2e88a33f_4032x3024.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1092,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:3300925,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!KVnD!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F384a9acc-563d-4e6a-b306-135c2e88a33f_4032x3024.jpeg 424w, https://substackcdn.com/image/fetch/$s_!KVnD!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F384a9acc-563d-4e6a-b306-135c2e88a33f_4032x3024.jpeg 848w, https://substackcdn.com/image/fetch/$s_!KVnD!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F384a9acc-563d-4e6a-b306-135c2e88a33f_4032x3024.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!KVnD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F384a9acc-563d-4e6a-b306-135c2e88a33f_4032x3024.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">The Relay Team! (L2R: Gurkaran, Tony, Anand, Travis, Me, Zhihao, and Pascal)</figcaption></figure></div><p>We walked away with new ideas, a deeper appreciation for advanced AI, and a glimpse into a future where intelligent agents could transform software development. Can't wait to see what's next!</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.machineyearning.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.machineyearning.io/subscribe?"><span>Subscribe now</span></a></p><p><em>Special thanks to the entire Relay team (<a href="https://twitter.com/aigsingh">Gurkaran</a>, <a href="https://twitter.com/tonyadastra">Tony</a>, <a href="https://twitter.com/andysid33">Anand</a>, <a href="https://twitter.com/traviscline">Travis</a>, <a href="https://twitter.com/oyzh888">Steve</a>, and <a href="https://twitter.com/pascalwieler">Pascal</a>), to Rocky Yu and <a href="https://twitter.com/agihouse_org">AGI House</a> for hosting, and to the event sponsors <a href="https://twitter.com/Wing_VC">WING VC</a>, RunPod, <a href="https://twitter.com/MultiON_AI">MultiON</a> for helping put on a great event with an awesome theme.</em></p><p><em>Disclaimer: This post was co-written by an AI.</em></p>]]></content:encoded></item><item><title><![CDATA[The Future is Open! Notes from #WoodstockAI | Machine Yearning 004]]></title><description><![CDATA[Why first movers WON'T win it all]]></description><link>https://www.machineyearning.io/p/the-future-is-open-notes-from-woodstockai</link><guid isPermaLink="false">https://www.machineyearning.io/p/the-future-is-open-notes-from-woodstockai</guid><dc:creator><![CDATA[Ryan Cunningham]]></dc:creator><pubDate>Thu, 06 Apr 2023 15:30:54 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/093c598f-2d02-4239-9a3e-8356e244886a_2642x1761.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Last Friday, my colleagues and I went to the 4/1 Hugging Face meetup (#WoodstockAI) and met dozens of passionate entrepreneurs and devs. We got a ton of questions around whether first-movers would end up winning the whole market. For the GenAI application layer, at least, there&#8217;s more to it than just being first! Here are some reasons why.</em></p><div><hr></div><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/jpeg&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/90928503-b2b3-4204-8be6-6931ff67102a_2049x1536.jpeg&quot;},{&quot;type&quot;:&quot;image/jpeg&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/250e7b64-d892-41a8-b16e-2dd4b4153cdf_2049x1536.jpeg&quot;},{&quot;type&quot;:&quot;image/jpeg&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/db3e9168-57f3-4a3a-90aa-4cbf608c1b8a_750x422.jpeg&quot;},{&quot;type&quot;:&quot;image/jpeg&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d1e499a4-6f40-4f24-bb61-00d84d21fd30_4032x3024.jpeg&quot;}],&quot;caption&quot;:&quot;Andrew Ng and I talking with entrepreneurs, chatting with Hugging Face CEO Clem Delangue, and also llamas!&quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/93d4cbd6-7d42-4bfb-8ad1-717c5311df46_1456x1456.png&quot;}},&quot;isEditorNode&quot;:true}"></div><h1>The Future is Open</h1><p>The halls were buzzing with enthusiasm. With over 130 open-source demos scattered throughout the Exploratorium (already packed with interesting exhibits), we hardly knew where to start - we saw next-gen literary teaching tools, a dev who converted hand-drawn sketches into richly textured digital assets, literal four-legged LLaMas, and much more.</p><p>In many ways, it reminded me of the early days of the App Store once Apple opened it up to third-party developers. At launch, there were only ~500 apps, but one year later there were 35,000, with over 1 billion downloads. Just like back then, we&#8217;re in the early innings of this game, and there&#8217;s plenty left to be discovered on UX, business models, retention, and so on. Some of the lessons of that era likely apply today; more on this in a little bit.</p><h2>The Gold Rush</h2><p>The folks I met aren&#8217;t just sitting idly by, waiting for the future to come to them. They&#8217;re actively building it! Everyone is an API key away from integrating into their applications state-of-the-art models from OpenAI, Stability, AI21, and other research labs. Daring engineers can pip install into their local file system whatever generative model they need from Hugging Face. This has unlocked a tremendous amount of creativity and potential.</p><p>&#8230; And yet, the generous availability of cutting-edge generative AI models has a downside.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.machineyearning.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Enjoying this? Subscribe for more hot takes.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>The double-edge of this accessibility is competition. No doubt about it, the competition is fierce. It&#8217;s going to be very challenging for AI entrepreneurs to build differentiated solutions, especially in the application layer.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a></p><p>We saw a similar dynamic in 2008. When the App Store officially opened up to third party developers, entrepreneurs scrambled to get onto the platform. They, too, had to navigate an unprecedented level of consumer accessibility and competition. It was a similar moment of unfettered creativity and opportunity. For anyone who doesn&#8217;t remember the top apps in the first year of the App Store, here&#8217;s <a href="https://techcrunch.com/2008/12/02/apple-announces-top-10-iphone-app-downloads-of-2008/">a refresher</a> on the top downloaded apps in 2008:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!4JfN!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68aead34-ae1c-4afe-982b-404df4caa95e_1420x1046.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!4JfN!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68aead34-ae1c-4afe-982b-404df4caa95e_1420x1046.png 424w, https://substackcdn.com/image/fetch/$s_!4JfN!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68aead34-ae1c-4afe-982b-404df4caa95e_1420x1046.png 848w, https://substackcdn.com/image/fetch/$s_!4JfN!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68aead34-ae1c-4afe-982b-404df4caa95e_1420x1046.png 1272w, https://substackcdn.com/image/fetch/$s_!4JfN!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68aead34-ae1c-4afe-982b-404df4caa95e_1420x1046.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!4JfN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68aead34-ae1c-4afe-982b-404df4caa95e_1420x1046.png" width="1420" height="1046" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/68aead34-ae1c-4afe-982b-404df4caa95e_1420x1046.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1046,&quot;width&quot;:1420,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:212723,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!4JfN!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68aead34-ae1c-4afe-982b-404df4caa95e_1420x1046.png 424w, https://substackcdn.com/image/fetch/$s_!4JfN!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68aead34-ae1c-4afe-982b-404df4caa95e_1420x1046.png 848w, https://substackcdn.com/image/fetch/$s_!4JfN!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68aead34-ae1c-4afe-982b-404df4caa95e_1420x1046.png 1272w, https://substackcdn.com/image/fetch/$s_!4JfN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68aead34-ae1c-4afe-982b-404df4caa95e_1420x1046.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><em>Source: <a href="https://www.theverge.com/2018/7/10/17550430/apple-iphone-ios-app-store-10-years-look-back">The Verge, 2018.</a></em></p><p>It was a weird time in technology. Lots of novelty, but productivity not so much, not at first. Many paid apps that hit the big time were fun expressions of the iPhone&#8217;s hardware capabilities. <a href="https://www.macrumors.com/2008/08/20/the-1-paid-app-koi-pond/">Koi Pond</a> was an elegantly simple pond simulator which took advantage of the responsive touch screen, and iBeer was considered the first killer demo of the accelerometer. It even minted its creators up to <a href="https://appleinsider.com/articles/22/01/25/original-ibeer-iphone-app-made-creators-20000-a-day">$20,000 / day at the top</a>. That didn&#8217;t last, but it&#8217;s worth pointing out!</p><h2>&#8220;Sherlocking&#8221;</h2><p>Novel usually doesn&#8217;t mean sustainable whether you&#8217;re talking about GenAI or iPhone apps. Apps like Recorder were quickly washed away by iOS-native apps that targeted the same functionality (Apple released Voice Memos in iOS 3). There&#8217;s even a term for this: <a href="https://www.howtogeek.com/297651/what-does-it-mean-when-a-company-sherlocks-an-app/">&#8220;Sherlocking,&#8221;</a> when a platform releases native functionality that renders a third-party development obsolete. The origin for this was a 2006 conversation between Steve Jobs and developer Dan Wood, whose company Karelia built a tool that extended the local search functionality of Apple&#8217;s Spotlight predecessor, Sherlock, to conducting online searches as well:</p><blockquote><p><em>It seemed that the sky was the limit, until I was called in for a meeting with Apple's Phil Schiller. I listened to him tell me that Apple was going to announce Sherlock 3, and it was very similar to Watson. I watched a demo of their program: all but one of their modules connected to the same service that Watson did and looked almost the same&#8230;</em></p></blockquote><p>Steve Jobs made it clear to Dan that any resemblance was very much his problem, not Apple&#8217;s.</p><blockquote><p><em>"&#8216;Here's how I see it,&#8217; Jobs said&#8230; &#8217;You know those handcars, the little machines that people stand on and pump to move along on the train tracks? That's Karelia. Apple is the steam train that owns the tracks.&#8217;"</em></p></blockquote><p><em>Origins of Sherlocking: &#8220;<a href="https://www.karelia.com/blog/the-long-story-behind-karel.html">The Long Story Behind Karelia's New Logo</a>,&#8220; Jan 7, 2006</em></p><p>Make no mistake, developers of foundation models own their GenAI tracks. In fact, <a href="https://www.theinformation.com/articles/the-best-little-unicorn-in-texas-jasper-was-winning-the-ai-race-then-chatgpt-blew-up-the-whole-game">a similar conversation happened</a> between Jasper CEO Dave Rogenmoser and Sam Altman once ChatGPT was released. Jasper had raised a $125 million war chest at a $1.5 billion valuation, leveraging a head start conferred by its private beta with OpenAI. But almost overnight, that advantage came to a screeching halt.</p><p>Jasper&#8217;s unique value proposition was perilously diluted by the fact that ChatGPT was fast, powerful, and above all, <em>free</em>. Jasper has plenty of cash to burn as it tries to deepen enterprise relationships and improve stickiness, but developers of less well capitalized text generators (and other first movers) risk getting &#8220;Sherlocked&#8221; nonetheless.</p><h2>Features vs. Products</h2><p>This means it&#8217;s a good time to ask yourself: Are you building a feature or a product? These are two very different things. In simple terms, a product is something that customers consider valuable enough to pay for. Products have to solve problems. Features are attributes, functions, helpers that come with a product. They&#8217;re useful, but not enough to pay for on their own.</p><ul><li><p>Using the Sherlock example, search is a product. Web search is a feature of search</p></li><li><p>Microsoft Nuance is a speech recognition product for medical scribes. Spanish speech recognition is a feature of Nuance</p></li></ul><p>This isn&#8217;t a constant, since products risk becoming features of larger products overtime. Watson and Sherlock are just one example where the verticalized product (Watson for web search) became a feature of a larger product (Sherlock for web and local search). ChatGPT may do the same to many copywriting apps.</p><p>You may be thinking to yourself &#8220;well, there&#8217;s no way a developer of foundation models has enough resources to go after every vertical.&#8221; True, which is why OpenAI effectively launched its own app store with <a href="https://openai.com/blog/chatgpt-plugins">Plugins.</a> Want to make an &#8220;LLM for dinner reservations&#8221; product? There&#8217;s a Plugin for that. What about an &#8220;LLM for booking flights&#8221;? There&#8217;s a Plugin for that.</p><p>Now, if you tell me &#8220;Travel is about ditching the mundane and discovering the extraordinary. But travel planning? Incredibly stressful. We believe you should be able to plan your entire trip with the push of a button. So we&#8217;re building everyone&#8217;s private travel agent. Interested?&#8221; Sure there are Plugins that can handle <em>pieces</em> of what I may be thinking of, but you&#8217;ve got me hooked. Tell me more.</p><p>Throughout the Hugging Face event, several entrepreneurs asked me to demo their apps. The tech was consistently impressive, but my first question was usually: &#8220;Who is this for?&#8221; More often than not, the answer was akin to &#8220;It&#8217;s for everyone, the possibilities are endless!&#8221;</p><p>I agree, the possibilities are endless! But if you intend to build a sustainable business, I encourage you to focus on building solutions with a specific customer in mind, a customer <em>who will pay for it </em>in time or money. If you can articulate the following:</p><ul><li><p>The pain they have</p></li><li><p>The willingness they have to pay to eliminate that pain</p></li><li><p>How many people out there share that pain (your total addressable market)</p></li></ul><p>then we&#8217;re really onto something. In doing so, we can create products that are genuinely valuable and rewarding for both the customer and the entrepreneur. If you need a place to start, I always recommend <a href="https://www.momtestbook.com/">&#8220;The Mom Test,&#8221;</a> one of the best resources on practical questions to ask as you&#8217;re finding product-market fit. Cut through the BS and figure out if customers will pony up!</p><h2>Most importantly, have fun!</h2><p>Most of these notes are aimed at founders intending to build GenAI businesses, but there&#8217;s just as much value in diving in and seeing what you can create. Don&#8217;t just build a generative AI app because you think it&#8217;ll grow into the next unicorn. Build it because you want to, build it because it&#8217;s fun, build it because it solves a specific problem and you&#8217;re passionate about solving it!</p><p>If enough people share that problem <em>and </em>are willing to pay for that product&#8230; then who knows! But building and learning are inherently valuable.</p><p>There are going to be so many new business models, experiences, and possibilities we discover in the application layer that it isn&#8217;t clear first-mover advantages exist. If you&#8217;re building an infrastructure, platform, marketplace, or any other product that depends on network effects for its fundamental value, then yes, it&#8217;s best to be first. But not every business fits that description. Above all, build something that people love.</p><p>The potential of AI is vast, and many exciting opportunities await entrepreneurs who&nbsp; create world-changing solutions. #WoodstockAI was an amazing celebration of all this potential energy. With so much talent and enthusiasm in the AI community, I'm confident that we'll continue to see groundbreaking advancements in the field.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.machineyearning.io/p/the-future-is-open-notes-from-woodstockai?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.machineyearning.io/p/the-future-is-open-notes-from-woodstockai?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><div><hr></div><h1>Thanks for reading!</h1><p>Liked this post? Comment and subscribe for more!</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.machineyearning.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.machineyearning.io/subscribe?"><span>Subscribe now</span></a></p><p>Machine Yearning is a collection of essays and news on the intersection between AI, investing, product, and economics, light on technicals but heavy on relevance.</p><p><a href="https://aifund.ai/team-member/ryan-cunningham/">Ryan Cunningham</a> is a Senior Builder at Andrew Ng&#8217;s <a href="https://aifund.ai/">AI Fund</a>, a venture studio in Palo Alto. As part of the venture studio, he&#8217;s co-founded AI startups in the fields of fintech, human capital, swarm intelligence, speech recognition, generative AI, and more. He specializes in deep tech applications (foundation models, drones, autonomous vehicles) and is active in the Stanford AI Alignment community at <a href="https://seri.stanford.edu/">SERI</a>, where he contributes to research and advocacy efforts focused on AI safety.</p><p>Any suggestions or topics you want to see? DM me on <a href="https://www.linkedin.com/in/rydcunningham">LinkedIn</a> or <a href="https://www.twitter.com/rydcunningham">Twitter</a>.</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Other VCs like <a href="https://www.sequoiacap.com/article/generative-ai-a-creative-new-world/">Sequoia</a> and <a href="https://base10.vc/post/generative-ai-mission-critical/">BaseTen</a> have opined on defensibility factors like UX, personalization, and more. My colleague <a href="https://www.linkedin.com/in/bolaadegbulu/">Bola Adegbulu</a> has <a href="https://novelefficacy.substack.com/p/how-to-build-a-defensible-generative">11 specific defensibility recommendations</a> relevant to GenAI if you&#8217;d like to learn more.</p></div></div>]]></content:encoded></item><item><title><![CDATA[What Most Get Wrong About the "AI Arms Race" | Machine Yearning 003]]></title><description><![CDATA[If you just want to be first, you're already behind]]></description><link>https://www.machineyearning.io/p/whats-wrong-with-the-ai-arms-race</link><guid isPermaLink="false">https://www.machineyearning.io/p/whats-wrong-with-the-ai-arms-race</guid><dc:creator><![CDATA[Ryan Cunningham]]></dc:creator><pubDate>Mon, 31 Jan 2022 19:51:55 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/d6fd8c6f-e24d-49c7-a7dd-8025a2acd57d_4003x2857.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Hello and welcome!</em></p><p><em>I wrote this piece shortly after attending a seminar with postdoctoral fellow <a href="https://hai.stanford.edu/people/jeffrey-ding">Jeffrey Ding</a> at <a href="https://hai.stanford.edu/events/hai-weekly-seminar-jeffrey-ding">Stanford&#8217;s Institute for Human-Centered AI</a>. In it, Ding convincingly laid out what I think are the most common fallacies American pundits fall into when discussing AI in the context of global powers. I&#8217;ve added to it with practical recommendations on how to navigate around these fallacies.</em></p><p><em>Ding also has a fantastic newsletter, <a href="https://chinai.substack.com/">ChinAI</a>, where he provides first-party translations of Chinese source texts and strategy documents on artificial intelligence and other high technologies. If you&#8217;re used to getting your news on China or AI from non-Chinese speakers or non-practitioners, you owe it to yourself to subscribe!</em></p><div><hr></div><h1><strong>&#128241; How Technology Changes Societies</strong></h1><p>&#8220;First to AI supremacy?&#8221; Not so fast. When it comes to artificial intelligence, it doesn&#8217;t matter who&#8217;s first because it's not a product in the way most pundits understand it.</p><p><a href="https://www.gsb.stanford.edu/insights/andrew-ng-why-ai-new-electricity">&#8220;AI is the new electricity,&#8221;</a> but electricity isn&#8217;t a product. Being first to have electricity doesn&#8217;t matter without the infrastructure to deliver it, the human capital to manage and improve upon it, and the standards to commercialize it. Benjamin Franklin famously conducted his kite experiment in 1752, and Thomas Edison patented the lightbulb in 1879, but it wasn&#8217;t until the 1920s that even half of American homes had electricity.</p><p>Like electricity, AI is a general-purpose technology - one with vast potential for nearly all sectors of the global economy. Talking about &#8220;AI supremacy&#8221; as if it were a zero-sum game is a fundamental misunderstanding of both how AI works and where AI power comes from. If institutions truly want to harness AI for large-scale transformations, then we need to create environments suitable for innovating upon it. We need to cultivate our societies into &#8220;innovation gardens,&#8221; instead of planting our flag on the moon and never going back.</p><h2>A Tale of Two Theories</h2><p>Before we get into it, let&#8217;s discuss the source of this misconception, so we know what to avoid when discussing AI&#8217;s potential.</p><p>There are two competing frameworks for understanding technological advancement in societies:</p><ol><li><p>The standard view is the <em><a href="https://www.jstor.org/stable/4177406">leading sector theory</a></em> of technological advancement, which emphasizes the first-mover advantages in fast-growing industries (aka <em>leading sectors</em>). The impact is felt rather immediately and is concentrated in a key sector. In international political economy, a single nation-state first monopolizes initial gains (sometimes called an &#8220;innovation monopoly&#8221;), then the industry spreads to other competing powers.</p></li><li><p>An alternative theory, proposed by researcher <a href="https://www.fhi.ox.ac.uk/team/jeffrey-ding/">Jeffrey Ding</a>, a postdoctoral fellow at <a href="https://twitter.com/StanfordCISAC">@StanfordCISAC</a> and <a href="https://twitter.com/StanfordHAI">@StanfordHAI</a>, is the <em><a href="https://hai.stanford.edu/events/hai-weekly-seminar-jeffrey-ding">diffusion theory</a></em> of general purpose technologies. This theory highlights longer, drawn-out trajectories of incremental improvements upon general technologies which diffuse into broad sectors of the global economy overtime. The impact comes later and is more dispersed.</p></li></ol><p>High-profile products like smartphones and ridesharing fit neatly into the first framework. They have immediate use cases, benefit from network effects, and have winner-take-all dynamics.&nbsp;&nbsp;</p><p>After all, If you&#8217;re buying an iPhone, that&#8217;s the only phone you&#8217;re going to buy for the <a href="https://www.statista.com/statistics/619788/average-smartphone-life/">next 2 years</a>. If you take an Uber, that&#8217;s one fewer trip <a href="https://www7.bts.dot.gov/sites/bts.dot.gov/files/docs/browse-statistical-products-and-data/surveys/224071/vtrpmap.pdf">out of your 5 daily trips</a> going towards a taxi or Lyft. These are zero-sum games, for the most part, which fit into the <em>product life cycle </em>framework taught in most business school curricula.&nbsp;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!suD1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0a3ce2e1-4300-4b7f-8cd7-0f81b00000f1_487x297.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!suD1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0a3ce2e1-4300-4b7f-8cd7-0f81b00000f1_487x297.png 424w, https://substackcdn.com/image/fetch/$s_!suD1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0a3ce2e1-4300-4b7f-8cd7-0f81b00000f1_487x297.png 848w, https://substackcdn.com/image/fetch/$s_!suD1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0a3ce2e1-4300-4b7f-8cd7-0f81b00000f1_487x297.png 1272w, https://substackcdn.com/image/fetch/$s_!suD1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0a3ce2e1-4300-4b7f-8cd7-0f81b00000f1_487x297.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!suD1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0a3ce2e1-4300-4b7f-8cd7-0f81b00000f1_487x297.png" width="487" height="297" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/0a3ce2e1-4300-4b7f-8cd7-0f81b00000f1_487x297.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:297,&quot;width&quot;:487,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!suD1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0a3ce2e1-4300-4b7f-8cd7-0f81b00000f1_487x297.png 424w, https://substackcdn.com/image/fetch/$s_!suD1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0a3ce2e1-4300-4b7f-8cd7-0f81b00000f1_487x297.png 848w, https://substackcdn.com/image/fetch/$s_!suD1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0a3ce2e1-4300-4b7f-8cd7-0f81b00000f1_487x297.png 1272w, https://substackcdn.com/image/fetch/$s_!suD1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0a3ce2e1-4300-4b7f-8cd7-0f81b00000f1_487x297.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><em>The </em>product life cycle<em> curve shows a general expectation of sales trends for new products. First movers who invent the product achieve monopoly profits in the early stages of customer adoption, maintain wide margins against the nearest competitor in the growth stage, but wane as the market matures and declines.</em></figcaption></figure></div><p>Artificial intelligence does not share these characteristics of <em>leading sector </em>products:</p><ul><li><p>Like electricity, AI is a general purpose technology that when first introduced did not have a singular commercial application.&nbsp;</p></li><li><p>The <a href="https://www.techrepublic.com/article/open-source-powers-ai-yet-policymakers-havent-seemed-to-notice/">open-source nature</a> of most AI research means that state-of-the-art performance is not restricted to first movers; I can visit <a href="https://huggingface.co/EleutherAI/gpt-j-6B">HuggingFace and deploy a GPT-like model</a> for some web app in under an hour.</p></li><li><p>It is highly pervasive, meaning it has applications for many sectors of the economy. To oversimplify, anywhere a decision must be made using data or intuition, AI can augment or replace the decision-maker.</p></li></ul><p>A summary of these distinguishing factors is laid out below.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!R7hB!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F61dfd472-b6fa-41d2-9222-50212f4bdc37_1944x932.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!R7hB!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F61dfd472-b6fa-41d2-9222-50212f4bdc37_1944x932.png 424w, https://substackcdn.com/image/fetch/$s_!R7hB!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F61dfd472-b6fa-41d2-9222-50212f4bdc37_1944x932.png 848w, https://substackcdn.com/image/fetch/$s_!R7hB!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F61dfd472-b6fa-41d2-9222-50212f4bdc37_1944x932.png 1272w, https://substackcdn.com/image/fetch/$s_!R7hB!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F61dfd472-b6fa-41d2-9222-50212f4bdc37_1944x932.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!R7hB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F61dfd472-b6fa-41d2-9222-50212f4bdc37_1944x932.png" width="1456" height="698" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/61dfd472-b6fa-41d2-9222-50212f4bdc37_1944x932.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:698,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:163866,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!R7hB!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F61dfd472-b6fa-41d2-9222-50212f4bdc37_1944x932.png 424w, https://substackcdn.com/image/fetch/$s_!R7hB!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F61dfd472-b6fa-41d2-9222-50212f4bdc37_1944x932.png 848w, https://substackcdn.com/image/fetch/$s_!R7hB!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F61dfd472-b6fa-41d2-9222-50212f4bdc37_1944x932.png 1272w, https://substackcdn.com/image/fetch/$s_!R7hB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F61dfd472-b6fa-41d2-9222-50212f4bdc37_1944x932.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: 1/19/2022 <a href="https://hai.stanford.edu/events/hai-weekly-seminar-jeffrey-ding">HAI Weekly Seminar with Jeffrey Ding</a></figcaption></figure></div><p></p><h2><strong>Being First Isn&#8217;t Enough</strong></h2><p>Suffice to say, it isn&#8217;t enough to just be first to invent a new general purpose technology. That alone does not guarantee a competitive moat. Instead of focusing on building innovation monopolies from one-off products, Ding suggests societies that see the greatest impact from technology diffusion are those which <em>intentionally</em> cultivate environments suitable for innovation.</p><p>These environments:</p><ul><li><p><strong>Continuously improve state-of-the-art benchmarks</strong> (via patents, research, and academic papers)</p></li><li><p><strong>Upgrade human capital</strong> with formalized disciplines (e.g. machine learning engineering, AI product managers)</p></li><li><p><strong>Introduce and evangelize standards</strong> for development and production (e.g. MLOps)</p></li><li><p><strong>Inspire innovation chains of complementary technologies </strong>across broad sectors of the economy</p></li></ul><p>It&#8217;s a more holistic and nuanced approach vs. the innovation-centric framework for leading sector technologies, which is more popular with pundits.</p><h1><strong>&#9889;&#65039; &#8220;AI is the New Electricity&#8221;</strong></h1><p><a href="https://aifund.ai/team-member/andrew-ng/">Andrew Ng</a> famously coined the phrase &#8220;AI is the new electricity,&#8221; claiming it will &#8220;transform every industry and create huge economic value.&#8221; If, like electricity, AI adoption follows a <em>diffusion theory</em> trajectory, what practical investments are required for institutions and countries to cultivate leading innovation gardens?</p><p>I would argue for a cohesion of 5 focus areas of public-private partnerships:</p><ol><li><p>Clear Research and Development Goals</p></li><li><p>Synergistic Infrastructure</p></li><li><p>Human Capital Upgrades</p></li><li><p>Commercial Standardization</p></li><li><p>Ethics and Explainable AI (XAI)</p></li></ol><h2><strong>1) Clear Research and Development Goals</strong></h2><p>Clear research and development objectives at the national level signal strategic interests which may not emerge from the commercial sector on their own.</p><p>Chinese AI basic research goals include <em>cross-media sensing and computing </em>(think <a href="https://arxiv.org/abs/1805.00705">audio-visual-text fusion</a>) and <em><a href="https://cset.georgetown.edu/wp-content/uploads/CSET-China-AI-Brain-Research.pdf">brain-inspired intelligence computing</a>, </em>among 6 others. All of these synergize with explicit commercial, infrastructure, and human capital goals in their strategy document, the 2017 <a href="https://www.newamerica.org/cybersecurity-initiative/digichina/blog/full-translation-chinas-new-generation-artificial-intelligence-development-plan-2017/">Next Generation Artificial Intelligence Development Plan</a>, which align neatly with the diffusion theory framework.</p><p>Though these basic research lanes aren&#8217;t likely to result in commercial offerings on a VC time scale, that isn&#8217;t the primary goal. The goal is creating the conditions for a quickly compounding chain of follow-on innovations.</p><p>Contrastingly, a glaring miss in the 2019 <a href="https://www.nitrd.gov/pubs/National-AI-RD-Strategy-2019.pdf">American AI Initiative</a> is clarity around specific R&amp;D goals of any kind (Strategy #1 is &#8220;make long-term investments in AI research&#8221;). The objectives mentioned are far too high level. In absence of specific R&amp;D goals, &#8220;planning to plan&#8221; is not a plan. Nor is &#8220;promoting leadership.&#8221;</p><h2><strong>2) Synergistic Infrastructure</strong></h2><p>Institutions should focus on building and supporting synergistic frameworks between open-source hardware, software, and cloud infrastructure.</p><p>If the past two years have taught us anything, it&#8217;s the fragility of the global silicon supply chain. Industry-wide over-reliance on just a few general-purpose chip manufacturers is a systemic risk for the global economy, with inventories under siege from players in nearly every industry. Investing in <a href="https://fs.blog/antifragile-a-definition/">antifragility</a><em> </em>for the hardware supply chain would be a good long-term bet, meaning serious consideration of novel computing methods like <a href="https://www.hpe.com/us/en/insights/articles/whats-this-neuromorphic-computing-youre-talking-about-2105.html">neuromorphic</a> computing or even quantum computing architectures, which are far more efficient than von Neumann architecture, is in order.</p><p>Open-source software libraries like Tensorflow and PyTorch, as well as platforms like HuggingFace, are speeding up AI application development time. Investing in these and other platforms for standardization across other AI paradigms should reap similar rewards.</p><p>Finally, development of <a href="https://hai.stanford.edu/policy/national-research-cloud">national research clouds</a>, as <a href="https://www.researchprofessionalnews.com/rr-news-europe-france-2021-11-france-banks-on-national-cloud/">France</a>, <a href="https://rcos.nii.ac.jp/en/service/merit/">Japan</a>, and <a href="https://www.cstcloud.net/">China</a> have done, should provide an antifragile alternative to incumbent cloud computing platforms, especially for basic research which may not have immediate revenue opportunities.</p><h2><strong>3) Human Capital Upgrades</strong></h2><p>As international competition for top researchers heats up, America is in particularly dire need of an upgrade in human capital.</p><p>American universities still attract the best international talent, and enjoy <a href="https://cset.georgetown.edu/wp-content/uploads/CSET-AI-Education-in-China-and-the-United-States-1.pdf">healthy ecosystems</a> promoting basic AI research. But the rest of the US population suffers from <a href="https://pnw.ai/article/america-needs-ai-literacy-now/72515409">incredibly low literacy rates</a> on fundamental AI concepts. By some estimates, <a href="https://www.linkedin.com/pulse/hello-world-announcing-kira-learning-kira-learning/?trackingId=tinRqmL1TYyV%2BHsXZ11PnQ%3D%3D">fewer than half</a> of US high schools teach any computer science at all. Of those that do, the curricula have remained more or less unchanged for 15 years.</p><p>Upskilling startups like <a href="https://www.fourthbrain.ai/">FourthBrain</a> and <a href="http://deeplearning.ai/">Deeplearning.AI</a> are delivering practical and highly relevant skillsets needed for learners to pivot into a career in AI. <a href="https://workera.ai/">Workera</a> is augmenting employees of existing workforces, and <a href="https://factored.ai/">Factored</a> is curating a deep bench of contractable AI/ML experts for multiple industries. Finally, in primary education, <a href="https://www.linkedin.com/company/kira-learning/">Kira Learning</a> is designing a contemporary AI fundamentals curriculum for K-12 American students in all 50 states, the first of its kind.</p><h2><strong>4) Commercial Standardization</strong></h2><p>Like Agile software processes which formalized mechanical and software engineering workflows, <a href="https://www.deeplearning.ai/program/machine-learning-engineering-for-production-mlops/">MLOps</a> is standardizing ML engineering in the workplace. Through MLOps, engineers and product managers are learning to tackle development of AI products in virtuous closed loops rather than linear progressions. Startups like <a href="https://whylabs.ai/">WhyLabs</a> and <a href="https://arize.com/">Arize</a> are formalizing these practices, making model development a core business process at many institutions, instead of a data science side project.</p><h2><strong>5) Ethics and &#8220;Explainable AI&#8221;</strong></h2><p>Much like the concerns around dangerous electricity inspired regulation and safety practices, we should anticipate a need for explainable AI (XAI) and the ability to audit model decision-making frameworks. This is especially relevant for &#8220;black box&#8221; models in mission-critical applications, such as <a href="https://www.nytimes.com/2019/11/10/business/Apple-credit-card-investigation.html">loan applications</a> or the <a href="https://www.theatlantic.com/technology/archive/2018/01/equivant-compas-algorithm/550646/">criminal justice system</a>, where hidden biases may have drastic and immediate impacts on humans&#8217; well-being. Startups like <a href="https://www.credo.ai/">Credo AI</a> are paving the way for XAI frameworks, guaranteeing safe model deployment to critical sectors.</p><h1><strong>&#128175; In 100 words or fewer&#8230;</strong></h1><p>Institutional AI supremacy will not be the result of one-off killer products, but a concerted, holistic series of investments in resource infrastructure, human capital upgrades, and standards-setting to create environments that nurture and encourage innovation. These innovation gardens do not exist <em>de facto</em> for any one system of governance, but are deliberate in their construction&#8230; without them, any first mover advantages will quickly wane, and the world&#8217;s primary AI innovation center may converge elsewhere.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.machineyearning.io/p/whats-wrong-with-the-ai-arms-race?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.machineyearning.io/p/whats-wrong-with-the-ai-arms-race?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><div><hr></div><h1>Thanks for reading!</h1><p>Machine Yearning is a collection of essays and news on the intersection between AI, investing, product, and economics, light on technicals but heavy on relevance.</p><p><a href="https://aifund.ai/team-member/ryan-cunningham/">Ryan Cunningham</a> is a Senior Builder at Andrew Ng&#8217;s <a href="https://aifund.ai/">AI Fund</a>, a venture studio accelerating the adoption of AI across the global economy. Prior to joining AI Fund, he worked in product at Uber and various AI startups, beginning his career as a technology investment banker at Credit Suisse. He studied Finance and Economics at Georgetown University, and is currently studying Artificial Intelligence part-time at Stanford.</p><p>Any suggestions or topics you want to see? Connect with me at <a href="https://www.takeme.to/ryan">takeme.to/ryan</a> (@rydcunningham across all platforms).</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.machineyearning.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Machine Yearning! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Watch Your Language, Part 2 | Machine Yearning 002]]></title><description><![CDATA["I don't know what GPT-3 is, and at this point I'm too afraid to ask."]]></description><link>https://www.machineyearning.io/p/watch-your-language-part-2</link><guid isPermaLink="false">https://www.machineyearning.io/p/watch-your-language-part-2</guid><dc:creator><![CDATA[Ryan Cunningham]]></dc:creator><pubDate>Mon, 08 Mar 2021 16:30:22 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/828c5b2a-8c75-45de-b557-c284e99bf1cd_2638x1484.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Hello and welcome! This is part 2 of an introductory series on language models like GPT-3.</em></p><p><em><a href="https://machineyearning.substack.com/p/watch-your-language-part-1">Part 1</a> explores language models at a high level, with both fun and dangerous applications.</em></p><p><em>Part 2 (this post) introduces GPT-3 for anyone who's heard about it but has no idea what makes it special.</em></p><h1>&#128300; Long Take: GPT-3 &amp; Me</h1><p>You've all probably seen article after article hyping up this thing called "GPT-3." You may have seen headlines like <a href="https://builtin.com/machine-learning/why-gpt-3-heralds-democratic-revolution-tech">"Why GPT-3 Heralds a Democratic Revolution in Tech"</a> or <a href="https://www.datanami.com/2021/02/16/one-model-to-rule-them-all-transformer-networks-usher-in-ai-2-0-forrester-says/">"One Model to Rule Them All"</a>. You may even know that GPT-3 has 175 billion parameters (kudos), and not really know what a parameter is, just that 175 billion is a lot.</p><p>Not to worry, we'll clear that up in this post. You'll be able to chat with your co-workers about the exciting new language model in about 5 minutes from now.</p><h2>What Even Are Parameters</h2><p>I get it, after a certain point the numbers all sort of blend together.</p><p>The broader definition is that &#8220;parameters&#8221; are different coefficients a machine learning model tries to optimize during training.</p><p>The plain English understanding is that for a language model, parameters represent the strength of connections between words and word sequences. More parameters means more opportunities to see different sequences in different contexts.</p><ul><li><p>If a sentence starts with &#8220;Please pass the,&#8221; what is the probability that the next word will be &#8220;salt&#8221;?</p></li><li><p>What about &#8220;kangaroo&#8221;?</p></li></ul><p>Chances are that most small models would never ever predict "please pass the kangaroo" as a likely sequence. Most humans wouldn&#8217;t, anyway. If a model has never seen that sequence in its training data, there is an infinitesimal chance of predicting "kangaroo" as the close word.</p><p>However, on the off-chance the training data is so large and parameters so numerous that the model spans many different domains, there may be a chance - <a href="https://www.google.com/search?safe=active&amp;sxsrf=ALeKk03d8-IEk45Kh-GW1GWINESGCJmK8w%3A1615175889126&amp;ei=0aBFYIqVB5XO0PEPjI2A6AU&amp;q=%22please+pass+the+kangaroo%22&amp;oq=%22please+pass+the+kangaroo%22&amp;gs_lcp=Cgdnd3Mtd2l6EAMyBQghEKsCOgcIABBHELADOgQIIxAnOgUIABCRAjoCCAA6CAguEMcBEKMCOgUIABCLAzoECAAQQzoHCAAQhwIQFDoRCC4QxwEQrwEQiwMQpgMQqAM6CAguEMcBEK8BOgIILjoLCC4QxwEQrwEQiwM6CwguEIsDEKgDEJ0DOgUILhCTAjoGCAAQFhAeOggIIRAWEB0QHjoFCCEQoAE6BwghEAoQoAFQ47ULWK_MC2DfzQtoAXACeACAAe8BiAGZH5IBBjAuMjUuMZgBAKABAaoBB2d3cy13aXrIAQi4AQLAAQE&amp;sclient=gws-wiz&amp;ved=0ahUKEwjKwfuI55_vAhUVJzQIHYwGAF0Q4dUDCA4&amp;uact=5">however small</a> - that sequence may come up.</p><h2>What Are Domains</h2><p>Traditionally, the <em>domain</em> of the training set is strongly correlated with performance on any one task.</p><p>For instance, a hate-speech detection model trained on language data from Wikipedia or another well-curated, grammatically sound corpus will rarely perform as well as a model trained on more informal sources like web forums, IRC, Discord chats, etc.. There are at least three reasons for this behavior:</p><ul><li><p>This is because the kinds of insults, slurs, and syntactical/grammatical language nightmares on those informal sources are far more varied, numerous, and current than a <a href="https://en.wikipedia.org/wiki/List_of_ethnic_slurs">formal list</a> from Wikipedia</p></li><li><p>Each forum may have unique lingo, like Redditors using <a href="https://www.reddit.com/r/dataisbeautiful/comments/8df1r3/the_top_subreddits_as_hashtags_oc/">/r/subredditsashashtags</a>, making it more difficult for models trained on other domains to predict subsequent words</p></li><li><p>Since there are usually more diverse <em>contexts</em> (i.e. the words around the slur) for forum-based sources, by using a model trained on that domain, we have a much better chance of recognizing contexts in the wild that match up with examples of hate speech the model has already seen</p></li></ul><p>At least, these are the <em>traditional</em> limitations of language model effectiveness.</p><h2>ELI5 GPT-3</h2><p>And then of course, GPT-3 comes around. Over two orders of magnitude larger than its predecessor GPT-2, and ten times larger than any other model at the time of release, GPT-3 is renowned for its generalizability and emergent properties unseen in prior language models. In particular, a meta-learning feature called "in-context learning," which we'll come back to in a bit.</p><h3>Few-shot learners</h3><p>OpenAI's paper <a href="https://arxiv.org/pdf/2005.14165.pdf">"Language Models are Few-Shot Learners"</a> introduced GPT-3 in July 2020. The term "few-shot" (as opposed to "one-shot" or "zero-shot") refers to the number of examples a model has to observe before inferring the task it's trying to accomplish.</p><p>Put more colorfully, picture the following:</p><ul><li><p>You're unexpectedly and violently woken up to the sound of a masked man blaring Queen's "Bohemian Rhapsody" through a megaphone in your bedroom.</p></li><li><p>At "Caught in a landslide..." the song stops, and this mystery person gestures to you.</p></li><li><p>With neither a coffee nor a clue as to why this person barged into your home for this, you muster up the softest, most confused "...no escape from reality."</p></li></ul><p>If the task was to predict the next lyric in the song, congratulations! You're a one-shot learner.</p><p>If however, the task was something else - like a question-answer task "Who is the artist who sings this lyric?", then you wouldn't have gotten it right the first time. It might take a few more examples from the masked man with a megaphone before you get it right. Or before you call the police. This is called "few-shot learning."</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!XPZ6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F378ceffa-2b3c-4812-be08-5fa832ede821_705x392.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!XPZ6!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F378ceffa-2b3c-4812-be08-5fa832ede821_705x392.png 424w, https://substackcdn.com/image/fetch/$s_!XPZ6!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F378ceffa-2b3c-4812-be08-5fa832ede821_705x392.png 848w, https://substackcdn.com/image/fetch/$s_!XPZ6!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F378ceffa-2b3c-4812-be08-5fa832ede821_705x392.png 1272w, https://substackcdn.com/image/fetch/$s_!XPZ6!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F378ceffa-2b3c-4812-be08-5fa832ede821_705x392.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!XPZ6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F378ceffa-2b3c-4812-be08-5fa832ede821_705x392.png" width="705" height="392" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/378ceffa-2b3c-4812-be08-5fa832ede821_705x392.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:392,&quot;width&quot;:705,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:122345,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!XPZ6!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F378ceffa-2b3c-4812-be08-5fa832ede821_705x392.png 424w, https://substackcdn.com/image/fetch/$s_!XPZ6!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F378ceffa-2b3c-4812-be08-5fa832ede821_705x392.png 848w, https://substackcdn.com/image/fetch/$s_!XPZ6!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F378ceffa-2b3c-4812-be08-5fa832ede821_705x392.png 1272w, https://substackcdn.com/image/fetch/$s_!XPZ6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F378ceffa-2b3c-4812-be08-5fa832ede821_705x392.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: &#8220;Language Models are Few-Shot Learners,&#8217; OpenAI.</figcaption></figure></div><p>GPT-3 is particularly good at few-shot learning for a wide variety of NLP tasks, including translation, question-answering, word unscrambling, and even 3-digit arithmetic. This is part of the reason why it's so generalizable, because a single model can be used for multiple applications.</p><h3>In-context learning</h3><p>The reason for this is that GPT-3 seems to infer the type of task it's being asked to solve much faster than smaller models. Its sheer size developed a broad range of pattern recognition skills, which are then used to quickly adapt to whatever unspecified task the user is prompting.</p><p>This meta-learning characteristic is called "in-context learning," which means that the model is able to infer the task it&#8217;s trying to accomplish on the fly, learning as it goes.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!oOHo!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1218ac6-b51d-497a-8b1c-835d97bc5a67_752x287.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!oOHo!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1218ac6-b51d-497a-8b1c-835d97bc5a67_752x287.png 424w, https://substackcdn.com/image/fetch/$s_!oOHo!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1218ac6-b51d-497a-8b1c-835d97bc5a67_752x287.png 848w, https://substackcdn.com/image/fetch/$s_!oOHo!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1218ac6-b51d-497a-8b1c-835d97bc5a67_752x287.png 1272w, https://substackcdn.com/image/fetch/$s_!oOHo!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1218ac6-b51d-497a-8b1c-835d97bc5a67_752x287.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!oOHo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1218ac6-b51d-497a-8b1c-835d97bc5a67_752x287.png" width="752" height="287" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/a1218ac6-b51d-497a-8b1c-835d97bc5a67_752x287.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:287,&quot;width&quot;:752,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!oOHo!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1218ac6-b51d-497a-8b1c-835d97bc5a67_752x287.png 424w, https://substackcdn.com/image/fetch/$s_!oOHo!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1218ac6-b51d-497a-8b1c-835d97bc5a67_752x287.png 848w, https://substackcdn.com/image/fetch/$s_!oOHo!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1218ac6-b51d-497a-8b1c-835d97bc5a67_752x287.png 1272w, https://substackcdn.com/image/fetch/$s_!oOHo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1218ac6-b51d-497a-8b1c-835d97bc5a67_752x287.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Starting with a few examples of correct inputs and outputs, GPT-3 can infer the task it&#8217;s meant to accomplish in real time, like 3 digit arithmetic, spelling correction, and English to French translation.</figcaption></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!gWn7!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F703ea47e-0884-4b0f-93ae-2506fdb13d3f_994x461.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!gWn7!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F703ea47e-0884-4b0f-93ae-2506fdb13d3f_994x461.png 424w, https://substackcdn.com/image/fetch/$s_!gWn7!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F703ea47e-0884-4b0f-93ae-2506fdb13d3f_994x461.png 848w, https://substackcdn.com/image/fetch/$s_!gWn7!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F703ea47e-0884-4b0f-93ae-2506fdb13d3f_994x461.png 1272w, https://substackcdn.com/image/fetch/$s_!gWn7!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F703ea47e-0884-4b0f-93ae-2506fdb13d3f_994x461.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!gWn7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F703ea47e-0884-4b0f-93ae-2506fdb13d3f_994x461.png" width="994" height="461" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/703ea47e-0884-4b0f-93ae-2506fdb13d3f_994x461.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:461,&quot;width&quot;:994,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!gWn7!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F703ea47e-0884-4b0f-93ae-2506fdb13d3f_994x461.png 424w, https://substackcdn.com/image/fetch/$s_!gWn7!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F703ea47e-0884-4b0f-93ae-2506fdb13d3f_994x461.png 848w, https://substackcdn.com/image/fetch/$s_!gWn7!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F703ea47e-0884-4b0f-93ae-2506fdb13d3f_994x461.png 1272w, https://substackcdn.com/image/fetch/$s_!gWn7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F703ea47e-0884-4b0f-93ae-2506fdb13d3f_994x461.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Another example of few-shot learning for correcting English grammar.</figcaption></figure></div><p></p><p>When prompted with even just a handful of examples, GPT-3 has remarkable accuracy on not just traditional NLP tasks, but also new adaptations like writing articles, <a href="https://twitter.com/amandaaskell/status/1283900372281511937?lang=en">guitar tabs</a>, and even <a href="https://www.nytimes.com/2020/11/24/science/artificial-intelligence-ai-gpt3.html">computer code</a> that otherwise would have required specialized models.</p><h3>Is this intelligence?</h3><p>I wouldn't say so, more like extremely high-dimensional mimicry.</p><p>Recall that in <a href="https://machineyearning.substack.com/p/watch-your-language-part-1">Watch Your Language, Part 1</a>, a Berkeley student was able to write entire productivity blog posts by prompting GPT-3 with a title and introduction. Does the fluency of the language, and its ability to fool human readers, imply it truly understands (as we would) causal links between sentences and aspects like "narrative"?</p><p>Not necessarily. Lifestyle blogs are one of the most popular blog formats, and posts on productivity are quite numerous. Because 60% of GPT-3's training mix is made up of crawled websites from all over the internet, it's safe to assume that the model has probably seen enough examples of lifestyle blogs to craft language that resembles them.</p><p>Some common tells are that GPT-3:</p><ul><li><p>Has trouble maintaining narrative consistency over longer documents</p></li><li><p>Is <a href="https://www.technologyreview.com/2020/07/20/1005454/openai-machine-learning-language-generator-gpt-3-nlp/">subject to repetitions and non sequiturs</a></p></li><li><p>Underperforms on language generation that requires logic and reasoning. <strong>Lifestyle blogs don't require either, which is how it was able to dupe readers so easily.</strong></p><p></p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!UCZ3!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Ffae5147c-8c83-45a9-96a1-5755e87623eb_1089x598.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!UCZ3!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Ffae5147c-8c83-45a9-96a1-5755e87623eb_1089x598.png 424w, https://substackcdn.com/image/fetch/$s_!UCZ3!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Ffae5147c-8c83-45a9-96a1-5755e87623eb_1089x598.png 848w, https://substackcdn.com/image/fetch/$s_!UCZ3!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Ffae5147c-8c83-45a9-96a1-5755e87623eb_1089x598.png 1272w, https://substackcdn.com/image/fetch/$s_!UCZ3!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Ffae5147c-8c83-45a9-96a1-5755e87623eb_1089x598.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!UCZ3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Ffae5147c-8c83-45a9-96a1-5755e87623eb_1089x598.png" width="1089" height="598" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/fae5147c-8c83-45a9-96a1-5755e87623eb_1089x598.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:598,&quot;width&quot;:1089,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!UCZ3!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Ffae5147c-8c83-45a9-96a1-5755e87623eb_1089x598.png 424w, https://substackcdn.com/image/fetch/$s_!UCZ3!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Ffae5147c-8c83-45a9-96a1-5755e87623eb_1089x598.png 848w, https://substackcdn.com/image/fetch/$s_!UCZ3!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Ffae5147c-8c83-45a9-96a1-5755e87623eb_1089x598.png 1272w, https://substackcdn.com/image/fetch/$s_!UCZ3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Ffae5147c-8c83-45a9-96a1-5755e87623eb_1089x598.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">A GPT-3 generated news article that humans were able to identify as bot-generated 61% of the time. Notice the highlighted uncharacteristic repetitions.</figcaption></figure></div><p></p><p>Regardless, GPT-3's size has undoubtedly made its meta-learning characteristics much more performant and easier to identify. It's still a mystery to most researchers exactly <em>how</em> in-context learning works, but the excitement is palpable to see how much further we can take it.</p><p>What is indisputable so far, is that the more data a model trains on, the better it generally seems to perform.</p><h2>In Closing: The Cost of Capital</h2><p>Which brings us to our final point. As models grow, so do their costs of capital. Using Google&#8217;s <strong>T5-11B</strong> (its previous 11 billion parameter language model)&#8217;s $1.3 million per run estimate (<a href="https://www.stateof.ai/">State of AI Report 2020, slide 17</a>), after all rounds of training experts ballpark the training cost of transformer language models at ~$1 / 1,000 parameters.</p><p>Roughly speaking, that puts the final tally for GPT-3's training costs <strong>at around $175 million for a single model.</strong> There are very few institutions that can afford that kind of capital. And if you&#8217;ll recall from <a href="https://machineyearning.substack.com/p/watch-your-language-part-1">part 1</a>, there are risks associated with such large models, including <a href="https://twitter.com/abidlabs/status/1291165311329341440?s=20">disturbing encoded biases</a> and the budding <a href="https://www.technologyreview.com/2020/11/12/1011944/artificial-intelligence-replication-crisis-science-big-tech-google-deepmind-facebook-openai/">replicability crisis</a>.</p><p>To wrap this up, I'll repeat what Dr. Kai-Fu Lee of <a href="https://www.sinovationventures.com/">Sinovation Ventures</a> recently shared when asked about the investment potential of massive language models at this year&#8217;s AAAI conference. He shares similar concerns about the growing tendency towards monopolistic advantage in large models. The full video and slides are in the <strong>&#127909;  Watching </strong>section of this week&#8217;s bonus content<strong>.</strong></p><blockquote><p>I believe that the new, huge language models that appear to make a big difference, also come with a large cost in computation. That on the one hand has energy implications, but it also makes it harder and harder for universities to compete with internet giants. Google and Microsoft, they can build a $100mm computer, and train models with a trillion parameters, but professors and researchers cannot. Every country should think about how we make those types of resources available to researchers and startups so that the giant corporations don't end up with a... monopolistic advantage.</p></blockquote><div><hr></div><h1>Bonus Content</h1><p>Articles, reports, videos, and more worth checking out this week that didn&#8217;t make the featured cut</p><h3>&#128185;  AI in the Markets</h3><h4><a href="https://ethz.ch/en/news-and-events/eth-news/news/2021/02/a-highly-accurate-digital-twin-of-our-planet.html">Building a digital twin of the planet (ETH Zurich)</a></h4><p>European and ETH Zurich computer scientists proposed developing a digital twin of the Earth for high-precision environmental and climate modeling across space and time. Researchers estimate this system would require ~20,000 GPUs, consuming an estimated 20MW of power.</p><h4><a href="https://techcrunch.com/2021/02/11/superannotate-a-computer-vision-platform-partners-with-with-open-source-to-spread-visual-ml/">SuperAnnotate, a no-code computer vision platform, partners with OpenCV</a> (TechCrunch)</h4><p>This partnership could lower the barrier to entry for lots of business leaders on cmputer vision applications. Personally I own 2 OpenCV OAK cameras. They come out-of-the-box with object detection inference models onboard, and are quite impressive.</p><h3>&#128295;&nbsp; Hardware &amp; ASICs</h3><h4><a href="https://www.anandtech.com/show/16511/leading-foundries-enjoy-massive-revenue-growth-as-capacities-get-fully-loaded">Semi Demand 30% Above Supply, 20% Year-on-Year Growth</a> (AnandTech)</h4><p>TSMC, Samsung, UMC, and GlobalFoundries leading the pack, with China's SMIC in 5th. Inventories are drying up since "semi companies are shipping 10% to 30% below current demand levels, and it will take at least 3-4 quarters for supply to catch up with demand."</p><h3>&#128161; Startups &amp; Strategy</h3><h4><a href="https://marker.medium.com/why-theres-no-such-thing-as-a-startup-within-a-big-company-c3003615f3bc">Why There's No Such Thing as a 'Startup Within a Big Company'</a> (Waze vs Google)</h4><p>Waze co-founder and CEO Noam Bardin left Google in January and published a personal essay detailing "the trickle-down problem" in thoughtful and heartbreaking detail.</p><h2>&#127911; Listening</h2><h4><a href="https://open.spotify.com/episode/73Em8QCFXsMS5Xa6YosrOS?si=FfnacDYbQfS4x0FgQ-HdlQ">The chip choke point. A single machine from the Netherlands could catapult China to the leading edge of the semiconductor industry. If the U.S. allowed it, that is. (The Wire China)</a></h4><p>EUV light sources emit light in incredibly short wavelengths, less than 20nm, which is necessary to carve circuit features  onto nodes less than 7nm. The company, ASML, has been courted by China for a decade for a potential acquisition, however the patent for EUV is actually owned by a US company. Without it, China will have to pursue alternative means to go from 14nm to 7nm.</p><h2>&#127909; Watching</h2><h4><a href="https://slideslive.com/38952432/ai-infusion-investment-opportunities">AI Infusion &amp; Investment Opportunities, with Kai-Fu Lee (AAAI-21)</a></h4><p>Dr. Kai-Fu Lee (founder of Sinovation Ventures, author of AI Superpowers) highlights the contributing factors to China's competitive rise within applied AI, and identifies some oft-overlooked investment themes.</p><div><hr></div><h1>Thanks for reading!</h1><p>Machine Yearning is a collection of essays and news on the intersection between AI, investing, product, and economics, light on technicals but heavy on relevance. Think of it as a casual chat about AI over coffee (or any other preferred beverage).</p><p>Ryan Cunningham is an AI Product Manager, strategist, and ex-investment banker. Today he leads applied AI strategy and new verticals at&nbsp;<a href="https://www.spiketrap.io/">Spiketrap</a>, an NLP-as-a-service company. He spent 4 years at Uber and is currently studying Artificial Intelligence part-time at Stanford, with a BS in Finance and Economics from Georgetown University.</p><p>Any suggestions or topics you want to see? Shoot me an email at&nbsp;<a href="mailto:rydcunningham@gmail.com">rydcunningham@gmail.com</a>&nbsp;or hit me up on&nbsp;<a href="https://www.twitter.com/rydcunningham">Twitter</a>&nbsp;/&nbsp;<a href="https://www.linkedin.com/in/rydcunningham">LinkedIn</a>&nbsp;/ Clubhouse&nbsp;@rydcunningham.</p>]]></content:encoded></item><item><title><![CDATA[Watch Your Language, Part 1 | Machine Yearning 001]]></title><description><![CDATA[Why Size Matters and What to Do About It]]></description><link>https://www.machineyearning.io/p/watch-your-language-part-1</link><guid isPermaLink="false">https://www.machineyearning.io/p/watch-your-language-part-1</guid><dc:creator><![CDATA[Ryan Cunningham]]></dc:creator><pubDate>Wed, 24 Feb 2021 17:02:51 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/03250098-1cd9-4e58-a8f6-0ec5eb9d9289_2616x1471.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Hello and welcome!</em></p><p><em>I originally planned this week to dive into Google&#8217;s recently released Switch Transformer language model and how it differs from previously published models like GPT-3. Instead, I will be splitting this topic into two parts:</em></p><p><em>Part 1 (this post) will explore language models at a high level, with some fun and some dangerous applications.</em></p><p><em>Part 2 will compare current state-of-the-art models like the Switch Transformer, GPT-3, and its open-sourced alternative GPT-Neo, through a more technical and economic lens.</em></p><p><em>For now, welcome to the sometimes silly, sometimes concerning, but always interesting world of language models. Here&#8217;s why they matter and why you should care.</em></p><h1><strong>&#128300;Long Take: Language Models for Fun and Profit</strong></h1><p>Before I start, it&#8217;s probably best to quickly state what a &#8220;language model&#8221; is.</p><p>Put simply, given the context of preceding words, a language model predicts what the next word in a sentence will be.</p><p>That&#8217;s it. Simple as that.</p><p>Believe it or not, you already use language models in your daily life. Gmail&#8217;s &#8216;Smart Compose&#8217; feature and the predictive keyboard on your iPhone are common examples.</p><p></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Eqtp!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fb762eb17-7111-424c-bac5-1f6cce99addd_1440x984.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Eqtp!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fb762eb17-7111-424c-bac5-1f6cce99addd_1440x984.png 424w, https://substackcdn.com/image/fetch/$s_!Eqtp!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fb762eb17-7111-424c-bac5-1f6cce99addd_1440x984.png 848w, https://substackcdn.com/image/fetch/$s_!Eqtp!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fb762eb17-7111-424c-bac5-1f6cce99addd_1440x984.png 1272w, https://substackcdn.com/image/fetch/$s_!Eqtp!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fb762eb17-7111-424c-bac5-1f6cce99addd_1440x984.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Eqtp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fb762eb17-7111-424c-bac5-1f6cce99addd_1440x984.png" width="1440" height="984" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/b762eb17-7111-424c-bac5-1f6cce99addd_1440x984.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:984,&quot;width&quot;:1440,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:307713,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Eqtp!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fb762eb17-7111-424c-bac5-1f6cce99addd_1440x984.png 424w, https://substackcdn.com/image/fetch/$s_!Eqtp!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fb762eb17-7111-424c-bac5-1f6cce99addd_1440x984.png 848w, https://substackcdn.com/image/fetch/$s_!Eqtp!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fb762eb17-7111-424c-bac5-1f6cce99addd_1440x984.png 1272w, https://substackcdn.com/image/fetch/$s_!Eqtp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fb762eb17-7111-424c-bac5-1f6cce99addd_1440x984.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Credit: Apple Support.</figcaption></figure></div><p></p><p>Researchers train these models on large collections of text documents (corpora), which contain millions and millions of words and sentences in natural language. Some of the most common corpora include sets from Wikipedia, books in the public domain, internet news, even Shakespeare. Importantly, the kinds of corpora researchers choose to train on have significant implications for the kinds of words the model predicts.</p><h2>&#8220;You are Hagrid now.&#8221;</h2><p>Back in late 2017 one studio trained a language model on <a href="https://www.theverge.com/2017/12/12/16768582/harry-potter-ai-fanfiction">Harry Potter books</a>, then used it to independently generate a hilarious new chapter called <a href="https://botnik.org/content/harry-potter.html">Harry Potter and the Portrait of What Looked Like a Large Pile of Ash</a>. I definitely recommend reading the whole thing, but here are my favorite excerpts:</p><blockquote><p><strong>Leathery sheets of rain lashed at Harry&#8217;s ghost as he walked across the grounds toward the castle. Ron was standing there and doing a kind of frenzied tap dance. He saw Harry and immediately began to eat Hermione&#8217;s family.</strong></p><p><strong>Ron&#8217;s Ron shirt was just as bad as Ron himself.</strong></p><p><strong>&#8220;If you two can&#8217;t clump happily, I&#8217;m going to get aggressive,&#8221; confessed the reasonable Hermione.</strong></p><p><strong>&#8220;Not so handsome now,&#8221; thought Harry as he dipped Hermione in hot sauce.</strong></p><p><strong>The pig of Hufflepuff pulsed like a large bullfrog. Dumbledore smiled at it, and placed his hand on its head: &#8220;You are Hagrid now.&#8221; </strong></p></blockquote><p>At best, it&#8217;s a whimsical, silly romp that actually does sound Rowling-esque. But since this model is trained only on Harry Potter books, with a measly total of 1,084,170 words, its performance is rather limited. Professional writers and copy-editors had to be employed to prune the generated text, and you shouldn&#8217;t expect this to perform very well on a task like Smart Composing email replies.</p><h2>&#8220;Writing more while thinking less.&#8221;</h2><p>Another silly, but<em> </em>more concerning real-world example is a <a href="https://adolos.substack.com/archive?sort=new">productivity blog</a> created by Berkeley student Liam Porr last year. Reportedly, it only took a few hours of playing with OpenAI&#8217;s popular language model<a href="https://openai.com/blog/openai-api/"> GPT-3</a> (more on this in a bit) for Porr to think of an experiment to run: first, he would feed the model a headline and introduction for a blog post as inputs. Next, given that context, the model would go on to generate several versions of full-length self-help and productivity blog posts <strong>based solely on just the headline and intro</strong>.</p><p>Determining these inputs wasn&#8217;t difficult. He would make quick trips to Medium and Hacker News to see what&#8217;s trending, copy something similar, and that would  get the job done. Under the pseudonym <a href="https://en.wikipedia.org/wiki/Dolos_(mythology)">&#8220;Adolos&#8221;</a>, he began publishing near-daily posts to his productivity blog over a two-week period:</p><p></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!6p5C!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F2d333c4b-c6c8-4f1c-8dc0-40ef9a86b045_758x613.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!6p5C!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F2d333c4b-c6c8-4f1c-8dc0-40ef9a86b045_758x613.png 424w, https://substackcdn.com/image/fetch/$s_!6p5C!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F2d333c4b-c6c8-4f1c-8dc0-40ef9a86b045_758x613.png 848w, https://substackcdn.com/image/fetch/$s_!6p5C!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F2d333c4b-c6c8-4f1c-8dc0-40ef9a86b045_758x613.png 1272w, https://substackcdn.com/image/fetch/$s_!6p5C!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F2d333c4b-c6c8-4f1c-8dc0-40ef9a86b045_758x613.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!6p5C!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F2d333c4b-c6c8-4f1c-8dc0-40ef9a86b045_758x613.png" width="758" height="613" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/2d333c4b-c6c8-4f1c-8dc0-40ef9a86b045_758x613.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:613,&quot;width&quot;:758,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:277147,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!6p5C!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F2d333c4b-c6c8-4f1c-8dc0-40ef9a86b045_758x613.png 424w, https://substackcdn.com/image/fetch/$s_!6p5C!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F2d333c4b-c6c8-4f1c-8dc0-40ef9a86b045_758x613.png 848w, https://substackcdn.com/image/fetch/$s_!6p5C!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F2d333c4b-c6c8-4f1c-8dc0-40ef9a86b045_758x613.png 1272w, https://substackcdn.com/image/fetch/$s_!6p5C!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F2d333c4b-c6c8-4f1c-8dc0-40ef9a86b045_758x613.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: https://adolos.substack.com. Can you spot the imposters? It&#8217;s easy. They ALL are.</figcaption></figure></div><p></p><p>This was an innocent enough idea, and in practice the blog posts seemed so familiarly written and the language so sensible that almost no one could see through the ruse.</p><p></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!64qo!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0a4c9ae9-286c-4408-bc5e-d882428d4427_769x354.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!64qo!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0a4c9ae9-286c-4408-bc5e-d882428d4427_769x354.png 424w, https://substackcdn.com/image/fetch/$s_!64qo!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0a4c9ae9-286c-4408-bc5e-d882428d4427_769x354.png 848w, https://substackcdn.com/image/fetch/$s_!64qo!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0a4c9ae9-286c-4408-bc5e-d882428d4427_769x354.png 1272w, https://substackcdn.com/image/fetch/$s_!64qo!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0a4c9ae9-286c-4408-bc5e-d882428d4427_769x354.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!64qo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0a4c9ae9-286c-4408-bc5e-d882428d4427_769x354.png" width="769" height="354" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/0a4c9ae9-286c-4408-bc5e-d882428d4427_769x354.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:354,&quot;width&quot;:769,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:73582,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!64qo!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0a4c9ae9-286c-4408-bc5e-d882428d4427_769x354.png 424w, https://substackcdn.com/image/fetch/$s_!64qo!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0a4c9ae9-286c-4408-bc5e-d882428d4427_769x354.png 848w, https://substackcdn.com/image/fetch/$s_!64qo!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0a4c9ae9-286c-4408-bc5e-d882428d4427_769x354.png 1272w, https://substackcdn.com/image/fetch/$s_!64qo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F0a4c9ae9-286c-4408-bc5e-d882428d4427_769x354.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Some readers felt a particularly strong connection to the content.</figcaption></figure></div><p></p><p>In a surprising turn of events, many who suspected these posts were AI-generated were actually criticized and downvoted by the community!</p><p></p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!28Ua!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F3aedd93f-8fd3-43e7-b8a2-40c94402c3ed_664x227.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!28Ua!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F3aedd93f-8fd3-43e7-b8a2-40c94402c3ed_664x227.png 424w, https://substackcdn.com/image/fetch/$s_!28Ua!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F3aedd93f-8fd3-43e7-b8a2-40c94402c3ed_664x227.png 848w, https://substackcdn.com/image/fetch/$s_!28Ua!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F3aedd93f-8fd3-43e7-b8a2-40c94402c3ed_664x227.png 1272w, https://substackcdn.com/image/fetch/$s_!28Ua!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F3aedd93f-8fd3-43e7-b8a2-40c94402c3ed_664x227.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!28Ua!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F3aedd93f-8fd3-43e7-b8a2-40c94402c3ed_664x227.png" width="664" height="227" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/3aedd93f-8fd3-43e7-b8a2-40c94402c3ed_664x227.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:227,&quot;width&quot;:664,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!28Ua!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F3aedd93f-8fd3-43e7-b8a2-40c94402c3ed_664x227.png 424w, https://substackcdn.com/image/fetch/$s_!28Ua!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F3aedd93f-8fd3-43e7-b8a2-40c94402c3ed_664x227.png 848w, https://substackcdn.com/image/fetch/$s_!28Ua!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F3aedd93f-8fd3-43e7-b8a2-40c94402c3ed_664x227.png 1272w, https://substackcdn.com/image/fetch/$s_!28Ua!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F3aedd93f-8fd3-43e7-b8a2-40c94402c3ed_664x227.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">Source: Hacker News.</figcaption></figure></div><p></p><p>Just before fessing up to the whole charade, his final, human-written blog post <a href="https://adolos.substack.com/p/what-i-would-do-with-gpt-3-if-i-had">&#8220;What I Would Do with GPT-3 If I Had No Morals&#8221;</a> pulled an <a href="https://en.wikipedia.org/wiki/If_I_Did_It">O.J.</a> and described in hypothetical detail how one might go about pulling this off.</p><blockquote><p><strong>While the output is not perfect, you can easily curate it to something that's convincing. This will make it so easy for people to just pump out clickbait articles to drive traffic. It would be pretty simple to do actually.</strong></p><p><strong>First thing you would need to do is come up with a name. If it were me, I&#8217;d name it after the Greek god of deception or something like that just to be clever. Then I&#8217;d just stick an &#8220;A&#8221; in front so nobody gets suspicious.</strong></p><p><strong>After that, I&#8217;d make a substack because it takes no time to set up. Once thats done you have to come up with some content. GPT-3 isn&#8217;t great with logic, so inspirational posts would probably be best, maybe some pieces on productivity too.</strong></p></blockquote><p>Porr observed that GPT-3 is &#8220;good at making pretty language, and &#8230; not very good at being logical and rational.&#8221; Given those constraints, what better blogging category to pick than productivity and self-help?</p><h2>Size matters</h2><p>On a more serious note, that these communities were so easily duped with such little effort on Porr&#8217;s part raises serious questions. How is it that language models like GPT-3 are able to generate such convincing language?</p><p>When it comes to building effective language models, <em>generally speaking </em>the more natural language data included in the training phase, the more convincing the generated outputs will be.</p><p>Now remember, models themselves aren&#8217;t malicious, they just amplify whatever biases exist in the data used to train them. Massive language models? Even more so.</p><h3>Two ___ Walked Into a ___</h3><p>Here&#8217;s where things get dicey. Taking the outputs of massive language models for granted can have dire consequences, well beyond occasionally duping a bunch of Hacker News readers. <a href="https://twitter.com/abidlabs">Abubakar Abid</a>, a Stanford PhD candidate, demonstrated one such example using the GPT-3 demo in August of last year. Watch the video below to see what happens when the model is fed a sentence that starts with &#8220;Two Muslims&#8221;&#8230;</p><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://twitter.com/abidlabs/status/1291165311329341440?s=20&quot;,&quot;full_text&quot;:&quot;I'm shocked how hard it is to generate text about Muslims from GPT-3 that has nothing to do with violence... or being killed... &quot;,&quot;username&quot;:&quot;abidlabs&quot;,&quot;name&quot;:&quot;Abubakar Abid&quot;,&quot;profile_image_url&quot;:&quot;&quot;,&quot;date&quot;:&quot;Thu Aug 06 00:12:53 +0000 2020&quot;,&quot;photos&quot;:[{&quot;img_url&quot;:&quot;https://cdn.substack.com/image/upload/w_728,c_limit/l_twitter_play_button_rvaygk,w_120/tunggwsfhn2ujxxzb4kh&quot;,&quot;link_url&quot;:&quot;https://t.co/biSiiG5bkh&quot;}],&quot;quoted_tweet&quot;:{},&quot;reply_count&quot;:0,&quot;retweet_count&quot;:1992,&quot;like_count&quot;:5180,&quot;impression_count&quot;:0,&quot;expanded_url&quot;:{},&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><p>Compared to any other religious group, preceding a sentence with &#8220;Two Muslims&#8221; generated violent content about 9x as frequently.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!oMxY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F4c879384-b2b6-422a-bb32-febabd0d9eab_752x425.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!oMxY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F4c879384-b2b6-422a-bb32-febabd0d9eab_752x425.png 424w, https://substackcdn.com/image/fetch/$s_!oMxY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F4c879384-b2b6-422a-bb32-febabd0d9eab_752x425.png 848w, https://substackcdn.com/image/fetch/$s_!oMxY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F4c879384-b2b6-422a-bb32-febabd0d9eab_752x425.png 1272w, https://substackcdn.com/image/fetch/$s_!oMxY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F4c879384-b2b6-422a-bb32-febabd0d9eab_752x425.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!oMxY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F4c879384-b2b6-422a-bb32-febabd0d9eab_752x425.png" width="752" height="425" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/4c879384-b2b6-422a-bb32-febabd0d9eab_752x425.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:425,&quot;width&quot;:752,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!oMxY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F4c879384-b2b6-422a-bb32-febabd0d9eab_752x425.png 424w, https://substackcdn.com/image/fetch/$s_!oMxY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F4c879384-b2b6-422a-bb32-febabd0d9eab_752x425.png 848w, https://substackcdn.com/image/fetch/$s_!oMxY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F4c879384-b2b6-422a-bb32-febabd0d9eab_752x425.png 1272w, https://substackcdn.com/image/fetch/$s_!oMxY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F4c879384-b2b6-422a-bb32-febabd0d9eab_752x425.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Abid sees this sort of anti-Muslim bias as &#8220;persistent&#8221; in many massive language models.</p><p>Now, why might this be happening? These models are built on massive datasets with billions and billions of tokens from many different sources, like books, news, Wikipedia, and website text. For these datasets, think about what bias the authors, editors, and journalists may have had going into the publishing process for the documents in these corpora.</p><ul><li><p>What language, what country, what time period were they written in?</p></li><li><p>What percentage of documents come from a time period, say, post-9/11, when the <a href="https://www.tandfonline.com/doi/abs/10.1300/J222v05n01_03">frequency of anti-Islamic sentiment and hate crimes</a> rose and persisted for many years? </p></li></ul><p>The bigger these models get, the more difficult it is to perform studies that spot these kinds of biases. Without proper scrutiny and auditing, training on massive text datasets <em>carte blanche</em> will encode sometimes extreme but otherwise unhelpful biases into our language models.</p><h3>The Replicability Crisis</h3><p>Because these models are so big, the arms race is contributing to a <a href="https://www.technologyreview.com/2020/11/12/1011944/artificial-intelligence-replication-crisis-science-big-tech-google-deepmind-facebook-openai/">replicability crisis</a> where very few entities have the capital and resources to train and run said models from scratch. We end up having to rely on the publisher (Google, OpenAI, etc.) to check their homework.</p><p>While some researchers are alright with this approach - let Google-sized entities pre-train and publish these large general models, and have smaller entities customize them for specific applications - this obscures peer reviewers&#8217; and third parties&#8217; ability to spot biases that can be used to discriminate, radicalize, or justify extreme violence.</p><h2>What can we do about it?</h2><p>Understanding <strong>how </strong>these biases show up is critically important. That&#8217;s the first step to solving the problem.</p><h3>Ask the right questions.</h3><p>I alluded to this before. Be inquisitive about the datasets used for training. What are their sources, on what time window were they collected, etc.. Are there alternative datasets available which may mitigate or de-risk unhelpful bias?</p><h3>Conduct regular audits.</h3><p>Structured reviews and audits of these popular datasets often yield surprising results. To borrow a computer vision example, the <strong><a href="https://www.theregister.com/2020/07/01/mit_dataset_removed/">Tiny Images</a></strong> dataset from MIT (comprised of 80 million labeled 32x32 pixel images used to train and benchmark computer vision models) was removed from the public domain after a more thorough review revealed some highly inappropriate, derogatory annotations (emphasis mine):</p><blockquote><p><strong>The dataset includes, for example, </strong><em><strong>pictures of Black people and monkeys labeled with the N-word; women in bikinis, or holding their children, labeled whores;</strong></em><strong> parts of the anatomy labeled with crude terms; and so on &#8211; needlessly linking everyday imagery to slurs and offensive language, and baking prejudice and bias into future AI models.</strong></p></blockquote><p>Personally, I think there&#8217;s much more to learn from auditing and pruning than outright removal, but tough calls have to be made. Unfortunately for MIT, the images themselves were too small for manual review, so the lab elected to remove the dataset in its entirety.</p><h3>When creating datasets, do your diligence.</h3><p>The lab (CSAIL) also admitted that in creating the dataset, they &#8220;automatically obtained the images from the internet without checking whether any offensive pics or language were ingested into the library.&#8221; Any downstream models making use of the Tiny Images dataset, or any large pre-trained model built on it, would therefore mistakenly suggest problematic, causal links not grounded in reality.</p><h3>Open-source where possible.</h3><p>For now, GPT-3 is under lock-and-key <a href="https://blogs.microsoft.com/blog/2020/09/22/microsoft-teams-up-with-openai-to-exclusively-license-gpt-3-language-model/">on an exclusive license to Microsoft</a>. The source code was never released to the public, which has obscured the ability for researchers to examine its biases in detail.</p><p>Fortunately, a team called <a href="https://venturebeat.com/2021/01/15/ai-weekly-meet-the-people-trying-to-replicate-and-open-source-openais-gpt-3/">EleutherAI</a> is working to open-source GPT-3 through a parallel effort called GPT-Neo. Areas of improvement include a dedicated team of curators who have performed &#8220;extensive bias analysis&#8221; on training data, and in some cases, excluded otherwise popular datasets they felt had &#8220;unacceptably negatively biased&#8221; content. Compared to GPT-3&#8217;s 5 datasets, GPT-Neo has a more diverse mix of 22 smaller datasets.</p><h2>Moving forward</h2><p>It&#8217;s important to remember that machine learning models are, in their simplest form, just pattern recognition tools. They are not free of subjectivity. Any bias that exists in the training data will be amplified. So there is an ethical responsibility on the part of both researchers and customers to think deeply about their models&#8217; application and misapplication.</p><p>In a world where proliferated, unmoderated fake news on Facebook <a href="https://www.nytimes.com/2018/10/15/technology/myanmar-facebook-genocide.html">directly contributed to the systemic genocide</a> of a Muslim ethnic minority (Rohingya Muslims in Myanmar), it isn&#8217;t a stretch to imagine how malicious actors could, with a botnet and a language model, procedurally generate enough conspiracies and hateful content to dupe individuals, communities, and governments alike.</p><p>At the end of the day, I would caution against measuring progress by size alone, and recommend a more nuanced approach which takes extreme bias detection into consideration. To make effective use of these large models you need capital and resources belonging to a select few, which concentrates research and publishing power, de-emphasizes practical relevance for most institutions, and increases systemic risks when left unchecked and unchallenged.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.machineyearning.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.machineyearning.io/subscribe?"><span>Subscribe now</span></a></p><div><hr></div><h1>Bonus Content</h1><p>These are articles, reports, videos, and more worth checking out that didn&#8217;t get a featured mention this issue.</p><h2>&#128218; Reading</h2><h3>&#128178; AI in the Market</h3><h4><a href="https://cset.georgetown.edu/research/from-china-to-san-francisco-the-location-of-investors-in-top-u-s-ai-startups/">From China to San Francisco: The Location of Investors in Top U.S. AI Startups (CSET)</a></h4><p>The most popular foreign investors in top US A.I. startups are out of - you guessed it, China.</p><h4><a href="https://cset.georgetown.edu/research/comparing-corporate-and-university-publication-activity-in-ai-ml/">Comparing Corporate and University Publication Activity in AI/ML (CSET)</a></h4><p>While universities technically produce more numerous AI papers, corporations generate the most citations for published work.</p><h3>&#127963; Government &amp; Policymaking</h3><h4><a href="https://www.technologyreview.com/2021/01/22/1016652/biden-administration-ai-plans-what-to-expect/">The Biden Administration&#8217;s AI Plans: What to Expect (MIT Technology Review)</a></h4><p>Three takeaways: first, the President elevated the director of the Office of Science and Technology Policy (OSTP) to a cabinet-level position. Then, he named a prominent sociologist as deputy director, who may be more mindful of encoded bias to algorithm inputs. Finally, his Secretary of State described the pursuit of AI standard-setting against &#8216;techno autocracies&#8217; (i.e. China) in particularly dramatic terms.</p><h4><a href="https://www.theguardian.com/science/2021/jan/26/us-has-moral-imperative-to-develop-ai-weapons-says-panel?mc_cid=f72e1827dc">US has &#8216;moral imperative&#8217; to develop AI weapons, says panel (The Guardian)</a></h4><p>For about eight years, a coalition of non-governmental organizations have been pushing for an international treaty banning the use of autonomous (read: no humans in the loop) weapons. Interesting takes in here as to why (&#8220;human control is necessary to&#8230; assign blame for war crimes.&#8221;). The National Security Commission on Artificial Intelligence, led by former Google CEO Eric Schmidt, acknowledges risks but does not recommend a ban, saying such weapons will contribute to a cleaner, safer battlefield, free of human error.</p><p>We&#8217;ll definitely cover this in a future issue, but for now keep this in mind: genetic diversity is a crucial line of defense for any species. If our autonomous weapons can all fall victim to the same <a href="https://hackernoon.com/adversarial-attacks-how-to-trick-computer-vision-7484c4e85dc0">adversarial attacks</a>, our defense infrastructure becomes far more fragile.</p><h3>&#128269; Hidden Biases</h3><h4><a href="https://venturebeat.com/2021/02/09/openai-and-stanford-researchers-call-for-urgent-action-to-address-harms-of-large-language-models-like-gpt-3/">OpenAI and Stanford Researchers Call for Urgent Action to Address Harms of Large Language Models Like GPT-3 (VentureBeat)</a></h4><p>&#8220;Lies travel halfway around the world before the truth ties its shoes.&#8221; Perpetuating disinformation is only one of many malicious applications of large, unaudited, pre-trained language models.</p><h4><a href="https://www.technologyreview.com/2021/02/05/1017560/predictive-policing-racist-algorithmic-bias-data-crime-predpol/">Predictive Policing is Still Racist - Whatever Data it Uses (MIT Technology Review)</a></h4><p>Predictably, supervised algorithms trained on past crime data make irresponsible causal fallacies between race, crime, and recidivism, no matter which way researchers slice it. They&#8217;re having a tough time finding a technical fix; in the meantime, they advocate for a political solution, but officials are reluctant to abandon the technology.</p><h3>&#128295; Hardware &amp; ASICs</h3><h4><a href="https://www.cnbc.com/2019/09/25/alibaba-unveils-its-first-ai-chip-called-the-hanguang-800.html">Alibaba Unveils Its First AI ASIC, Hanguang 800 (CNBC)</a></h4><p>Older news (Sep 2019), but still important. Large companies reliant on AI for competitive advantages have been developing their own hardware in-house, like Google and its custom tensor processing units (TPUs). Alibaba&#8217;s <strong>Hanguang 800</strong> can allegedly speed up computing tasks by 12x in crucial tasks for e-commerce (product search, translation, recommendations, etc.). Eventually, Alibaba can lease out their cloud compute and Hanguang 800 capacity to other companies, a l&#225; AWS.</p><h4><a href="https://www.cnbc.com/2021/02/10/baidu-in-talks-to-raise-money-for-a-standalone-ai-chip-company-.html">Baidu in Talks to Raise Money for a Standalone AI Chip Company (CNBC)</a></h4><p>In the face of global semiconductor tariffs, Chinese tech enterprises have realized the need to invest in their own semiconductor capacity. GGV Capital and IDG Capital are advising. These industries don&#8217;t just pop up overnight, but compute (and the resources to produce it) is becoming more valuable than oil. Keep an eye out for pundits quantifying &#8220;compute independence&#8221; on an international basis.</p><h4><a href="https://www.cnbc.com/2021/01/06/tencent-invests-in-chinese-ai-chip-start-up-enflame.html">Tencent Invests in Chinese AI Chip Startup as Part of $279 Million Funding Round (CNBC)</a></h4><p>The startup in question is &#8220;<a href="https://www.enflame-tech.com/">Enflame Technology</a>,&#8221; headquartered in Shanghai. CITIC, CICC Capital, and Primavera are involved.</p><h4><a href="https://www.hpcwire.com/off-the-wire/neureality-emerges-from-stealth-with-8m-seed-for-ai-compute-infrastructure/">NeuReality Emerges From Stealth with $8m Seed for AI Compute Infrastructure</a> (HPCwire)</h4><p>NeuReality is an Israeli semiconductor startup building scalable AI-specific chipset. They recently appointed Naveen Rao to their Board of Directors; Rao founded the first AI chip startup, Nervana Systems, later acquired by Intel.</p><h2>&#127911; Listening</h2><h4><a href="https://open.spotify.com/episode/1UummL6HZJ0SoiP8MADl8t?si=pXwvFNVnTCC2I9kylTD81w">The First AI Chip Startup with Naveen Rao, Nervana Systems</a> (Spotify)</h4><p>Naveen Rao shares firsthand experience with the &#8216;trickle-down problem&#8217; after Nervana Systems was acquired by Intel. Listen in for why the chipmakers of today will most likely not be the AI ASIC champions of tomorrow.</p><h2>&#127909; Watching</h2><h4><a href="https://mitsloan.mit.edu/ideas-made-to-matter/how-can-human-centered-ai-fight-bias-machines-and-people">How Can Human-Centered AI Fight Bias in Machines and People? (MIT Sloan)</a></h4><blockquote><p><strong>Prevailing wisdom assumes that the role of algorithms &#8220;is to correct the biases that humans have&#8230; [t]his follows the assumption that algorithms can simply come in and help us make better decisions &#8211; and I don&#8217;t think that&#8217;s an assumption we should operate under.</strong></p></blockquote><div><hr></div><h1>Thanks for reading!</h1><p>Machine Yearning is a collection of essays and news on the intersection between AI, investing, product, and economics, light on technicals but heavy on relevance. Think of it as a casual chat about AI over coffee (or any other preferred beverage).</p><p>Ryan Cunningham is an AI Product Manager, strategist, and ex-investment banker. Today he leads applied AI strategy and new verticals at <a href="https://www.spiketrap.io">Spiketrap</a>, an NLP-as-a-service company. He spent 4 years at Uber and is currently studying Artificial Intelligence part-time at Stanford, with a BS in Finance and Economics from Georgetown University.</p><p>Any suggestions or items you want to see? Shoot me an email at <a href="mailto:rydcunningham@gmail.com">rydcunningham@gmail.com</a> or hit me up on <a href="https://www.twitter.com/rydcunningham">Twitter</a> / <a href="https://www.linkedin.com/in/rydcunningham">LinkedIn</a> / Clubhouse&nbsp;@rydcunningham.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.machineyearning.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.machineyearning.io/subscribe?"><span>Subscribe now</span></a></p>]]></content:encoded></item><item><title><![CDATA[S-Curves and Hype Trains | Machine Yearning 000]]></title><description><![CDATA[When Expectations Meet Reality]]></description><link>https://www.machineyearning.io/p/machine-yearning-0-s-curves-and-hype</link><guid isPermaLink="false">https://www.machineyearning.io/p/machine-yearning-0-s-curves-and-hype</guid><dc:creator><![CDATA[Ryan Cunningham]]></dc:creator><pubDate>Mon, 08 Feb 2021 19:28:24 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!_tfZ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fb64b9428-023a-49bc-91db-3858f4988996_400x400.gif" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><strong><code>Welcome!</code> </strong>Machine Yearning is a semi-regular update on high and low-profile developments in artificial intelligence, focusing on actionable tips and stories for executives, investors, and project managers.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!_tfZ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fb64b9428-023a-49bc-91db-3858f4988996_400x400.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!_tfZ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fb64b9428-023a-49bc-91db-3858f4988996_400x400.gif 424w, https://substackcdn.com/image/fetch/$s_!_tfZ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fb64b9428-023a-49bc-91db-3858f4988996_400x400.gif 848w, https://substackcdn.com/image/fetch/$s_!_tfZ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fb64b9428-023a-49bc-91db-3858f4988996_400x400.gif 1272w, https://substackcdn.com/image/fetch/$s_!_tfZ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fb64b9428-023a-49bc-91db-3858f4988996_400x400.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!_tfZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fb64b9428-023a-49bc-91db-3858f4988996_400x400.gif" width="268" height="268" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/b64b9428-023a-49bc-91db-3858f4988996_400x400.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:400,&quot;width&quot;:400,&quot;resizeWidth&quot;:268,&quot;bytes&quot;:2329454,&quot;alt&quot;:&quot;robot-waving&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/gif&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="robot-waving" title="robot-waving" srcset="https://substackcdn.com/image/fetch/$s_!_tfZ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fb64b9428-023a-49bc-91db-3858f4988996_400x400.gif 424w, https://substackcdn.com/image/fetch/$s_!_tfZ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fb64b9428-023a-49bc-91db-3858f4988996_400x400.gif 848w, https://substackcdn.com/image/fetch/$s_!_tfZ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fb64b9428-023a-49bc-91db-3858f4988996_400x400.gif 1272w, https://substackcdn.com/image/fetch/$s_!_tfZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fb64b9428-023a-49bc-91db-3858f4988996_400x400.gif 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Welcome!</figcaption></figure></div><p>I kicked off this project as an extension of a time-honored tradition where every Thanksgiving, a family member asks me &#8220;what is AI? When are the machines coming to get us?&#8221;</p><p>AI as a concept can often seem intimidating to folks with non-technical backgrounds, and we tend to fear what we don&#8217;t understand. They&#8217;ll usually fill in the gaps with pumped up expectations and dystopic sci-fi imagery.</p><p>In my work leading applied machine learning projects at institutions big and small, I&#8217;ve run into many versions of the same Thanksgiving problem. Simply put, there&#8217;s not enough people speaking the same language.</p><p>The goal of this newsletter is to bridge that gap by providing relevance, commentary, and deep-dives for the 90% of institutions who have heard about AI but don&#8217;t yet know where to start. It&#8217;s the intersection between AI, business, and product strategy. If this sounds like something you&#8217;re into, read on!</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.machineyearning.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.machineyearning.io/subscribe?"><span>Subscribe now</span></a></p><h1><strong>&#128300;Long Take: Knowledge Gaps</strong></h1><p>Let&#8217;s get one thing out of the way: machine learning is a toolkit like any other. <strong>What&#8217;s more important than knowing how to use a tool is to know when not to use it.</strong> You don&#8217;t need an advanced degree to know when to use a hammer, and you don&#8217;t need to know how the hammer was designed, either. Just what it&#8217;s for.</p><p>These knowledge gaps manifest in a few different forms:</p><ul><li><p><strong>The expertise problem:</strong> When the quants are too far away from the customer to understand problems in context. In my work, the most common ask I hear from engineers and scientists is to hear the customer&#8217;s perspective more often.</p></li><li><p><strong>The marketing problem:</strong> When client-facing personnel are not well-versed enough in technical concepts to concisely explain the value-add. This is one of the most common problems in fundraising.</p></li><li><p><strong>The trickle-down problem:</strong> When an acquired AI startup doesn&#8217;t effectively grease the wheels of the inertia-laden parent company, stagnates, and the talent is either bought out, returns to academia, or both. Ultimately the acquisition is written off. More on this in a future issue.</p></li></ul><p>These problems are so pervasive that according to a BCG/MIT <a href="https://sloanreview.mit.edu/projects/expanding-ais-impact-with-organizational-learning/?utm_medium=pr&amp;utm_source=release&amp;utm_campaign=ReportBCGAI2020">joint study</a>, <strong>only around 10% of businesses&nbsp;have actually seen &#8220;any significant financial benefits&#8221; from applied AI</strong>. According to the report, the reasons for failure are nearly always preventable, not technical: a lack of alignment, clear goal-setting, and leadership buy-in. The marketing problem is at the center of each of those failure points.</p><p>This doesn&#8217;t stop at the enterprise level. Sometimes, the misunderstanding is so pervasive that it spreads across the entire industry as runaway hype train, derailing otherwise perfectly competent portfolio companies for not living up to sky-high expectations.</p><h2>A Tale of Two Curves</h2><p>To illustrate this, I want to highlight a particularly common case where investors and practitioners disagree on what should be considered reasonable progress.</p><h3><a href="https://research.ark-invest.com/hubfs/1_Download_Files_ARK-Invest/White_Papers/ARK%E2%80%93Invest_BigIdeas_2021.pdf">ARK&#8217;s &#8220;Big Ideas 2021&#8221;</a></h3><p><a href="https://ark-invest.com/articles/">ARK</a> is an asset manager that structures its investment strategies around themes linked to &#8220;disruptive innovation.&#8221; They have a podcast I listen to called <a href="https://open.spotify.com/show/0xOdWuktBQKWCv2mlVjizn?si=1hdOwrxaTSOJBVKxwzY9Fw">FYI - For Your Innovation</a>, and recently released their <a href="https://research.ark-invest.com/hubfs/1_Download_Files_ARK-Invest/White_Papers/ARK%E2%80%93Invest_BigIdeas_2021.pdf">&#8220;Big Ideas 2021&#8221;</a> report, which highlights some of their favorite investment themes for the upcoming year.</p><p>This year, they extensively focus on deep learning (the <em>parfum du jour</em> of machine learning methods) and share high expectations for its economic contributions in the coming years. In particular, they expect deep learning applications to grow from an estimated $2 trillion in contribution to equity market capitalizations (as of 2020) to a <strong>whopping $30 trillion</strong> over the next two decades.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!iXsK!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa58c7546-bae2-4006-b8c9-615c9efa9de1_727x523.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!iXsK!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa58c7546-bae2-4006-b8c9-615c9efa9de1_727x523.png 424w, https://substackcdn.com/image/fetch/$s_!iXsK!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa58c7546-bae2-4006-b8c9-615c9efa9de1_727x523.png 848w, https://substackcdn.com/image/fetch/$s_!iXsK!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa58c7546-bae2-4006-b8c9-615c9efa9de1_727x523.png 1272w, https://substackcdn.com/image/fetch/$s_!iXsK!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa58c7546-bae2-4006-b8c9-615c9efa9de1_727x523.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!iXsK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa58c7546-bae2-4006-b8c9-615c9efa9de1_727x523.png" width="727" height="523" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/a58c7546-bae2-4006-b8c9-615c9efa9de1_727x523.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:523,&quot;width&quot;:727,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:48797,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!iXsK!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa58c7546-bae2-4006-b8c9-615c9efa9de1_727x523.png 424w, https://substackcdn.com/image/fetch/$s_!iXsK!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa58c7546-bae2-4006-b8c9-615c9efa9de1_727x523.png 848w, https://substackcdn.com/image/fetch/$s_!iXsK!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa58c7546-bae2-4006-b8c9-615c9efa9de1_727x523.png 1272w, https://substackcdn.com/image/fetch/$s_!iXsK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa58c7546-bae2-4006-b8c9-615c9efa9de1_727x523.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Credit: ARK Invest</figcaption></figure></div><p>This massive &#8220;deep learning wave&#8221; is contingent in part on the expected performance enhancements from deep learning applications. Like many others, ARK projects a familiar curve resembling Moore&#8217;s Law.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!x5XN!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F83ffe738-9785-4428-a7bf-43671c514c6d_1100x438.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!x5XN!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F83ffe738-9785-4428-a7bf-43671c514c6d_1100x438.png 424w, https://substackcdn.com/image/fetch/$s_!x5XN!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F83ffe738-9785-4428-a7bf-43671c514c6d_1100x438.png 848w, https://substackcdn.com/image/fetch/$s_!x5XN!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F83ffe738-9785-4428-a7bf-43671c514c6d_1100x438.png 1272w, https://substackcdn.com/image/fetch/$s_!x5XN!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F83ffe738-9785-4428-a7bf-43671c514c6d_1100x438.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!x5XN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F83ffe738-9785-4428-a7bf-43671c514c6d_1100x438.png" width="1100" height="438" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/83ffe738-9785-4428-a7bf-43671c514c6d_1100x438.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:438,&quot;width&quot;:1100,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:85541,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!x5XN!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F83ffe738-9785-4428-a7bf-43671c514c6d_1100x438.png 424w, https://substackcdn.com/image/fetch/$s_!x5XN!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F83ffe738-9785-4428-a7bf-43671c514c6d_1100x438.png 848w, https://substackcdn.com/image/fetch/$s_!x5XN!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F83ffe738-9785-4428-a7bf-43671c514c6d_1100x438.png 1272w, https://substackcdn.com/image/fetch/$s_!x5XN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F83ffe738-9785-4428-a7bf-43671c514c6d_1100x438.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Credit: ARK Invest</figcaption></figure></div><p>Projections like these are a good example of the marketing problem. Yes, some deep learning models are getting <a href="https://www.kdnuggets.com/2021/01/google-trillion-parameter-switch-transformer-model.html">exponentially bigger</a>, and if you were to take this to its logical conclusion, it&#8217;s only a matter of time before deep learning truly eats the world, with economically uncompetitive humans living off scraps. But is size on its own tantamount to performance? Not exactly.</p><h3><a href="https://medium.com/starsky-robotics-blog/the-end-of-starsky-robotics-acb8a6a8a5f5">Starsky Robotics&#8217; Swan Song</a></h3><p>Contrary to the Moore&#8217;s Law expectation, practitioners assert that progress in machine learning applications isn&#8217;t exponential&#8230; it&#8217;s sigmoidal, an S-curve.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!muL6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F9a8b5158-024a-4200-a4d7-bde3d117223a_575x349.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!muL6!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F9a8b5158-024a-4200-a4d7-bde3d117223a_575x349.png 424w, https://substackcdn.com/image/fetch/$s_!muL6!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F9a8b5158-024a-4200-a4d7-bde3d117223a_575x349.png 848w, https://substackcdn.com/image/fetch/$s_!muL6!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F9a8b5158-024a-4200-a4d7-bde3d117223a_575x349.png 1272w, https://substackcdn.com/image/fetch/$s_!muL6!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F9a8b5158-024a-4200-a4d7-bde3d117223a_575x349.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!muL6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F9a8b5158-024a-4200-a4d7-bde3d117223a_575x349.png" width="575" height="349" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/9a8b5158-024a-4200-a4d7-bde3d117223a_575x349.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:349,&quot;width&quot;:575,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image for post&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image for post" title="Image for post" srcset="https://substackcdn.com/image/fetch/$s_!muL6!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F9a8b5158-024a-4200-a4d7-bde3d117223a_575x349.png 424w, https://substackcdn.com/image/fetch/$s_!muL6!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F9a8b5158-024a-4200-a4d7-bde3d117223a_575x349.png 848w, https://substackcdn.com/image/fetch/$s_!muL6!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F9a8b5158-024a-4200-a4d7-bde3d117223a_575x349.png 1272w, https://substackcdn.com/image/fetch/$s_!muL6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F9a8b5158-024a-4200-a4d7-bde3d117223a_575x349.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Credit: Stefan Seltz-Axmacher</figcaption></figure></div><p>Last year, <a href="https://www.starsky.io/">Starsky Robotics </a>(one of the earliest self-driving truck startups) eventually <a href="https://medium.com/starsky-robotics-blog/the-end-of-starsky-robotics-acb8a6a8a5f5">shut its doors for good</a>. Despite a string of excellent milestones and firsts for the autonomous vehicles (AV) space, Starsky&#8217;s CEO Stefan Seltz-Axmacher credits unmet promises and overinflated expectations as the company&#8217;s kiss of death.</p><p>A common joke in the industry is the first 90% of progress in a machine learning application is almost always pretty achievable&#8230; <a href="https://towardsdatascience.com/how-i-consistently-improve-my-machine-learning-models-from-80-to-over-90-accuracy-6097063e1c9a">it&#8217;s the next 90% that&#8217;s so tough.</a></p><p>All models work well in small, controlled environments. But real-world deployments bring a plethora of edge cases you can&#8217;t always simulate, and depending on the required safety bar, you may not be able to resolve these cases in a window investors or executives are comfortable with.</p><p>The below chart captures this dilemma pretty well. The horizontal lines represent hypothetical human-equivalent performance levels.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!d8hV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F74ab6637-bd98-4fa3-8e0d-f2c22c08b8a0_711x476.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!d8hV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F74ab6637-bd98-4fa3-8e0d-f2c22c08b8a0_711x476.png 424w, https://substackcdn.com/image/fetch/$s_!d8hV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F74ab6637-bd98-4fa3-8e0d-f2c22c08b8a0_711x476.png 848w, https://substackcdn.com/image/fetch/$s_!d8hV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F74ab6637-bd98-4fa3-8e0d-f2c22c08b8a0_711x476.png 1272w, https://substackcdn.com/image/fetch/$s_!d8hV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F74ab6637-bd98-4fa3-8e0d-f2c22c08b8a0_711x476.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!d8hV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F74ab6637-bd98-4fa3-8e0d-f2c22c08b8a0_711x476.png" width="711" height="476" data-attrs="{&quot;src&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/74ab6637-bd98-4fa3-8e0d-f2c22c08b8a0_711x476.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:476,&quot;width&quot;:711,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:24121,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!d8hV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F74ab6637-bd98-4fa3-8e0d-f2c22c08b8a0_711x476.png 424w, https://substackcdn.com/image/fetch/$s_!d8hV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F74ab6637-bd98-4fa3-8e0d-f2c22c08b8a0_711x476.png 848w, https://substackcdn.com/image/fetch/$s_!d8hV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F74ab6637-bd98-4fa3-8e0d-f2c22c08b8a0_711x476.png 1272w, https://substackcdn.com/image/fetch/$s_!d8hV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F74ab6637-bd98-4fa3-8e0d-f2c22c08b8a0_711x476.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Credit: Stefan Seltz-Axmacher</figcaption></figure></div><blockquote><p><strong>If L1 is the line of human equivalence, then leading AV companies merely have to prove safety to be able to deploy&#8230; If L2 is the case, the bigger teams are somewhere from $1&#8211;25b away from solving this problem&#8230; If, however, L3 is the line of human equivalence it&#8217;s unlikely any of the current technology will make that jump.</strong></p><p><strong>Whenever someone says autonomy is 10 years away that&#8217;s almost certainly what their thought is.</strong></p></blockquote><p>The investing community&#8217;s exponential pitch creates a misalignment with operators when the reality of the S-curve comes to light.</p><p>Starsky&#8217;s failure was not one of competence or technical ability. Safety (both real and perceived) is one of the biggest hurdles for AV, and Stefan&#8217;s team invested nearly two years doing nothing but safety engineering. Pursuing that path with realistic milestones and centering on a business model that (though lower than traditional software margins) at least got the wheels turning was a strategy investors found little enthusiasm for.</p><blockquote><p>It took me way too long to realize that VCs would rather a $1b business with a 90% margin than a $5b business with a 50% margin, even if capital requirements and growth were the same. And growth would be the same. The biggest limiter of autonomous deployments isn&#8217;t sales, it&#8217;s safety.</p></blockquote><p>I wholeheartedly recommend reading Stefan&#8217;s <a href="https://medium.com/starsky-robotics-blog/the-end-of-starsky-robotics-acb8a6a8a5f5">excellent blog post</a> in full, and to keep his thoughts in mind anytime you see exponential curves or hear the words &#8220;AI is taking over the world.&#8221; There are lots of low-hanging fruit, but overinflated expectations and runaway hype trains recklessly divert capital and resources from the most feasible and high ROI projects to the sexiest stories that are easier to sell.</p><h2>A Personal Note</h2><p>Early in my career, I made a lot of the same mistakes mentioned above. Coming from a business background rather than engineering, I&#8217;ve hyped the potential of some projects to executive audiences as soon as we determined a clever way to apply machine learning to the problem. This creates unfair expectations on your team and converts what might have been a few small wins into one big promise you might not live up to.</p><p>I&#8217;ve found the most success by appreciating the limitations of our solution, narrowing the scope, and laying out a stepped roadmap for how to eventually get to the Big Idea that executives and investors want. For the 90% of businesses that aren&#8217;t yet finding success with applied AI, that&#8217;s what I recommend: start small.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://machineyearning.substack.com/?utm_source=substack&amp;utm_medium=email&amp;utm_content=share&amp;action=share&quot;,&quot;text&quot;:&quot;Share Machine Yearning&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://machineyearning.substack.com/?utm_source=substack&amp;utm_medium=email&amp;utm_content=share&amp;action=share"><span>Share Machine Yearning</span></a></p><div><hr></div><h1>Bonus Content</h1><p>These are articles, reports, videos, and more worth checking out that didn&#8217;t get a featured mention this issue.</p><h2>&#128218; Reading</h2><h4><a href="https://blogs.unity3d.com/2021/01/18/gametune-introducing-reinforcement-learning-for-optimizing-the-player-lifecycle/">Introducing Reinforcement Learning for Optimizing the Player Lifecycle (Unity)</a></h4><p>Don&#8217;t yet know what reinforcement learning (RL) is and too afraid to ask? Unity&#8217;s case study is a great primer on how the technology is used to maximize customer lifetime value (LTV) by keeping players engaged and their wallets open.</p><h4><a href="https://cset.georgetown.edu/research/the-semiconductor-supply-chain">The Semiconductor Supply Chain: Assessing National Competitiveness</a> (CSET)</h4><p>Required reading for anyone without an industry background to get familiar with the hardware powering machine learning applications (CTRL+F &#8220;AI ASICs&#8221;).</p><h4><a href="https://www.fedscoop.com/national-ai-initiative-office-launched/">National AI Initiative Office Launched by White House (FedScoop)</a></h4><p>The US formalizes a commitment to funding AI research of national interest and importance.</p><h4><a href="https://venturebeat.com/2021/01/26/why-the-oecd-wants-to-calculate-the-ai-compute-needs-of-national-governments/">Why the OECD Wants to Calculate the AI Compute Needs of National Governments (VentureBeat)</a></h4><p>&#8220;Compute&#8221; is a generalized term for computing capacity used to train and deploy machine learning models. <em>&#8220;If we can measure how much compute exists within a country or set of countries, we can quantify one of the factors for the AI capacity of that country.&#8221;</em></p><h4><a href="https://cset.georgetown.edu/research/chinas-sti-operations/">China&#8217;s STI Operations: Monitoring Foreign Science and Technology Through Open Sources (CSET)</a></h4><p>The US relatively neglects open-source intelligence (OSINT) while China uses it as the <em>&#8220;INT of first resort.&#8221;</em> This contrasts extends to science and technology intelligence (STI), used for assessing foreign capabilities in applied research and engineering.</p><h4><a href="https://cset.georgetown.edu/research/the-u-s-ai-workforce/">The U.S. AI Workforce (CSET)</a></h4><p>CSET defines a taxonomy for AI labor supply sizing. Part 1 of a 3 part series.</p><h4><a href="https://www.wired.com/story/chinese-lab-aiming-big-ai-breakthroughs/">This Chinese Lab is Aiming for Big AI Breakthroughs (WIRED)</a></h4><p>Take a tour through one of the newest government-sponsored research labs, the Beijing Academy of Artificial Intelligence. Spoiler alert: they&#8217;re working on a GPT-3 / Switch Transformer competitor.</p><h4><a href="https://www.business-standard.com/article/technology/amazon-opens-alexa-s-advanced-ai-for-firms-to-build-their-own-assistants-121011600245_1.html">Amazon opens Alexa&#8217;s advanced AI for firms to build their own assistants (Business Standard)</a></h4><p>Intelligent assistants in cars, apps, real estate, games, and edge devices galore.</p><p></p><h2>&#127911; Listening</h2><h4><a href="https://www.bloomberg.com/news/articles/2021-01-28/dan-wang-on-china-s-mission-to-be-a-world-leader-in-semiconductors?sref=6ZE6q2XR">Dan Wang on China&#8217;s Mission to Be a World Leader in Semiconductors (Odd Lots Podcast)</a></h4><p>Dan Wang of Gavekal Dragonomics gives on-the-ground perspective of China&#8217;s efforts to de-risk itself from the global semiconductor supply chain.</p><p></p><h2>&#127909; Watching</h2><h4><a href="https://www.desire.film/watch">People&#8217;s Republic of Desire (Documentary)</a></h4><p>At the intersection of technology and culture, &#8220;A digital fantasy world where culture has been abandoned in favor of commerce.&#8221; Timely for the <a href="https://pandaily.com/kuaishou-shares-jump-194-in-hong-kong-trading-debut/">Kuaishou IPO</a>.</p><div><hr></div><h1>About me</h1><p>I am an AI Product Manager at <a href="https://www.spiketrap.io/">Spiketrap</a>, a seed-stage language processing company where I focus on applied strategy, analytics, and new verticals for our NLP stack. Previously, I spent ~4 years at Uber integrating machine learning techniques into high-growth marketplaces like Uber Eats, Uber Elevate, and JUMP. I started my career in technology investment banking at Credit Suisse, specializing in fintech, blockchain, and Chinese internet companies.</p><p>I&#8217;m studying Artificial Intelligence part-time at Stanford, and graduated from Georgetown University with a BS in Finance and Economics.</p><h1>Thanks for reading!</h1><p>Any suggestions or items you want to see? Contact me on Twitter <a href="https://www.twitter.com/rydcunningham">@rydcunningham</a> or shoot me an email at rydcunningham@gmail.com.</p>]]></content:encoded></item><item><title><![CDATA[Artificially Intelligent.]]></title><description><![CDATA[Welcome to Machine Yearning!]]></description><link>https://www.machineyearning.io/p/coming-soon</link><guid isPermaLink="false">https://www.machineyearning.io/p/coming-soon</guid><dc:creator><![CDATA[Ryan Cunningham]]></dc:creator><pubDate>Wed, 07 Oct 2020 03:50:01 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!-RAu!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39397fed-3de8-46df-ab35-4f48dc5edf4e_300x300.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to Machine Yearning!</p><p>Sign up now so you don&#8217;t miss the first issue.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.machineyearning.io/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.machineyearning.io/subscribe?"><span>Subscribe now</span></a></p><p>In the meantime, <a href="https://www.machineyearning.io/p/coming-soon?utm_source=substack&utm_medium=email&utm_content=share&action=share">tell your friends</a>!</p>]]></content:encoded></item></channel></rss>